Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The training phase converges quickly (acc>0.95), but the validate result is very bad (acc<0.3) #15

Open
UESTC-Liuxin opened this issue Mar 12, 2021 · 4 comments

Comments

@UESTC-Liuxin
Copy link

您好,我用我自己的数据集(汉英,真实场景160k数据量)进行实验,发现训练很快收敛,但是验证结果很差,您出现过这种情况吗?我想的话,这是不是因为这种结构和输入方式,相当于设置了teaching_forcing = 1,很容易就导致过拟合了。

@charlesmindee
Copy link

Hi, I have the same issue

@jiangxiluning
Copy link
Owner

@UESTC-Liuxin 你好我没有出现过具体问题,会不会跟你文字长度有关吶

@charlesmindee
Copy link

charlesmindee commented Jul 1, 2021

Actually I had made a mistake in the loss, I fixed it shifting the ground-truth sequences to the right! (I changed a bit the loss function in my implementation, now the model is working well when predicting)

@jiangxiluning
Copy link
Owner

@charlesmindee great!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants