We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
师兄,请教个问题。 采用和Qwen2一样的模型架构,调了一下参数,模型规模在1.1B左右 。8卡训练了10天,训练5000万行数据了,但现在模型训练的Loss一直在2.8左右徘徊,根据您之前训练的经验,有什么解决方案吗?
The text was updated successfully, but these errors were encountered:
精简到1000条数据,训练10个epochs,看loss变化和模型效果。
Sorry, something went wrong.
No branches or pull requests
师兄,请教个问题。
采用和Qwen2一样的模型架构,调了一下参数,模型规模在1.1B左右 。8卡训练了10天,训练5000万行数据了,但现在模型训练的Loss一直在2.8左右徘徊,根据您之前训练的经验,有什么解决方案吗?
The text was updated successfully, but these errors were encountered: