Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support infinite loop over alpaca dataset #92

Merged
merged 1 commit into from
Feb 27, 2024

Conversation

tianyu-l
Copy link
Contributor

@tianyu-l tianyu-l commented Feb 26, 2024

Stack from ghstack (oldest at bottom):

Previously, alpaca dataset is consumed up after only ~50 iterations with 8 data parallel ranks and 8 batch size. This PR adds the (default) option to loop infinitely on the dataset, so that we can unblock integrating other functionalities. Note that loss-related metrics should be read with caution as this will cause overfit.

(This is a replicated PR from #66 as the migration to pytorch/ confused ghstack.)

tianyu-l added a commit that referenced this pull request Feb 26, 2024
ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22
Pull Request resolved: #92
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 26, 2024
Copy link
Contributor

@lessw2020 lessw2020 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good!
Thanks for adding this, I hit this out of data issue quite a bit so will be great to have a resolution when we don't care about actual training loss but need to check perf/scale.

@tianyu-l tianyu-l merged commit db4cf53 into gh/tianyu-l/2/base Feb 27, 2024
4 checks passed
tianyu-l added a commit that referenced this pull request Feb 27, 2024
ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22
Pull Request resolved: #92
@tianyu-l tianyu-l deleted the gh/tianyu-l/2/head branch February 27, 2024 00:25
lessw2020 pushed a commit that referenced this pull request Apr 18, 2024
ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22
Pull Request resolved: #92
philippguevorguian pushed a commit to YerevaNN/YNNtitan that referenced this pull request Aug 17, 2024
ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22
Pull Request resolved: pytorch#92
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants