support infinite loop over alpaca dataset #92

tianyu-l · 2024-02-26T23:57:04Z

Stack from ghstack (oldest at bottom):

-> support infinite loop over alpaca dataset #92

Previously, alpaca dataset is consumed up after only ~50 iterations with 8 data parallel ranks and 8 batch size. This PR adds the (default) option to loop infinitely on the dataset, so that we can unblock integrating other functionalities. Note that loss-related metrics should be read with caution as this will cause overfit.

(This is a replicated PR from #66 as the migration to pytorch/ confused ghstack.)

[ghstack-poisoned]

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: #92

lessw2020

looks good!
Thanks for adding this, I hit this out of data issue quite a bit so will be great to have a resolution when we don't care about actual training loss but need to check perf/scale.

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: #92

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: pytorch#92

support infinite loop over alpaca dataset

db4cf53

[ghstack-poisoned]

tianyu-l added a commit that referenced this pull request Feb 26, 2024

support infinite loop over alpaca dataset

bd6fe55

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: #92

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 26, 2024

tianyu-l mentioned this pull request Feb 27, 2024

support infinite loop over alpaca dataset #66

Closed

tianyu-l requested review from wanchaol, XilunWu and lessw2020 February 27, 2024 00:03

lessw2020 approved these changes Feb 27, 2024

View reviewed changes

tianyu-l merged commit db4cf53 into gh/tianyu-l/2/base Feb 27, 2024
4 checks passed

tianyu-l added a commit that referenced this pull request Feb 27, 2024

support infinite loop over alpaca dataset

5dec536

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: #92

tianyu-l deleted the gh/tianyu-l/2/head branch February 27, 2024 00:25

lessw2020 pushed a commit that referenced this pull request Apr 18, 2024

support infinite loop over alpaca dataset

325951f

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: #92

philippguevorguian pushed a commit to YerevaNN/YNNtitan that referenced this pull request Aug 17, 2024

support infinite loop over alpaca dataset

78a1643

ghstack-source-id: 38cbc277e2a177bc0baf35450a661835b97a7f22 Pull Request resolved: pytorch#92

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support infinite loop over alpaca dataset #92

support infinite loop over alpaca dataset #92

tianyu-l commented Feb 26, 2024 •

edited

Loading

lessw2020 left a comment

support infinite loop over alpaca dataset #92

support infinite loop over alpaca dataset #92

Conversation

tianyu-l commented Feb 26, 2024 • edited Loading

lessw2020 left a comment

Choose a reason for hiding this comment

tianyu-l commented Feb 26, 2024 •

edited

Loading