Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request] Multi-node training #140

Open
yjzhong111 opened this issue Dec 18, 2023 · 2 comments
Open

[Feature request] Multi-node training #140

yjzhong111 opened this issue Dec 18, 2023 · 2 comments

Comments

@yjzhong111
Copy link

Hi,
I have two questions:

  1. Can it be used in multi-node training?
  2. When will trainer support deepspeed? I have noticed that integrating deepspeed is in to-do list, but do you have the exact time or schedule?

Thank you!

@erogol
Copy link
Member

erogol commented Dec 18, 2023

  1. It should work for multi-node training.
  2. No timeline for deepspeed. why do you need deepspeed for training?

@yjzhong111
Copy link
Author

  1. It should work for multi-node training.
  2. No timeline for deepspeed. why do you need deepspeed for training?

Thanks! But how can I train in multi-node, is there any instructions about it? For deepspeed, I may use some tricks in deepspeed to improve the training performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants