Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问是否支持异构设备共享 #514

Open
wJunjie-1995 opened this issue Sep 25, 2024 · 1 comment
Open

请问是否支持异构设备共享 #514

wJunjie-1995 opened this issue Sep 25, 2024 · 1 comment

Comments

@wJunjie-1995
Copy link

是否支持异构设备共享,即

  1. 单个任务使用相同厂商但不同芯片的显卡进行训练/推理(如同时使用1张V100和1张H100进行训练)?
  2. 或者单个任务使用不同厂商的卡进行训练/推理(如使用1张V100+1张910B进行训练)?
@Nimbus318
Copy link
Contributor

单个任务使用相同厂商但不同芯片的显卡进行训练/推理(如同时使用1张V100和1张H100进行训练)?

If both NVIDIA GPUs are on the same node, then it's supported.

单个任务使用不同厂商的卡进行训练/推理(如使用1张V100+1张910B进行训练)?

Not supported.

Usually, frameworks like TensorFlow and PyTorch, along with their related Python libraries, are designed to work in homogeneous environments where at least the hardware vendors are the same to function properly. I'm a bit confused about the second scenario.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants