We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Related Issue
training with --gpus 0, --gpus 0,1 hangs at initializing ddp ...
--gpus 0,
--gpus 0,1
I've tried, change ddp to dp, change nccl to gloo, upgrade lightning or torch version... still not work.
ddp
dp
nccl
gloo
(only cpu works)
The text was updated successfully, but these errors were encountered:
@shyakocat I encountered the same issue, have you resolved it?
Sorry, something went wrong.
try running export NCCL_P2P_DISABLE=1 before running the script it worked for me
export NCCL_P2P_DISABLE=1
No branches or pull requests
Related Issue
training with
--gpus 0,
--gpus 0,1
hangs at initializing ddp ...I've tried, change
ddp
todp
, changenccl
togloo
, upgrade lightning or torch version... still not work.(only cpu works)
The text was updated successfully, but these errors were encountered: