Multi-GPU segmentation fault (gslib) #578
Unanswered
abettini99
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Did you try to run without GPU MPI support (set env-var |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am attempting to execute a small test case using two GPUs on a single node using nekRS 23.0. I am facing similar issues as #568 and I am unsure if the FAQ is addressing this problem. My test case segfaults right after meshParallelGatherScatterSetup, but works completely fine when running on a single GPU, single node. My test case executes up to:
Which then directly segfaults:
I presume that this is also related to gslib testing GPU-aware MPI based on the backtrace and #568. I would appreciate some insight/confirmation. If it is about GPU-aware MPI, then I can quicken the hotfix with the HPC peeps :)
Thanks a lot for the support!
Beta Was this translation helpful? Give feedback.
All reactions