Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add documentation about --shm error #31

Closed
chillenzer opened this issue Jun 9, 2023 · 5 comments
Closed

Add documentation about --shm error #31

chillenzer opened this issue Jun 9, 2023 · 5 comments

Comments

@chillenzer
Copy link
Collaborator

The resolution of my --shm error was that I was parallelising the wrong axes, i.e., I was doing --mpi=1.1.2.2 instead of --mpi=2.2.1.1. While that seems stupid in hindsight, I somehow still thought that this could be a good idea at that time. So, I feel that we should guard against others running into this by, e.g., adding something to the documentation or README or tracking down the error message and adding a hint that it might be a problem of parallelisation.

@chillenzer
Copy link
Collaborator Author

PS: The said error message was

 ShmBufferMalloc exceeded shared heap size -- try increasing with --shm <MB> flag
 Parameter specified in units of MB (megabytes) 
 Current alloc is 192MB
 Current bytes is 1152MB
 Current heap  is 1024MB

@LupoA
Copy link
Owner

LupoA commented Jun 13, 2023

what was the global volume of the lattice? (the --grid command)

@chillenzer
Copy link
Collaborator Author

It was Benchmark_ITT right after

Grid : Message : 21.362239 s : ==================================================================================
Grid : Message : 21.550668 s : ==================================================================================
Grid : Message : 21.550701 s : Benchmark DWF on 32^4 local volume 
Grid : Message : 21.550703 s : * Nc             : 4
Grid : Message : 21.550705 s : * Global volume  : 32 32 64 64 
Grid : Message : 21.550717 s : * Ls             : 12
Grid : Message : 21.550723 s : * ranks          : 4
Grid : Message : 21.550725 s : * nodes          : 1
Grid : Message : 21.550726 s : * ranks/node     : 4
Grid : Message : 21.550728 s : * ranks geom     : 1 1 2 2 
Grid : Message : 21.550731 s : * Using 4 threads
Grid : Message : 21.550732 s : ==================================================================================
Grid : Message : 24.469923 s : Initialised RNGs

@chillenzer
Copy link
Collaborator Author

Discussed with Alessandro on Friday: We should open a corresponding issue on Peter's repo.

@chillenzer chillenzer closed this as not planned Won't fix, can't repro, duplicate, stale Jun 18, 2023
@chillenzer
Copy link
Collaborator Author

PS: paboyle#438 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants