-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to set the number of parallel threads? #66
Comments
This interface needs to be both improved and better documented, sorry about that. This is what it currently does: Note that it probably isn't going to use 56 threads on a problem of that size (or if it does it's not going to be scaling well). For most of the internal kernels you can observe what's happening with |
This works well, thanks! |
I'm going to leave this open until I find a better interface |
In a similar vein, do you know if multi-threading works on Apple ARM chips? Changing the number of threads with |
The benchmark section shows faster execution with more threads. However, I cannot reproduce such parallel scaling.
The benchmark script:
Run with:
Then, by changing any of the
*_NUM_THREADS
variables, the execution time always stays the same (~180ms on my 112-core machine). It is 7x faster than the built-in sparse matmul (~1.2s). However I don't really know how many threads it is using, and doesn't seem to be able to change it.The text was updated successfully, but these errors were encountered: