Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[LAPACK][CUSOLVER] Inefficient conversion from 32-bit ints to 64-bit ints. #230

Open
AidanBeltonS opened this issue Sep 23, 2022 · 0 comments
Labels
help wanted Tasks, issues or features that could be implemented and contributed to the project

Comments

@AidanBeltonS
Copy link
Contributor

Summary

The legacy cuSolver API only accepts 32-bit ints as it is a limit of the library. oneMKL uses 64-bit ints, so in cases with ipiv vectors we use sycl kernels to convert from 32-bits to 64-bits. This is inefficient as it dispatches work to the GPU and requires an additional memory allocation. This could be improved by using an inplace expansion using the already 64-bit allocated memory.

The problem was discussed on #209. See #209 (comment)

This issue is to track the problem and its solution.

@mkrainiuk mkrainiuk added the help wanted Tasks, issues or features that could be implemented and contributed to the project label Sep 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Tasks, issues or features that could be implemented and contributed to the project
Projects
None yet
Development

No branches or pull requests

2 participants