-
Notifications
You must be signed in to change notification settings - Fork 462
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Are there any performance bugs that prevent the usage of shuffle! in the AVX2 backend? #181
Comments
Duh... the |
Thanks for pointing this out! There's no reason other than that when I wrote the original code last November, the intrinsics were using the packed vector types (which were later moved to packed_simd), and I thought these were on track for stabilization, while the Later on, the intrinsics were changed to use the bag-of-bits types Now, there's no reason not to use the Using the |
Looking at the code, the shuffle macro just takes a constant vector of indices. I think (not sure, haven't tried) that you might be able to keep using the const AAAA: [i32; 4] = [0, 0, 0, 0];
shuffle!(vec, AAAA); |
Closing this for now just because I don't have any plans to refactor the AVX2 backend right now; it could be reopened later. |
I just read a blog post about this library and was wondering why does the AVX2 backend do not use the
shuffle!
macro instead of the many other intrinsics for re-ordering vector elements.If there are any performance issues with it, it would really help if bugs could be filled in packed_simd upstream.
The text was updated successfully, but these errors were encountered: