-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improved Memory Operations #174
Commits on Sep 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d04993d - Browse repository at this point
Copy the full SHA d04993dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 172bf93 - Browse repository at this point
Copy the full SHA 172bf93View commit details -
Fix: Missing, but documented
partition(':')
Closes #172 Co-authored-by: Takuya Hashimoto <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 432fb3d - Browse repository at this point
Copy the full SHA 432fb3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for a0e9be7 - Browse repository at this point
Copy the full SHA a0e9be7View commit details -
Configuration menu - View commit details
-
Copy full SHA for b5fcc62 - Browse repository at this point
Copy the full SHA b5fcc62View commit details -
Configuration menu - View commit details
-
Copy full SHA for ee6f754 - Browse repository at this point
Copy the full SHA ee6f754View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97cf753 - Browse repository at this point
Copy the full SHA 97cf753View commit details -
Configuration menu - View commit details
-
Copy full SHA for 224a3a0 - Browse repository at this point
Copy the full SHA 224a3a0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 97535bc - Browse repository at this point
Copy the full SHA 97535bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4e138c - Browse repository at this point
Copy the full SHA e4e138cView commit details
Commits on Sep 28, 2024
-
Improve: Faster
memcpy
in AVX-512On the Leipzig1M dataset, LibC vs SZ: ~ 128b lines, aligned: 2.3 vs 2.6 GB/s ~ 128b lines, unaligned: 2.34 vs 2.53 GB/s ~ 5b tokens, aligned: 0.1 vs 0.1 GB/s ~ 5b tokens, unaligned: 0.1 vs 0.1 GB/s ~ 124 MB, aligned: 19.6 vs 20.3 GB/s ~ 124 MB, unaligned: 19.6 vs 20.3 GB/s
Configuration menu - View commit details
-
Copy full SHA for affebc0 - Browse repository at this point
Copy the full SHA affebc0View commit details -
Configuration menu - View commit details
-
Copy full SHA for a265d3b - Browse repository at this point
Copy the full SHA a265d3bView commit details
Commits on Sep 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 36df73d - Browse repository at this point
Copy the full SHA 36df73dView commit details -
Make: Lighter debugging in VS Code
Previously SZ would build too many targets for each debugging session.
Configuration menu - View commit details
-
Copy full SHA for 5d522cf - Browse repository at this point
Copy the full SHA 5d522cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5388ab4 - Browse repository at this point
Copy the full SHA 5388ab4View commit details
Commits on Oct 1, 2024
-
Add: Faster memory ops in AVX2
This commit accelerates the `sz_fill_avx2` and `sz_copy_avx2` by avoiding unaligned writes. It also adds an `sz_equal_avx2` to help validate large files with matching checksums faster. It also adds a placeholder for `sz_order_avx2`, discouraging further optimizations. C++ API with a matching argument order was added to mimic `std::memcpy`, `std::memset`, `std::memmove`. Matching `test_memory_utilities` tests were extended.
Configuration menu - View commit details
-
Copy full SHA for cef29c9 - Browse repository at this point
Copy the full SHA cef29c9View commit details
Commits on Oct 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6d326d9 - Browse repository at this point
Copy the full SHA 6d326d9View commit details -
Configuration menu - View commit details
-
Copy full SHA for a383e9e - Browse repository at this point
Copy the full SHA a383e9eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 69060ac - Browse repository at this point
Copy the full SHA 69060acView commit details -
Improve: Using more registers for small moves
In AVX-512, similar to GLibC we should use the register space to load more data simultaneously and avoid loops and data-dependency between iterations.
Configuration menu - View commit details
-
Copy full SHA for 696797d - Browse repository at this point
Copy the full SHA 696797dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e2f8cc7 - Browse repository at this point
Copy the full SHA e2f8cc7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02b9d68 - Browse repository at this point
Copy the full SHA 02b9d68View commit details -
Configuration menu - View commit details
-
Copy full SHA for bba72a6 - Browse repository at this point
Copy the full SHA bba72a6View commit details
Commits on Oct 11, 2024
-
The new `sz_look_up_transform` API implements a 256-byte lookup table using serial code and AVX-512 that can significantly accelerates text and image processing. The AVX-512 implementation reaches 18 GB/s on Intel Sapphire Rapids CPU, while serial code stays around 3 GB/s for large files.
Configuration menu - View commit details
-
Copy full SHA for 850e4e8 - Browse repository at this point
Copy the full SHA 850e4e8View commit details
Commits on Oct 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 014bcf2 - Browse repository at this point
Copy the full SHA 014bcf2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 26a0fea - Browse repository at this point
Copy the full SHA 26a0feaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 423ad99 - Browse repository at this point
Copy the full SHA 423ad99View commit details -
Configuration menu - View commit details
-
Copy full SHA for 11272e5 - Browse repository at this point
Copy the full SHA 11272e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d8ac78 - Browse repository at this point
Copy the full SHA 4d8ac78View commit details -
Configuration menu - View commit details
-
Copy full SHA for 82146b0 - Browse repository at this point
Copy the full SHA 82146b0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 165986f - Browse repository at this point
Copy the full SHA 165986fView commit details -
Configuration menu - View commit details
-
Copy full SHA for be6c93b - Browse repository at this point
Copy the full SHA be6c93bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3898481 - Browse repository at this point
Copy the full SHA 3898481View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1baa3a9 - Browse repository at this point
Copy the full SHA 1baa3a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1db702a - Browse repository at this point
Copy the full SHA 1db702aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c1426f - Browse repository at this point
Copy the full SHA 5c1426fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78937f9 - Browse repository at this point
Copy the full SHA 78937f9View commit details -
Configuration menu - View commit details
-
Copy full SHA for c0c1dcb - Browse repository at this point
Copy the full SHA c0c1dcbView commit details