-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Combine decomposibility check and combining class lookup #41
Comments
If needed, in the reorder buffer we store the char + combining class in the higher order bits as a |
Sounds like a good idea. We can store "is decomposable" as combining class 255, so that it all boils down to a single long byte array, ~128 Kb. Stil fits CPU cache, I believe. |
Do you want to try this out? It will be exciting to see where we can go with this. |
I can probably migrate |
I can try that. You can push your changes to a branch in this repo, we can collaborate on that. |
Currently we need to do three lookups:
We can have a single lookup table storing decomposability and combining class. This will get us all the information in one memory access. We may have to store the combining class in the buffer along with the char for later use when reordering is actually done.
It can potentially speed up both NFD and NFC normalizations. Whether it actually will and how much has to be seen by experimenting.
The text was updated successfully, but these errors were encountered: