-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[c++] Reindexer overrides and fast COO/CSR #1728
Conversation
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## main #1728 +/- ##
==========================================
+ Coverage 75.79% 75.85% +0.06%
==========================================
Files 126 135 +9
Lines 10083 10407 +324
Branches 161 200 +39
==========================================
+ Hits 7642 7894 +252
- Misses 2366 2420 +54
- Partials 75 93 +18
Flags with carried forward coverage won't be shown. Click here to find out more.
|
439f82b
to
854c3b8
Compare
e35c92e
to
a6cd2aa
Compare
a6cd2aa
to
3474759
Compare
Reading this now, I have a good idea of how we can add an abstraction in somacore to make this change much less invasive and more straightforward. I have some more work to do here, but this is the start: single-cell-data/SOMA@main...build-index-abstraction |
06d16b6
to
5f8442f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding this and making the changes. I have one last minor change request, but I think it's good to go.
libtiledbsoma/test/test_indexer.cc
Outdated
* | ||
* @section DESCRIPTION | ||
* | ||
* This file manages unit tests for column buffers |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* This file manages unit tests for column buffers | |
* This file manages unit tests for the reindexer |
27ed768
to
85f8740
Compare
-It inclcudes khash files -It also includes a unitests trying different combinations of hash and lookup keys
Merge issues
85f8740
to
c8e1b51
Compare
somacore v1.0.7 is fully compatible with v1.0.6 and does not introduce any behavior changes. It does *prepare* us for the reindexer change: #1728 Co-authored-by: John Kerl <[email protected]>
Issue and/or context:
#1611
#1614
#1590
#718
#1610
Changes:
Notes for Reviewer:
Implement a re-indexer similar to Pandas reindexer
Provide pybind11 for the re-indexer (currently only numpy and soon Arrow interface) for python
Expose AxisQuery and fast_csr to use the re-indexer to speedup COO to CSR conversion
For the time being switching between the two indexer (IntIndexer and pandas indexer) is possible using two variables (named use_int_indexer) in _query and fast_csr. As a part of this PR review, I'd like to figure out the best way to have this new feature easily enabled or disabled.