-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pattern Search on Sequence Data #167
base: main
Are you sure you want to change the base?
Conversation
0ac902a
to
4a43c9e
Compare
4a43c9e
to
4ec6464
Compare
src/silo/storage/sequence_store.cpp
Outdated
const auto* negated_reference_bitmap = | ||
this->getBitmap(genome_pos, this->reference_genome[genome_pos]); | ||
const auto* n_symbol_bitmap = this->getBitmap(genome_pos, NUCLEOTIDE_SYMBOL::N); | ||
auto pure_mutation = std::make_unique<roaring::Roaring>(roaring_bitmap_andnot( | ||
&negated_reference_bitmap->roaring, &n_symbol_bitmap->roaring | ||
)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Taepper I did not find where the reference genome bitmap gets flipped. Does SILO still do it? Anyways, this part is not yet correct. Should we do a union over all bitmaps except the reference bitmap to get the ones that are actually mutated to a symbol other than N
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh, it should definitely still happen! For now you can do that, but the bitmap flipping must have gotten lost in refactoring somewhere .. it will be reenabled. But yes, you can Save those bitmaps. I guess "mutated to other than - or N" in order to not need a workaround for that anymore!
38cd759
to
568909f
Compare
4ec6464
to
9bd0288
Compare
9bd0288
to
5ab6a05
Compare
970a73b
to
4d783ad
Compare
4d783ad
to
97c5732
Compare
Preparation of pattern search #165!
Most relevant parts of the edits should happen in the TODO marked areas. Of course, the additional data structures that need to be added in the sequence store which are not yet added, will make up a large part of this PR