Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable prefix-grouping for one-to-one filtering #66

Merged
merged 2 commits into from
Jan 2, 2024

Conversation

bkille
Copy link
Contributor

@bkille bkille commented Dec 24, 2023

Previously, the -f one-to-one filter was applied to all mappings at the same time. In cases where users are mapping multiple query genomes to one or more target sequences with the --skipPrefix # flag, the one-to-one filter would treat all query sequences as part of the same genome, even if they had unique prefixes.

This patch makes it so that the one-to-one plane-sweep filter is applied to each pair of query and reference groups independently, ensuring that -n mappings are retained for each pair. A "group" of sequences is the set of sequences which contain the same prefix up until the last occurrence of the character c, where --skipPrefix c is specified.

@bkille bkille merged commit 1a07d0e into master Jan 2, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant