Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

question on split pairs #43

Open
caonetto opened this issue Apr 17, 2023 · 3 comments
Open

question on split pairs #43

caonetto opened this issue Apr 17, 2023 · 3 comments

Comments

@caonetto
Copy link

When running split pairs on pod5s and the basecalled sam, duplex tools generates a new folder with split pod5 files and associated read ids. Does this folder contain all the reads that where split, including non-duplex reads, or just the ones that where identified as duplex?

@ollenordesjo
Copy link
Contributor

Hi @caonetto, It will only contain the reads which were identified as being duplex. The split point has to be somewhere in the middle (~45-55% into the read, counting in bases) for it to be identified as duplex (and for it to be split). You can change these thresholds:
https://github.com/nanoporetech/duplex-tools/blob/master/duplex_tools/split_pairs.py#L18

It doesn't work as a generic read splitter though, so you will not be getting non-duplex reads from it. Hope that answers the question!

@caonetto
Copy link
Author

Thank you for your quick response. Do you have a recommendation on how to split chimeric reads with midstrand adapters from dorado basecalled reads?

Thanks!

@ollenordesjo
Copy link
Contributor

Yes, if you are ok with splitting reads in base-space (having input fastq & output fastq), then this tool should work for that: https://github.com/nanoporetech/duplex-tools/blob/master/duplex_tools/split_on_adapter.py. Feel free to give it a go and let me know if that is sufficient for the use case.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants