Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MT DNA freq guesstimate on each variant #5047

Open
dnil opened this issue Nov 19, 2024 · 1 comment
Open

MT DNA freq guesstimate on each variant #5047

dnil opened this issue Nov 19, 2024 · 1 comment

Comments

@dnil
Copy link
Collaborator

dnil commented Nov 19, 2024

Is your feature request related to a problem in the current program to new available techology or software? Please describe and add links/citations if appropriate.
We currently export AD for ref and alt. It would be convenient for the end users to have a frequency guesstimate based on thes.

Describe the solution you'd like
Divide AD alt with sum of AD alt and ref. Present as a percentage on the mtDNA report.

Additional context
It may be necessary to deal with multiallelics, discussion ongoing.

@dnil
Copy link
Collaborator Author

dnil commented Nov 19, 2024

In summary after a good discussion with @ramprasadn, we may indeed have multiallelics on the MT side, to be accounted for. Currently (MIP) we have a step with bcftools norm splitting, but that will produce poor AD values. We should investigate if we can prioritise AF in parsing for MTs as well as for Balsamic. However, DeepVariant does not produce a FORMAT.AF currently - it is assumed that GT is sufficient and better reflecting ground truth in an ordinary, non-mosaic germline case here, so we can't just use it for everything.

raredisease will going forward use --keep-sum for bcftools norm so the AD counts should be better in a bit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant