Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request to Query Alleles #13

Open
katiecardone26 opened this issue Oct 23, 2024 · 4 comments
Open

Request to Query Alleles #13

katiecardone26 opened this issue Oct 23, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@katiecardone26
Copy link

I think that biofilter should return reference and alternate alleles in addition to the chromosome, position, RSID, etc. This would be really useful. For example, when annotating GWAS results, I sometimes get multiple RSIDs for a particular chromosome and position. This is because there can be different types of variations at a particular position (SNP, insertion, deletion, multi-nucleotide variant). Having an allele will help to parse through these duplicates and identify what RSID matches my variant of interest. I believe these are available in the databases that LOKI pulls from.

@AndreRico
Copy link
Collaborator

Hi @katiecardone26. I understand that this would be an improvement. What do you think—should we implement it in the current version 2.4.3, or add it as a new feature for version 3?

@katiecardone26
Copy link
Author

Hi @AndreRico, I think this should be implemented in the current version (2.4.3). Thanks!

@AndreRico
Copy link
Collaborator

Hi @katiecardone26, could you provide me the command for Biofilter and any example input data you are using, please?

@AndreRico AndreRico added the enhancement New feature or request label Oct 25, 2024
@katiecardone26
Copy link
Author

example command:

module load biofilter/2.4.3
biofilter.py \
--verbose \
--knowledge /project/ritchie00/datasets/loki/loki.db \
--position-file input_filename \
 --annotate position_label snp position gene upstream downstream \
--report-invalid-input \
--overwrite \
--prefix output_prefix \
--ucsc-build-version 38

You can make example data from any genomic data. I use a three column input with the following columns (in order): chromosome number, variant id, position. Here is a screenshot of an example input file.
Screenshot 2024-10-28 at 10 25 10 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants