Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using DisGeNET SQLite to access more data #58

Open
colleenXu opened this issue Feb 23, 2024 · 0 comments
Open

Using DisGeNET SQLite to access more data #58

colleenXu opened this issue Feb 23, 2024 · 0 comments

Comments

@colleenXu
Copy link
Contributor

@DylanWelzel and I discussed today (2/22) how MyDisease's DisGeNET info lacks some of the useful info that we see on DisGeNET's website (example: https://www.disgenet.org/browser/0/1/1/C0001080/_a/_b./)

We looked at the gene-disease associations (GDA), but the info below probably applies to the variant-disease associations (VDA) as well.

  • the "Evidence for GDAs" tab has "association type" (type of gene-disease relationship) and supporting sentence info for PMIDs ➡️ this looks particularly interesting!
  • the "Summary of GDAs" tab has the EL (evidence level), and EI (evidence index) for each gene-disease relationship, and a pLI measure.

MyDisease is using the tsv downloads, which lack this info (look at the README). However, we realized that the SQLite file has this info! So it may be a good idea to rewrite the parser to use the SQLite data and provide more info. This may involve changing the structure of the documents/objects.

(Side note: we do have DSI/DPI in MyDisease right now, but these are actually gene-level properties...not directly related to the gene-disease relationship)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant