-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
finalize data model #1
Comments
I think you would also want to know why the identifier died. Things like "gene turns out not to be coding" or "green turns out to be two separate genes" are really different things. (And in both cases we might actually want to keep a record even though the identifier was removed from the most recent database since you could still do things like: "map variants mapped to the now longer gene area to the chromosome location", decide that gene expression for a cluster should belong to one of two genes and possibly you could decide which one based on reporter sequence) |
Yes, indeed. Once we have one or more data sources that mention this, we can start modelling this. WikiPathways has similar things, like "merged content into". Ensembl does not seem to provide this information. |
Would be nice to add the curator ORCID identifier to keep track of attributions (or when content is imported, use like the wikidata identifier for the database itself) |
@egonw can I take a look at the current overall data model? This seems to be all very related to ontology-metadata and mapping metadata efforts we are trying to reconcile across the board.. |
@matentzn the data model is currently 3 columns in a TSV file - see https://github.com/bridgedb/tiwid/blob/main/data/hgnc.symbol.tsv as an example |
Thank you @cthoyt! I thought this was related to the more broad birdgedb data models, not just tiwid, but makes sense now. |
@matentzn, let's start a discussion here: bridgedb/BridgeDb#216 |
Oh, and with SSSOM in mind, this format may have mappings but these are optional: there is no clear mapping necessarily. |
Ok, got it :) I remember looking at bridgedb in the past, and talking to @Chris-Evelo about adopting sssom for any mappings, but I did not really follow up with that! |
At this moment I do not have clear yet what the data model is going to be. These things come to mind at this moment:
The text was updated successfully, but these errors were encountered: