finalize data model #1

egonw · 2021-01-29T11:08:58Z

At this moment I do not have clear yet what the data model is going to be. These things come to mind at this moment:

died identifier
when the identifier died
type of identifier: dead, zombie, ghost
next of kin

Chris-Evelo · 2021-01-31T21:55:54Z

I think you would also want to know why the identifier died. Things like "gene turns out not to be coding" or "green turns out to be two separate genes" are really different things. (And in both cases we might actually want to keep a record even though the identifier was removed from the most recent database since you could still do things like: "map variants mapped to the now longer gene area to the chromosome location", decide that gene expression for a cluster should belong to one of two genes and possibly you could decide which one based on reporter sequence)

egonw · 2021-02-01T06:36:37Z

Yes, indeed. Once we have one or more data sources that mention this, we can start modelling this. WikiPathways has similar things, like "merged content into". Ensembl does not seem to provide this information.

cthoyt · 2022-10-24T08:07:49Z

Would be nice to add the curator ORCID identifier to keep track of attributions (or when content is imported, use like the wikidata identifier for the database itself)

matentzn · 2022-10-25T10:20:10Z

@egonw can I take a look at the current overall data model? This seems to be all very related to ontology-metadata and mapping metadata efforts we are trying to reconcile across the board..

cthoyt · 2022-10-25T10:21:25Z

@matentzn the data model is currently 3 columns in a TSV file - see https://github.com/bridgedb/tiwid/blob/main/data/hgnc.symbol.tsv as an example

matentzn · 2022-10-25T10:26:35Z

Thank you @cthoyt! I thought this was related to the more broad birdgedb data models, not just tiwid, but makes sense now.

egonw · 2022-10-26T10:48:51Z

@matentzn, let's start a discussion here: bridgedb/BridgeDb#216

egonw · 2022-10-26T10:50:07Z

Oh, and with SSSOM in mind, this format may have mappings but these are optional: there is no clear mapping necessarily.

matentzn · 2022-10-27T13:23:17Z

Ok, got it :) I remember looking at bridgedb in the past, and talking to @Chris-Evelo about adopting sssom for any mappings, but I did not really follow up with that!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finalize data model #1

finalize data model #1

egonw commented Jan 29, 2021

Chris-Evelo commented Jan 31, 2021

egonw commented Feb 1, 2021

cthoyt commented Oct 24, 2022 •

edited

Loading

matentzn commented Oct 25, 2022

cthoyt commented Oct 25, 2022

matentzn commented Oct 25, 2022

egonw commented Oct 26, 2022

egonw commented Oct 26, 2022

matentzn commented Oct 27, 2022

finalize data model #1

finalize data model #1

Comments

egonw commented Jan 29, 2021

Chris-Evelo commented Jan 31, 2021

egonw commented Feb 1, 2021

cthoyt commented Oct 24, 2022 • edited Loading

matentzn commented Oct 25, 2022

cthoyt commented Oct 25, 2022

matentzn commented Oct 25, 2022

egonw commented Oct 26, 2022

egonw commented Oct 26, 2022

matentzn commented Oct 27, 2022

cthoyt commented Oct 24, 2022 •

edited

Loading