Skip to content

Question about GbifID #605

Answered by muttcg
niconoe asked this question in Q&A
Discussion options

You must be logged in to vote

HI @niconoe, apologies for the waiting.

There are 2 main variations gbifIDs for DWCA:

1) DwcTerm.occurrenceID and triplet (a combination of DwcTerm.institutionCode, DwcTerm.collectionCode and DwcTerm.catalogNumber).

If the dataset contains unique occurrenceIDs and unique triplets data, we create pairs in key-value database:

datasetKey(registry.gbif.org) + occurrenceID -> gbifID
datasetKey(registry.gbif.org) + triplet (institutionCode| collectionCode|catalogNumber) -> the same gbifID

If data provider changes one of 4 fields we get a collision and drop that records from the index until the provider fix data or we break that pair and tell what to use as the main key (occurrenceID, triplet o…

Replies: 4 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by muttcg
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
2 participants
Converted from issue

This discussion was converted from issue #604 on October 15, 2021 08:47.