-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remove duplicates from export formats #67
Comments
Could you give a more concrete hint or example? |
Sure, here is an example: The transformation to tei_all (and all other export formats based on this) are now converting the duplicates to "proper" TEI files, yet without any Lines 240 to 277 in 51a662c
This struck me during the eXist workshop at the Edirom Summer School I was giving with @martinascholger where we were using the WeGA-data package from Zenodo. Right now I'm thinking of removing those duplicates from the export – or can you think of any benefit of keeping them? |
No, I also think that this is a way to resove that issue. The duplicates are neccessary for internal reasons (forwarding). Or might this also be important for the export? Should we keep this information, e.g., by adding the "old" fileID to the header? Something like an "alternative (former) identifier". |
duplicates are propagated to the export formats (thus being published to Zenodo) without proper ID at the root element. Either this should be fixed or even better those duplicates should be removed from the export completely.
The text was updated successfully, but these errors were encountered: