-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data loss on citeproc import #20
Comments
thanks for this @bwiernik - I definitely want to improve the citeproc reader/writer.
can you explain what you mean here. i'm not sure I follow. what is csl_note and csl_medium? |
By the way, I'm working on a package The list of CSL variables is given here: https://aurimasv.github.io/z2csl/typeMap.xml For example: [
{"id":"CuttsHappiness2017",
"abstract":"[truncated]",
"accessed":{
"date-parts":[[2019,10,26]]
},
"dimensions":"PT00H04M16S",
"director":[{"family":"Cutts","given":"Steve"}],
"issued":{
"date-parts":[[2017,11,24]]
},
"medium":"Video",
"publisher":"Vimeo",
"source":"Vimeo",
"title":"Happiness",
"type":"motion_picture",
"URL":"https://vimeo.com/244405542"}
] Here, A similar argument could be made for fields that are also specific biblatex, bibtex, or other formats and not represented in the Crosscite schema. In general, I think it would make sense to create a table that cross-references the fields for each data format (e.g., I am happy to help create such a table for the formats |
looks like you forgot to finish a thought:
|
Sorry, fixed that. |
thanks for the fix.
As you've probably seen, we do have some named lists, e.g, https://github.com/ropensci/handlr/blob/master/R/translations.R as converters between formats. A table would be good though. I agree about not dropping fields, and assigning them a |
@bwiernik Are you still interested in making that table? |
Yes, I'm hoping to get to it in the next week or two. |
Okay, thanks |
notes:
google spreadsheet started in https://docs.google.com/spreadsheets/d/1p1XaEtTBU_CmZba0P8nGpIlqAS2A8r4ZUs-WJarKUxo/edit#gid=0 - then move to the package when more stable |
|
thanks! Do you know where to get a complete list of JATS types? |
There is the full list in the JATS spec https://groups.niso.org/apps/group_public/download.php/21030/ANSI-NISO-Z39.96-2019.pdf |
i don't see a full list in there for
|
Oh I see what you mean. Hmm. I’m not sure there is a formal list anywhere. Probably the best option would be to compile the converter programs, such as those listed in the Wiki article here https://en.wikipedia.org/wiki/Journal_Article_Tag_Suite, and see what conventions have emerged. |
Okay, thanks - not sure we need to include JATS, but if its easy enough to do seems worth it |
there's better support for citeproc now. im sure could be better, but need to submit a new version for other reasons, so moving this to the next milestone - still need to finish the crosswalk between all formats spreadsheet linked above and then implement using that here |
When reading the Citeproc/CSL JSON format, handlr currently discards any valid CSL variables that are not part of its internal Crosscite format. This seems quite suboptimal, because it means that handlr can really only properly work with Citeproc data for a small number of item types (pretty much just
article-journal
andwebpage
). For example, thegenre
andmedium
variables that are used to indicate the category for a report or thesis are discarded. The variableeditor
is used for books and book chapters. In the example data I provide above, the variablereferences
is discarded.If I were to generate a reference for this item using the American Psychological Association CSL style, it would be:
Joly, J. F., Stapel, D. A., & Lindenberg, S. M. (2008). Silence and table manners: When environments activate norms. Personality and Social Psychology Bulletin, 34(8), 1047–1056. https://doi.org/10.1177/0146167208318401 (Retraction published 2012, Personality and Social Psychology Bulletin, 38[10], 1378, https://doi.org/10.1177/0146167212462821)
However, if I import the item to handlr, export to CSL JSON again, and render the citation, it's:
Joly, J. F., Stapel, D. A., & Lindenberg, S. M. (2008). Silence and table manners: When environments activate norms. Personality and Social Psychology Bulletin, 34(8), 1047–1056. https://doi.org/10.1177/0146167208318401
The retraction information has been lost.
Other variables, such as
annote
, ,genre
,note
,medium
,collection-title
,number
, andillustrator
are also all discarded on importFor item types and fields that don't have a Crosscite analogue, it seems like it would be wise to store these in the item data (e.g., as
csl_note
,csl_medium
) and map them to other formats at translation time as needed.The text was updated successfully, but these errors were encountered: