-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a feature to update the XML file based on the CSV information #10
Comments
After looking at this for a couple hours, the biggest stumbling block to this is that the XML file gets format converted to a DataFrame: and all other processing is done in that format: This makes working backwards tricky, because you can't directly look at the relevant publication in the XML when you find a mistake. In fact the XML object has been thrown away by that point. In https://github.com/jcohenadad/bibeasy/pull/11/commits/f261436463334efc02644c977a952098c7575a2e I make the DataFrame retain the IDs used in the XML, so at least there's a chance of going from one to the other. As a note to seld, some avenues that could be explored:
|
Some questions:
(I could do this feature without touching those features but it would be easier if I had a freeer hand to rearrange the existing code) |
I know, this is a bit of a pain, which is why i reached out to a smart software engineer 😅
yes, it could be used. Typical scenario: i write a grant on sept 2021 and i format all references using CCV's ref. On March 2022 I write another grant and I want to reuse the text from sept 2021: I will need to update the references by matching CCV's sept 2021 with CCV's march 2022.
yes, it could also be used, although it is more rare. Scenario: the grant i wrote in sept 2021 using CCV's refs should now be exported using CSV's gsheet references system in order to "standardize" it for future use. is anything besides scripts/bibeasy_cli.py ever used? Can I remove the rest of the scripts? hum, i'm not sure, i would have to dig but i don't have the time right now. I know we had some scripts to format the references into a dokuwiki to have them display on our website-- but we are not on dokuwiki anymore. Also, when i refactored bibeasy_cli, it should now be able to output a formatted document for our markdown new website. So i think it is safe to remove, but again, i'm not 100% sure... @alexfoias can you pls chip in? |
I was using Ideally we would use bibeasy directly on the gitbook to fetch the latest version of publications. |
Some reverse engineering: I made a test account on https://ccv-cvc.ca/ and stripped down @jcohenadad's CV to a sample of 3 publications and used it to generate a sample PDF CV. The PDFs it outputs have publications numbered, and I've convinced myself from this that these numbers are ordered not by the order in the XML file, but by the order of With I get this: CCV-TestyTesterton.pdf which also displays in the same order in their UI: but with editing the XML so that I have I get CCV-TestyTesterton-6.pdf and if you re-export the XML you get So what does this tell me? It tells me that the "CCV IDs" that
(and notice that it counts from 1). This also means that the IDs are not fixed, but can potentially change if an XML file is uploaded that adds an ID number that happens to fall in between previous ones; or perhaps it can happen just upon adding a new entry with the UI. Which probably explains a large part of why |
Some deeper reverse engineering: This will add 10000 Publications, titled "c0001" through "c10000"
And for completeness, this will delete the
I ran this for a while; it only got to 89, not 10000, but I think that's enough to demonstrate that indeed the CCV order is not robust: So my conclusion from this is: you should always be using your own, robust, ID numbers, those that you keep in the gsheet, when writing anything manually, and you should be keeping those files around as source documents, clearly demarcated different than the Also this is sort of off-topic. Sorry about that. |
amazing investigations @kousu !!!
yup!
I also came to that realization, and this is what i've been trying to do for the past 2 years. |
Now that I know about the way CCV handles Do we need to pick an unused In my test account, I exported a current copy of my records, then made it usable with
Here is it: CCV-10206959.xml I added a section to it, without adding the kousu@ail:~/src/neuropoly/bibeasy$ diff -u --color CCV-10206959.xml CCV-10206959-addition.xml
--- CCV-10206959.xml 2021-10-19 01:41:10.153386693 -0400
+++ CCV-10206959-addition.xml 2021-10-19 01:42:26.811823964 -0400
@@ -154,6 +154,71 @@
</bilingual>
</field>
</section>
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
+ <value type="String">ADDITION</value>
+ </field>
+ <field id="5c04ea4dae464499807d0b40b4cad049" label="Journal">
+ <value type="String">ADDITION</value>
+ </field>
+ <field id="0a826c656ff34e579dfcbfb373771260" label="Volume">
+ <value type="String">77</value>
+ </field>
+ <field id="cc1d9e14945b4e8496641dbe22b3448a" label="Issue">
+ <value type="String">5</value>
+ </field>
+ <field id="00ba1799ece344dc8d0779a3f05a4df8" label="Page Range">
+ <value type="String">1703-1713</value>
+ </field>
+ <field id="3b56e4362d6a495aa5d22a1de5914741" label="Publishing Status">
+ <lov id="00000000000000000000000100001704">Published</lov>
+ </field>
+ <field id="6fafe258e19e49a7884428cb49d75424" label="Year">
+ <value format="yyyy" type="Year">2021</value>
+ </field>
+ <field id="4ad593960aba4a21bf154fa8daf37f9f" label="Publisher">
+ <value type="String"/>
+ </field>
+ <field id="4c3bc805ceaa42259f014514fc4905f8" label="Publication Location"/>
+ <field id="1167905d079c4400ae7a4a76a203a445" label="Description / Contribution Value">
+ <value type="Bilingual"/>
+ <bilingual>
+ <french/>
+ <english/>
+ </bilingual>
+ </field>
+ <field id="478545acac5340c0a73b7e0d2a4bee06" label="URL">
+ <value type="String">https://pubmed.ncbi.nlm.nih.gov/33775122/</value>
+ </field>
+ <field id="2089ff1a86844b6c9a10fc63469f9a9d" label="Refereed?">
+ <lov id="00000000000000000000000000000400">Yes</lov>
+ </field>
+ <field id="51b7eaff05444990af823b9d80924f5b" label="Open Access?"/>
+ <field id="b779cc6478bd4b09b516c6d55e938583" label="Synthesis?"/>
+ <field id="289c8814fff141d89b12569d49aa2cb3" label="Contribution Role">
+ <lov id="00000000000000000000000100002102">Co-Author</lov>
+ </field>
+ <field id="dc7922dfa04348a3a83c9afb5bbaa24a" label="Number of Contributors">
+ <value type="Number">11</value>
+ </field>
+ <field id="bc3b428d99384b04bb749311bb804e1d" label="Authors">
+ <value type="String">Noriega de la Colina A, Your Favourite Place, Robitaille-Grou MC, Gagnon C, Boshkovski T, Lamarre-Cliche M, Joubert S, Gauthier C, Bherer L, Cohen-Adad J, Girouard H</value>
+ </field>
+ <field id="707a6e0ca58341a5a82fb923b2842530" label="Editors">
+ <value type="String"/>
+ </field>
+ <field id="375a0e2ea0914291b05b0529c4755aa7" label="DOI">
+ <value type="String"/>
+ </field>
+ <field id="9afd9e28df47464faf3f9ee2c4809e25" label="Contribution Percentage"/>
+ <field id="9f2e163dfcbf4abdb73e9d5c4daf03c4" label="Description of Contribution Role">
+ <value type="Bilingual"/>
+ <bilingual>
+ <french/>
+ <english/>
+ </bilingual>
+ </field>
+ </section>
</section>
</section>
</generic-cv:generic-cv> Then I uploaded it to the site, and redownloaded it, and cleaned it up (same as before:)
Here that is: CCV-10206959-reexported.xml kousu@ail:~/src/neuropoly/bibeasy$ diff -u --color CCV-10206959-addition.xml CCV-10206959-reexported.xml
--- CCV-10206959-addition.xml 2021-10-19 01:47:47.520030354 -0400
+++ CCV-10206959-reexported.xml 2021-10-19 01:52:17.787744264 -0400
@@ -1,5 +1,5 @@
<?xml version="1.0" ?>
-<generic-cv:generic-cv dateTimeGenerated="2021-10-19 01:39:37" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
+<generic-cv:generic-cv dateTimeGenerated="2021-10-19 01:52:02" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
<section id="f589cbc028c64fdaa783da01647e5e3c" label="Personal Information">
<section id="2687e70e5d45487c93a8a02626543f64" label="Identification" recordId="801f624b32b348f0bfb8cb9514083c7d">
<field id="ee8beaea41f049d8bcfadfbfa89ac09e" label="Title">
@@ -154,7 +154,7 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="b781869115894f409ad525e796a448e0">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">ADDITION</value>
</field> So it's pretty clear: we can trigger an |
This begs a question for me: should we be editing the publications at all? Maybe To find out, I dropped --- CCV-10206959-reexported.xml 2021-10-19 01:52:17.787744264 -0400
+++ CCV-10206959-step3.xml 2021-10-19 01:59:12.513040443 -0400
@@ -24,7 +24,7 @@
</section>
<section id="047ec63e32fe450e943cb678339e8102" label="Contributions">
<section id="46e8f57e67db48b29d84dda77cf0ef51" label="Publications">
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="38d1815fcf6143919814e40f1ce76b92">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">Cortico-spinal imaging to study pain</value>
</field>
@@ -89,7 +89,7 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="4a66bf5d63cd4c37a848d3df602832d6">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">Associations between relative morning blood pressure, cerebral blood flow, and memory in older adults treated and controlled for hypertension</value>
</field>
@@ -154,7 +154,7 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="b781869115894f409ad525e796a448e0">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">ADDITION</value>
</field> Before uploading this, the UI showed: After uploading, the UI looks like and the XML (after cleaning:
) now looks like CCV-10206959-step3-reexported.xml kousu@ail:~/src/neuropoly/bibeasy$ diff -u --color CCV-10206959-step3.xml CCV-10206959-step3-reexported.xml
--- CCV-10206959-step3.xml 2021-10-19 01:59:12.513040443 -0400
+++ CCV-10206959-step3-reexported.xml 2021-10-19 02:02:41.443312315 -0400
@@ -1,5 +1,5 @@
<?xml version="1.0" ?>
-<generic-cv:generic-cv dateTimeGenerated="2021-10-19 01:52:02" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
+<generic-cv:generic-cv dateTimeGenerated="2021-10-19 02:02:06" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
<section id="f589cbc028c64fdaa783da01647e5e3c" label="Personal Information">
<section id="2687e70e5d45487c93a8a02626543f64" label="Identification" recordId="801f624b32b348f0bfb8cb9514083c7d">
<field id="ee8beaea41f049d8bcfadfbfa89ac09e" label="Title">
@@ -24,7 +24,7 @@
</section>
<section id="047ec63e32fe450e943cb678339e8102" label="Contributions">
<section id="46e8f57e67db48b29d84dda77cf0ef51" label="Publications">
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="bb4549c3252f4be8b288879825bcbc39">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">Cortico-spinal imaging to study pain</value>
</field>
@@ -89,7 +89,7 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="c36c62b23a3d42e6ae415331c68b7cd0">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">Associations between relative morning blood pressure, cerebral blood flow, and memory in older adults treated and controlled for hypertension</value>
</field>
@@ -154,7 +154,7 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="c995b089b0534658965187202626d529">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">ADDITION</value>
</field> So it worked, and maybe there's no good point trying to match (#12) and patch publications, at least not when we know we can just overwrite all of them. The one weird quirk of doing this is that one "Submit" checkbox got toggled; because I guess it's on by default for new publications and I just tossed out the pre-existing ones rows and the XML file doesn't have a way to control that checkbox. Ah but here's a good reason not to throw out the pre-existing records: it disturbs the sort order. I got lucky in my example above, but I tried uploading a second time and got this order: and a third and got It's not a deal breaker -- afterall, the whole problem |
The Does that mean that to do "create" I need to extract the Here is is that table, or part of it anyway, extracted from a sample publication, in case we need it;
To answer this question:
and edited it so that kousu@ail:~/src/neuropoly/bibeasy$ diff -u --color CCV-10206959-step3-reexported.xml CCV-10206959-create-missing-fields.xml
--- CCV-10206959-step3-reexported.xml 2021-10-19 02:02:41.443312315 -0400
+++ CCV-10206959-create-missing-fields.xml 2021-10-19 12:51:47.477394216 -0400
@@ -219,6 +219,21 @@
</bilingual>
</field>
</section>
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
+ <value type="String">MISSING FIELDS</value>
+ </field>
+ <field id="1167905d079c4400ae7a4a76a203a445" label="Description / Contribution Value">
+ <value type="Bilingual"/>
+ <bilingual>
+ <french/>
+ <english/>
+ </bilingual>
+ </field>
+ <field id="2089ff1a86844b6c9a10fc63469f9a9d" label="Refereed?">
+ <lov id="00000000000000000000000000000400">Yes</lov>
+ </field>
+ </section>
</section>
</section>
</generic-cv:generic-cv> CCV-10206959-create-missing-fields.xml I uploaded it and downloaded it, and got:
CCV-10206959-create-missing-fields-reexported.xml kousu@ail:~/src/neuropoly/bibeasy$ diff -u --color CCV-10206959-create-missing-fields.xml CCV-10206959-create-missing-fields-reexported.xml
--- CCV-10206959-create-missing-fields.xml 2021-10-19 12:52:28.064164492 -0400
+++ CCV-10206959-create-missing-fields-reexported.xml 2021-10-19 12:55:45.891059874 -0400
@@ -1,5 +1,5 @@
<?xml version="1.0" ?>
-<generic-cv:generic-cv dateTimeGenerated="2021-10-19 02:02:06" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
+<generic-cv:generic-cv dateTimeGenerated="2021-10-19 12:54:45" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
<section id="f589cbc028c64fdaa783da01647e5e3c" label="Personal Information">
<section id="2687e70e5d45487c93a8a02626543f64" label="Identification" recordId="801f624b32b348f0bfb8cb9514083c7d">
<field id="ee8beaea41f049d8bcfadfbfa89ac09e" label="Title">
@@ -24,6 +24,21 @@
</section>
<section id="047ec63e32fe450e943cb678339e8102" label="Contributions">
<section id="46e8f57e67db48b29d84dda77cf0ef51" label="Publications">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="8e73cefdd1ee47a58563ff099d4d6958">
+ <field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
+ <value type="String">MISSING FIELDS</value>
+ </field>
+ <field id="1167905d079c4400ae7a4a76a203a445" label="Description / Contribution Value">
+ <value type="Bilingual"/>
+ <bilingual>
+ <french/>
+ <english/>
+ </bilingual>
+ </field>
+ <field id="2089ff1a86844b6c9a10fc63469f9a9d" label="Refereed?">
+ <lov id="00000000000000000000000000000400">Yes</lov>
+ </field>
+ </section>
<section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="bb4549c3252f4be8b288879825bcbc39">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">Cortico-spinal imaging to study pain</value>
@@ -219,21 +234,6 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
- <field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
- <value type="String">MISSING FIELDS</value>
- </field>
- <field id="1167905d079c4400ae7a4a76a203a445" label="Description / Contribution Value">
- <value type="Bilingual"/>
- <bilingual>
- <french/>
- <english/>
- </bilingual>
- </field>
- <field id="2089ff1a86844b6c9a10fc63469f9a9d" label="Refereed?">
- <lov id="00000000000000000000000000000400">Yes</lov>
- </field>
- </section>
</section>
</section>
</generic-cv:generic-cv> Huh, so it didn't fill in the missing fields. It did fill in the In the UI, the new publication looks like However after clicking "Done" on the UI and reexporting:
the fields have been filled in. So the upshot of this is: CCV is tolerant of missing data, and we only need to fill in the fields we're actually using. And to find out if its tolerant of missing kousu@ail:~/src/neuropoly/bibeasy$ cp CCV-10206959.xml CCV-10206959-create-with-missing-field-ids.xml
kousu@ail:~/src/neuropoly/bibeasy$ vi CCV-10206959-create-with-missing-field-ids.xml kousu@ail:~/src/neuropoly/bibeasy$ diff -u CCV-10206959.xml CCV-10206959-create-with-missing-field-ids.xml
--- CCV-10206959.xml 2021-10-19 14:40:21.516399254 -0400
+++ CCV-10206959-create-with-missing-field-ids.xml 2021-10-19 14:41:40.876257303 -0400
@@ -280,6 +280,20 @@
</bilingual>
</field>
</section>
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <field label="Article Title">
+ <value type="String">ADDITION</value>
+ </field>
+ <field label="Journal">
+ <value type="String">ADDITION</value>
+ </field>
+ <field label="Volume">
+ <value type="String">77</value>
+ </field>
+ <field label="Issue">
+ <value type="String">5</value>
+ </field>
+ </section>
</section>
</section>
</generic-cv:generic-cv> This file produced an error I tried kousu@ail:~/src/neuropoly/bibeasy$ diff -u CCV-10206959.xml CCV-10206959-create-with-missing-field-ids.xml
--- CCV-10206959.xml 2021-10-19 14:40:21.516399254 -0400
+++ CCV-10206959-create-with-missing-field-ids.xml 2021-10-19 14:44:44.343638379 -0400
@@ -280,6 +280,20 @@
</bilingual>
</field>
</section>
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <field id="707a6e0ca58341a5a82fb923b2842530" label="Article Title">
+ <value type="String">ADDITION</value>
+ </field>
+ <field id="375a0e2ea0914291b05b0529c4755aa7" label="Journal">
+ <value type="String">ADDITION</value>
+ </field>
+ <field label="Volume">
+ <value type="String">77</value>
+ </field>
+ <field label="Issue">
+ <value type="String">5</value>
+ </field>
+ </section>
</section>
</section>
</generic-cv:generic-cv> where I've intentionally mismatched the Same error This one: kousu@ail:~/src/neuropoly/bibeasy$ diff -u CCV-10206959.xml CCV-10206959-create-with-missing-field-ids.xml
--- CCV-10206959.xml 2021-10-19 14:40:21.516399254 -0400
+++ CCV-10206959-create-with-missing-field-ids.xml 2021-10-19 14:46:37.214501693 -0400
@@ -280,6 +280,14 @@
</bilingual>
</field>
</section>
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
+ <field id="707a6e0ca58341a5a82fb923b2842530" label="Article Title">
+ <value type="String">ADDITION</value>
+ </field>
+ <field id="375a0e2ea0914291b05b0529c4755aa7" label="Journal">
+ <value type="String">ADDITION</value>
+ </field>
+ </section>
</section>
</section>
</generic-cv:generic-cv> This one was accepted; and upon re-exporting: kousu@ail:~/src/neuropoly/bibeasy$ diff -u CCV-10206959-create-with-missing-field-ids.xml CCV-10206959.xml
--- CCV-10206959-create-with-missing-field-ids.xml 2021-10-19 14:46:37.214501693 -0400
+++ CCV-10206959.xml 2021-10-19 14:48:27.840302582 -0400
@@ -1,5 +1,5 @@
<?xml version="1.0" ?>
-<generic-cv:generic-cv dateTimeGenerated="2021-10-19 14:40:08" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
+<generic-cv:generic-cv dateTimeGenerated="2021-10-19 14:48:19" lang="en" xmlns:generic-cv="http://www.cihr-irsc.gc.ca/generic-cv/1.0.0">
<section id="f589cbc028c64fdaa783da01647e5e3c" label="Personal Information">
<section id="2687e70e5d45487c93a8a02626543f64" label="Identification" recordId="801f624b32b348f0bfb8cb9514083c7d">
<field id="ee8beaea41f049d8bcfadfbfa89ac09e" label="Title">
@@ -24,6 +24,14 @@
</section>
<section id="047ec63e32fe450e943cb678339e8102" label="Contributions">
<section id="46e8f57e67db48b29d84dda77cf0ef51" label="Publications">
+ <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="550ef73512ae4ade9b5bd7b9c062b454">
+ <field id="707a6e0ca58341a5a82fb923b2842530" label="Editors">
+ <value type="String">ADDITION</value>
+ </field>
+ <field id="375a0e2ea0914291b05b0529c4755aa7" label="DOI">
+ <value type="String">ADDITION</value>
+ </field>
+ </section>
<section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles" recordId="8e73cefdd1ee47a58563ff099d4d6958">
<field id="f3fd4878d47c4e83aef6959620ba4870" label="Article Title">
<value type="String">MISSING FIELDS</value>
@@ -280,14 +288,6 @@
</bilingual>
</field>
</section>
- <section id="9a34d6b273914f18b2273e8de7c48fd6" label="Journal Articles">
- <field id="707a6e0ca58341a5a82fb923b2842530" label="Article Title">
- <value type="String">ADDITION</value>
- </field>
- <field id="375a0e2ea0914291b05b0529c4755aa7" label="Journal">
- <value type="String">ADDITION</value>
- </field>
- </section>
</section>
</section>
</generic-cv:generic-cv> It's gone and rewritten the btw this is super annoying isn't it? |
In doing 'create' I need to understand the type schema used in the XML, since I can't just edit a pre-existing data structure. Most
but some contain what I assume are enums:
numbers
or even years
This is not how XML is meant to be used. You should have to parse a 'kind' field, the type should be stored in the tag types itself along with a schema specifying all the types for child nodes that implies. So e.g. there should be
I think what's going on here is it's saying this field can have two values simultaneously: an french version and an english version. There's a "Show Bilingual Fields" button on the UI which must let you do data entry. And here they've done the reasonable thing: the french content goes between But I'll deal with it because we have to. |
Looking at the fields in https://docs.google.com/spreadsheets/d/1dEUBYf17hNM22dqV4zx1gsh3Q-d97STnRB4q7p9nQ54/edit#gid=566297787, I think we can sync these:
Currently |
I also notice: a lot of the conference titles in the gsheet include their location; but CCV has a separate "Location" field we could try to fill in. For example, here I would move "Berlin" to "City" and "Germany" to "Location": <field id="b3c8a60c053a405597b92899d95765a3" label="Conference Name">
<value type="String">4 Jahrestagung der Deutschen Gesellschaft für Computer-und Roboter-Assistierte Chirurgie, Berlin, Germany</value>
</field>
<field id="5813833859a64bb58ee55e4f55aff29b" label="Conference Location"/>
<field id="c2efd9725588489b8df73467c5597c32" label="City">
<value type="String"/>
</field> |
@namgo Do you have an update on this? I'm about to submit a grant in a few weeks and this feature would be really useful. Many thanks! |
I don't have any updates sorry, I had started following Kousu's steps and then other tasks came up! I had signed up for ccv-cvc, and have some example code somewhere (I think), so I think I have everything I need to get started on it again. I'll re-prioritize this issue. |
(edit: I was complaining about lack of access to our cluster for these sorts of projects, but nevermind my laptop finished building pandas, I was being impatient) I'm struggling to understand if the mismatched UUID Nick saw is important or not, but, I tried to get myself back on track by looking at existing works by others: It looks like a lot of the constants we depend on were also found independently by https://github.com/sylvainhalle/CCCVTK , this seems like a great reference moving forward if we weren't aware of it yet. More recently, https://ahemnason.notion.site/ORCID-to-CCV-7cfb24c9f13c4d869cd2beb950e9e2e2 looks to me like it's not directly relevant to our needs... but is it, so far as I know we don't use ORCID? |
Okay I had the sense of what I was doing... partially right and partially wrong: In this case an asterisk specifically needs to be added to HQP (students of Julien as denoted by Julien) which means overwriting existing names. The asterisk needs to be beside student names in the name field. (side-note: what if we add asterisk'd student names purely from an exported xml to immediately re-upload? This would mean the iteratively written document is correct but wheww) I have a small test xml set Which I got from removing sections regarding the Testy Mc Testerson account (the first xml attrs) and reuploading. nameless-initialization.xml.txt I modified the initial xml test set to have asterisks and reuploaded: initial-export_modified-for-asterisk.xml.txt Which transforms: Into: :) asterisks!! However none of this addresses the automation section of this issue, but I'm figuring it out. |
Currently, when there is a mismatch, fields are manually updated on the CCV website.
Example of output
What would be useful is if bibeasy could update the XML, so that I would then import the XML into CCV to update the publication records.
The text was updated successfully, but these errors were encountered: