The following test runs are documented:
-
Datasets from the Greek National Aggregator (EKT)
-
Dataset from the Dutch Digital Heritage Network (NDE)
-
Dataset from the National Library of Portugal (BNP)
-
Dataset from the Finnish National Library (FINNA)
-
Dataset from the Swedish National Heritage Board (SOCH)
Quantative test results:
Provider | dataset name | crawl type | result EDM file (NDE server) | visible in Europeana | # triples | size | crawling time (sec) | # crawled resources | mapping time (sec) |
---|---|---|---|---|---|---|---|---|---|
EKT | ecc-books | dump | ecc-books-edm.zip (30K) | preview Metis | 1416 | 420K | 26.87 | 1? | 0.387 |
EKT | ecc-sculptures | dump | ecc-sculptures-edm.zip (35K) | preview Metis | 1152 | 366K | 25.98 | 1? | 0.367 |
EKT | ecc-photographes | dump | ecc-photographs-edm.zip (30K) | preview Metis | 1113 | 296K | 26.09 | 1? | 0.414 |
EKT | ecc-paintings | dump | ecc-paintings-edm.zip (37K) | preview Metis | 1136 | 370K | 25.75 | 1? | 0.372 |
NDE | kb-centsprenten | links | centsprenten-edm.zip (1.7M) | - | 41977 | 5.4M | 633.15 | 1255 | 3.44 |
NDE | nmvw | dump | nmvw-edm.zip (870M) | - | 14.945.723 | 2.0 G | 108.28 | 1 | 531.4 |
NLP | rnod | dump | rnod-edm.zip (118M) | - | 3.030.649 | 390M | 175,7 | 1 | no conversion needed |
FINNA | fennica | sparql | fennica-edm.zip (56M) | preview Metis (part) | 33.967.718 | 4.4G | 24646 | 48216 | 281.13 |
SOCH | LSH | dump | soch-lsh-edm.zip (48M) | - | 3.491.551 | 528M | 99.7 | 2 | 112.6 |
-
tests run on laptop with i7-8550U CPU / 1.80GHz / 8-core | 16Gb Memory ; JVM run with
-Xmx24G
option -
# triples measured with
wc -l
on .nt file -
crawling time measured with bash
time
prefix -
mapping time measured through jena sparql
-time
option -
due to time and capacity constraints only 5 datasets have been uploaded to the Metis test environment