Description: Datasets can be thought of in many ways. In terms of being able to apply the concept across multiple data sources and as a consequence limit the size of a dataset the following definition of dataset in this set is:
"A dataset is defined as all the data published in a research paper that is about the same chemical system."
This means that a ThermoML file that contains many PureOrMixtureData
sections may contain one dataset, if all the
sections are data for the same chemical system, or multiple datasets if the chemical systems are different.
- id: datasets primary key (auto-generated and unique)
- title: autogenerated title based on the setnum and the DOI of the reference
- setnum: index of the dataset in the ThermoML file inferred by script
- file_id: foreign key (files table) of the
file
the dataset is part of - report_id: foreign key (reports table) of the
report
the dataset is part of - system_id: foreign key (systems table) of the chemical
system
under study - reference_id: foreign key (references table) of the
reference
the dataset belongs to - trcrefset_id: unique id created by using the data in the ThermoML
<TRCRefID>
section concatenated with setnum - points: number of datapoints in a dataset
- updated: datetime last updated
In this data model the datasets section is the central linking point connecting the data model together.