Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 3.62 KB

mapping_readme.md

File metadata and controls

22 lines (16 loc) · 3.62 KB

Alignment of KG Concepts

As the data from various sources follow different nomenclature and data structure, we propose Common Metadata Ontology(CMO) to unify them. The mapping of each data source to CMO can be found below.

Nodes in Common Metadata Ontology Papers-with-Code OpenML Huggingface
Pipeline: AI pipeline executed to solve a Task Extracted from Paper Flow Extracted from Models
Report: Any published text document regarding the pipeline implementation Paper Extracted from runs Extracted from Model Cards
Task: The AI Task for which the pipeline is implemented. Example: image classification Task Task type + Dataset Pipeline Tag
Framework: The framework used to implement the pipeline and their code repository Extracted from Git Repository Extracted from Runs Library
Stage: Various stages of the pipeline such as data preprocessing, training, testing or evaluation Generated Generated Generated
Execution: Multiple executions of a given stage in a pipeline Generated Runs Generated
Artifact: Artifacts such as model, dataset and metric generated at the end of each execution Generated Generated Generated
Metric: Subclass of artifact. The evaluation result of each execution Evaluation and Results Extracted from runs N/A
Dataset: Subclass of artifact. The dataset used in each execution. Dataset Dataset Dataset
Model: Subclass of artifact. The model used in each execution or produced as a result of an execution Method Extracted from Flow Model
Hyperparameters: Parameter setting using for each execution of a stage N/A Extracted from runs N/A

The current data source do not provide a detailed information on the stage (data preprocessing, training or testing) of the pipeline excution. On the other hand, we know that the pipeline metadata exposed are for train/test stages. Hence we generate the information for stage and so execution. The properties of each node can be found at common-metadata-ontology.