Skip to content

Latest commit

 

History

History
115 lines (68 loc) · 4.22 KB

module-idea.md

File metadata and controls

115 lines (68 loc) · 4.22 KB

Ideas for Modules (M) and Lessons (L) for Infusing FAIR Principles into Materials Science

M+: Software Carpentry and Data Management Related Tools

programming, version control, unix shell for automating tasks, database tools for programmatic data access interaction and labbook for documenting science

Potential resources:

M0: World beyond spreadsheets and metadata in filenames

L1: data formats for machine-readable data (CSV, JSON, XML for materials data)

Motivating examples:

L2: Introduction to knowledge representation: data + metadata

introducing structure and syntax into the data or how to make the data machine-actionable through self-describing key/value data structure (e.g., XML, JSON with semantically relevant key names)

Potential resources:

L3: Prepare your first dataset and code repository

(git or zenodo, jupyterlab vs notebook) and pay attention to documentation (https://www.writethedocs.org/)

L4: Verify if data is well structured for correctness - schema and schema validators

M1: Data reuse (use prior data from last year's course/literature/database)

L1: Work with data from prior work or last year course (CSV, JSON, XML)

(load data, and visualize it using the script (python/matplotlib, gnuplot) – write notebook to capture the workflow)

Potential resources:

L2: Use REST API (Materials Project, Citirination, AWFLOW)

L3: Data scraping and data restructuring

(e.g., Beautiful Soup for XML and HTML documents in Python, rvest and xml2 in R)

M2: Data integration (taking data from other groups/work, integrating experimental and computational work)

L1: Data cleaning

Potential resources:

L2: Data integration: metadata standardization challenges

(dealing with synonymous and homonymous terms, singular/plural word forms, lexical/dialectical variants, etc.)

Potential resources:

L3: Knowledge representation

data + metadata + rules - adding semantics to structure and syntax: Resource Description Framework (RDF) and schema (RDFS), database and ontology

Potential resources:

Manufacturing ontologies:

Mechanical testing ontologies:

M3: FAIR data generation

L1: Generate data with metadata

L2: Generate data with metadata and annotate with existing ontology/vocabulary

L3: Prepare FAIR publication

(publication space: zenodo, MDF), or plan for effortless and reproducible work: describe a computational/experimental environment, provide notebook as an essay/journal article, plan for incremental work.

Other resources