-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DOC: better document in the data catalogs if datasets were pre-processed #356
Comments
Part of the solution is found in updating the meta data section in deltares_data.yml & documentation according to: meta:
source_url: zenodo.org/my_dataset # should point to processed data OR original in combi with processing_notes/script
source_license: CC-BY-3.0
source_version: vX.X
paper_ref: Author et al. (year)
paper_doi: doi
processing_notes: <description of process in script OR simple processing steps (e.g. filter / gdalbuildvrt)>
processing_script: <url to script>
category: category It should be checked case by case what is required for reproducibility. there are several options:
|
In this issue we add the |
FYI: This issue is split into #537 (to identify and make notes on datasets with preprocessing) and more (to be created) issues. |
HydroMT version checks
main
hereLocation of the documentation
In the docs here: https://deltares.github.io/hydromt/latest/user_guide/data_existing_cat.html
Maybe also here (meta section) depending on implementation: https://deltares.github.io/hydromt/latest/user_guide/data_prepare_cat.html
Documentation problem
Some of the datasets in the pre-defined catalogs are actually not original but pre-processed data (eg. modis_lai, merit_hydro for some of the layers etc).
Maybe we should find a standard way of letting the user know about this?
See also issue in hydromt-wflow:#157
Known issues:
Possibly related:
Suggested fix for documentation
I think so far we tried to use source_url and notes in meta to say if processing was done. For some data sources it's missing but I also wonder if this way is clear to the user or if we should do it differently ?
For example only add source_url if no processing was done.
In case of processing, use new keywords processing_from_url, processing_from_doi, processing_steps?
The text was updated successfully, but these errors were encountered: