Skip to content

Commit

Permalink
insert hugginface data hosting
Browse files Browse the repository at this point in the history
  • Loading branch information
geronimocharlie authored Feb 21, 2024
1 parent 2c7358c commit 45b078a
Showing 1 changed file with 8 additions and 1 deletion.
9 changes: 8 additions & 1 deletion README_emulator.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,8 +5,15 @@ Here we provide a quick documentation on installation, setup and a quickstart gu

## Getting started
### Getting the data
To download and store the preprocessed dataset ready for training locally, execute the following command:
The preprocessed dataset is available on [hugginface](https://huggingface.co/datasets/climateset/causalpaca). You can opt to download the entire dataset or pick only specific climate models for targets. Please note that the core dataset only entails two variables (pr & tas) and is only available in a single resouliton that was used for the benchmarking. We will release code to preprocess other variables and other resolutions shortly and will also update the data repository in the future.

To download the entire dataset, you can make use of the provided python script:
```python
python download_climateset.py
```
If you wish to download only specific climate model data, please refer to the instructions on [hugginface](https://huggingface.co/datasets/climateset/causalpaca/blob/main/README.md).

If you happen to be inside of Canada and wish to download the data in a different way, you can also make use of the provided bash script. Please note that this option is very slow for users located outside of Canada.
```bash
bash download_climateset.sh
```
Expand Down

0 comments on commit 45b078a

Please sign in to comment.