EHN: Support multiple timeseries files for creating a GeoDataset object #453

Tjalling-dejong · 2023-07-20T14:55:51Z

Kind of request

Adding new functionality

Enhancement Description

This issue was raised in #372. But the change is quite big, so a separate issue is the right thing to do.

The fn_data argument can be multiple (csv) files: for now geodataset require one file per variable with all IDs but for some data this could be organized as one file per ID (with all variables)

This issue depends on how issue #372 is solved.

Use case

@hboisgon Could you elaborate a bit more on the use case of this issue?

Additional Context

No response

hboisgon · 2023-07-21T00:35:35Z

Sure! But I think you explained it yourself : for now geodataset require one file per variable with all IDs but for some data this could be organized as one file per ID (with all variables)

My current use case is for lakes/reservoirs where classically you have one file per lake ID with rating curve tables (ie A=f(H) and Q=f(H)). But it can also happens for meteo/water quality observation stations where they have sometimes an excel file with one tab per station (and pandas can only open one tab at a time).

DirkEilander · 2023-07-21T08:46:30Z

I think what we need to progress on this is a detailed description of how the csv/excel files look that we want to support. In your case each file (or tab) has data (can be multiple variables?) for a single location. But do we also want to support files with a single variables for multiple locations (i.e. one file per variable)? How is the file linked to the geospatial index? Through file naming conventions of the csv file? Or a filename in the attribute table of the locations file? How do we set the names for dimensions and variables if we translate the files to a xarray.Dataset?

In the light of the discussion in #372 I would also argue against including this in the io.open_geodaset method. Instead my suggestion would be to make this a separate driver in the DataFrameAdapter (return a multiindex DataFrame) or a driver in a new DatasetAdapter.

hboisgon · 2023-08-04T08:07:11Z

Need to make a decision what we do want to support and see if this can be part of pre-processing or added in the data adapter

Tjalling-dejong added Enhancement New feature or request Needs refinement issue still needs refinement labels Jul 20, 2023

Tjalling-dejong changed the title ~~support multiple timeseries files for creating a GeoDataset~~ EHN: Support multiple timeseries files for creating a GeoDataset object Jul 20, 2023

Tjalling-dejong mentioned this issue Jul 20, 2023

ENH: improve adapter for GeoDataset #372

Closed

3 tasks

savente93 mentioned this issue Aug 28, 2023

Add support for multiple csv segmented by var #489

Merged

5 tasks

savente93 linked a pull request Sep 20, 2023 that will close this issue

Add support for multiple csv segmented by var #489

Merged

5 tasks

hboisgon added this to the Q3 milestone Sep 21, 2023

hboisgon removed the Needs refinement issue still needs refinement label Sep 21, 2023

savente93 added the Blocked An issue that cannot be progressed right now label Oct 5, 2023

savente93 self-assigned this Oct 5, 2023

savente93 modified the milestones: Q3, Q4 Oct 5, 2023

savente93 closed this as completed in #489 Oct 6, 2023

savente93 removed the Blocked An issue that cannot be progressed right now label Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EHN: Support multiple timeseries files for creating a GeoDataset object #453

EHN: Support multiple timeseries files for creating a GeoDataset object #453

Tjalling-dejong commented Jul 20, 2023

hboisgon commented Jul 21, 2023

DirkEilander commented Jul 21, 2023 •

edited

Loading

hboisgon commented Aug 4, 2023

EHN: Support multiple timeseries files for creating a GeoDataset object #453

EHN: Support multiple timeseries files for creating a GeoDataset object #453

Comments

Tjalling-dejong commented Jul 20, 2023

Kind of request

Enhancement Description

Use case

Additional Context

hboisgon commented Jul 21, 2023

DirkEilander commented Jul 21, 2023 • edited Loading

hboisgon commented Aug 4, 2023

DirkEilander commented Jul 21, 2023 •

edited

Loading