Fix tests for Aug 2023 updated remote datasets #2636

seisman · 2023-08-21T07:53:18Z

Description of proposed changes

Some tests related to the GMT remote datasets are failing due to the recent updates of these datasets (see https://www.generic-mapping-tools.org/remote-datasets/changes.html for a list of these changes). The min/max values of these datasets change slightly in the new versions.

This PR fixes these failing tests by updating the min/max values.

Notes:

In the tests, we use statements like npt.assert_allclose(data.min(), -180.40002, rtol=1e-5) to compare the grid value with the expected value. All datasets have a specific precision (e.g., the earth_geoid dataset has a precision of 0.01), thus it makes no sense to compare earth_geoid value with a number like -180.40002 and a relative tolerance of 1.0e-5. Instead, it makes more sense to compare the value with -180.4 and an absolute tolerance of 0.01 (the precision of the earth_geoid dataset). I've changed rtol to atol in all these tests.

Reminders

Run make format and make check to make sure the code follows the style guide.
Add tests for new features or tests that would have caught the bug that you're fixing.
Add new public functions/methods/classes to doc/api/index.rst.
Write detailed docstrings for all functions/methods.
If wrapping a new module, open a 'Wrap new GMT module' issue and submit reasonably-sized PRs.
If adding new functionality, add an example to docstrings or tutorials.
Use underscores (not hyphens) in names of Python files and directories.

Slash Commands

You can write slash commands (/command) in the first line of a comment to perform
specific operations. Supported slash commands are:

/format: automatically format and lint the code
/test-gmt-dev: run full tests on the latest GMT development version

seisman · 2023-08-21T09:53:13Z

Two synbath tests are not updated due to upstream dataset issue (GenericMappingTools/gmtserver-admin#213).

weiji14 · 2023-08-22T21:15:10Z

Two synbath tests are not updated due to upstream dataset issue (GenericMappingTools/gmtserver-admin#213).

Ok, looks they're re-processing the synbath dataset again, let's wait a day or two to see if it gets fixed.

weiji14

In the tests, we use statements like npt.assert_allclose(data.min(), -180.40002, rtol=1e-5) to compare the grid value with the expected value. All datasets have a specific precision (e.g., the earth_geoid dataset has a precision of 0.01), thus it makes no sense to compare earth_geoid value with a number like -180.40002 and a relative tolerance of 1.0e-5. Instead, it makes more sense to compare the value with -180.4 and an absolute tolerance of 0.01 (the precision of the earth_geoid dataset). I've changed rtol to atol in all these tests.

Cool, setting the absolute tolerance does makes more sense. Looked through each dataset and the precision values look ok, except for the ones indicated below where I couldn't find the precision documented at https://www.generic-mapping-tools.org/remote-datasets/index.html.

I'm wondering if we should file an upstream issue to https://github.com/GenericMappingTools/remote-datasets/issues to have the precision information in the metadata too? That way we could do something like npt.assert_allclose(data.min(), ..., atol=data.attrs["precision"]) instead of hardcoding a value. Just in case the precision changes in the future.

weiji14 · 2023-08-22T21:33:12Z

pygmt/tests/test_datasets_earth_relief.py

+    npt.assert_allclose(data.min(), -8600.5, atol=0.5)
+    npt.assert_allclose(data.max(), 5559.0, atol=0.5)


Did you get the uncertainty for the earth_relief grids from the paper(s)? I'm not seeing them on https://www.generic-mapping-tools.org/remote-datasets/earth-relief.html#technical-information.

pygmt/tests/test_datasets_earth_relief.py

seisman · 2023-08-22T23:36:52Z

Looked through each dataset and the precision values look ok, except for the ones indicated below where I couldn't find the precision documented at https://www.generic-mapping-tools.org/remote-datasets/index.html.

I get the precisions from the recipes（https://github.com/GenericMappingTools/gmtserver-admin/tree/master/recipes) that were used in building the grids (the DST_SCALE parameter in the recipes).

I'm wondering if we should file an upstream issue to https://github.com/GenericMappingTools/remote-datasets/issues to have the precision information in the metadata too? That way we could do something like npt.assert_allclose(data.min(), ..., atol=data.attrs["precision"]) instead of hardcoding a value. Just in case the precision changes in the future.

I'm not sure, because these values are just the precisions that GMT used to build the grids, not exactly the real precisions of the original datasets.

weiji14 · 2023-08-27T23:16:25Z

Looked through each dataset and the precision values look ok, except for the ones indicated below where I couldn't find the precision documented at https://www.generic-mapping-tools.org/remote-datasets/index.html.

I get the precisions from the recipes（https://github.com/GenericMappingTools/gmtserver-admin/tree/master/recipes) that were used in building the grids (the DST_SCALE parameter in the recipes).

I'm wondering if we should file an upstream issue to https://github.com/GenericMappingTools/remote-datasets/issues to have the precision information in the metadata too? That way we could do something like npt.assert_allclose(data.min(), ..., atol=data.attrs["precision"]) instead of hardcoding a value. Just in case the precision changes in the future.

I'm not sure, because these values are just the precisions that GMT used to build the grids, not exactly the real precisions of the original datasets.

Ah I see, if it's just GMT's re-gridded precisions and not the actual precisions, then it probably shouldn't be in the metadata. Let's just wait for the grids to be fixed then, seems like the earth_synbath_* grids have been fixed on the candidate server already, and just need to wait for them to be copied to the oceania server according to GenericMappingTools/gmtserver-admin#213 (comment).

weiji14

May need to manually regenerate the cache by uncommenting

pygmt/.github/workflows/cache_data.yaml

Line 15 in dcf1c7f

# pull_request:

before re-running the tests.

pygmt/tests/test_datasets_earth_relief.py

weiji14 · 2023-08-30T22:08:38Z

If you're short on time, I can help push the changes? Just give me a 👍

Also setting absolute tolerance to 0.5

This reverts commit 364c1b5.

weiji14

Have updated the cache at 364c1b5 and re-ran the tests with updated minmax numbers, should all work now.

seisman added 8 commits August 21, 2023 15:22

Fix tests for earth_age dataset

79f6ca4

Fix tests for earth_faa dataset

d2d630d

Set atol to earth_geoid tests

f28822b

Fix earth_mag and earth_wdmam tests

0fc0b3b

Fix tests for earth_vgg dataset

50fc085

Fix formatting issues

0a9d82d

Fix some tests for earth_relief

2b2a824

Add atol to earth_relief tests

892b5fc

seisman added the maintenance Boring but important stuff for the core devs label Aug 21, 2023

seisman added this to the 0.10.0 milestone Aug 21, 2023

weiji14 changed the title ~~Fix tests for updated datasets~~ Fix tests for Aug 2023 updated remote datasets Aug 22, 2023

weiji14 reviewed Aug 22, 2023

View reviewed changes

This was referenced Aug 25, 2023

Release PyGMT v0.10.0 #2640

Closed

Figure.text: Support non-ASCII characters in the 'text' parameter #2638

Merged

Merge branch 'main' into fix-datasets

9ea64d8

weiji14 reviewed Aug 28, 2023

View reviewed changes

pygmt/tests/test_datasets_earth_relief.py Show resolved Hide resolved

pygmt/tests/test_datasets_earth_relief.py Outdated Show resolved Hide resolved

pygmt/tests/test_datasets_earth_relief.py Show resolved Hide resolved

pygmt/tests/test_datasets_earth_relief.py Outdated Show resolved Hide resolved

weiji14 added 5 commits August 31, 2023 13:37

Merge branch 'main' into fix-datasets

b35e91c

Update minmax values for some earth_relief grids

4d6b42f

Also setting absolute tolerance to 0.5

Rebuild cache for GMT remote datasets

364c1b5

Revert "Rebuild cache for GMT remote datasets"

43bfa5d

This reverts commit 364c1b5.

Check attributes for GEBCO and GEBCOSI datasets

74fa646

weiji14 marked this pull request as ready for review August 31, 2023 02:05

weiji14 approved these changes Aug 31, 2023

View reviewed changes

seisman merged commit b00181d into main Aug 31, 2023
14 of 17 checks passed

seisman deleted the fix-datasets branch August 31, 2023 05:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix tests for Aug 2023 updated remote datasets #2636

Fix tests for Aug 2023 updated remote datasets #2636

seisman commented Aug 21, 2023 •

edited

Loading

seisman commented Aug 21, 2023

weiji14 commented Aug 22, 2023

weiji14 left a comment

weiji14 Aug 22, 2023

seisman commented Aug 22, 2023 •

edited

Loading

weiji14 commented Aug 27, 2023

weiji14 left a comment

weiji14 commented Aug 30, 2023

weiji14 left a comment

		npt.assert_allclose(data.min(), -8600.5, atol=0.5)
		npt.assert_allclose(data.max(), 5559.0, atol=0.5)

Fix tests for Aug 2023 updated remote datasets #2636

Fix tests for Aug 2023 updated remote datasets #2636

Conversation

seisman commented Aug 21, 2023 • edited Loading

seisman commented Aug 21, 2023

weiji14 commented Aug 22, 2023

weiji14 left a comment

Choose a reason for hiding this comment

weiji14 Aug 22, 2023

Choose a reason for hiding this comment

seisman commented Aug 22, 2023 • edited Loading

weiji14 commented Aug 27, 2023

weiji14 left a comment

Choose a reason for hiding this comment

weiji14 commented Aug 30, 2023

weiji14 left a comment

Choose a reason for hiding this comment

seisman commented Aug 21, 2023 •

edited

Loading

seisman commented Aug 22, 2023 •

edited

Loading