Improve test coverage for storage classes #2693

maxrjones · 2025-01-13T01:35:07Z

This PR improves the test coverage for the various storage classes. While testing the storage classes, I fixed a few issues:

Implement open() for LoggingStore
Add _is_open property and setter for WrapperStore
Use stdout rather than stderr as the default stream for LoggingStore
Ensure that ZipStore is open before getting or setting any values
Update equality for LoggingStore and WrapperStore such that the types much be equal. This is an opinionated change. For example, previously a LocalStore and LoggingStore instance could be evaluated as equal, whereas now they are distinct.

Here's the change in coverage:

src/zarr/abc/store.py                    84% -> 93%
src/zarr/storage/__init__.py             94% -> 94%
src/zarr/storage/_utils.py               94% -> 97%
src/zarr/storage/common.py               80% -> 91%
src/zarr/storage/fsspec.py               25% -> 90%
src/zarr/storage/local.py                86% -> 92%
src/zarr/storage/logging.py              62% -> 96%
src/zarr/storage/memory.py               82% -> 85%
src/zarr/storage/wrapper.py              56% -> 94%
src/zarr/storage/zip.py                  96% -> 97%
src/zarr/testing/store.py                92% -> 99%

src/zarr/storage/memory.py coverage is low because it includes the GPUStore and I don't have a test environment with cuda. I'm opening this PR now even though it's not at 100% coverage because I don't expect to have much time to work on it during the week and would rather the PR not get stale if the team has time for a review.

The set partial values methods are addressed separately because they require discussion (xref #2688).

src/zarr/storage/_fsspec.py

d-v-b · 2025-01-13T11:25:41Z

src/zarr/storage/_logging.py

@@ -18,6 +19,8 @@

    counter: defaultdict[str, int]

+T_Store = TypeVar("T_Store", bound=Store)
+

 class LoggingStore(WrapperStore[Store]):


Should WrapperStore be generic w.r.t T_Store here?

d-v-b · 2025-01-13T11:27:36Z

src/zarr/testing/store.py

+        with await self.store_cls.open(**open_kwargs) as store:
+            assert store._is_open
+            # Test trying to open an already open store
+            with pytest.raises(ValueError):


can we check that the error message in the ValueError has the expected content? We don't want this test to succeed because of a ValueError unrelated to the store being already open.

d-v-b · 2025-01-13T11:28:27Z

src/zarr/testing/store.py

+                await store._open()
+        assert not store._is_open
+
+    async def test_read_only_store_raises(self, open_kwargs: dict[str, Any]) -> None:


contrary to the name, this test doesn't seem to check that an exception is raised

d-v-b · 2025-01-13T11:30:10Z

src/zarr/testing/store.py

@@ -135,6 +154,26 @@ async def test_get(self, store: S, key: str, data: bytes, byte_range: ByteReques
        expected = data_buf[start:stop]
        assert_bytes_equal(observed, expected)

+    async def test_get_not_open(self, store_not_open: S) -> None:


this is rather surprising -- I would expect that a non-open store would not support IO of any kind. what exactly does open mean? cc @jhamman

d-v-b · 2025-01-13T11:31:33Z

src/zarr/testing/store.py

+    async def test_getsize_raises(self, store: S) -> None:
+        """
+        Test the result of store.getsize().
+        """


I think the method name and the docstring don't quite match the behavior of the test

d-v-b · 2025-01-13T11:32:04Z

src/zarr/testing/store.py

+    async def test_set_not_open(self, store_not_open: S) -> None:
+        """
+        Ensure that data can be written to the store that's not yet open using the store.set method.
+        """


same as https://github.com/zarr-developers/zarr-python/pull/2693/files#r1913045841

src/zarr/testing/store.py

tests/test_store/test_core.py

d-v-b · 2025-01-13T11:41:09Z

tests/test_store/test_core.py

+@pytest.mark.parametrize("zarr_format", [2, 3])
+async def test_contains_group(local_store, write_group: bool, zarr_format: ZarrFormat) -> None:
+    """
+    Test contains group method


can we parametrize this over path, ensuring that we check a level of nesting? e.g. @pytest.mark.parametrize('path', ['foo', 'foo/bar'])

and similarly for the contains_array tests

d-v-b · 2025-01-13T11:41:59Z

tests/test_store/test_core.py

+    with pytest.raises(ValueError):
+        assert await contains_array(store_path, zarr_format="3.0")


Suggested change

with pytest.raises(ValueError):

assert await contains_array(store_path, zarr_format="3.0")

deduplicate

this is a distinct check for contains_array rather than contains_group. I can parameterize these functions to make it more concise and clear.

ah oops, I missed that these were testing different methods

d-v-b · 2025-01-13T11:43:02Z

tests/test_store/test_core.py

+    with pytest.raises(ValueError):
+        await StorePath.open(LocalStore(str(tmpdir), read_only=False), path=None, mode="x")


lets parametrize the test over mode instead of repeating nearly identical checks

looks like we would need to parametrize over (read_only, mode) tuples

d-v-b · 2025-01-13T11:43:45Z

tests/test_store/test_local.py

@@ -53,3 +54,17 @@ def test_creates_new_directory(self, tmp_path: pathlib.Path):

        store = self.store_cls(root=target)
        zarr.group(store=store)
+
+    def test_invalid_root_raises(self):


add a docstring explaining what this test is checking

tests/test_store/test_local.py

d-v-b · 2025-01-13T11:46:47Z

this looks great, I had some minor suggestions.

Co-authored-by: Davis Bennett <[email protected]>

maxrjones added 27 commits January 11, 2025 10:22

Run Store tests on logging

2ea442c

Run store tests on wrapper

7f76575

Add read only open tests to WrapperStore

98b7392

Ignore new coverage files

18be47f

Simplify wrapper tests

69ce1d7

Fix __eq__ method in WrapperStore

5877355

Implement __repr__ for WrapperStore

b4310fd

Allow separate open and init kwargs

d08458e

Add open class method to LoggingStore

f663694

Add __str__ to WrapperStore

cf62f67

Add repr test for LoggingStore

31f9931

Fix __eq__ in LoggingStore

964aeaa

Test getsize for stores

332f564

Test for invalid ByteRequest

4d4d728

Use stdout rather than stderr as the default logging stream

30d1323

Test default logging stream

9764204

Add test for getsize_prefix

fefd666

Document buffer prototype parameter

6f240c2

Add test for invalid modes in StorePath.open()

d2bbd9d

Add test for contains_group

85f44db

Add tests for contains_array

51c0c15

Test for invalid root type for LocalStore

ddd6bc9

Test LocalStore.get with default prototype

62a528c

Test for invalid set buffer arguments

5f00efd

Test get and set on closed stores

6923337

Test using stores in a context manager

0792fa8

Specify abstract methods for StoreTests

dd0de05