You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are seeing different kinds of errors when creating a ktk dataset and it is unclear where these errors come from. Initially, those were AssertionErrors from somewhere in the parquet stack. More recently, we have seen: Exception: OSError('IOError: ZSTD decompression failed: Corrupted block detected',) on a dask worker node.
Example code (ideally copy-pastable)
Unfortunately not so easy: essentially we are triggering a long running (> 3h) ktk job with kartothek.io.dask.dataframe.update_dataset_from_ddf. During this long running job we sometimes (?) see the following stacktrace:
Debugging the issue hints towards some improper fetch in our io buffer but the root cause is unknown. The issue might be triggered by a non-threadsafe reader in pyarrow, a bug in our azure storage backend or the buffer itself, see also #402
The text was updated successfully, but these errors were encountered:
Problem description
We are seeing different kinds of errors when creating a ktk dataset and it is unclear where these errors come from. Initially, those were
AssertionErrors
from somewhere in the parquet stack. More recently, we have seen:Exception: OSError('IOError: ZSTD decompression failed: Corrupted block detected',)
on a dask worker node.Example code (ideally copy-pastable)
Unfortunately not so easy: essentially we are triggering a long running (> 3h) ktk job with
kartothek.io.dask.dataframe.update_dataset_from_ddf
. During this long running job we sometimes (?) see the following stacktrace:Used versions
Debugging the issue hints towards some improper fetch in our io buffer but the root cause is unknown. The issue might be triggered by a non-threadsafe reader in pyarrow, a bug in our azure storage backend or the buffer itself, see also #402
The text was updated successfully, but these errors were encountered: