Dagster + SQLMesh Metrics: Use DuckDB as a pre-warmed cache for rolling metrics #2445

ravenac95 · 2024-11-04T05:26:27Z

What is it?

Based on some previous testing I've done (seen as part of #2430) we can actually get the metrics to run in a semi-performant way with a very large duckdb instance. Due to the way that the sqlmesh rolling windows ran upon our initial version, deletes + writes into trino were exceedingly slow. Using duckdb as a pre-warmed cache, we can distribute the calculation of metrics to a cluster of pre-warmed duckdbs and then write that back to the trino warehouse.

github-project-automation bot added this to OSO Nov 4, 2024

ravenac95 self-assigned this Nov 4, 2024

github-project-automation bot moved this to Backlog in OSO Nov 4, 2024

ravenac95 moved this from Backlog to In Progress in OSO Nov 4, 2024

ravenac95 moved this from In Progress to Up Next in OSO Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dagster + SQLMesh Metrics: Use DuckDB as a pre-warmed cache for rolling metrics #2445

Dagster + SQLMesh Metrics: Use DuckDB as a pre-warmed cache for rolling metrics #2445

ravenac95 commented Nov 4, 2024

Dagster + SQLMesh Metrics: Use DuckDB as a pre-warmed cache for rolling metrics #2445

Dagster + SQLMesh Metrics: Use DuckDB as a pre-warmed cache for rolling metrics #2445

Comments

ravenac95 commented Nov 4, 2024

What is it?