[indexer-alt] Fix first_checkpoint option #20079

lxfind · 2024-10-29T22:34:49Z

Description

The "first_checkpoint" option is a bit broken.
If first_checkpoint is larger than watermark + 1, for sequential pipeline this will simply lead to loss of liveness because it always expects watermark + 1 as the next checkpoint; for concurrent pipeline it will never be able to update watermark for a similar reason.
This PR adds a check when registering the pipeline to simply not allow that to happen.

If first_checkpoint is smaller than watermark + 1, the intention must be that we want to be able to backfill.
However the sequential pipeline will ignore any data that is below the watermark.
This PR fixes that by still allowing us to commit data even when they are below watermark.

Test plan

CI.
Probably need to add tests too.

Release notes

Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required.

For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates.

vercel · 2024-10-29T22:34:53Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
sui-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Nov 4, 2024 2:59am

3 Skipped Deployments

Name	Status	Preview	Updated (UTC)
multisig-toolkit	⬜️ Ignored (Inspect)	Visit Preview	Nov 4, 2024 2:59am
sui-kiosk	⬜️ Ignored (Inspect)	Visit Preview	Nov 4, 2024 2:59am
sui-typescript-docs	⬜️ Ignored (Inspect)	Visit Preview	Nov 4, 2024 2:59am

amnn

Thanks @lxfind, I had had a slightly different plan in mind for addressing this issue, which should touch the pipeline logic less, let me know what you think:

Concurent Pipelines

For concurrent pipelines, when we add the pipeline, we should check for a gap between an optional first_checkpoint and the loaded watermark at that point.

This allows us to quote the specific pipeline that pulled the watermark lower than first_checkpoint.
We should also honour the --skip-watermark flag in these cases. I.e. it shouldn't matter to a concurrent pipeline that there is a potential gap, if --skip-watermark has been provided, and we should mention that in the error message as a potential way to make progress.

Sequential Pipelines

For sequential pipelines, it's more tricky. Today sequential pipelines:

Guarantee that each update only runs once on the table, because the updates themselves are not idempotent.
Guarantee that the watermark is only updated if all the updates have been applied up to and including that watermark, since genesis.
Require that there are no gaps in ingestion/processing -- this requirement is implicit, the pipeline will stall and eventually it will start producing warnings, but there are ways we could have detected this issue statically.

To me, the key thing is that this pipeline deals with non-idempotent updates. For pipelines related to objects (which are all the sequential pipelines we have today) it's also important that we track all updates since genesis, but I'm going to argue that this is secondary (you could reasonably have sequential pipelines that don't care about starting from genesis -- this is even true for the object pipelines if we start from a formal snapshot).

IIUC, the current solution enforces the "no gaps" property, but it weakens the other two properties:

Updates could be run multiple times, because the pipeline was made to restart at some earlier checkpoint. For certain updates (let's say the pipeline's update is applying a delta of some kind) this will just corrupt the data straight away, for other updates it is not so bad (I think all the updates we want to do today are in this second category), but it still...
...means you could read a watermark but the data in the table is not an accurate reflection of the data at that watermark, because the pipeline is replaying some updates.

Instead, we should keep the property that each update will run exactly once, and that the watermark is moved atomically, consistently and durably with the associated updates and weaken the property that we must start from genesis.

We can do this by allowing next_checkpoint to take the value of a non-zero first_checkpoint in case there is no watermark, but complaining early (in a similar fashion to the solution for concurrent pipelines, which we can check when we add the pipeline) if we detect that there would be a gap introduced.

It would also be useful to track where a pipeline started from (so that we could know if the objects pipeline was not started from genesis). I think we can address that by using the reader_lo field in the watermarks table to track that -- if we are starting a pipeline, and we have been given a first checkpoint and there is no initial watermark, we can set reader_lo to first_checkpoint before we start.

amnn · 2024-10-30T11:29:15Z

crates/sui-indexer-alt/src/lib.rs

+                if first_checkpoint > self.first_checkpoint_from_watermark {
+                    return Err(anyhow::anyhow!(
+                        "First checkpoint {} is larger than the expected first checkpoint from watermark {}.\
+                        This will create gaps in the data.",
+                        first_checkpoint,
+                        self.first_checkpoint_from_watermark
+                    ));
+                }


nit, this is an ensure! call in disguise:

Suggested change

if first_checkpoint > self.first_checkpoint_from_watermark {

return Err(anyhow::anyhow!(

"First checkpoint {} is larger than the expected first checkpoint from watermark {}.\

This will create gaps in the data.",

first_checkpoint,

self.first_checkpoint_from_watermark

));

}

ensure!(

first_checkpoint > self.first_checkpoint_from_watermark,

"First checkpoint {first_checkpoint} is larger than the expected first checkpoint from watermark {}. \

This will create gaps in the data",

self.first_checkpoint_from_watermark

);

lxfind · 2024-10-30T15:09:03Z

you could reasonably have sequential pipelines that don't care about starting from genesis -- this is even true for the object pipelines if we start from a formal snapshot).

If we ever do start from a formal snapshot, we should also update the watermark when we do so, such that the watermark is at the right place. So I don't quite see a valid reasons that we would ever want a gap here?

Also for the concurrent pipeline, in what scenario would you want to allow the pipeline to run with a gap but skipping the watermark update?

amnn · 2024-10-30T19:40:41Z

If we ever do start from a formal snapshot, we should also update the watermark when we do so, such that the watermark is at the right place. So I don't quite see a valid reasons that we would ever want a gap here?

You don't ever want a gap in the sense of "you processed rows for checkpoints up to C, then didn't for C + 1, C + 2, ..., C + k, and then picked back up at C + k + 1" -- that will always be incorrect -- but for many sequential pipelines it might be reasonable to start the pipeline at some checkpoint C and then let it run continuously from there -- technically a gap from genesis to C but otherwise no holes.

Also for the concurrent pipeline, in what scenario would you want to allow the pipeline to run with a gap but skipping the watermark update?

When dealing with backfills, or other data corruption issues.
When handing over between indexers -- only one can update the watermark.

lxfind · 2024-10-30T20:15:48Z

but for many sequential pipelines it might be reasonable to start the pipeline at some checkpoint C and then let it run continuously from there -- technically a gap from genesis to C but otherwise no holes.

In that case, we should always put an initial watermark when we initialize the index, right?

When dealing with backfills, or other data corruption issues.

Well in that case there should be no gap at all because we would be updating older data entries.

When handing over between indexers -- only one can update the watermark.

I don't see how to pull this off. So you start the new indexer at some future checkpoint with --skip-watermark, then?

amnn · 2024-10-31T11:19:31Z

In that case, we should always put an initial watermark when we initialize the index, right?

Yes, that ends up happening as part of my suggested change, in an off-by-one fashion. Inside the sequential pipeline, next_checkpoint retains its existing meaning as the next watermark to be written, and we allow it to be initialised by a non-zero first_checkpoint if there is no watermark present. After the first checkpoint is processed the watermark will be written with first_checkpoint.

Well in that case there should be no gap at all because we would be updating older data entries.

I don't think I understood your question fully earlier -- if the question is why you would want to leave a gap in checkpoints processed with a concurrent pipeline, I agree with you, there isn't a good reason to do that, except at the front (between genesis and some starting checkpoint).

Nevertheless, the committing logic should not prevent us from doing that because we may know we are not introducing a gap even if the pipeline does not, and the examples I gave were primarily related to that: Situations where we needed to run the pipeline without it being able to refer to or update the watermark, but we wanted to run it anyway on that range of checkpoints.

I don't see how to pull this off. So you start the new indexer at some future checkpoint with --skip-watermark, then?

The scenario I had in mind is one where we introduce a change to an existing table (e.g. new column). To backfill the new column, we would need to run the new indexer from some past initial checkpoint to some recent checkpoint without updating the watermark, while the old indexer is mainly running the show.

Once we see that the backfill instance of the indexer has caught up, we could hand over to the new indexer in production as well -- i.e. it could control the watermark.

amnn

This is great, thanks @lxfind. As it is, this change doesn't include the part where the pipelines initialise their next_checkpoint to first_checkpoint (instead of 0) when the watermark does not exist, is that to come in a follow-up? (That would be fine, I think this PR is good as it is).

amnn · 2024-11-01T13:19:49Z

crates/sui-indexer-alt/src/lib.rs

+        if let (Some(watermark), Some(first_checkpoint)) = (watermark, self.first_checkpoint) {
+            ensure!(
+                first_checkpoint as i64 <= watermark.checkpoint_hi_inclusive + 1,
+                "For pipeline {}, first checkpoint override {} is too far ahead than watermark {}. This could create gaps in the data.",


Suggested change

"For pipeline {}, first checkpoint override {} is too far ahead than watermark {}. This could create gaps in the data.",

"For pipeline {}, first checkpoint override {} is too far ahead of watermark {}. This could create gaps in the data.",

Added TODO. Will follow up in a separate PR.

lxfind requested a review from amnn October 29, 2024 22:35

lxfind force-pushed the indexer-alt-remove-first-checkpoint branch from 3af3e24 to 3dc9d24 Compare October 30, 2024 03:31

lxfind changed the title ~~[indexer-alt] Remove first_checkpoint option~~ [indexer-alt] Fix first_checkpoint option Oct 30, 2024

vercel bot deployed to Preview – sui-docs October 30, 2024 03:32 View deployment

amnn reviewed Oct 30, 2024

View reviewed changes

[indexer-alt] Ensure proper use of first_checkpoint

ad32ded

lxfind force-pushed the indexer-alt-remove-first-checkpoint branch from 3dc9d24 to ad32ded Compare October 31, 2024 18:11

lxfind temporarily deployed to sui-typescript-aws-kms-test-env October 31, 2024 18:11 — with GitHub Actions Inactive

vercel bot deployed to Preview – sui-docs October 31, 2024 18:13 View deployment

lxfind marked this pull request as draft October 31, 2024 18:17

lxfind marked this pull request as ready for review October 31, 2024 18:32

lxfind temporarily deployed to sui-typescript-aws-kms-test-env October 31, 2024 18:32 — with GitHub Actions Inactive

lxfind requested a review from amnn October 31, 2024 18:32

amnn approved these changes Nov 1, 2024

View reviewed changes

lxfind temporarily deployed to sui-typescript-aws-kms-test-env November 4, 2024 02:56 — with GitHub Actions Inactive

Add TODOs

ff0a8f9

lxfind force-pushed the indexer-alt-remove-first-checkpoint branch from 217f678 to ff0a8f9 Compare November 4, 2024 02:56

lxfind temporarily deployed to sui-typescript-aws-kms-test-env November 4, 2024 02:56 — with GitHub Actions Inactive

lxfind enabled auto-merge (squash) November 4, 2024 02:56

vercel bot deployed to Preview – sui-docs November 4, 2024 02:59 View deployment

lxfind merged commit c538507 into main Nov 4, 2024
52 checks passed

lxfind deleted the indexer-alt-remove-first-checkpoint branch November 4, 2024 03:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[indexer-alt] Fix first_checkpoint option #20079

[indexer-alt] Fix first_checkpoint option #20079

lxfind commented Oct 29, 2024 •

edited

Loading

vercel bot commented Oct 29, 2024 •

edited

Loading

amnn left a comment •

edited

Loading

amnn Oct 30, 2024

lxfind commented Oct 30, 2024

amnn commented Oct 30, 2024

lxfind commented Oct 30, 2024

amnn commented Oct 31, 2024

amnn left a comment

amnn Nov 1, 2024

lxfind Nov 4, 2024

	"For pipeline {}, first checkpoint override {} is too far ahead than watermark {}. This could create gaps in the data.",
	"For pipeline {}, first checkpoint override {} is too far ahead of watermark {}. This could create gaps in the data.",

[indexer-alt] Fix first_checkpoint option #20079

[indexer-alt] Fix first_checkpoint option #20079

Conversation

lxfind commented Oct 29, 2024 • edited Loading

Description

Test plan

Release notes

vercel bot commented Oct 29, 2024 • edited Loading

amnn left a comment • edited Loading

Choose a reason for hiding this comment

Concurent Pipelines

Sequential Pipelines

amnn Oct 30, 2024

Choose a reason for hiding this comment

lxfind commented Oct 30, 2024

amnn commented Oct 30, 2024

lxfind commented Oct 30, 2024

amnn commented Oct 31, 2024

amnn left a comment

Choose a reason for hiding this comment

amnn Nov 1, 2024

Choose a reason for hiding this comment

lxfind Nov 4, 2024

Choose a reason for hiding this comment

lxfind commented Oct 29, 2024 •

edited

Loading

vercel bot commented Oct 29, 2024 •

edited

Loading

amnn left a comment •

edited

Loading