[server][dvc] Drop partitions asynchronously #1310

kvargha · 2024-11-15T06:01:35Z

Summary, imperative, start upper case, don't end with a period

When a storage node is transitioning from LEADER -> STANDBY -> OFFLINE -> DROPPED, a race condition can occur. Specifically, the DROPPED state transition may be executed synchronously before the other state transitions are processed (LEADER -> STANDBY -> OFFLINE are executed asynchronously). This results in the store partition being deleted prematurely. Consequently, when the LEADER -> STANDBY message is eventually processed, it triggers a PersistenceFailureException since the storage partition no longer exists.

The solution for this is to drop the store partition asynchronously by adding a DROP_PARTITION message to the consumerActionsQueue if the ingestion task is still running. In the case that it's not running (this can happen if it was killed), the store partition will be dropped synchronously.

How was this PR tested?

Added unit and integration tests.

Does this PR introduce any user-facing changes?

No. You can skip the rest of this section.
Yes. Make sure to explain your proposed changes and call out the behavior change.

lluwm · 2024-11-15T18:46:39Z

...nci-client/src/main/java/com/linkedin/davinci/kafka/consumer/KafkaStoreIngestionService.java

+    final String topic = veniceStore.getStoreVersionName();
+
+    if (isPartitionConsuming(topic, partitionId)) {
+      throw new VeniceException("Tried to drop storage partition that is still consuming");


This exception could cause the ST to be in the ERROR state, is that right? I read function stopConsumptionAndWait and, today, we simply log a warning message if consumption couldn't be stopped in time. This sounds like a behavior change in the new PR and we probably want to be careful about it.

STANDBY->OFFLINE issues an UNSUBSCRIBE message, and so will stopConsumptionAndWait.

By the time SIT processes the DROP_PARTITION message, it should have been already unsubscribed.

I think it's safe to remove this check. What do you think?

lluwm · 2024-11-15T19:00:24Z

...nci-client/src/main/java/com/linkedin/davinci/kafka/consumer/KafkaStoreIngestionService.java

+
+    try (AutoCloseableLock ignore = topicLockManager.getLockForResource(topic)) {
+      StoreIngestionTask ingestionTask = topicNameToIngestionTaskMap.get(topic);
+      if (ingestionTask != null && ingestionTask.isRunning()) {


I am thinking of a race condition that after we add DROP_PARTITION actions to the queue, then SIT terminates due to some exceptions (as we see several cases today) before executing all the remaining actions from the queue and it could probably cause some partition leaks. If this race is possible, we probably need to add some logic in the SIT to make sure that all DROP_PARTITION actions have to be executed before it can terminate itself, or maybe some other measures to avoid it.

clients/da-vinci-client/src/main/java/com/linkedin/davinci/DaVinciBackend.java

...ci-client/src/main/java/com/linkedin/davinci/ingestion/isolated/IsolatedIngestionServer.java

…been processed yet

kvargha and others added 10 commits November 13, 2024 15:44

Drop storage partitions gracefully

682cf65

Pass storageService to constructors

b49d9c4

Merge branch 'linkedin:main' into kvargha/helix-st-race-condition

88a78a5

If ingestion task isn't running, drop partition sychnronously instead

772166b

Cleanup

ee2591d

Merge branch 'linkedin:main' into kvargha/helix-st-race-condition

41836ec

Add unit test testDropStoragePartitionGracefully

de4d3d0

Test SIT dropPartition

7db261b

Add more logging and add integration test

fee2146

Add synchronous partition drop

483fa9d

lluwm reviewed Nov 15, 2024

View reviewed changes

eldernewborn reviewed Nov 15, 2024

View reviewed changes

clients/da-vinci-client/src/main/java/com/linkedin/davinci/DaVinciBackend.java Outdated Show resolved Hide resolved

eldernewborn reviewed Nov 15, 2024

View reviewed changes

...ci-client/src/main/java/com/linkedin/davinci/ingestion/isolated/IsolatedIngestionServer.java Outdated Show resolved Hide resolved

kvargha added 3 commits November 15, 2024 15:14

Remove redundant constructor parameter

13a1de8

When a KILL is issued, make sure to drop any partitions that haven't …

04d43f1

…been processed yet

Fix unit test

fb034c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server][dvc] Drop partitions asynchronously #1310

[server][dvc] Drop partitions asynchronously #1310

kvargha commented Nov 15, 2024

lluwm Nov 15, 2024 •

edited

Loading

kvargha Nov 15, 2024

lluwm Nov 15, 2024 •

edited

Loading

[server][dvc] Drop partitions asynchronously #1310

Are you sure you want to change the base?

[server][dvc] Drop partitions asynchronously #1310

Conversation

kvargha commented Nov 15, 2024

Summary, imperative, start upper case, don't end with a period

How was this PR tested?

Does this PR introduce any user-facing changes?

lluwm Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

kvargha Nov 15, 2024

Choose a reason for hiding this comment

lluwm Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

lluwm Nov 15, 2024 •

edited

Loading

lluwm Nov 15, 2024 •

edited

Loading