Kube Object Protection Enhancements #1531

raghavendra-talur · 2024-08-27T07:48:39Z

Kubeobjects

Remove the usage of function pointers
- vrg: remove the use of captureInProgressStatusUpdate #1530
- vrg: remove the use of captureStartConditionally #1532
Refactor to have one function for every state (primary, secondary) and action (relocate, failover). We want to make it trivial to understand which code run for at which state of the system. If we have common code running in many states/actions it can move to helpers.
Comment unexpected behavior (e.g creating a dummy backup to perform a restore). This may not be needed after we fix the BSL issues, since velero will create the related backups.

Verify what happens to resources which already exist on the target cluster(overwritten or skipped)
Debug if includeClusterResources == true works
Is a backupRef necessary in a recoverWorkflow? If yes, can we make it not required?

We make 2 backups(0 and 1) and store the capture id we need to recover from in the VRG. Find a way to determine the capture id by without using the VRG.
When using recipes, volumes section or the group section of type "volume" should be able to specify consistency groups.

Log more specific logs with only the relevant information for every backup phase. Currently we log all backup/restore properties for every reconcile
Don't log huge and unneeded values (e.g. complete CA base64 value). Huge values makes it harder to find the needed details.

Include relevant context (e.g. validationErrors when backup or restore fail with phase ValidationFailed)
Use actual phase names from velero (e.g. ValidationFailed) instead of fake names (backupValidationFailed) that are not searchable (code use "backup" + string(Phase))
Don't log an error and return it - we want one log with all the context for every error. Minimize the time to find the relevant context when debugging.

Add a test in integration tests/e2e that can perform backup and restore without the hub

We have "backup" and "restore" (velero), "capture" and "recover", and "protect". Use one term for the same thing inside ramen. Since we already use "restore" (e.g. for pvcs) using "backup" and "restore" seems like the best way.

raghavendra-talur self-assigned this Aug 27, 2024

asn1809 self-assigned this Oct 23, 2024