Add running_workflows to ScheduleListInfo, for ListSchedules #464

lina-temporal · 2024-10-16T17:44:21Z

What changed?

A new field RunningWorkflows was added to ScheduleListInfo, matching the field in ScheduleInfo.

Why?

Customers would like to be able to drill down into running workflow executions without having to follow a ListSchedule call with fan-outs to DescribeSchedule).

Breaking changes

None here

Server PR

[Scheduling] Add RunningWorkflows to schedule memos in visibility, and in ListSchedules results temporal#6665

dnr · 2024-10-16T22:27:05Z

temporal/api/schedule/v1/message.proto

+    // Running workflows returned here are eventually consistent, and their
+    // status may be out-of-date. For a strongly consistent view of a schedule's
+    // running workflows, use the `DescribeSchedule` API.
+    repeated temporal.api.common.v1.WorkflowExecution running_workflows = 7;


This goes in the memo so I think we should keep it as short as possible.

Part of the request was to include the close state of recent workflows, right? Is that going to be another change?

I'm thinking that for the close state, we can put it in ScheduleActionResult and fill it in when we know it.

For running workflows... since we don't track allowall anymore, there will be 0 or 1 running workflows, and if it's 1, it'll be the most recent action. So instead of repeating the wf id and run id from recent_actions here, we can just use a flag in the ScheduleActionResult for whether it's running or not? But actually the close state enum is just that: if it's not set yet, then it's still running, if it's set to completed/failed/timedout/cancelled/terminated, it's not running.

(If we do track running workflows from allowall, then that's not enough since there could be an old one still running. But we have no plans to do that right now so maybe we can wait until we do.)

Part of the request was to include the close state of recent workflows, right? Is that going to be another change?

Discussed during standup, I'll do this as part of this change.

I'm thinking that for the close state, we can put it in ScheduleActionResult and fill it in when we know it.

Sounds good, will do!

So instead of repeating the wf id and run id from recent_actions here, we can just use a flag in the ScheduleActionResult for whether it's running or not?

Sounds good in terms of state, I'll update. For the API, however, that would make the ListSchedulesResult distinct compared to DescribeSchedule (which has the separate running_workflows field), which doesn't seem great from a UX perspective; what do you think about populating a running_workflows field in API responses based on the flag in our mutable state memo's ScheduleActionResult?

For the reverse question, the presence of the new field in Describe: If we just add the extra field to the action result and fill it in in the state, it'll automatically come out in Describe (query) as well as List, and I think that's good, it's slightly more information than is there now (and doesn't need any versioning in the workflow).

For your actual question, about a running_workflows list in List: I'm concerned that we won't always have enough information to fill it in. E.g. if we use the state machine impl and have it track a limited number of concurrent workflows that aren't the latest (from AllowAll), and supply them in Describe, that's great, but we might not be able to fit them in the memo (state machines don't help there at all), so we'd still be inconsistent. Except then also misleading.

Or we might put a bunch in the memo, but only as int timestamps (to save space) and then reconstruct the workflow ids. But we wouldn't have the run ids.

So I'm leaning towards no, just the enum in recents in list results.

For running workflows... since we don't track allowall anymore, there will be 0 or 1 running workflows, and if it's 1, it'll be the most recent action.

For the state machine rewrite this may not be true anymore but the current structure would still work for this, we'd just need to ensure we limit the tracked action count.

This all should have probably gone in a oneof to prepare for when we have other scheduled actions and for consistency with ScheduleAction but that may be too late.

Here's how I think it should have been done:

message ScheduleActionResult { // Time that the action was taken (according to the schedule, including jitter). google.protobuf.Timestamp schedule_time = 1; // Time that the action was taken (real time). google.protobuf.Timestamp actual_time = 2; message Workflow { temporal.api.common.v1.WorkflowExecution start_workflow_result = 1; temporal.api.enums.v1.WorkflowExecutionStatus status = 2; } oneof variant { Workflow workflow = 3; } }

@bergundy I like the structure suggestion. I think it's probably too late for the existing API, but maybe we could consider introducing a V2 API when we support different schedule action types?

Works for me. I approved the PR already.

Agreed the oneof is cleaner, maybe it's not too late though... we could do:

message ScheduleActionResult { // Time that the action was taken (according to the schedule, including jitter). google.protobuf.Timestamp schedule_time = 1; // Time that the action was taken (real time). google.protobuf.Timestamp actual_time = 2; message WorkflowExecutionWithStatus { // superset of common.WorkflowExecution string workflow_id = 1; string run_id = 2; temporal.api.enums.v1.WorkflowExecutionStatus status = 3; } oneof variant { // If action was start_workflow: WorkflowExecutionWithStatus workflow = 11; } }

That's fully backwards compatible at the proto level, though source code will have to change.

This reverts commit afe7d04.

cretz

I like this latest solution of just adding the status much better

dnr · 2024-10-24T23:41:00Z

temporal/api/schedule/v1/message.proto

+    // Running workflows returned here are eventually consistent, and their
+    // status may be out-of-date. For a strongly consistent view of a schedule's
+    // running workflows, use the `DescribeSchedule` API.
+    repeated temporal.api.common.v1.WorkflowExecution running_workflows = 7;


Agreed the oneof is cleaner, maybe it's not too late though... we could do:

message ScheduleActionResult { // Time that the action was taken (according to the schedule, including jitter). google.protobuf.Timestamp schedule_time = 1; // Time that the action was taken (real time). google.protobuf.Timestamp actual_time = 2; message WorkflowExecutionWithStatus { // superset of common.WorkflowExecution string workflow_id = 1; string run_id = 2; temporal.api.enums.v1.WorkflowExecutionStatus status = 3; } oneof variant { // If action was start_workflow: WorkflowExecutionWithStatus workflow = 11; } }

That's fully backwards compatible at the proto level, though source code will have to change.

dnr · 2024-10-24T23:43:04Z

temporal/api/schedule/v1/message.proto

+
+    // If the action was start_workflow, this field will reflect an
+    // eventually-consistent view of the started workflow's status.
+    temporal.api.enums.v1.WorkflowExecutionStatus status = 12;


if we do with this extra field and not an embedded message, we should name it workflow_status or started_workflow_status so it's clear it's just for workflows, if there are more action types in the future.

Add running_workflows to ScheduleListInfo, for ListSchedules

afe7d04

lina-temporal requested review from a team as code owners October 16, 2024 17:44

lina-temporal mentioned this pull request Oct 16, 2024

[Scheduling] Add RunningWorkflows to schedule memos in visibility, and in ListSchedules results temporalio/temporal#6665

Open

Quinn-With-Two-Ns approved these changes Oct 16, 2024

View reviewed changes

dnr reviewed Oct 16, 2024

View reviewed changes

lina-temporal added 2 commits October 23, 2024 10:28

Revert "Add running_workflows to ScheduleListInfo, for ListSchedules"

98e4a92

This reverts commit afe7d04.

Add status to ScheduleActionResult/recent_actions

0e3447a

cretz approved these changes Oct 24, 2024

View reviewed changes

bergundy approved these changes Oct 24, 2024

View reviewed changes

dnr reviewed Oct 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add running_workflows to ScheduleListInfo, for ListSchedules #464

Add running_workflows to ScheduleListInfo, for ListSchedules #464

lina-temporal commented Oct 16, 2024 •

edited

Loading

dnr Oct 16, 2024

lina-temporal Oct 21, 2024

dnr Oct 22, 2024

bergundy Oct 24, 2024

lina-temporal Oct 24, 2024

bergundy Oct 24, 2024

dnr Oct 24, 2024

cretz left a comment •

edited

Loading

dnr Oct 24, 2024

dnr Oct 24, 2024

Add running_workflows to ScheduleListInfo, for ListSchedules #464

Are you sure you want to change the base?

Add running_workflows to ScheduleListInfo, for ListSchedules #464

Conversation

lina-temporal commented Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cretz left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lina-temporal commented Oct 16, 2024 •

edited

Loading

cretz left a comment •

edited

Loading