-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: support scalar resources metric #3937
base: master
Are you sure you want to change the base?
feat: support scalar resources metric #3937
Conversation
Signed-off-by: zedongh <[email protected]>
ScalerResources is used for extension resources. Keep CPU, and memory not to merge them into scalerResources are fine. |
/lgtm |
/area stable-metrics |
@JesseStutler: The label(s) In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/lgtm |
how about capacity ? such as : volcano_queue_capacity_cpu |
cc @Monokaix |
You mean add the capability of each dimension resource? The cpu and memory metrics is already exposed and we cannot modify its metric name for compatibility: ) |
there is no metrics about capacity in both vc-sheduler and vc-controller. any other place ? |
https://github.com/volcano-sh/volcano/blob/master/docs/design/metrics.md |
Plesse also update doc https://github.com/volcano-sh/volcano/blob/master/docs/design/metrics.md and volcano-monitoring-development.yaml to include the new metrics: ) |
Any test results will be more appreciated:
|
Signed-off-by: zedongh <[email protected]>
Signed-off-by: zedongh <[email protected]>
New changes are detected. LGTM label has been removed. |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
some changes update:
|
And
Update The original cpu/memeory metrics are not present in dashboard. Update later if necessary. |
We also need export new metrics to describe each resource's capacity of a queue. scene:
The proposal has been agreed by @JesseStutler . Can you implement this feature? @zedongh |
/ok-to-test |
If its not present in |
@yccharles Your capacity should mean the set value of Guarantee/Deserved/Capability, right? |
New commit for queue capacity metric. @yccharles |
@yccharles: changing LGTM is restricted to collaborators In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
metrics.UpdateQueueDeserved(attr.name, attr.deserved.MilliCPU, attr.deserved.Memory, attr.deserved.ScalarResources) | ||
metrics.UpdateQueueAllocated(attr.name, attr.allocated.MilliCPU, attr.allocated.Memory, attr.allocated.ScalarResources) | ||
metrics.UpdateQueueRequest(attr.name, attr.request.MilliCPU, attr.request.Memory, attr.request.ScalarResources) | ||
metrics.UpdateQueueCapacity(attr.name, attr.realCapability.MilliCPU, attr.realCapability.Memory, attr.realCapability.ScalarResources) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The capacity you ask is capability here? @yccharles I misunderstood the meaning of capacity before and wanted to confirm it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the reminder.
- what i want is queue's capacity . not realCapability.xxx
- but, if realCapability.xxx can be exported is really a good idea. such as : queue_real_capability_xxx ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the reminder.
- what i want is queue's capacity . not realCapability.xxx
- but, if realCapability.xxx can be exported is really a good idea. such as : queue_real_capability_xxx ?
updated with support both metrics
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@yccharles capacity
refers to the capability value set by the user, right? Because there are too many words now, I am afraid that capacity
and capability
will be confused, we need to be clear. Besides, do we also need the deserved
and guarantee
values set by the user?
docs/design/metrics.md
Outdated
| `unschedule_job_counts` | Gauge | None | The number of jobs could not be scheduled | | ||
| `queue_allocated_milli_cpu` | Gauge | `queue_name`=<queue_name> | Allocated CPU count for one queue | | ||
| `queue_allocated_memory_bytes` | Gauge | `queue_name`=<queue_name> | Allocated memory for one queue | | ||
| `queue_allocated_scalar_resources` | Gauge | `queue_name`=<queue_name>, `resource`=<resource_name> | Allocated scalar resource for one queue | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also add queue_capacity_xxx metrics here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated with queue_capacity_xxx
and queue_real_capacity_xxx
metric docs
Signed-off-by: zedongh <[email protected]>
5f58a33
to
c8260b0
Compare
please squash commits to only one. |
capacity metric part need disscuss. Split it to another issue to make all things done? |
Add metric
/close #3931
Need discuss:
Shall we merge
cpu
,memory
bulitin resources into scalar resource ?