TMA 4.8 Release #181

calebbiggers · 2024-05-10T20:37:37Z

Updated TMA metrics to 4.8
Added client platform metrics

- Updated TMA metrics to 4.8 - Added client platform metrics

captain5050 · 2024-05-10T21:12:06Z

SKX/metrics/perf/skylakex_metrics_perf.json

        "MetricName": "tma_frontend_bound",
-        "ScaleUnit": "100%"
+        "ScaleUnit": "100%",
+        "Threshold": "tma_frontend_bound > 15"


Here is the existing perf version:

{ "BriefDescription": "This category represents fraction of slots where the processor's Fronte nd undersupplies its Backend", "MetricExpr": "IDQ_UOPS_NOT_DELIVERED.CORE / tma_info_thread_slots", "MetricGroup": "PGO;TmaL1;TopdownL1;tma_L1_group", "MetricName": "tma_frontend_bound", "MetricThreshold": "tma_frontend_bound > 0.15", "MetricgroupNoGroup": "TopdownL1", "PublicDescription": "This category represents fraction of slots where the processor's Frontend undersupplies its Backend. Frontend denotes the first part of the processor core responsible to fetch operations that are executed later on by the Backend part. Within the Frontend; a branch predictor predicts the next address to fetch; cache-lines are fetched from the memory subsystem; parsed into instructions; and lastly decoded into micro-operations (uops). Ideally the Frontend can issue Pipeline_Width uops every cycle to the Backend. Frontend Bound denotes unutilized issue-slots when there is no Backend stall; i.e. bubbles where Frontend delivered no uops while Backend could have accepted them. For example; stalls due to instruction-cache misses would be categorized under Frontend Bound. Sample with: FRONTEND_RETIRED.LATENCY_GE_4_PS", "ScaleUnit": "100%" },

It looks like Threshold should be MetricThreshold.

slots rather than tma_info_slots

In the description Sample with: FRONTEND_RETIRED.LATENCY_GE_4_PS isn't present.

I've renamed the "Threshold" field to "MetricThreshold"

captain5050 · 2024-05-10T21:15:51Z

SKX/metrics/perf/skylakex_metrics_perf.json

-        "MetricExpr": "( ( BR_MISP_RETIRED.ALL_BRANCHES / ( BR_MISP_RETIRED.ALL_BRANCHES + MACHINE_CLEARS.COUNT ) ) * ( ( UOPS_ISSUED.ANY - ( UOPS_RETIRED.RETIRE_SLOTS ) + ( 4 ) * ( ( INT_MISC.RECOVERY_CYCLES_ANY / 2 ) if #SMT_on else INT_MISC.RECOVERY_CYCLES ) ) / ( ( 4 ) * ( ( CPU_CLK_UNHALTED.THREAD_ANY / 2 ) if #SMT_on else ( CPU_CLK_UNHALTED.THREAD ) ) ) ) )",
-        "MetricGroup": "BadSpec;BrMispredicts;TmaL2;TopdownL2;tma_L2_group;tma_bad_speculation_group",
+        "MetricExpr": "( BR_MISP_RETIRED.ALL_BRANCHES / ( BR_MISP_RETIRED.ALL_BRANCHES + MACHINE_CLEARS.COUNT ) ) * tma_bad_speculation",
+        "MetricGroup": "BadSpec;BrMispredicts;BvMP;TmaL2;TopdownL2;tma_L2_group;tma_bad_speculation_group;Slots",


In the existing generated from a spreadsheet perf version the MetricGroup here is:

"MetricGroup": "BadSpec;BrMispredicts;TmaL2;TopdownL2;tma_L2_group;tma_bad_speculation_group;tma_issueBM"

tma_issueBM in particular is missing. The issue groups come from the threshold column.

I've added the issues to the "MetricGroup" field

- Renamed "Threshold" field to "MetricThreshold" - Added issues to "MetricGroup" field - Edited incorrect event names - Updated EMR metrics

weilinwa · 2024-05-13T23:55:28Z

CLX/metrics/perf/cascadelakex_metrics_perf.json

        "MetricName": "tma_dtlb_load",
        "ScaleUnit": "100%",
-        "Threshold": "tma_dtlb_load > 10 && tma_l1_bound > 10 && tma_memory_bound > 20 && tma_backend_bound > 20"
+        "MetricThreshold": "tma_dtlb_load > 10 && tma_l1_bound > 10 && tma_memory_bound > 20 && tma_backend_bound > 20"


@calebbiggers, current perf JSON uses & instead of &&.

captain5050 · 2024-05-15T16:59:12Z

BDW/metrics/perf/broadwell_metrics_perf.json

+        "Threshold": "tma_frontend_bound > 15"
+    },
+    {
+        "BriefDescription": "This metric represents fraction of slots the CPU was stalled due to Frontend latency issues.  For example; instruction-cache misses; iTLB misses or fetch stalls after a branch misprediction are categorized under Frontend Latency. In such cases; the Frontend eventually delivers no uops for some period.",


The existing definition has "Sample with":

"PublicDescription": "This metric represents fraction of slots the CPU was stalled due to Frontend latency issues. For example; instruction-cache misses; iTLB misses or fetch stalls after a branch misprediction are categorized under Frontend Latency. In such cases; the Frontend eventually delivers no uops for some period. Sample with: RS_EVENTS.EMPTY_END",

https://git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/perf/pmu-events/arch/x86/broadwell/bdw-metrics.json#n281

Which is pulled from the "Locate-with" column here:
https://github.com/intel/perfmon/blob/main/scripts/create_perf_json.py#L836

Could we add this to avoid a regression?

captain5050 · 2024-05-15T17:03:14Z

BDW/metrics/perf/broadwell_metrics_perf.json

+        "Threshold": "tma_dsb_switches > 5 && tma_fetch_latency > 10 && tma_frontend_bound > 15"
+    },
+    {
+        "BriefDescription": "This metric represents fraction of slots the CPU was stalled due to Frontend bandwidth issues.  For example; inefficiencies at the instruction decoders; or restrictions for caching in the DSB (decoded uops cache) are categorized under Fetch Bandwidth. In such cases; the Frontend typically delivers suboptimal amount of uops to the Backend.",


This existing description has "Related metrics":

"PublicDescription": "This metric represents fraction of slots the CPU was stalled due to Frontend bandwidth issues. For example; inefficiencies at the instruction decoders; or restrictions for caching in the DSB (decoded uops cache) are categorized under Fetch Bandwidth. In such cases; the Frontend typically delivers suboptimal amount of uops to the Backend. Related metrics: tma_dsb_switches, tma_info_frontend_dsb_coverage, tma_info_inst_mix_iptb, tma_lcp"

https://git.kernel.org/pub/scm/linux/kernel/git/perf/perf-tools-next.git/tree/tools/perf/pmu-events/arch/x86/broadwell/bdw-metrics.json#n271

That has come from the issues in the threshold column:
https://github.com/intel/perfmon/blob/main/scripts/create_perf_json.py#L1321

Could we add this to avoid a regression?

TMA 4.8 Release

d54c847

- Updated TMA metrics to 4.8 - Added client platform metrics

calebbiggers requested review from 1perrytaylor and kshiprab as code owners May 10, 2024 20:37

calebbiggers added 2 commits May 10, 2024 13:52

Updated PMEM metrics on ICX

bd590c5

Updated PMEM metrics on CLX

927c8a7

captain5050 reviewed May 10, 2024

View reviewed changes

calebbiggers added 2 commits May 10, 2024 15:50

Added perf pkg power event

86c2858

Event updates on ICL & RKL

e7a0e8a

weilinwa mentioned this pull request May 11, 2024

TMA 4.8 perf converted JSON files #182

Merged

Various fixes

fcaa020

- Renamed "Threshold" field to "MetricThreshold" - Added issues to "MetricGroup" field - Edited incorrect event names - Updated EMR metrics

weilinwa reviewed May 13, 2024

View reviewed changes

calebbiggers added 4 commits May 14, 2024 13:32

Fixed slots event usage

4765c72

Fixed duplicate ampersands

bfbfeac

Fixed duplicate operators

d04d4e8

Threshold fixes

e269c6b

1perrytaylor requested a review from captain5050 May 15, 2024 16:48

captain5050 reviewed May 15, 2024

View reviewed changes

Merge branch 'main' into TMA-4.8-Release

a42292f

1perrytaylor approved these changes May 15, 2024

View reviewed changes

calebbiggers added 3 commits May 15, 2024 13:03

TMA naming update for ICL & RKL

d22fc0e

Fix for missing SKX event options

ae7bb86

Fix for events missing options

154f96d

kshiprab approved these changes May 21, 2024

View reviewed changes

calebbiggers merged commit b7d8c00 into main May 21, 2024
7 checks passed

calebbiggers deleted the TMA-4.8-Release branch May 21, 2024 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TMA 4.8 Release #181

TMA 4.8 Release #181

calebbiggers commented May 10, 2024

captain5050 May 10, 2024

calebbiggers May 13, 2024

captain5050 May 10, 2024

calebbiggers May 13, 2024

weilinwa May 13, 2024

captain5050 May 15, 2024

captain5050 May 15, 2024

TMA 4.8 Release #181

TMA 4.8 Release #181

Conversation

calebbiggers commented May 10, 2024

captain5050 May 10, 2024

Choose a reason for hiding this comment

calebbiggers May 13, 2024

Choose a reason for hiding this comment

captain5050 May 10, 2024

Choose a reason for hiding this comment

calebbiggers May 13, 2024

Choose a reason for hiding this comment

weilinwa May 13, 2024

Choose a reason for hiding this comment

captain5050 May 15, 2024

Choose a reason for hiding this comment

captain5050 May 15, 2024

Choose a reason for hiding this comment