ci/transformers: add baseline checks for test cases #1269

dvrogozh · 2025-01-09T02:44:07Z

Baseline is set in a python script. Script looks into the results and categorizes test cases as:

New failures - if detected workflow is marked as failed
New passes - if detected workflow is marked as failed (since baseline requires an update)
Skipped flakies - if detected workflow is marked as failed (since baseline requires an update)
Known failures - these are failing tests knowing to fail per baseline, workflow is marked as passed (if nothing from above detected)

Test cases from above categories are printed into summary.

Signed-off-by: Dmitry Rogozhkin <[email protected]>

chuanqi129

LGTM

chuanqi129 · 2025-01-13T04:27:36Z

.github/scripts/check-transformers.py

+failing_cases = {
+    'tests.benchmark.test_benchmark.BenchmarkTest': {
+        'test_inference_encoder_decoder_with_configs': None,
+        'test_inference_fp16': None,


BTW, what does the "None" mean here? Do we need to root cause and fix those test cases?

Yes, these are all test cases which fail and which need to be root caused and fixed. We require that no more tests should fail on top of this list - in such a case workload should fail.

None here is in a place for the dictionary to pass additional information about failing tests. In the simplest case, we don't pass anything, i.e. we pass None, since test name in this list by itself signifies that test fails. However, in some cases we need to mark that test is flaky. That's where this placeholder comes handy:

failing_cases = { 'tests.models.detr.test_image_processing_detr.DetrImageProcessingTest': { 'test_fast_is_faster_than_slow': { 'flaky': True }, ...

I use this already for few flaky tests in the middle of the list. See lines 48-57.

Actually we can further expand this idea if needed and pass for example links to the known bugs or PRs associated with the failing case and further print this in the result table.

dvrogozh force-pushed the transformers2 branch 5 times, most recently from 1b7ce58 to aa79416 Compare January 9, 2025 23:07

dvrogozh mentioned this pull request Jan 10, 2025

ci/transformers: add baseline checks for test cases #1207

Closed

dvrogozh marked this pull request as ready for review January 10, 2025 01:46

dvrogozh requested review from chuanqi129 and RUIJIEZHONG66166 January 10, 2025 01:46

dvrogozh force-pushed the transformers2 branch from aa79416 to 0861a88 Compare January 12, 2025 16:09

ci/transformers: add baseline checks for test cases

0d79274

Signed-off-by: Dmitry Rogozhkin <[email protected]>

dvrogozh force-pushed the transformers2 branch from 0861a88 to 0d79274 Compare January 12, 2025 19:00

chuanqi129 approved these changes Jan 13, 2025

View reviewed changes

chuanqi129 reviewed Jan 13, 2025

View reviewed changes

dvrogozh added this pull request to the merge queue Jan 13, 2025

Merged via the queue into intel:main with commit b2560ac Jan 13, 2025
2 of 5 checks passed

dvrogozh deleted the transformers2 branch January 13, 2025 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci/transformers: add baseline checks for test cases #1269

ci/transformers: add baseline checks for test cases #1269

dvrogozh commented Jan 9, 2025 •

edited

Loading

chuanqi129 left a comment

chuanqi129 Jan 13, 2025 •

edited

Loading

dvrogozh Jan 13, 2025

ci/transformers: add baseline checks for test cases #1269

ci/transformers: add baseline checks for test cases #1269

Conversation

dvrogozh commented Jan 9, 2025 • edited Loading

chuanqi129 left a comment

Choose a reason for hiding this comment

chuanqi129 Jan 13, 2025 • edited Loading

Choose a reason for hiding this comment

dvrogozh Jan 13, 2025

Choose a reason for hiding this comment

dvrogozh commented Jan 9, 2025 •

edited

Loading

chuanqi129 Jan 13, 2025 •

edited

Loading