Self-test script rewrite #232

Luc45 · 2024-12-23T23:57:59Z

Background & Motivation

The motivation for this PR came after I wasted a couple of hours while doing the sync with 9.6.0-beta.1. One of the self-tests didn't start, and I only found out about this 2h later. Debugging why it failed was much harder than it should be, because the self-test script would just dispatch 20 tests in parallel, keep them all alive with --wait and --json, where each would poll their own test status individually.

This PR greatly simplifies this setup, we dispatch the tests async and collect their test run IDs, then we poll their status ourselves with one request to the newly introduced get-multiple endpoint, which takes a comma-separated list of test run IDs.

This made it possible to also improve the output, and added extensive logging to all requests to last-self-test.log, that gets reset with each self test.

What Changed: Before vs. After

Before

Multiple Parallel Processes: We would start many processes, each:
1. Dispatching its QIT test.
2. Polling individually for status until completion.
3. Reporting success or failure.
Drawbacks:
- Hard to coordinate and debug: each process produced its own logs and status updates.
- Risk of wasted time: if one test fails to run, it would be very hard to know why.
- More overhead managing N distinct polling loops concurrently.

After

Asynchronous Dispatch
- All tests are dispatched quickly (no blocking).
- We gather every test_run_id upfront.
Unified Polling Loop
- Rather than multiple pollers, a single loop calls qit get-multiple for all test_run_ids in one go.
- This repeats every X seconds until each test reports a final status.
Snapshot Verification
- As soon as a test finishes, its JSON result is stored and checked against the PHPUnit snapshot.
- Any mismatch shows up right away (or updates, if in update mode).

Implementation Details

QitRunner.php:
- Dispatches all tests asynchronously and records their IDs.
- Repeatedly calls qit get-multiple until all have a final result.
QITLiveOutput.php:
- Adjusted to show a unified table of test statuses during each poll iteration.
PhpUnitRunner.php:
- Receives completed test data and checks it against snapshots.
Documentation:
- Updated to explain the single polling flow (instead of many parallel pollers).

Testing instructions

Run the self-tests as usual, see how it goes

…2/get-multiple"

zhongruige · 2024-12-24T02:26:46Z

I'll dig into this a bit more tomorrow and run the tests but so far loving these changes!

Luc45 · 2024-12-24T02:36:26Z

No worries! Ignore if you see some activity in GH, I'm just taking the opportunity to run some self-tests and scheduling the PC to auto-shutdown in a couple of hours.

_tests/managed_tests/README.md

zhongruige · 2024-12-24T19:16:54Z

Went through the self tests (php QITSelfTests.php) and it batched and incremented as it was running:

However, I did hit a timeout and it eventually exited:

Wanted to just double check on the above behavior.

Luc45 · 2024-12-26T23:21:48Z

Wanted to just double check on the above behavior.

Nice catch, I've limitted the get-multiple endpoint to 20 tests per request on the backend, and updated the logic here to:

Stop requesting tests that are complete
Batch polling in chunks of 20 tests, instead of trying to fetch them all at once

I was able to run a mass test with all tests at once, so it seems to have solved it.

zhongruige · 2024-12-27T16:00:56Z

Re-ran this again and just finished (ran this twice just to see) and got this result both times:

Would we expect these all to pass or is this expected as we have some no-op tests we might expect to fail?

Luc45 · 2024-12-27T16:24:32Z

It should pass on the 9.6.0-beta.1 sync branch

zhongruige · 2024-12-27T18:26:34Z

Weird I got the same results on that branch @Luc45:

All snapshots have been verified.

Some snapshots still failed. Final outcome: ❌

For more details, see last-self-test.log.
➜  managed_tests git:(24-12/sync-960)

Not sure, could it be due to the PHP version I'm using?

zhongruige · 2024-12-27T19:47:34Z

Perfect that fixed it, thanks @Luc45!

Luc45 added 30 commits December 18, 2024 01:05

Refactor how we run self-tests

323c579

Improvements + debug log

33fce4d

typo

7626d25

15

2ed9ba5

Break down self-test into smaller classes

504f48b

Reduce log verbosity

6e41723

Add a way to reuse JSON to make it easier to debug phpunit tests

55f17b9

Tweaks to output/verbosity and fix incorrect path

d1885dc

Improve output

1b91d40

Re-add missing arg

310c390

Improve output

75a7c1a

Add maybe_echo function

8ee86a2

maybe_echo

277b1f8

maybe_echo

cf0a7d6

Always print final summary in CI

f95e0a3

Minor tweak

9c05eb8

Output improvement

25ccc9d

Improve output/show snapshot diff

e2215ab

Test snapshot

e1f616e

Add more entries to known strings

e18c55b

Preserve phpunit output until summary

8cb1611

Output tweaks

be214fc

Update tweaks

8e54394

Update snapshots

c8b5508

Tweak

907517c

Tweak to run woo-e2e

d4c6e1a

Throw if fail

438eba3

Merge branch '24-12/get-multiple' into 24-12/self-test-script-rewrite

bd36471

Remove parallel processes

bc054c3

Update README

fc3074a

Update log file name

30abd17

Luc45 self-assigned this Dec 24, 2024

Luc45 requested a review from a team December 24, 2024 01:51

Luc45 marked this pull request as ready for review December 24, 2024 01:51

Luc45 added 3 commits December 23, 2024 23:17

Merge branch '24-12/get-multiple' into 24-12/self-test-script-rewrite

8021085

Auto stash before merge of "24-12/self-test-script-rewrite" and "24-1…

a499ef2

…2/get-multiple"

build

fc67c4e

Luc45 added 2 commits December 23, 2024 23:30

Add known line pattern to output to be ignored

8c5cfe2

Merge branch 'trunk' into 24-12/self-test-script-rewrite

3f4c6e0

zhongruige reviewed Dec 24, 2024

View reviewed changes

_tests/managed_tests/README.md Outdated Show resolved Hide resolved

zhongruige reviewed Dec 24, 2024

View reviewed changes

_tests/managed_tests/README.md Outdated Show resolved Hide resolved

Luc45 added 2 commits December 26, 2024 15:57

Fetch in chunks of 20

319d674

Add comments, use time-based timeout instead of attempt-based, etc

6f0cb75

Luc45 requested a review from zhongruige December 26, 2024 23:29

Remove reference to directory in README.md

808061a

Luc45 mentioned this pull request Dec 27, 2024

Sync 9.6.0 snapshots #234

Merged

zhongruige approved these changes Dec 27, 2024

View reviewed changes

Luc45 merged commit e0d15a1 into trunk Dec 27, 2024

Luc45 deleted the 24-12/self-test-script-rewrite branch December 27, 2024 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Self-test script rewrite #232

Self-test script rewrite #232

Luc45 commented Dec 23, 2024 •

edited

Loading

zhongruige commented Dec 24, 2024

Luc45 commented Dec 24, 2024

zhongruige commented Dec 24, 2024

Luc45 commented Dec 26, 2024

zhongruige commented Dec 27, 2024

Luc45 commented Dec 27, 2024

zhongruige commented Dec 27, 2024

zhongruige commented Dec 27, 2024

Self-test script rewrite #232

Self-test script rewrite #232

Conversation

Luc45 commented Dec 23, 2024 • edited Loading

Background & Motivation

What Changed: Before vs. After

Before

After

Implementation Details

Testing instructions

zhongruige commented Dec 24, 2024

Luc45 commented Dec 24, 2024

zhongruige commented Dec 24, 2024

Luc45 commented Dec 26, 2024

zhongruige commented Dec 27, 2024

Luc45 commented Dec 27, 2024

zhongruige commented Dec 27, 2024

zhongruige commented Dec 27, 2024

Luc45 commented Dec 23, 2024 •

edited

Loading