Force flake8 to the use the single-threaded pool. #505

clalancette · 2024-10-17T19:18:41Z

The comment in the code has more information about why we want to do this.

This is a draft because a) I'm not 100% sure this fixes the issue, and b) I'm not sure what this does to our CI times. @fujitatomoya FYI.

The comment in the code has more information about why we want to do this. Signed-off-by: Chris Lalancette <[email protected]>

clalancette · 2024-10-17T19:19:29Z

CI (of everything):

Linux
Linux-aarch64
Linux-rhel
Windows

fujitatomoya

👍 thanks, i will keep eyes on this!

Linux-rhel Debug

fujitatomoya

i will give official lgtm because this turns everything into all green!!!

clalancette · 2024-10-18T12:51:19Z

So it does indeed seem to make everything green, which is a good start.

In terms of job times, it looks like all jobs except one took the same, or shorter, than their nightly counterparts from last night. The one exception was aarch64, which took 1 hr 34 min on the nightly, but 1 hr 55 min here. I'm not sure why that is different.

That said, the jobs run here do not have a 1-1 correspondence with the nightly jobs. The nightly jobs explicitly use either "Release" or "Debug", while the CI jobs use "None" (which is different than both of those). So what I'm going to do here is to run another set of jobs, all in Release mode. We'll see how that compares in terms of time.

Linux
Linux-aarch64
Linux-rhel
Windows

clalancette · 2024-10-18T20:48:04Z

Hm, so the results on the Release jobs show that this is slower. Note that in order to compare apples-to-apples, I actually took the total time for the job, subtracted off the time until we made it to the "Run Dockerfile" step, and used that as the numbers below (because the nightly jobs are at a disadvantage, since they always start from a fresh container):

Job	nightly time	this PR time	delta
Linux	2 hr 33 min	2 hr 34 min	+1 min
Linux aarch64	1 hr 29 min	1 hr 57 min	+28 min
RHEL	2 hr 14 min	2 hr 18 min	+4 min
Windows	4 hr 28 min	4 hr 36 min	+8 min

So it is clear that this change has something of an impact on CI times, particularly on aarch64. However, aarch64 is the job that can most afford a CI time regression, since the workers aren't as heavily used as the amd64 workers, and it isn't ridiculously long like Windows is.

This one is a tough call to make. My personal opinion is that we should take the CI time hit in favor of making RHEL consistently pass CI. But pinging @cottsay @nuclearsandwich @Crola1702 @claraberendsen @ament/team for thoughts.

fujitatomoya · 2024-10-18T21:26:53Z

ament_flake8/ament_flake8/main.py

+    # We've seen some problems, especially on RHEL-9, where using the multi-threaded
+    # pool in flake8 can cause the Python interpreter to crash.  Force flake8 to use
+    # the single-threaded pool here.  This has some performance implications for
+    # large codebases, but given the distributed nature of ROS packages this shouldn't
+    # generally be a large problem.
+    flake8_argv.append('-j=1')
+


@clalancette just idea, can we do something like this? we have never seen this issue with other platform.

Suggested change

# We've seen some problems, especially on RHEL-9, where using the multi-threaded

# pool in flake8 can cause the Python interpreter to crash. Force flake8 to use

# the single-threaded pool here. This has some performance implications for

# large codebases, but given the distributed nature of ROS packages this shouldn't

# generally be a large problem.

flake8_argv.append('-j=1')

import platform

if platform.system() == "Linux" and "el9" in platform.version():

# We've seen some problems, especially on RHEL-9, where using the multi-threaded

# pool in flake8 can cause the Python interpreter to crash. Force flake8 to use

# the single-threaded pool here. This has some performance implications for

# large codebases, but given the distributed nature of ROS packages this shouldn't

# generally be a large problem.

flake8_argv.append('-j=1')

cottsay · 2024-10-18T22:47:21Z

Have there been any attempts to force a different version of flake8 on RHEL 9? Or to force a Noble build to use the same flake8 as what RHEL 9 currently has?

It would be good to understand if this is a problem with the interpreter, some particular version(s) of flake8, a dependency thereof, or something else entirely.

Force flake8 to the use the single-threaded pool.

97ef4dc

The comment in the code has more information about why we want to do this. Signed-off-by: Chris Lalancette <[email protected]>

fujitatomoya reviewed Oct 17, 2024

View reviewed changes

fujitatomoya mentioned this pull request Oct 17, 2024

🧑‍🌾 ros2cli pytest timeout in RHEL nightlies ros2/ros2cli#932

Closed

fujitatomoya approved these changes Oct 17, 2024

View reviewed changes

fujitatomoya reviewed Oct 18, 2024

View reviewed changes

fujitatomoya mentioned this pull request Oct 20, 2024

Revert "Adds types to Lifecycle Objects" ros2/rclpy#1373

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force flake8 to the use the single-threaded pool. #505

Force flake8 to the use the single-threaded pool. #505

clalancette commented Oct 17, 2024

clalancette commented Oct 17, 2024 •

edited

Loading

fujitatomoya left a comment •

edited

Loading

fujitatomoya left a comment

clalancette commented Oct 18, 2024

clalancette commented Oct 18, 2024

fujitatomoya Oct 18, 2024

cottsay commented Oct 18, 2024

Force flake8 to the use the single-threaded pool. #505

Are you sure you want to change the base?

Force flake8 to the use the single-threaded pool. #505

Conversation

clalancette commented Oct 17, 2024

clalancette commented Oct 17, 2024 • edited Loading

fujitatomoya left a comment • edited Loading

Choose a reason for hiding this comment

fujitatomoya left a comment

Choose a reason for hiding this comment

clalancette commented Oct 18, 2024

clalancette commented Oct 18, 2024

fujitatomoya Oct 18, 2024

Choose a reason for hiding this comment

cottsay commented Oct 18, 2024

clalancette commented Oct 17, 2024 •

edited

Loading

fujitatomoya left a comment •

edited

Loading