Make nnbench ready for concurrency/parallelism #189

nicholasjng · 2024-12-05T10:02:14Z

From a conversation with @janfb.

Unlike microbenchmarks, whose results become less accurate when parallelized due to resource contention, there is no reason not to run an ML benchmark suite in parallel. We should give an example of how to do that in the docs, or even make that the default (think something like make -j $N).

Maybe we can get started on a single MP backend, like multiprocessing or joblib.

Bonus: It could be wise to restructure (read: drop) the BenchmarkRunner before this, and instead give the collection and run loop APIs as stateless functions, which is then fairly easy to parallelize over.

The text was updated successfully, but these errors were encountered:

nicholasjng · 2024-12-05T10:08:38Z

This also has implications for our benchmark structuring recommendations. In the case of a multiprocessing approach, it might be best to structure benchmarks per-algorithm (model) and send one model's entire benchmark suite to one Python process to improve memory efficiency.

On the other hand, this clashes with our approach for parametrizing over a model input value in a benchmark, which we advertise as a best practice for avoiding code duplication.

nicholasjng added the enhancement New feature or request label Dec 5, 2024

nicholasjng added this to the v0.5.0 milestone Dec 5, 2024

nicholasjng mentioned this issue Dec 21, 2024

Replace nnbench.BenchmarkRunner with modular nnbench.collect() and nnbench.run() APIs #193

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make nnbench ready for concurrency/parallelism #189

Make nnbench ready for concurrency/parallelism #189

nicholasjng commented Dec 5, 2024

nicholasjng commented Dec 5, 2024

Make nnbench ready for concurrency/parallelism #189

Make nnbench ready for concurrency/parallelism #189

Comments

nicholasjng commented Dec 5, 2024

nicholasjng commented Dec 5, 2024