Best practices for collaborative ML R & D: How to structure frameworks and collaboration #26

klieret · 2023-07-19T18:41:56Z

Examples of challenges/discussion points:

Technological aspects:

How can we cut boilerplate and standardize interfaces so that people can focus on developing models without sacrificing "hackability". Pytorch lightning is a popular option for pytorch, but IMO the way it is laid out by default has its own challenges (and might lead to duplicated code)
How can we share results between the collaborators and bring everyone "on the same page" (for example using weights & biases)

Social aspects:

How can we make sure to move in the same direction without constraining ourselves? How do we keep everyone engaged in building a common framework and avoid people "branching off forever".
How do we balance more technical SW development work with model development? A lot of people want to focus on developing their model; few people want to work on framework issues. A good collaboration needs both.

I originally suggested this as a subtopic for #6 (doing open source). It also overlaps with #1 (packaging), #5 (fitting), and #19 (ML workflows for analysis). However, I think the challenges are very distinct because this targets development and R & D, rather than use in production/integration with other tools (for example, backwards compatibility isn't as big of an issue as is allowing for creativity).

lgray · 2023-07-22T12:49:09Z

This has a large overlap in themes with #19. Usefully different scope and kinds of requirements though!

klieret · 2023-07-24T15:41:40Z

Yes, I was thinking about this too, but the title of #19 led me to believe that its mainly about ML Ops and facilities (?).

lgray · 2023-07-24T15:42:59Z

User interface necessarily must deal with collaboration and frameworks.

klieret · 2023-07-27T16:12:48Z

Live notes

ML R & D Breakout session (Tuesday)

Present: Philip, Kilian, Richa, Raghav, Josue, Mike

Some of the questions that were discussed:

What frameworks do people use (lightning & friends)?

pytorch lightning

ML Flow might also do some things that lightning does

Onnyx for plugging in ML in other frameworks/model exchange

Dashboards (wandb & friends)?

ML Flow

Weights & Biases

Projects that were mentioned:

https://github.com/jet-net/JetNet: Interface for Jet datasets (combining different data sources).

Kilian: https://github.com/gnn-tracking/gnn_tracking

https://github.com/FAIR4HEP/cookiecutter4fair: Cookie cutter for data science

Conclusions:

Dashboards like (W&B / ML Flow) are a good way to bring people "on the same page" and compare/review/debug performance

Frameworks that are built around hooks and plugin/callback structure are a good way to allow extensibility without growing "Dinosaur classes. For example, lightning hooks like on_validation_epoch_ends allow you to write callbacks to do stuff at the end of an epoch rather than sublassing/modifying your class

klieret added 2023 PyHEP.dev 2023 ML Machine Learning labels Jul 19, 2023

klieret mentioned this issue Jul 19, 2023

Doing open source #6

Closed

oshadura mentioned this issue Jul 25, 2023

complete ML workflows in analysis: facilities capabilities and user interface #19

Closed

jpivarski closed this as completed Jan 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best practices for collaborative ML R & D: How to structure frameworks and collaboration #26

Best practices for collaborative ML R & D: How to structure frameworks and collaboration #26

klieret commented Jul 19, 2023 •

edited

Loading

lgray commented Jul 22, 2023

klieret commented Jul 24, 2023

lgray commented Jul 24, 2023

klieret commented Jul 27, 2023

ML R & D Breakout session (Tuesday)

Best practices for collaborative ML R & D: How to structure frameworks and collaboration #26

Best practices for collaborative ML R & D: How to structure frameworks and collaboration #26

Comments

klieret commented Jul 19, 2023 • edited Loading

lgray commented Jul 22, 2023

klieret commented Jul 24, 2023

lgray commented Jul 24, 2023

klieret commented Jul 27, 2023

ML R & D Breakout session (Tuesday)

klieret commented Jul 19, 2023 •

edited

Loading