Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scalability test wall clock #239

Merged
merged 71 commits into from
Nov 7, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
71 commits
Select commit Hold shift + click to select a range
e9167b9
add gpu utilization decorator and begin work on plots
jarlsondre Oct 24, 2024
4cb840e
add decorator for gpu energy utilization
jarlsondre Oct 28, 2024
5c9add1
Added config option to hpo script, styling (#235)
annaelisalappe Oct 22, 2024
2ff5388
Update README.md
matbun Oct 23, 2024
7f0262d
Update README.md
matbun Oct 23, 2024
a38974a
Update createEnvVega.sh
matbun Oct 18, 2024
15de059
remove unused dist file
jarlsondre Oct 28, 2024
00f7af3
run black and isort to fix linting errors
jarlsondre Oct 28, 2024
f3e3752
temporary changes
jarlsondre Oct 28, 2024
cd77243
remove redundant variable
jarlsondre Oct 28, 2024
10f079a
add absolute time plot
jarlsondre Oct 28, 2024
ffefdec
remove trailing whitespace
jarlsondre Oct 28, 2024
16596d2
remove redundant variable
jarlsondre Oct 28, 2024
1a6ef64
remove trailing whitespace
jarlsondre Oct 28, 2024
92ffdc4
begin implementation of backup
jarlsondre Oct 29, 2024
b70ec0f
fix issues from PR
jarlsondre Oct 29, 2024
dedafe7
fix issues from PR
jarlsondre Oct 29, 2024
e4974e1
add backup to gpu monitoring
jarlsondre Oct 29, 2024
d1abf51
fix import in eurac trainer
jarlsondre Oct 29, 2024
b3da765
cleanup backup mechanism slightly
jarlsondre Oct 29, 2024
ddbac51
fix linting errors
jarlsondre Oct 29, 2024
6609266
update logging directory and pattern
jarlsondre Oct 29, 2024
b555bc5
update default pattern for gpu energy plots
jarlsondre Oct 29, 2024
f6b3226
fix isort linting
jarlsondre Oct 29, 2024
b6ef021
add support for none pattern and general cleanup
jarlsondre Oct 29, 2024
1160e96
fix linting errors with black and isort
jarlsondre Oct 29, 2024
9843108
fix import in eurac trainer
jarlsondre Oct 29, 2024
c72fd6b
fix linting errors
jarlsondre Oct 29, 2024
b2d3c69
update logging directory and pattern
jarlsondre Oct 29, 2024
484d02c
update default pattern for gpu energy plots
jarlsondre Oct 29, 2024
b24bf99
fix isort linting
jarlsondre Oct 29, 2024
4612569
add support for none pattern and general cleanup
jarlsondre Oct 29, 2024
b59df16
fix linting errors with black and isort
jarlsondre Oct 29, 2024
6b96e6a
begin implementation of backup
jarlsondre Oct 29, 2024
0002868
add backup to gpu monitoring
jarlsondre Oct 29, 2024
0ee5714
add backup functionality to communication plot
jarlsondre Oct 29, 2024
5b6ebba
rewrite epochtimetracker and refactor scalability plot code
jarlsondre Oct 29, 2024
929c4cb
cleanup scalability plot code
jarlsondre Oct 29, 2024
0257693
updating some epochtimetracker dependencies
jarlsondre Oct 30, 2024
413103b
add configurable and dynamic wait and warmup times for the profiler
jarlsondre Oct 30, 2024
e965637
temporary changes
jarlsondre Oct 28, 2024
de7a941
add absolute time plot
jarlsondre Oct 28, 2024
7978187
begin implementation of backup
jarlsondre Oct 29, 2024
7510377
add backup to gpu monitoring
jarlsondre Oct 29, 2024
2e321ad
cleanup backup mechanism slightly
jarlsondre Oct 29, 2024
57428a8
fix isort linting
jarlsondre Oct 29, 2024
3ca420a
add support for none pattern and general cleanup
jarlsondre Oct 29, 2024
fe02b30
fix linting errors with black and isort
jarlsondre Oct 29, 2024
2f1e5f8
begin implementation of backup
jarlsondre Oct 29, 2024
47169f6
add backup functionality to communication plot
jarlsondre Oct 29, 2024
0b2016e
rewrite epochtimetracker and refactor scalability plot code
jarlsondre Oct 29, 2024
8caa8d1
cleanup scalability plot code
jarlsondre Oct 29, 2024
0155b73
updating some epochtimetracker dependencies
jarlsondre Oct 30, 2024
30fe5ee
Merge branch 'scalability-test-wall-clock' of github.com:interTwin-eu…
jarlsondre Oct 30, 2024
d4ed0ae
fix linting errors
jarlsondre Oct 30, 2024
2bfb939
fix more linting errors
jarlsondre Oct 30, 2024
9d6e204
add utilization percentage plot
jarlsondre Oct 30, 2024
8b850d0
run isort for linting
jarlsondre Oct 30, 2024
922feb8
update default save path for metrics
jarlsondre Oct 30, 2024
0d04824
add decorators to virgo and some cleanup
jarlsondre Nov 1, 2024
7679cf7
add contributions and cleanup
jarlsondre Nov 1, 2024
63db070
fix linting errors
jarlsondre Nov 1, 2024
8785b72
change 'credits' to 'credit'
jarlsondre Nov 1, 2024
7c4db59
update communication plot style
jarlsondre Nov 1, 2024
2c0ba5a
update function names
jarlsondre Nov 1, 2024
df83b65
update scalability function for a more streamlined approach
jarlsondre Nov 1, 2024
88ed336
run isort
jarlsondre Nov 1, 2024
7a660bc
move horovod import
jarlsondre Nov 5, 2024
7053362
fix linting errors
jarlsondre Nov 5, 2024
aa0d0c8
add contributors
jarlsondre Nov 5, 2024
d01a3d7
fix merge conflict
jarlsondre Nov 7, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -202,4 +202,7 @@ cython_debug/
# be found at https://github.com/github/gitignore/blob/main/Global/JetBrains.gitignore
# and can be added to the global gitignore or merged into this file. For a more nuclear
# option (not recommended) you can uncomment the following to ignore the entire idea folder.
#.idea/
#.idea/

# MacOS
.DS_Store
1 change: 1 addition & 0 deletions pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -42,6 +42,7 @@ dependencies = [
# "prov4ml@git+https://github.com/HPCI-Lab/ProvML@main", # Prov4ML
# "prov4ml@git+https://github.com/matbun/ProvML@main",
"pandas",
"seaborn"
]

# dynamic = ["version", "description"]
Expand Down
Loading