Skip to content

Commit

Permalink
Add global seed setting to all benchmarks, rerun results and sort res…
Browse files Browse the repository at this point in the history
…ults by benchmark name for consistent table generation
  • Loading branch information
Bronzila committed Jul 3, 2024
1 parent 704e7ef commit b690ad0
Show file tree
Hide file tree
Showing 33 changed files with 92 additions and 86 deletions.
2 changes: 1 addition & 1 deletion benchmarking/generate_summary.py
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ def main():
args = input_arguments()
results_dict = defaultdict(lambda: defaultdict(pd.DataFrame))
base_result_path = Path("benchmarking/results")
for file_path in base_result_path.glob("**/traj.parquet.gzip"):
for file_path in sorted(base_result_path.glob("**/traj.parquet.gzip")):
version = file_path.parts[-3]
benchmark = file_path.parts[-2]

Expand Down
3 changes: 3 additions & 0 deletions benchmarking/hpobench_benchmark.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
import argparse
import random
from pathlib import Path

import numpy as np
Expand Down Expand Up @@ -159,6 +160,8 @@ def main():
trajectories = []
for seed in seeds:
print(f"Running benchmark {benchmark_name} on seed {seed}")
np.random.seed(seed)
random.seed(seed)
dehb_params["seed"] = int(seed)
dehb_optimizer = DEHBOptimizerHPOBench(
dehb_params=dehb_params,
Expand Down
3 changes: 3 additions & 0 deletions benchmarking/mfpbench_benchmark.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
import argparse
import random
from pathlib import Path

import mfpbench
Expand Down Expand Up @@ -122,6 +123,8 @@ def main():
trajectories = []
for seed in seeds:
print(f"Running benchmark {benchmark_name} on seed {seed}")
np.random.seed(seed)
random.seed(seed)
dehb_params["seed"] = int(seed)
dehb_optimizer = DEHBOptimizerMFPBench(
dehb_params=dehb_params,
Expand Down
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/counting_ones/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/imagenet_resnet_512/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/lm1b_transformer_2048/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/mfh3/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/mfh6/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/nasbench201/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/surrogate/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/tab_lr/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/tab_nn/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/tab_rf/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.0/tab_svm/traj.parquet.gzip
Binary file not shown.
Binary file not shown.
Binary file modified benchmarking/results/0.1.1/counting_ones/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.1/mfh3/traj.parquet.gzip
Binary file not shown.
Binary file modified benchmarking/results/0.1.1/mfh6/traj.parquet.gzip
Binary file not shown.
170 changes: 85 additions & 85 deletions benchmarking/results/benchmark_results.md

Large diffs are not rendered by default.

Binary file modified benchmarking/results/cifar100_wideresnet_2048_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/counting_ones_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/imagenet_resnet_512_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/lm1b_transformer_2048_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/mfh3_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/mfh6_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/nasbench201_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/surrogate_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/tab_lr_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/tab_nn_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/tab_rf_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/tab_svm_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified benchmarking/results/translatewmt_xformer_64_traj.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit b690ad0

Please sign in to comment.