Loading model #168

shakibamrd · 2023-03-13T16:12:22Z

saves optimizer.get_checkpointables() return values in checkpoint, and is also able to load them.
changed:
trainer.py
most of optimizer.py.

Please review the use of _set_checkpoint() in trainer.py which is before calling optimizer.before_training(). In the metaclass of the optimizers it is mentioned that _set_checkpoint() should be used c

…luate_oneshot

gierle

I was able to reproduce the issue of #153 and its solution 👍🏻

gierle · 2023-03-20T13:39:49Z

naslib/defaults/darts_defaults.yaml

  batch_size: 64
  learning_rate: 0.025
  learning_rate_min: 0.001
  momentum: 0.9
  weight_decay: 0.0003
-  epochs: 50
+  epochs: 5


Was this change only used for testing? 50 epochs is also stated in the paper

Yes, please revert to 50.

I reverted the epochs to its original value.

gierle · 2023-03-20T13:45:54Z

naslib/optimizers/oneshot/darts/optimizer.py

@@ -133,6 +133,12 @@ def new_epoch(self, epoch):
        """
        Just log the architecture weights.
        """
+        # print("=====================================")


Why was this code added? Can it be removed?

The extra code which was used for debugging is removed.

gierle · 2023-03-20T13:49:48Z

naslib/defaults/drnas_defaults.yaml

+
+  fidelity: 200
+
+  # GDAS


In order to make yaml files generally more readable, should focus only on specific optimizer settings @Neonkraft ?

The darts_defualts.yaml was reverted to the format of the Develop_copy branch.

gierle · 2023-03-20T13:53:09Z

naslib/defaults/trainer.py

@@ -146,7 +147,7 @@ def search(self, resume_from="", summary_writer=None, after_epoch: Callable[[int

                    self.train_loss.update(float(train_loss.detach().cpu()))
                    self.val_loss.update(float(val_loss.detach().cpu()))
-
+                    # break


Can this be removed?

Agreed. Please remove.

'break' was used for debugging and got removed in the new commit.

gierle · 2023-03-20T14:10:12Z

naslib/optimizers/core/metaclasses.py

+
+    def set_checkpointables(self, architectural_weights):
+        """
+        would set the objects saved in the checkpoint during last phase of training


Since other functions also include this, maybe add a description of parameters and return types.

+1, agreed.

The type of the Args has been specified. This function has no return value.

gierle · 2023-03-20T14:14:43Z

naslib/runners/nas/runner.py

 trainer.search(resume_from="")
-trainer.evaluate(resume_from="", dataset_api=dataset_api)
+
+# trainer.search(resume_from="/home/moradias/nas-fix/run/nasbench201/cifar10/darts/97/search/model_0000002.pth")


Since this is most likely for testing purposes, it should be removed

gierle · 2023-03-20T14:17:31Z

naslib/runners/nas/runner.py

-    'transbench101_macro': TransBench101SearchSpaceMacro(),
-    'asr': NasBenchASRSearchSpace(),
+    'nasbench201': NasBench201SearchSpace(n_classes=config.n_classes),
+    # 'nasbench301': NasBench301SearchSpace(n_classes=config.n_classes, auxiliary=False),


Should we uncomment NB301 here, so it can be used?

Yes. Also, why remove transbench101_macro and asr? @shakibamrd

The runner had some parts removed during of debugging. With the new commit it is reverted back to its original form.

gierle · 2023-03-20T14:21:31Z

naslib/search_spaces/simple_cell/graph.py

@@ -6,7 +6,7 @@
 from naslib.search_spaces.core.graph import Graph, EdgeData
 from naslib.search_spaces.core import primitives as ops

-from ..nasbench301.graph import _truncate_input_edges
+# from ..nasbench301.graph import _truncate_input_edges


Can the unused input be removed?

Why does this fix change stuff in the graph of Simple Cell?

_truncate_input_edge is not called or used in Simple cell that's why I removed it. I have put it back in the new commit.

gierle · 2023-03-20T14:42:09Z

naslib/utils/utils.py

+        elif config.dataset == 'ImageNet16-120':
+            config.n_classes = 120
+        else:
+            config.n_classes = 10


Could this conflict with e.g. custom datasets?
We could raise an exception or create a warning instead

Suggested change

config.n_classes = 10

raise AttributeError

Suggested change

config.n_classes = 10

import warnings

warnings.warn("Number of classes was not set. Default 10 is set.")

Thanks for the suggestions added the warning.

gierle · 2023-03-20T14:45:22Z

naslib/evaluators/zc_evaluator.py

@@ -138,6 +138,7 @@ def single_evaluate(self, test_data, zc_api):
        logger.info("Querying the predictor")
        query_time_start = time.time()

+        # TODO: shouldn't mode="val" be passed?


Makes sense for me.

shakibamrd added 12 commits December 5, 2022 14:52

added config to reproduce drnas results

01f97f0

update config for drnas

6cb1e7b

add batch_size and train_portion for eval mode

788b517

add todo to drnas optimizer

d021f05

fixed merge conflict with remote branch

f12c73d

remove auxiliry head when running nb301

73c70c8

bug fix: able to do full eval with spaces

44891c9

refactored naslib runner

9766329

added feature to handels the number of classes based on the dataset

cf68cef

added arch_weights as additional checkpointables

7ba12ea

added set_checkpointables to all optimizers

f0aa6a8

bug fix, updated order of before_training and _set_checkpoint for eva…

09641d0

…luate_oneshot

shakibamrd requested review from arberzela, yashsmehta and crwhite14 as code owners March 13, 2023 16:12

gierle reviewed Mar 20, 2023

View reviewed changes

resolve comments made on the pull request

a17a284

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading model #168

Loading model #168

shakibamrd commented Mar 13, 2023

gierle left a comment

gierle Mar 20, 2023

Neonkraft Mar 21, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

Neonkraft Mar 23, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

Neonkraft Mar 21, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

Neonkraft Mar 21, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

gierle Mar 20, 2023

Neonkraft Mar 21, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

Neonkraft Mar 21, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

shakibamrd Mar 25, 2023

gierle Mar 20, 2023

	config.n_classes = 10
	import warnings
	warnings.warn("Number of classes was not set. Default 10 is set.")

Loading model #168

Are you sure you want to change the base?

Loading model #168

Conversation

shakibamrd commented Mar 13, 2023

gierle left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment