You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One of our variable names doesn't use snake case. We need to do a find-and-replace to fix it.
One of the maze tokens values also includes "adjlist" and needs to be updated, which may affect the performance of old models on future maze datasets and vice versa. Announce the tokenization update in the main channel so people are aware.
The text was updated successfully, but these errors were encountered:
Refactor to be compatible with `maze-dataset` versions `0.2.1` and onwards.
See PRs:
- [`maze_dataset` PR #5](understanding-search/maze-dataset#5)
- [`maze_dataset` PR #6](understanding-search/maze-dataset#6)
See related issues:
- #164
- #163
- #77
These changes also revert changes in #118, to be consistent with underscores only appearing once in the special tokens.
# commit history:
* test_cfg_post_init working
* migrated SPECIAL_TOKENS usage
* wip
* wip
* wip, all but 3 in tok tests passing
* test_tokenizers passing
* unit tests passing (but need to update maze_dataset dep)
* poetry lock
* format
* remove deprecated kwarg to process_weights_
Upgrading transformer_lens to 1.4.0 caused
`HookedTransformer.process_weights_()` to no longer accept
the keyword arg `move_state_dict_to_device`
However, I'm not sure if this was important in the first place.
If any issues come up, move the state dict to device manually in
`ZanjHookedTransformer._load_state_dict_wrapper()` where all this
was happening in the first place
* fixed MazeTokenizer not being passed to as_tokens() in some spots
* updated changed dataset config key
since we removed tokenizer stuff from the dataset
* fixed eval_model nb, added ZanjHookedTransformer.config ref
the `eval_model.ipynb` notebook has a function `testdata_plot_predicted_path`
which was using `model.zanj_model_config` to get the tokenizer, an attribute
missing from the `RandomBaseline` class since it only inherits from `HookedTransformer`
to fix this:
- `ZanjHookedTransformer` now has a `config` property which just
accesses the `zanj_model_config` used by the parent `ConfiguredModel`
- `testdata_plot_predicted_path` now uses `model.config` everywhere
* lock after update maze-dataset to 0.2.1
* fixed minor import issue
* update configs refs in train_model notebook
* lock poetry, re-run notebook
* format
* update coverage
One of our variable names doesn't use snake case. We need to do a find-and-replace to fix it.
One of the maze tokens values also includes "adjlist" and needs to be updated, which may affect the performance of old models on future maze datasets and vice versa. Announce the tokenization update in the main channel so people are aware.
The text was updated successfully, but these errors were encountered: