v1.1.0
New release with a bunch of quality of life improvements, including attention patching utils and early stopping
What's Changed
- Tracr demo by @neelnanda-io in #142
- Added option to stop running the model at an earlier layer by @neelnanda-io in #143
- Arthur/loss per token by @neelnanda-io in #144
- Make project versioning clearer by @alan-cooney in #146
- let 'get_dataset' function pass kwargs by @afspies in #141
- Fix deps by @jas-ho in #149
- Fix OPT BOS Prepending issue by @afspies in #154
- Induction heads phase changes demo by @ckkissane in #148
- Add helper enum for torch typing by @dkamm in #145
- In run_with_hooks, remove hooks, even when an error is thrown. by @joelburget in #156
- Correct incorrect equation by @epurdy in #159
- add hook tokens by @callummcdougall in #147
- Update pythia-19m to 70m by @ArthurConmy in #162
- Added Utilities for Activation Patching + A Demo of how to use them by @neelnanda-io in #165
New Contributors
- @afspies made their first contribution in #141
- @jas-ho made their first contribution in #149
- @ckkissane made their first contribution in #148
- @dkamm made their first contribution in #145
- @epurdy made their first contribution in #159
- @callummcdougall made their first contribution in #147
Full Changelog: v1.0.0...v1.1.0