v1.4.0
Note: There is a bug in GPT-J in this version.
What's Changed
- Halve GPU memory when loading by @slavachalnev in #333
- Update to
hook_mlp_in
by @ArthurConmy in #316 names_filter
bug fix by @ArthurConmy in #321- [Ready] Enable Pytorch GPU acceleration for M1 chips by @luciaquirke in #326
- Introduce Global prepend_bos Attribute to HookedTransformer by @soheeyang in #343
- Fix hook_result shape comment by @ckkissane in #347
- Support for reduced precision (#104) by @glerzing in #317
- Added tiny pythia models by @neelnanda-io in #350
- Add Llama-2 7B and 13B models by @ArthurConmy in #352
- Fix API docs by @Smaug123 in #339
- Enhance the API for default_prepend_bos by @soheeyang in #345
- Integrate StableLM (#254) by @glerzing in #354
- add colab buttons to demos by @ckkissane in #359
- Remove n_devices assert in config by @slavachalnev in #357
- Updated readme by @neelnanda-io in #351
- Scalar multiplication by @matthiasdellago in #355
New Contributors
- @soheeyang made their first contribution in #343
- @Smaug123 made their first contribution in #339
- @matthiasdellago made their first contribution in #355
Full Changelog: v1.3.0...v1.4.0