Skip to content

v1.4.0: Improved Support for Huggingface Transformers & LLMs

Compare
Choose a tag to compare
@VainF VainF released this 04 Jun 12:19
· 83 commits to master since this release
b0f0a7c

What's Changed

  • Add support for Grouped Query Attention (GQA) in Huggingface transformers.
  • Include minimal examples for Large Language Models (LLaMA-2 & LLaMA-3).

Full Changelog: v1.3.7...v1.4.0