Skip to content

Release v0.7.0πŸš€

Compare
Choose a tag to compare
@seungahdev seungahdev released this 25 Sep 06:56
· 3 commits to main since this release
  • Further optimization for running FP8, and INT8 quantization.
  • Support searching automatic calibration dataset batch size for running FMO.
  • Support [AWQ(Activation-aware Weight Quantization)].