Skip to content

Releases: chengzeyi/stable-fast

v0.0.12

24 Nov 07:10
821054e
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.0.11...v0.0.12

v0.0.11

22 Nov 09:52
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.0.10...v0.0.11

v0.0.10

14 Nov 10:23
e1f6c12
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.0.9...v0.0.10

v0.0.9

12 Nov 06:17
Compare
Choose a tag to compare

What's Changed

  • Build automated CI to publish binary wheels on Linux and Windows
  • fix use_count of mempool becoming zero by @chengzeyi in #23
  • Dev by @chengzeyi in #24

Full Changelog: v0.0.8...v0.0.9

v0.0.8 release

09 Nov 13:42
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.0.7...v0.0.8

v0.0.7 release

06 Nov 05:57
f1916ad
Compare
Choose a tag to compare

Various improvements:

  • Implement flat_tensors: A complete solution to convert arbitrary python objects to a list of PyTorch tensors, making JIT trace more flexible.
  • Add more fuse passes to improve performance on ComfyUI.

v0.0.6 release

04 Nov 14:21
Compare
Choose a tag to compare

Fix acquiring unreachable GIL when process exits

v0.0.5 release

04 Nov 04:33
Compare
Choose a tag to compare

Disable CUDA Graph for SDXL

v0.0.4 release

04 Nov 03:14
Compare
Choose a tag to compare

Many bug fixes and improvements:

  • Support SDXL
  • Support CUDA Graph with dynamic shape
  • Support development version of Triton
  • Fix crash when process exits because of missing GIL

v0.0.3 release

31 Oct 03:34
187f6fe
Compare
Choose a tag to compare

Bug fixes:

  • Fix compilation failure when Triton is not enabled.
  • Fix wrong output in Triton NCHW GroupNorm kernel.