Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

chengzeyi / stable-fast Public

Notifications You must be signed in to change notification settings
Fork 72
Star 1.2k

Code
Issues 53
Pull requests 5
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: chengzeyi/stable-fast

Releases Tags

Releases · chengzeyi/stable-fast

v0.0.12

24 Nov 07:10

github-actions

v0.0.12

821054e

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.0.12

What's Changed

add _jit_pass_eliminate_simple_arith by @chengzeyi in #45
Dev by @chengzeyi in #46
Dev by @chengzeyi in #47
Dev by @chengzeyi in #48

Full Changelog: v0.0.11...v0.0.12

Contributors

chengzeyi

Assets 18

All reactions

v0.0.11

22 Nov 09:52

github-actions

v0.0.11

89697a0

Compare

Choose a tag to compare

View all tags

v0.0.11

What's Changed

Bump version to 0.0.11 and use fixed CUDNN major version for CI by @chengzeyi in #35
remove triton.autotune to make compilation faster by @chengzeyi in #36
Feature/quantization by @chengzeyi in #43
Dev by @chengzeyi in #44

Full Changelog: v0.0.10...v0.0.11

Contributors

chengzeyi

Assets 18

All reactions

v0.0.10

14 Nov 10:23

github-actions

v0.0.10

e1f6c12

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.0.10

What's Changed

Dev by @chengzeyi in #28
Preserve parameters by @skirsten in #27
Dev by @chengzeyi in #30
Dev by @chengzeyi in #33
close #32, add env var WITHOUT_CUDA to setup.py, fail if CUDA is not available and WITHOUT_CUDA is not set by @chengzeyi in #34
Fix missing linking CUDNN、CUBLAS and CUDA in previous CI wheels😭

New Contributors

@skirsten made their first contribution in #27

Full Changelog: v0.0.9...v0.0.10

Contributors

skirsten and chengzeyi

Assets 18

Aptronymist reacted with heart emoji

All reactions

❤️ 1 reaction

1 person reacted

v0.0.9

12 Nov 06:17

github-actions

v0.0.9

ab4d909

Compare

Choose a tag to compare

View all tags

v0.0.9

What's Changed

Build automated CI to publish binary wheels on Linux and Windows
fix use_count of mempool becoming zero by @chengzeyi in #23
Dev by @chengzeyi in #24

Full Changelog: v0.0.8...v0.0.9

Contributors

chengzeyi

Assets 2

All reactions

v0.0.8 release

09 Nov 13:42

chengzeyi

v0.0.8

52afadf

Compare

Choose a tag to compare

View all tags

v0.0.8 release

What's Changed

Efficient mem cuda graph by @chengzeyi in #18
Bug Fixes by @chengzeyi in #20

Full Changelog: v0.0.7...v0.0.8

Contributors

chengzeyi

Assets 2

All reactions

v0.0.7 release

06 Nov 05:57

chengzeyi

v0.0.7

f1916ad

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

v0.0.7 release

Various improvements:

Implement flat_tensors: A complete solution to convert arbitrary python objects to a list of PyTorch tensors, making JIT trace more flexible.
Add more fuse passes to improve performance on ComfyUI.

Assets 2