-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refact quantization, support torchao quant and vllm w8a8 #596
base: main
Are you sure you want to change the base?
Commits on Oct 25, 2024
-
baishihao committed
Oct 25, 2024 Configuration menu - View commit details
-
Copy full SHA for 08531cd - Browse repository at this point
Copy the full SHA 08531cdView commit details -
ppl w4a16 for llama-cls models
baishihao committedOct 25, 2024 Configuration menu - View commit details
-
Copy full SHA for e6022da - Browse repository at this point
Copy the full SHA e6022daView commit details
Commits on Oct 28, 2024
-
baishihao committed
Oct 28, 2024 Configuration menu - View commit details
-
Copy full SHA for b4cd881 - Browse repository at this point
Copy the full SHA b4cd881View commit details
Commits on Oct 29, 2024
-
baishihao committed
Oct 29, 2024 Configuration menu - View commit details
-
Copy full SHA for 2b28cd1 - Browse repository at this point
Copy the full SHA 2b28cd1View commit details -
baishihao committed
Oct 29, 2024 Configuration menu - View commit details
-
Copy full SHA for 8cb6526 - Browse repository at this point
Copy the full SHA 8cb6526View commit details -
baishihao committed
Oct 29, 2024 Configuration menu - View commit details
-
Copy full SHA for 416e940 - Browse repository at this point
Copy the full SHA 416e940View commit details
Commits on Nov 1, 2024
-
modify router model backend for quant
baishihao committedNov 1, 2024 Configuration menu - View commit details
-
Copy full SHA for 33ff5e9 - Browse repository at this point
Copy the full SHA 33ff5e9View commit details -
baishihao committed
Nov 1, 2024 Configuration menu - View commit details
-
Copy full SHA for 15ab7db - Browse repository at this point
Copy the full SHA 15ab7dbView commit details -
baishihao committed
Nov 1, 2024 Configuration menu - View commit details
-
Copy full SHA for ed40f35 - Browse repository at this point
Copy the full SHA ed40f35View commit details
Commits on Nov 4, 2024
-
baishihao committed
Nov 4, 2024 Configuration menu - View commit details
-
Copy full SHA for cd92bf6 - Browse repository at this point
Copy the full SHA cd92bf6View commit details
Commits on Nov 5, 2024
-
add vllm fp8 w8a8 (per-channel/per-token)
baishihao committedNov 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 253725c - Browse repository at this point
Copy the full SHA 253725cView commit details -
baishihao committed
Nov 5, 2024 Configuration menu - View commit details
-
Copy full SHA for 309fefe - Browse repository at this point
Copy the full SHA 309fefeView commit details
Commits on Nov 7, 2024
-
refact quantization to support mix quantization
baishihao committedNov 7, 2024 Configuration menu - View commit details
-
Copy full SHA for f57b996 - Browse repository at this point
Copy the full SHA f57b996View commit details
Commits on Nov 8, 2024
-
baishihao committed
Nov 8, 2024 Configuration menu - View commit details
-
Copy full SHA for b038857 - Browse repository at this point
Copy the full SHA b038857View commit details -
baishihao committed
Nov 8, 2024 Configuration menu - View commit details
-
Copy full SHA for 24add93 - Browse repository at this point
Copy the full SHA 24add93View commit details
Commits on Nov 12, 2024
-
baishihao committed
Nov 12, 2024 Configuration menu - View commit details
-
Copy full SHA for dfd16be - Browse repository at this point
Copy the full SHA dfd16beView commit details
Commits on Nov 13, 2024
-
fix load weight with multi-threads
baishihao committedNov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 116e85a - Browse repository at this point
Copy the full SHA 116e85aView commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for f213eab - Browse repository at this point
Copy the full SHA f213eabView commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for cba8e07 - Browse repository at this point
Copy the full SHA cba8e07View commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 7ede4a7 - Browse repository at this point
Copy the full SHA 7ede4a7View commit details -
fix gemma2 tp2 for multi-query
baishihao committedNov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 99a7f76 - Browse repository at this point
Copy the full SHA 99a7f76View commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 5d0eae6 - Browse repository at this point
Copy the full SHA 5d0eae6View commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 17b9839 - Browse repository at this point
Copy the full SHA 17b9839View commit details -
Configuration menu - View commit details
-
Copy full SHA for b51b573 - Browse repository at this point
Copy the full SHA b51b573View commit details -
Merge branch 'quantization' of github.com:ModelTC/lightllm into quant…
…ization Conflicts: lightllm/common/basemodel/layer_weights/meta_weights/__init__.py
Configuration menu - View commit details
-
Copy full SHA for 6d6f83c - Browse repository at this point
Copy the full SHA 6d6f83cView commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 2060038 - Browse repository at this point
Copy the full SHA 2060038View commit details -
Merge branch 'quantization' of https://github.com/ModelTC/lightllm in…
…to quantization
baishihao committedNov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 5e132aa - Browse repository at this point
Copy the full SHA 5e132aaView commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for 65de425 - Browse repository at this point
Copy the full SHA 65de425View commit details -
baishihao committed
Nov 13, 2024 Configuration menu - View commit details
-
Copy full SHA for ed84426 - Browse repository at this point
Copy the full SHA ed84426View commit details
Commits on Nov 14, 2024
-
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for ef06da6 - Browse repository at this point
Copy the full SHA ef06da6View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for b55ea10 - Browse repository at this point
Copy the full SHA b55ea10View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 49dfb51 - Browse repository at this point
Copy the full SHA 49dfb51View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28ada22 - Browse repository at this point
Copy the full SHA 28ada22View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20b4909 - Browse repository at this point
Copy the full SHA 20b4909View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 60e16db - Browse repository at this point
Copy the full SHA 60e16dbView commit details -
Merge branch 'quantization' of https://github.com/ModelTC/lightllm in…
…to quantization
baishihao committedNov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 0a21710 - Browse repository at this point
Copy the full SHA 0a21710View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 61c4ceb - Browse repository at this point
Copy the full SHA 61c4cebView commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 3428699 - Browse repository at this point
Copy the full SHA 3428699View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 1fcd74e - Browse repository at this point
Copy the full SHA 1fcd74eView commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 0171a12 - Browse repository at this point
Copy the full SHA 0171a12View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fcfa82 - Browse repository at this point
Copy the full SHA 3fcfa82View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a40bed - Browse repository at this point
Copy the full SHA 7a40bedView commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for a82148d - Browse repository at this point
Copy the full SHA a82148dView commit details -
Merge branch 'quantization' of https://github.com/ModelTC/lightllm in…
…to quantization
baishihao committedNov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for f1ee848 - Browse repository at this point
Copy the full SHA f1ee848View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for c5f8288 - Browse repository at this point
Copy the full SHA c5f8288View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 211ffc9 - Browse repository at this point
Copy the full SHA 211ffc9View commit details -
baishihao committed
Nov 14, 2024 Configuration menu - View commit details
-
Copy full SHA for 91132c4 - Browse repository at this point
Copy the full SHA 91132c4View commit details
Commits on Nov 15, 2024
-
baishihao committed
Nov 15, 2024 Configuration menu - View commit details
-
Copy full SHA for 50135e5 - Browse repository at this point
Copy the full SHA 50135e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e58ac95 - Browse repository at this point
Copy the full SHA e58ac95View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3bde26a - Browse repository at this point
Copy the full SHA 3bde26aView commit details -
baishihao committed
Nov 15, 2024 Configuration menu - View commit details
-
Copy full SHA for 84dfadb - Browse repository at this point
Copy the full SHA 84dfadbView commit details -
Merge branch 'quantization' of https://github.com/ModelTC/lightllm in…
…to quantization
baishihao committedNov 15, 2024 Configuration menu - View commit details
-
Copy full SHA for 1099279 - Browse repository at this point
Copy the full SHA 1099279View commit details -
Configuration menu - View commit details
-
Copy full SHA for 76cda5a - Browse repository at this point
Copy the full SHA 76cda5aView commit details -
Configuration menu - View commit details
-
Copy full SHA for b8467de - Browse repository at this point
Copy the full SHA b8467deView commit details -
Configuration menu - View commit details
-
Copy full SHA for 32b542d - Browse repository at this point
Copy the full SHA 32b542dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d88f4a - Browse repository at this point
Copy the full SHA 5d88f4aView commit details -
baishihao committed
Nov 15, 2024 Configuration menu - View commit details
-
Copy full SHA for 8b1b2a1 - Browse repository at this point
Copy the full SHA 8b1b2a1View commit details -
baishihao committed
Nov 15, 2024 Configuration menu - View commit details
-
Copy full SHA for b1d4f52 - Browse repository at this point
Copy the full SHA b1d4f52View commit details