Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Difference between the quantization methods of other LLM engines. #3107

Open
BrandonLee0626 opened this issue Jan 23, 2025 · 0 comments
Labels
question Question about the usage

Comments

@BrandonLee0626
Copy link

BrandonLee0626 commented Jan 23, 2025

❓ General Questions

I am curious if there is a difference between the quantization methods, such as q4f16_0 and q4f32_0 of this engine, and the q4_0 quantization of other LLM engines. If there is a difference, what is it?

@BrandonLee0626 BrandonLee0626 added the question Question about the usage label Jan 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Question about the usage
Projects
None yet
Development

No branches or pull requests

1 participant