[Question] Difference between the quantization methods of other LLM engines. #3107

BrandonLee0626 · 2025-01-23T10:01:03Z

❓ General Questions

I am curious if there is a difference between the quantization methods, such as q4f16_0 and q4f32_0 of this engine, and the q4_0 quantization of other LLM engines. If there is a difference, what is it?

The text was updated successfully, but these errors were encountered:

BrandonLee0626 added the question Question about the usage label Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Difference between the quantization methods of other LLM engines. #3107

[Question] Difference between the quantization methods of other LLM engines. #3107

BrandonLee0626 commented Jan 23, 2025 •

edited

Loading

[Question] Difference between the quantization methods of other LLM engines. #3107

[Question] Difference between the quantization methods of other LLM engines. #3107

Comments

BrandonLee0626 commented Jan 23, 2025 • edited Loading

❓ General Questions

BrandonLee0626 commented Jan 23, 2025 •

edited

Loading