You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am curious if there is a difference between the quantization methods, such as q4f16_0 and q4f32_0 of this engine, and the q4_0 quantization of other LLM engines. If there is a difference, what is it?
The text was updated successfully, but these errors were encountered:
❓ General Questions
I am curious if there is a difference between the quantization methods, such as
q4f16_0
andq4f32_0
of this engine, and theq4_0
quantization of other LLM engines. If there is a difference, what is it?The text was updated successfully, but these errors were encountered: