What is `general.quantization_version` in the GGUF spec used for? #1366

qskousen · 2025-10-13T23:02:09Z

qskousen
Oct 13, 2025

Reading the spec file required keys, one of those is general.quantization_version, but I am not clear exactly what this field is for. According to the docs it is unrelated to the quantization scheme. In what case would this be used?

CISC · 2025-10-17T12:53:59Z

CISC
Oct 17, 2025

It's used for changes to the quantization formats themselves, see ggml-org/llama.cpp#1508 when the scaling factor in Q4 and Q8 changed from F32 to F16.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What is `general.quantization_version` in the GGUF spec used for? #1366

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

What is general.quantization_version in the GGUF spec used for? #1366

Uh oh!

qskousen Oct 13, 2025

Replies: 1 comment

Uh oh!

CISC Oct 17, 2025

What is `general.quantization_version` in the GGUF spec used for? #1366

qskousen
Oct 13, 2025

CISC
Oct 17, 2025