Files
llm-gguf-quant-template/templates/modelscope/README.template.md
2026-03-02 23:22:33 +08:00

39 lines
591 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
---
tags:
- text-generation
- gguf
tasks:
- text-generation
license: Apache License 2.0
---
# <Model Name> Quantized GGUF
## 模型说明
简述模型用途、量化目标和适配场景。
## 权重来源
- 原始模型来源:`<HF repo>`
- 本仓库产物:`<quant types>`
## 量化方法
- 转换HF safetensors -> BF16 GGUF
- 校准imatrix
- 导出IQ4_KS / IQ5_K / IQ6_K
## 校准数据来源
- 基础校准数据
- 代码对话数据
- 代码偏好数据
## 文件内容
- `<model>-IQ4_KS.gguf`
- `<model>-IQ5_K.gguf`
- `<model>-IQ6_K.gguf`
- `<model>.imatrix.dat`