first commit
This commit is contained in:
5
templates/modelscope/.gitattributes
vendored
Normal file
5
templates/modelscope/.gitattributes
vendored
Normal file
@@ -0,0 +1,5 @@
|
||||
*.gguf filter=lfs diff=lfs merge=lfs -text
|
||||
*.dat filter=lfs diff=lfs merge=lfs -text
|
||||
*.md text eol=lf
|
||||
*.json text eol=lf
|
||||
.gitattributes text eol=lf
|
||||
38
templates/modelscope/README.template.md
Normal file
38
templates/modelscope/README.template.md
Normal file
@@ -0,0 +1,38 @@
|
||||
---
|
||||
tags:
|
||||
- text-generation
|
||||
- gguf
|
||||
tasks:
|
||||
- text-generation
|
||||
license: Apache License 2.0
|
||||
---
|
||||
|
||||
# <Model Name> Quantized GGUF
|
||||
|
||||
## 模型说明
|
||||
|
||||
简述模型用途、量化目标和适配场景。
|
||||
|
||||
## 权重来源
|
||||
|
||||
- 原始模型来源:`<HF repo>`
|
||||
- 本仓库产物:`<quant types>`
|
||||
|
||||
## 量化方法
|
||||
|
||||
- 转换:HF safetensors -> BF16 GGUF
|
||||
- 校准:imatrix
|
||||
- 导出:IQ4_KS / IQ5_K / IQ6_K
|
||||
|
||||
## 校准数据来源
|
||||
|
||||
- 基础校准数据
|
||||
- 代码对话数据
|
||||
- 代码偏好数据
|
||||
|
||||
## 文件内容
|
||||
|
||||
- `<model>-IQ4_KS.gguf`
|
||||
- `<model>-IQ5_K.gguf`
|
||||
- `<model>-IQ6_K.gguf`
|
||||
- `<model>.imatrix.dat`
|
||||
10
templates/modelscope/configuration.template.json
Normal file
10
templates/modelscope/configuration.template.json
Normal file
@@ -0,0 +1,10 @@
|
||||
{
|
||||
"framework": "ggml",
|
||||
"task": "text-generation",
|
||||
"model": {
|
||||
"type": "qwen35"
|
||||
},
|
||||
"pipeline": {
|
||||
"type": "text-generation"
|
||||
}
|
||||
}
|
||||
Reference in New Issue
Block a user