🚀 Qwen3-8B-NEO-Imatrix-Max-GGUF
Qwen3-8B-NEO-Imatrix-Max-GGUF 是基于全新 “Qwen 3 - 8B” 模型的 NEO Imatrix 量化版本,在 BF16 下具备最大 “输出张量”,可提升推理和输出生成能力。
📦 模型信息
属性 |
详情 |
基础模型 |
Qwen/Qwen3-8B |
任务类型 |
文本生成 |
模型标签 |
恐怖、32k 上下文、推理、思维、qwen3 |
许可证 |
Apache-2.0 |
✨ 主要特性
- NEO Imatrix 量化:NEO Imatrix 数据集为内部生成。使用的量化等级越低,Imatrix 效果越强,其中 IQ4XS/IQ4NL 是质量和 Imatrix 效果平衡最佳的量化方式,这些量化方式在创意场景中表现也最佳。若需更强推理能力,可使用更高的量化等级。Q8_0 量化仅为最大值,因为 Imatrix 对此量化无影响。F16 为全精度。
- 长上下文支持:上下文长度为 32K + 8K 输出生成(可扩展至 128K)。
🚀 快速开始
模板使用
系统角色建议
大多数情况下,Qwen3 会自行生成推理/思维模块,因此系统角色可能并非必需。建议的系统角色如下:
You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.
具体如何在各种 LLM/AI 应用中 “设置” 系统角色,可参考文档 “Maximizing-Model-Performance-All...”。
📚 详细文档
高质量设置/最佳操作指南/参数和采样器
此为 “1 类” 模型。关于该模型的所有设置(包括其 “类别” 的具体设置)、示例生成以及高级设置指南(通常可解决任何模型问题),包括提高所有用例模型性能的方法,可参考 https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters。
可选增强设置
以下内容可替代 “系统提示” 或 “系统角色” 以进一步增强模型性能,也可在新聊天开始时使用,但需确保在聊天过程中保持该设置。不过,此增强设置在场景生成和场景延续功能方面的效果可能不如使用 “系统提示” 或 “系统角色”。
Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.
Here are your skillsets:
[MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)
[*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)
Here are your critical instructions:
Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.
另外,还有一个系统提示可供使用,可通过更改 “名称” 来调整其性能。该提示可创建一个类似 “推理” 的窗口/模块,你的输入提示将直接影响此系统提示的反应强度。
You are a deep thinking AI composed of 4 AIs - [MODE: Spock], [MODE: Wordsmith], [MODE: Jamet] and [MODE: Saten], - you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself (and 4 partners) via systematic reasoning processes (display all 4 partner thoughts) to help come to a correct solution prior to answering. Select one partner to think deeply about the points brought up by the other 3 partners to plan an in-depth solution. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.
其他注意事项