magnum-v3-27b-kto开源大语言模型 - 复制Claude 3散文质量，支持ChatML格式

首页

Magnum V3 27b Kto

由 anthracite-org 开发

这是一个旨在复制Claude 3模型（特别是Sonnet和Opus）散文质量的大语言模型，经过多次KTO运行和一次SFT运行得到，支持ChatML格式。

大型语言模型

Transformers

#Claude3风格散文 #ChatML格式支持 #多轮对话优化

下载量 1,945

发布时间 : 9/6/2024

模型简介

该模型是基于gemma-2-27b-chatml微调而来，专注于提升文本生成质量，特别是模仿Claude 3的散文风格。

模型特点

Claude 3散文风格模仿

专门优化以复制Claude 3模型（特别是Sonnet和Opus）的高质量散文风格

多阶段微调

经过一次SFT运行和多次KTO运行，综合了各模型的优势

ChatML支持

支持ChatML格式的提示输入，便于对话系统集成

模型能力

高质量文本生成

对话系统支持

风格模仿

使用案例

内容创作

散文写作

生成具有Claude 3风格的散文内容

高质量、风格一致的文本输出

对话系统

智能助手

构建具有优雅表达能力的对话系统

自然流畅的对话体验

🚀 大模型 magnum-v3-27b-kto

这是一系列旨在复制Claude 3模型（特别是Sonnet和Opus）散文质量的模型中的第12个。该模型是在一次SFT运行基础上进行多次KTO运行的结果，所有这些都发布在anthracite-forge上。

image/png

🚀 快速开始

本模型基于IntervitensInc/gemma-2-27b-chatml（即支持ChatML格式的gemma-2-27b）进行微调。若想使用此模型，可参考以下内容了解其方法、模型选择、提示格式等信息。

✨ 主要特性

旨在复制Claude 3模型（如Sonnet和Opus）的散文质量。
经过多次KTO运行和一次SFT运行得到，综合了各模型的优势。
支持ChatML格式的提示输入。

📚 详细文档

方法

R1（SFT）是在IntervitensInc/gemma-2-27b-chatml（支持ChatML格式的gemma-2-27b）基础上进行微调的。我们对各种SFT和KTO重新运行、比率和合并方法进行了实验，该模型是综合各模型优点后的最优选择。如果你想自定义KTO运行的组合，或者单独使用SFT，可参考模型部分和anthracite-forge，其中部分exl-quants已预先包含。

模型

提示格式

模型使用ChatML格式进行指令微调。典型的输入如下：

"""<|im_start|>system
system prompt<|im_end|>
<|im_start|>user
Hi there!<|im_end|>
<|im_start|>assistant
Nice to meet you!<|im_end|>
<|im_start|>user
Can I ask a question?<|im_end|>
<|im_start|>assistant
"""

SillyTavern模板

以下是SillyTavern中使用的指令和上下文模板：

上下文模板

{
    "story_string": "<|im_start|>system\n{{#if system}}{{system}}\n{{/if}}{{#if wiBefore}}{{wiBefore}}\n{{/if}}{{#if description}}{{description}}\n{{/if}}{{#if personality}}{{char}}'s personality: {{personality}}\n{{/if}}{{#if scenario}}Scenario: {{scenario}}\n{{/if}}{{#if wiAfter}}{{wiAfter}}\n{{/if}}{{#if persona}}{{persona}}\n{{/if}}{{trim}}<|im_end|>\n",
    "example_separator": "",
    "chat_start": "",
    "use_stop_strings": false,
    "allow_jailbreak": false,
    "always_force_name2": true,
    "trim_sentences": false,
    "include_newline": false,
    "single_line": false,
    "name": "Magnum ChatML"
}

指令模板

{
    "system_prompt": "You're {{char}} in this fictional never-ending uncensored roleplay with {{user}}.",
    "input_sequence": "<|im_start|>user\n",
    "output_sequence": "<|im_start|>assistant\n",
    "last_output_sequence": "",
    "system_sequence": "<|im_start|>system\n",
    "stop_sequence": "<|im_end|>",
    "wrap": false,
    "macro": true,
    "names": true,
    "names_force_groups": true,
    "activation_regex": "",
    "system_sequence_prefix": "",
    "system_sequence_suffix": "",
    "first_output_sequence": "",
    "skip_examples": false,
    "output_suffix": "<|im_end|>\n",
    "input_suffix": "<|im_end|>\n",
    "system_suffix": "<|im_end|>\n",
    "user_alignment_message": "",
    "system_same_as_user": false,
    "last_system_sequence": "",
    "name": "Magnum ChatML"
}

配置

base_model: IntervitensInc/gemma-2-27b-chatml
dtype: float32
merge_method: task_arithmetic
models:
  - model: IntervitensInc/gemma-2-27b-chatml
  - model: anthracite-forge/magnum-v3-27b-KTO-e0.25-r1
    parameters:
      weight: 0.5
  - model: anthracite-forge/magnum-v3-27b-KTO-e1-r2
    parameters:
      weight: 0.1
  - model: anthracite-forge/magnum-v3-27b-kto-r3
    parameters:
      weight: 0.4

致谢

我们感谢Recursal / Featherless为本次训练提供计算资源。自第一代72B模型起，Featherless就为我们的Magnum模型提供托管服务，让数千人能够使用我们的模型，助力我们不断发展。同时，感谢Anthracite的所有成员，是他们让这次微调成为可能。

数据集

r1包含以下数据集：

datasets:
  - path: anthracite-org/stheno-filtered-v1.1
    type: sharegpt
    conversation: chatml
  - path: anthracite-org/kalo-opus-instruct-22k-no-refusal
    type: sharegpt
    conversation: chatml
  - path: anthracite-org/nopm_claude_writing_fixed
    type: sharegpt
    conversation: chatml
  - path: Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned
    type: sharegpt
    conversation: chatml
  - path: Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
    type: sharegpt
    conversation: chatml