基础模型:
- TheDrummer/Cydonia-24B-v2
- ReadyArt/Gaslit-Transgression-24B-v1.0
- ReadyArt/Forgotten-Safeword-24B-v4.0
- ReadyArt/The-Omega-Directive-M-24B-v1.1
- ReadyArt/Omega-Darker_The-Final-Directive-24B
- Mawdistical/Mawdistic-NightLife-24b
- Undi95/MistralThinker-v1.1
- cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition
- AlexBefest/CardProjector-24B-v3
- arcee-ai/Arcee-Blitz
- TroyDoesAI/BlackSheep-24B
库名称: transformers
标签:
- mergekit
- 合并
- sce
- della
- cabs
- 非全年龄向
- rp
- 角色扮演
- 角色扮演
语言:
- 英语
管道标签: 文本生成
真实异形-24B
这是使用mergekit创建的预训练语言模型合并版本。
合并详情
描述
本模型的合并理念与Casual-Autopsy/L3-Super-Nova-RP-8B相同:在保持角色扮演能力的同时,尽可能兼容SillyTavern的各种功能和扩展。
推理能力虽非完美,但显著增强了模型在传统推理(状态块/思考框链式推理)方面的表现(个人认为这种推理方式比现代推理更少破坏沉浸感)。建议将最大推理提示数设为3或更多,和/或注入链式推理格式提示。
合并方法
本模型采用SCE、Della和CABS方法,以TheDrummer/Cydonia-24B-v2为基础进行合并。
合并模型
合并包含以下模型:
配置
以下是生成本模型所用的YAML配置:
受虐-安全词
models:
- model: TheDrummer/Cydonia-24B-v2
- model: ReadyArt/Forgotten-Safeword-24B-v4.0
parameters:
weight: 0.4
density: 0.35
epsilon: 0.3
- model: ReadyArt/Gaslit-Transgression-24B-v1.0
parameters:
weight: 0.4
density: 0.35
epsilon: 0.3
merge_method: della
base_model: TheDrummer/Cydonia-24B-v2
parameters:
normalize: true
dtype: bfloat16
欧米茄双雄
models:
- model: TheDrummer/Cydonia-24B-v2
- model: ReadyArt/The-Omega-Directive-M-24B-v1.1
parameters:
weight: 0.4
density: 0.35
epsilon: 0.3
- model: ReadyArt/Omega-Darker_The-Final-Directive-24B
parameters:
weight: 0.4
density: 0.35
epsilon: 0.3
merge_method: della
base_model: TheDrummer/Cydonia-24B-v2
parameters:
normalize: true
dtype: bfloat16
SCE异形
models:
- model: TheDrummer/Cydonia-24B-v2
- model: Mawdistical/Mawdistic-NightLife-24b
- model: Gaslit-Safeword
- model: Omega-Duo
merge_method: sce
base_model: TheDrummer/Cydonia-24B-v2
parameters:
select_topk: 0.8
dtype: bfloat16
推理增强
models:
- model: SCE-Abomination
- model: Undi95/MistralThinker-v1.1
parameters:
weight: 0.6
n_val: 16
m_val: 32
- model: cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition
parameters:
weight: 0.4
n_val: 11
m_val: 33
merge_method: cabs
default_n_val: 8
default_m_val: 32
pruning_order:
- Undi95/MistralThinker-v1.1
- cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition
base_model: SCE-Abomination
dtype: bfloat16
多任务整合
models:
- model: SCE-Abomination
- model: AlexBefest/CardProjector-24B-v3
parameters:
weight: 0.6
n_val: 16
m_val: 32
- model: arcee-ai/Arcee-Blitz
parameters:
weight: 0.4
n_val: 11
m_val: 33
merge_method: cabs
default_n_val: 8
default_m_val: 32
pruning_order:
- AlexBefest/CardProjector-24B-v3
- arcee-ai/Arcee-Blitz
base_model: SCE-Abomination
dtype: bfloat16
真实异形-24B
models:
- model: SCE-Abomination
- model: TroyDoesAI/BlackSheep-24B
- model: UNC-Reasoning
- model: INT-Multitasks
merge_method: sce
base_model: SCE-Abomination
parameters:
select_topk: 0.45
dtype: bfloat16