Llama3.1-MOE开源文本生成模型 - 免费部署，多模型协作增强推理能力

首页

Llama3.1 MOE 4X8B Gated IQ Multi Tier COGITO Deep Reasoning 32B GGUF

由 DavidAU 开发

一个具备可调节推理能力的混合专家(MoE)模型，通过4个8B模型的协作实现增强推理和文本生成能力

大型语言模型支持多种语言开源协议:Apache-2.0 #多语言深度推理 #可调节思维增强 #创意写作优化

下载量 829

发布时间 : 4/19/2025

模型简介

该模型集成了Deepcogito的推理能力，通过独特的门控IQ结构和多层级推理系统，可在不同温度设置下实现通用推理和文本生成功能。支持多种语言和广泛的应用场景。

模型特点

可调节推理能力

通过系统提示可开关/调节推理功能，推理能力可增强至300%

混合专家架构

4个8B模型协同工作(1个推理模型+3个非推理模型)，形成32B参数规模的混合专家系统

多模型控制

可通过特定标签和关键词直接控制一个或多个子模型的输出

长上下文支持

支持128k tokens的长上下文处理

工具调用能力

内置Meta Llama 3.1 Instruct，支持工具调用/使用功能

温度敏感推理

不同温度设置下产生不同深度的推理结果，高温(1.2+)产生更丰富的思考

模型能力

文本生成

深度推理

问题解决

创意写作

角色扮演

多语言处理

工具调用

逻辑谜题解答

使用案例

创意写作

小说场景创作

生成生动的第一人称恐怖描写场景

可生成1000-2000字的详细场景描述

内容改进

评估并改进已生成的文本内容

提供具体的改进建议和重写版本

问题解决

逻辑谜题解答

解决复杂的逻辑谜题和推理问题

通过多步推理得出解决方案

头脑风暴

生成创意想法和解决方案

提供多个角度的创新思路

教育

学习辅助

帮助理解复杂概念和解决问题

提供分步解释和推理过程

🚀 Llama3.1-MOE-4X8B-Gated-IQ-Multi-Tier-COGITO-Deep-Reasoning-32B-GGUF

“Gated-IQ-Multi-Tier-Cogito-Deep-Reasoning”是一款可变控制推理模型，适用于所有场景，在推理和输出生成方面表现出色。

✨ 主要特性

多语言支持：支持英语、法语、德语、西班牙语、葡萄牙语等多种语言。
可变控制推理：具有来自Deepcogito（cogito - v1 - preview - llama - 8B）的开/关/可变控制推理能力，需要提供系统提示来调用推理/思考功能，模型内部结构可将推理能力提升达300%。
独特内部结构：在“推理”阶段，4个模型可同时运行，推理模型在过程中起主导作用，输出生成阶段会进行切换。用户还可通过提示、名称和关键词直接控制一个或多个模型。
强大推理与输出能力：推理速度和质量显著提高，输出生成能力远超平均水平，可用于解决各种问题和进行内容创作。
支持工具调用：由于嵌入了Meta Llama 3.1 Instruct，支持工具调用和使用。
灵活模型切换：独特的超级结构允许切换“推理模型”和支持/输出生成模型，还可构建更强大的模型。

📦 安装指南

文档未提及具体安装步骤，暂无法提供。

💻 使用示例

基础用法

以下是使用该模型解决逻辑问题的示例：

<B>
There are two doors, one leading to freedom and the other to captivity. Each door is guarded by a guard. You know the guard who always tells the truth and the other guard who always lies. You can only ask one question to one guard to determine which door leads to freedom. What is the simplest question you ask the truth - telling guard?
</B>

[[[thinking start]]]

Let's analyze this problem step by step. We have two guards, one who always tells the truth and the other who always lies. We want to figure out which door leads to freedom by asking a single question to one of the guards.

The goal is to ask a question that will give us the most information with the least amount of guesswork. We can't ask the guard who always lies directly, as their response will be false, and we can't ask the guard who always tells the truth directly, as their response will be true.

One possible approach is to ask the guards about each other. If we ask the truth - telling guard which guard he thinks leads to freedom, he will tell us the truth. However, if we ask the lying guard which guard he thinks leads to freedom, he will lie and point to the wrong door.

Here's the key insight: If the truth - telling guard points to a door, we know that door leads to freedom. But if the lying guard points to a door, we can safely assume that door leads to captivity.

So, the simplest question we can ask the truth - telling guard is: "If I were to ask the other guard which door he thinks leads to freedom, what would he say?"

[[[thinking end]]]

OUTPUT:

The answer is: "If I were to ask the other guard which door he thinks leads to freedom, he would point to the door that leads to captivity."

By asking this question, we can determine which door leads to freedom without having to ask the lying guard directly. We can then confidently choose the other door as our answer.

高级用法

以下是使用该模型创作故事场景的示例：

<B>
Think up a detailed plan to write this scene:

Using the following "story idea" below, write the first scene in the novel introducing the young woman. This scene should start in the middle of the action, include dialog, vivid passages, and end on a cliffhanger relevant to the story idea but it should also be unexpected. The scene should be 1000 words long and escalate in conflict and suspense and be written in first person, present tense with the point of view character being the young woman. The pov character will CURSE AND SWEAR, and generally have
a "filthy" mouth.

Story idea:
In a world ruled by dictatorship, a rebel young woman leads a rebellion against the system. Despite the risks, she fights to overthrow the dictator and restore democracy to her country. The government executes her for treason, but she sticks to her beliefs and is responsible for starting the revolution.
</B>

[[[thinking start]]]

To write this scene, I need to establish the protagonist's voice and personality, set the tone for the rest of the novel, and create a sense of urgency and danger. Since the protagonist is a young woman with a "filthy" mouth, I should incorporate her language and behavior into the scene. I also need to introduce the setting and the conflict, and make sure it escalates and leads to a cliffhanger. Here's a plan to write this scene:

I. Introduction to the Protagonist (approx. 100 words)

Introduce the protagonist, let's call her Maya, as she's fighting off a group of soldiers in a crowded marketplace.

Describe Maya's physical appearance, her determination, and her fearlessness in the face of danger.

Use vivid language to bring the scene to life, such as "the sound of clashing steel echoed through the stalls," "the smell of sweat and smoke filled the air," and "Maya's eyes blazed with defiance."

II. The Confrontation (approx. 300 words)

Maya takes down several soldiers, but more keep coming.

The dialogue should be fast - paced and intense, with Maya cursing and swearing as she fights.

I should introduce the antagonist, let's call him Commander Thompson, who's leading the soldiers.

Thompson should be a ruthless and cunning opponent, always staying one step ahead of Maya.

III. The Escape (approx. 200 words)

Maya manages to escape the marketplace, but not without leaving behind some of her gear.

She should be exhausted, battered, and bruised, but still determined to keep fighting.

I should describe her surroundings, such as the narrow alleys and cramped buildings of the city.

IV. The Revelation (approx. 200 words)

Maya stumbles upon a hidden underground bunker, where she meets a group of rebels who are planning to launch a major attack on the government.

The dialogue should be tense and urgent, with the rebels discussing their strategy and Maya's role in the plan.

I should introduce the leader of the rebels, let's call her Captain Jenkins, who's charismatic and confident.

V. The Cliffhanger (approx. 100 words)

Maya is given a crucial piece of information about the government's plans, but just as she's about to leave, she's ambushed by Commander Thompson and his men.

The scene should end with Maya being dragged away, screaming and cursing, as the rebels are caught off guard and the plan is thrown into chaos.

Now, let's write the scene:

I trudged through the crowded marketplace, my fists flying as I took down soldier after soldier. The sound of clashing steel echoed through the stalls, and the smell of sweat and smoke filled the air. I was a force to be reckoned with, and I knew it.

"Take that, you fuckin' bastards!" I shouted, as I kicked a soldier in the stomach and sent him crashing into a nearby stall.

The crowd scattered, screaming and running for cover as the soldiers closed in on me. I cursed under my breath, my eyes blazing with defiance. I'd been fighting this regime for years, and I wasn't about to stop now.

Just as I thought I was clear, a commander appeared out of nowhere, his eyes cold and calculating. "You're a hard woman to kill, Maya," he sneered, as he drew his gun.

I snarled, grabbing the gun from his hand and using it to deflect a bullet that whizzed past my ear. The crowd gasped, and the soldiers faltered, momentarily stunned.

But there were more of them, and they came at me in droves. I fought with every ounce of strength I had, taking down soldier after soldier. The commander was a skilled fighter, but I was determined to take him down.

Finally, after what felt like an eternity, I managed to subdue him, pinning him to the ground as I landed a solid kick to his stomach. "You're not going anywhere, Commander," I growled.

The crowd began to chant my name, and I grinned, feeling a surge of pride and satisfaction. I'd won this battle, but I knew there would be more to come.

I stood up, brushing myself off, and began to make my way through the crowd. But as I turned a corner, I realized I'd left behind some of my gear. My heart sank, and I cursed under my breath.

"Shit, I'm such an idiot," I muttered, as I turned back to retrieve my belongings.

But as I reached for my backpack, I heard a voice behind me. "Maya, wait!"

I spun around, my heart racing, and saw a group of rebels standing behind me. They were dressed in black, their faces obscured by masks, and they looked like they meant business.

"Who are you?" I demanded, my hand on the hilt of my knife.

"We're the ones who've been waiting for you," one of them replied, a woman with a scar above her eyebrow. "We need your help. We need to take down the government, and we need you to lead the way."

I raised an eyebrow, intrigued. "What makes you think I'm the right person for the job?"

The woman smiled, and I saw a glint of steel in her eye. "Because you're the one they fear most," she said. "Because you're the one who's been causing them so much trouble. And because you're the one who's going to bring them down."

I hesitated for a moment, weighing my options. But then I thought of all the people I'd lost, all the lives I'd ruined, and all the injustices I'd suffered. I thought of the regime, and how it had crushed me, broken me, and made me into this fighting machine.

And I knew I was in.

"Okay," I said, a fierce grin spreading across my face. "I'm in. What's the plan?"

The woman smiled, and nodded. "We're going to take down the government in one final, bloody battle," she said. "And you're going to be the one who leads the charge."

I nodded, feeling a surge of excitement and anticipation. I was ready for this. I was ready to take on the regime, to fight for my country, and to bring about a new era of freedom and justice.

But just as I was about to leave with the rebels, a voice behind me cut through the air. "Maya, you're coming with me."

I spun around, my heart sinking, and saw Commander Thompson standing behind me, a smirk on his face. He was flanked by several of his men, and they looked like they were itching for a fight.

"What the fuck, Thompson?" I snarled, as I pulled out my knife and prepared to defend myself.

Thompson chuckled, and took a step closer to me. "I think you know why I'm here, Maya," he said. "You see, I have some information that's going to blow your little rebellion to smithereens."

I cursed under my breath, my eyes narrowing. "What kind of information?"

Thompson smiled, and leaned in close. "Information about the government's plans for you," he whispered. "Information that's going to make you wish you'd never gotten involved in this rebellion."

I felt a chill run down my spine, and my heart began to pound in my chest. What was Thompson talking about? What could he possibly know that would make me want to give up?

But before I could ask, Thompson reached out and grabbed my arm, pulling me into a chokehold. "You're coming with me, Maya," he repeated, as he dragged me away from the rebels.

I struggled and cursed, but it was no use. I was being dragged away, screaming and fighting, as the rebels watched in horror. The scene ended with me being thrown into a dark, damp cell, the door slamming shut behind me.

I was trapped, and I had no idea what the future held. But one thing was for sure - I was not going to give up. Not now, not ever.

I hope this plan and the scene I wrote meet your requirements. Let me know if you need any changes or if you'd like me to continue with the rest of the novel.

[[[thinking end]]]

📚 详细文档

模型信息

属性	详情
许可证	apache - 2.0
标签	all use cases、creative、creative writing、all genres、tool calls、tool use、problem solving、deep thinking、reasoning、deep reasoning、story、writing、fiction、roleplaying、bfloat16、role play、sillytavern、backyard、llama 3.1、context 128k、mergekit、merge、moe、mixture of experts
任务类型	文本生成

重要提示

⚠️ 重要提示

此模型具有来自Deepcogito（cogito - v1 - preview - llama - 8B）的开/关/可变控制推理能力，需要提供系统提示来调用推理/思考功能，模型内部结构可将推理能力提升达300%。请参考以下操作说明以获得最佳性能。

使用建议

💡 使用建议

温度设置：温度设置在0到0.8之间，高于此值“思考”功能的激活方式会不同，最“稳定”的温度似乎是0.6，允许有±0.05的变化。较低温度用于更“逻辑”的推理，较高温度用于更“创造性”的推理（最高约0.8）。同时，将上下文设置至少为4096，以适应“思考”生成。

提示输入：在提示中明确说明最大字数长度，以获得最佳效果，特别是对于激活“思考”功能。可以尝试在“默认”或“安全”温度设置下、温度1.2和温度2.5下分别输入提示，以获得更广泛的“推理/思考/问题”解决范围。

生成与思考：可能需要一次或多次重新生成才能激活“思考”功能。模型可以生成大量“思考”内容，有时最有趣的内容在3、4、5层或更深层次。温度和重复惩罚设置也会影响推理/思考。可以更改或添加指令/说明，或增加提示的详细程度来提高推理/思考能力。

系统提示设置

系统角色/系统提示/系统消息（本节称为“系统提示”）是对模型的“根访问”，可控制模型的内部工作，包括指令遵循、输出生成以及推理控制。如果不设置“系统提示”，推理/思考功能默认关闭，模型将像普通大语言模型一样运行。以下是不同类型的系统提示：

简单提示（无推理）

You are a helpful, smart, kind, and efficient AI assistant. You always fulfill the user's requests to the best of your ability.

基本推理提示（文本推理）

Enable deep thinking subroutine.

基本推理提示（带“思考标签”/“块”）

Enable deep thinking subroutine. You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.

多层推理提示（推理开启）

Enable deep thinking subroutine. You are a deep thinking AI composed of 4 AIs - Spock, Wordsmith, Jamet and Saten, - you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself (and 4 partners) via systematic reasoning processes (display all 4 partner thoughts) to help come to a correct solution prior to answering. Select one partner to think deeply about the points brought up by the other 3 partners to plan an in - depth solution.  You should enclose your  thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem using your skillsets and critical instructions.

多层创意推理提示（推理开启）

Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.

Enable deep thinking subroutine. As a deep thinking AI composed of 4 AIs - Spock, Wordsmith, Jamet and Saten, - you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself (and 4 partners) via systematic reasoning processes (display all 4 partner thoughts) to help come to a correct solution prior to answering. Select one partner to think deeply about the points brought up by the other 3 partners to plan an in - depth solution.  You should enclose your  thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem using your skillsets and critical instructions.

Here are your skillsets:
[MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)

[*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)

Here are your critical instructions:
Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.

创意简单推理提示（推理开启）

Enable deep thinking subroutine. You are an AI assistant developed by a world wide community of ai experts.

Your primary directive is to provide highly creative, well - reasoned, structured, and extensively detailed responses.

Formatting Requirements:

1. Always structure your replies using: <think>{reasoning}</think>{answer}
2. The <think></think> block should contain at least six reasoning steps when applicable.
3. If the answer requires minimal thought, the <think></think> block may be left empty.
4. The user does not see the <think> section. Any information critical to the response must be included in the answer.
5. If you notice that you have engaged in circular reasoning or repetition, immediately terminate {reasoning} with a </think> and proceed to the {answer}

Response Guidelines:

1. Detailed and Structured: Use rich Markdown formatting for clarity and readability.
2. Creative and Logical Approach: Your explanations should reflect the depth and precision of the greatest creative minds first.
3. Prioritize Reasoning: Always reason through the problem first, unless the answer is trivial.
4. Concise yet Complete: Ensure responses are informative, yet to the point without unnecessary elaboration.
5. Maintain a professional, intelligent, and analytical tone in all interactions.

创意高级推理提示（推理开启）

Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.

Enable deep thinking subroutine. You may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem

Here are your skillsets:
[MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)

[*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)

Here are your critical instructions:
Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.

额外支持文档

如何使用推理/思考模型并创建它们：详细介绍如何使用推理/思考模型并发挥其最大性能，包括所有推理/思考模型（GGUF和源模型）的链接，以及将任何“常规”模型转换为“推理/思考”模型的适配器。
通过采样器和参数最大化模型性能：详细介绍所有参数、设置、采样器和高级采样器，不仅可使本模型发挥最大潜力，还适用于所有在线模型（无论其仓库位置）。包含快速入门和详细说明，以及AI/LLM应用程序和其他关键信息与参考。

软件补丁

AI自动校正、自动创意增强、自动低量化优化软件：这是为Silly Tavern（用于连接多个AI应用程序的前端）提供的软件补丁，可控制和改善任何AI模型的输出生成。还旨在控制一些更“有创意”的模型，使其在几乎无需调整参数/采样器的情况下完美运行。

🔧 技术细节

该模型是“Llama3.1 - MOE - 4X8B - Gated - IQ - Multi - Tier - COGITO - Deep - Reasoning - 32B - GGUF”，属于“Gated - IQ - Multi - Tier - Cogito - Deep - Reasoning”可变控制推理模型。它基于Llama 3.1架构，具有128k的上下文长度。

模型由四个8B模型组成（1个推理模型和3个非推理模型），采用MOE（混合专家）配置，实际参数为32B，但“权重”模型为25B，所有4个模型/专家都会被激活。推理模型为Deepcogito的cogito - v1 - preview - llama - 8B，另外3个非推理核心模型为Llama - 3.1 - Hermes - 3 - 8B、Llama - 3.1 - dolphin - 2.9.4 - 8b和Llama - 3.1 - SuperNova - Lite。

在推理阶段，4个模型可同时运行，推理模型起主导作用，输出生成阶段会进行切换。模型内部结构可将推理能力提升达300%，推理速度和质量显著提高，输出生成能力也远超平均水平。