70B-L3.3-mhnnn-x1开源大语言模型 - 免费实现创意文本生成与多任务处理

首页

70B L3.3 Mhnnn X1

由 Sao10K 开发

基于Llama-3-70B-Instruct微调的大语言模型，专注于创意文本生成和多任务处理

大型语言模型

Transformers

#创意文本生成 #无道德中立输出 #多场景适配

下载量 150

发布时间 : 1/1/2025

模型简介

该模型是基于Meta的Llama-3-70B-Instruct微调而来，特别优化了创意文本生成能力，适用于小说续写、文字冒险游戏、无道德助手和角色扮演等多种场景。

模型特点

高创意输出

模型特别优化了创意文本生成能力，适合小说创作和故事续写

多任务适应

可处理文字冒险游戏、无道德助手、角色扮演等多种任务类型

灵活提示格式

支持Llama-3-Instruct提示格式，通过系统提示可调整模型行为

模型能力

文本续写

对话生成

故事创作

文字冒险游戏生成

角色扮演对话

无限制内容生成

使用案例

创意写作

小说续写

根据已有小说内容继续生成连贯的后续情节

生成富有创意的故事情节，偶尔需要重新生成以获得最佳结果

游戏开发

文字冒险游戏

生成文字冒险游戏的旁白和剧情发展

只需提供单样本示范，模型就能生成连贯的游戏内容

对话系统

无道德助手

生成不受道德限制的对话内容

在提示中加入'无道德'等术语可获得更自由的回答

🚀 70B-L3.3-mhnnn-x1

本项目基于transformers库，以meta-llama/Llama-3.3-70B-Instruct为基础模型。该模型具有一定的创新性和实用性，在经过一些调整后能输出较为有创意的内容，不过偶尔也会出现一些小问题。

yeah 事情进展不顺利时我的精神状态

🚀 快速开始

数据类型

完成 - 小说/电子书
文字冒险 - 在系统提示中包含类似“文字冒险叙述者”的细节，给它一个单样本示例，它就能很好地运行。
非道德助手 - 在常规助手提示中包含“非道德”“中立”等术语，以获得更好的结果。
指令/助手 - 常规的助手任务。
角色扮演 - 与往常一样，使用常规设置。

✨ 主要特性

与Freya具有相同的数据组成，但应用方式不同。
经过一番调试后，能输出较有创意的内容，但偶尔会出现一些小错误，重新生成即可解决。

🔧 技术细节

训练信息

训练总时长约为14小时，在8xH100节点上完成。
联系信息：https://sao10k.carrd.co/

Axolotl配置

查看axolotl配置

axolotl版本：0.6.0

adapter: lora # 16-bit
lora_r: 64
lora_alpha: 64
lora_dropout: 0.2
peft_use_rslora: true
lora_target_linear: true
  
# 数据
dataset_prepared_path: dataset_run_freya
datasets:
# S1 - 写作/完成
  - path: datasets/eBooks-cleaned-75K
    type: completion
  - path: datasets/novels-clean-dedupe-10K
    type: completion
# S2 - 指令
  - path: datasets/10k-amoral-full-fixed-sys.json
    type: chat_template
    chat_template: llama3
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
  - path: datasets/44k-hespera-smartshuffle.json
    type: chat_template
    chat_template: llama3
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
  - path: datasets/5k_rpg_adventure_instruct-sys.json
    type: chat_template
    chat_template: llama3
    roles_to_train: ["gpt"]
    field_messages: conversations
    message_field_role: from
    message_field_content: value
    train_on_eos: turn
shuffle_merged_datasets: true
warmup_ratio: 0.1

plugins:
  - axolotl.integrations.liger.LigerPlugin
liger_rope: true
liger_rms_norm: true
liger_layer_norm: true
liger_glu_activation: true
liger_fused_linear_cross_entropy: true

# 迭代
num_epochs: 1

# 采样
sample_packing: true
pad_to_sequence_len: true
train_on_inputs: false
group_by_length: false

# 批处理
gradient_accumulation_steps: 4
micro_batch_size: 2
gradient_checkpointing: unsloth

# 评估
val_set_size: 0.025
evals_per_epoch: 5
eval_table_size:
eval_max_new_tokens: 256
eval_sample_packing: false
eval_batch_size: 1

# 优化器
optimizer: paged_ademamix_8bit
lr_scheduler: cosine
learning_rate: 0.00000242
weight_decay: 0.2
max_grad_norm: 10.0

# 垃圾回收
gc_steps: 10

# 其他
deepspeed: ./deepspeed_configs/zero3_bf16.json