Humanish-Roleplay-Llama-3.1-8B开源模型 - 免费部署实现人性化角色扮演互动

首页

Humanish Roleplay Llama 3.1 8B

由 vicgalle 开发

这是一个经过DPO微调的Llama-3.1模型，能表现得更'人性化'，避免AI助手式的生硬回答，适用于角色扮演场景。

大型语言模型

Transformers

开源协议:Apache-2.0 #人性化对话 #角色扮演优化 #非机械回复

下载量 1,880

发布时间 : 8/3/2024

模型简介

该模型通过DPO微调优化了Llama-3.1的表现，使其在对话中更自然、更具人性化特点，特别适合角色扮演场景。

模型特点

人性化对话

避免传统AI助手的生硬表述，提供更自然的人类对话体验

角色扮演优化

专门针对角色扮演场景进行优化，支持动作格式描述

情感表达丰富

能够通过文本表达丰富的情感和肢体语言

模型能力

自然语言生成

情感化对话

角色扮演

上下文理解

使用案例

娱乐应用

角色扮演游戏

用于创建沉浸式的角色扮演对话体验

生成富有情感和肢体语言的对话内容

社交互动

模拟人类社交互动中的自然对话

提供更真实的社交对话体验

🚀 Humanish-Roleplay-Llama-3.1-8B

这是一个经过DPO微调的Llama-3.1模型，能表现得更“人性化”，即避免所有AI助手式的生硬回答。它也适用于角色扮演（RP）场景。

image/webp

为了让模型表现得更“人性化”，即避免出现AI助手常见的生硬表述，同时适用于角色扮演（RP）场景，该模型在一系列数据集上进行了微调：

来自Claude Opus的通用对话数据，源自 Undi95/Meta-Llama-3.1-8B-Claude
Undi95/Weyaxi-humanish-dpo-project-noemoji，用于让模型像人类一样做出反应，拒绝类似助手或过于中立的回复
ResplendentAI/NSFW_RP_Format_DPO，引导模型在角色扮演场景中使用动作格式。如果在第一条消息中自然地使用这种格式，效果最佳（见示例）

🚀 快速开始

本部分将为你展示如何使用该模型进行对话生成。

💻 使用示例

基础用法

conversation = [{'role': 'user', 'content': """*With my face blushing in red* Tell me about your favorite film!"""}]

prompt = tokenizer.apply_chat_template(conversation, tokenize=False, add_generation_prompt=True)

inputs = tokenizer(prompt, return_tensors="pt").to(model.device) 

outputs = model.generate(**inputs, max_new_tokens=512, do_sample=True, temperature=0.8)

模型的回复如下：

*Giggles nervously* Oh my, um, *clears throat*, well... I'm not really sure if I should be sharing my favorite film with you like this! *Blushes even more* But if you insist, *gulps*, I suppose I could tell you about this really romantic movie I saw once... *Leans in close and whispers* It's called "When Harry Met Sally"! *Squeals and covers mouth* You know, it's this super cute story about two friends who try to keep things platonic for a whole year, but end up falling for each other in the end! *Squirms in seat* The chemistry between the leads is just... *Sighs wistfully* There's this famous scene in a diner where... *Waves hand in front of face*... where you know what happens, and it's just so *Blushes bright red*... realistic and relatable! *Giggles* I mean, we've all been there, right? *Winks* Anyway, it's just such a classic rom-com with great dialogue and a perfect soundtrack! *Sways dreamily* I could watch it over and over again. *Tilts head* What about you, do you have a favorite film? *Peeks at you with big, hopeful eyes* *Squeals* I'm dying to know! *Bounces in seat excitedly*

📚 详细文档

微调脚本

用于训练该模型的脚本可在此文件中找到。在T4 GPU上训练耗时不到一小时，且需要以下库：

peft                     0.11.1
transformers             4.44.0.dev0
trl                      0.9.6

📄 许可证

本项目采用 apache-2.0 许可证。

精选推荐AI模型

Llama 3 Typhoon V1.5x 8b Instruct

专为泰语设计的80亿参数指令模型，性能媲美GPT-3.5-turbo，优化了应用场景、检索增强生成、受限生成和推理任务

Cadet-Tiny是一个基于SODA数据集训练的超小型对话模型，专为边缘设备推理设计，体积仅为Cosmo-3B模型的2%左右。

Roberta Base Chinese Extractive Qa

基于RoBERTa架构的中文抽取式问答模型，适用于从给定文本中提取答案的任务。

问答系统中文

uer

2,694

智启未来，您的人工智能解决方案智库

简体中文