drama-large-xnli-anli开源零样本分类模型 - 支持15种语言自然语言推理任务

首页

Drama Large Xnli Anli

由 mjwong 开发

基于facebook/drama-large在XNLI和ANLI数据集上微调的零样本分类模型，支持15种语言的自然语言推理任务。

大型语言模型

Safetensors

支持多种语言#多语言NLI #零样本分类 #语义推理

下载量 23

发布时间 : 3/1/2025

模型简介

该模型是用于零样本分类和多语言自然语言推理任务的微调版本，特别适用于跨语言文本分类和推理场景。

模型特点

多语言支持

支持15种语言的零样本分类和自然语言推理任务

高性能推理

在XNLI和ANLI数据集上表现出色，特别是在英语和其他主要语言上

零样本分类能力

无需特定任务训练即可对新类别进行分类

模型能力

零样本文本分类

多语言自然语言推理

跨语言文本理解

多类别分类

使用案例

文本分类

多语言内容分类

对多种语言的文本内容进行分类，如新闻分类、产品评论分类等

在15种语言上表现出良好的分类准确率

自然语言推理

跨语言文本推理

判断两种语言文本之间的逻辑关系（蕴含、中立或矛盾）

在XNLI数据集上英语准确率达79.9%，其他语言准确率在59.4%-75.4%之间

🚀 drama-large-xnli-anli

本模型是 facebook/drama-large 在XNLI和ANLI数据集上的微调版本。它可用于零样本分类任务，支持多种语言，为跨语言的文本分类提供了强大的解决方案。

🚀 快速开始

模型描述

DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers。 Xueguang Ma, Xi Victoria Lin, Barlas Oguz, Jimmy Lin, Wen - tau Yih, Xilun Chen, arXiv 2025

如何使用模型

零样本分类管道

可以使用 zero-shot-classification 管道加载模型，示例代码如下：

from transformers import AutoTokenizer, pipeline
model = "mjwong/drama-large-xnli-anli"
classifier = pipeline("zero-shot-classification",
                      model=model)

然后可以使用此管道将序列分类到指定的任何类别名称中。

sequence_to_classify = "one day I will see the world"
candidate_labels = ['travel', 'cooking', 'dancing']
classifier(sequence_to_classify, candidate_labels)

如果多个候选标签都可能正确，可以传递 multi_class=True 来独立计算每个类别：

candidate_labels = ['travel', 'cooking', 'dancing', 'exploration']
classifier(sequence_to_classify, candidate_labels, multi_class=True)

手动使用PyTorch

该模型也可用于NLI任务，示例代码如下：

import torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification

# device = "cuda:0" or "cpu"
device = torch.device("cuda") if torch.cuda.is_available() else torch.device("cpu")

model_name = "mjwong/drama-large-xnli-anli"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name)

premise = "But I thought you'd sworn off coffee."
hypothesis = "I thought that you vowed to drink more coffee."

input = tokenizer(premise, hypothesis, truncation=True, return_tensors="pt")
output = model(input["input_ids"].to(device))
prediction = torch.softmax(output["logits"][0], -1).tolist()
label_names = ["entailment", "neutral", "contradiction"]
prediction = {name: round(float(pred) * 100, 2) for pred, name in zip(prediction, label_names)}
print(prediction)

📊 评估结果

多语言XNLI测试集评估

该模型使用XNLI测试集在15种语言上进行了评估，评估指标为准确率。

数据集	英语 (en)	阿拉伯语 (ar)	保加利亚语 (bg)	德语 (de)	希腊语 (el)	西班牙语 (es)	法语 (fr)	印地语 (hi)	俄语 (ru)	斯瓦希里语 (sw)	泰语 (th)	土耳其语 (tr)	乌尔都语 (ur)	越南语 (vi)	中文 (zh)
drama-base-xnli-anli	0.788	0.689	0.708	0.715	0.696	0.732	0.737	0.647	0.711	0.636	0.676	0.664	0.588	0.708	0.710
drama-large-xnli-anli	0.799	0.698	0.730	0.721	0.717	0.754	0.754	0.649	0.718	0.652	0.678	0.656	0.594	0.719	0.719

MultiNLI开发集和ANLI测试集评估

该模型还使用MultiNLI开发集和ANLI测试集进行了评估，评估指标为准确率。

数据集	mnli_dev_m	mnli_dev_mm	anli_test_r1	anli_test_r2	anli_test_r3
drama-base-xnli-anli	0.781	0.787	0.500	0.420	0.440
drama-large-xnli-anli	0.794	0.796	0.534	0.446	0.452

🔧 技术细节

训练超参数

训练期间使用了以下超参数：

学习率（learning_rate）：2e - 05
训练批次大小（train_batch_size）：16
评估批次大小（eval_batch_size）：16
随机种子（seed）：42
优化器（optimizer）：Adam，其中betas = (0.9, 0.999)，epsilon = 1e - 08
学习率调度器类型（lr_scheduler_type）：线性
学习率调度器预热比例（lr_scheduler_warmup_ratio）：0.1

框架版本

Transformers：4.49.0
Pytorch：2.6.0 + cu124
Datasets：3.2.0
Tokenizers：0.21.0

📄 许可证

本模型使用CC - BY - NC - 4.0许可证。

📦 数据集

xnli
facebook/anli

🛠️ 任务类型

零样本分类（zero - shot - classification）

📋 模型信息

属性	详情
模型类型	drama - large - xnli - anli
基础模型	facebook/drama - large
支持语言	多语言（英语、阿拉伯语、保加利亚语、德语、希腊语、西班牙语、法语、印地语、俄语、斯瓦希里语、泰语、土耳其语、乌尔都语、越南语、中文）