IndoBERTweet-HateSpeech开源模型 - 精准识别印尼语中的仇恨言论

首页

Indobertweet HateSpeech

由 Exqrch 开发

IndoBERTweet-HateSpeech是一个在IndoToxic2024数据集上微调的模型，用于识别印尼语中的仇恨言论，具有较高的准确性和F1分数。

文本分类

Transformers

#印尼语仇恨言论检测 #高精度微调模型 #社交媒体内容审核

下载量 1,485

发布时间 : 7/26/2024

模型简介

该模型基于IndoBERTweet架构，在印尼语仇恨言论检测任务上进行了微调，适用于社交媒体内容审核等场景。

模型特点

高性能

在IndoToxic2024数据集上准确率达到0.89，宏F1分数为0.78

专业分词器支持

支持使用indolem/indobertweet-base-uncased分词器

交叉验证结果

性能指标通过分层10折交叉验证获得

模型能力

印尼语文本分类

仇恨言论检测

社交媒体内容分析

使用案例

内容审核

社交媒体仇恨言论过滤

自动识别印尼语社交媒体中的仇恨言论内容

准确识别率89%

学术研究

印尼语仇恨言论研究

为语言学和社交媒体研究提供分析工具

🚀 IndoBERTweet-HateSpeech

IndoBERTweet-HateSpeech是一个在IndoToxic2024数据集上微调的模型，可用于识别印尼语中的仇恨言论。该模型具有较高的准确性和F1分数，能有效助力相关领域的研究和应用。

🚀 快速开始

你可以按照以下步骤使用该模型：

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer

# Specify the model and tokenizer name
model_name = "Exqrch/IndoBERTweet-HateSpeech"
tokenizer_name = "indolem/indobertweet-base-uncased"

# Load the pre-trained model
model = AutoModelForSequenceClassification.from_pretrained(model_name)

# Load the tokenizer
tokenizer = AutoTokenizer.from_pretrained(tokenizer_name)

text = "selamat pagi semua!"

output = model(**tokenizer(text, return_tensors="pt"))
logits = output.logits

# Get the predicted class label
predicted_class = torch.argmax(logits, dim=-1).item()

print(predicted_class)
--- Output ---
> 0
--- End of Output ---

✨ 主要特性

高性能：IndoBERTweet在IndoToxic2024数据集上进行了微调，准确率达到0.89，宏F1分数为0.78。这些性能指标是通过分层10折交叉验证获得的。
支持的分词器：支持使用indolem/indobertweet-base-uncased分词器。

💻 使用示例

基础用法

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer

# Specify the model and tokenizer name
model_name = "Exqrch/IndoBERTweet-HateSpeech"
tokenizer_name = "indolem/indobertweet-base-uncased"

# Load the pre-trained model
model = AutoModelForSequenceClassification.from_pretrained(model_name)

# Load the tokenizer
tokenizer = AutoTokenizer.from_pretrained(tokenizer_name)

text = "selamat pagi semua!"

output = model(**tokenizer(text, return_tensors="pt"))
logits = output.logits

# Get the predicted class label
predicted_class = torch.argmax(logits, dim=-1).item()

print(predicted_class)

示例输出

Model name: Exqrch/IndoBERTweet-HateSpeech
Text 1: Kenapa sih mereka berantem terus?
Prediction: 0
Text 2: Orang gila emang elu!
Prediction: 1

📚 详细文档

局限性

该模型仅在印尼语文本上进行了训练，对于代码混合文本（code-switched text）的性能没有相关信息。

引用方式

如果使用该模型，请引用以下论文：

@article{susanto2024indotoxic2024,
      title={IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language}, 
      author={Lucky Susanto and Musa Izzanardi Wijanarko and Prasetia Anugrah Pratama and Traci Hong and Ika Idris and Alham Fikri Aji and Derry Wijaya},
      year={2024},
      eprint={2406.19349},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2406.19349}, 
}