indobert-analisis-sentimen-review-produk开源模型 - 免费部署做印尼语产品评论情感分类

首页

Indobert Analisis Sentimen Review Produk

由 siRendy 开发

基于IndoBERT微调的印尼语产品评论情感分类模型，支持正面和负面两种情感分类。

文本分类

Transformers

其他开源协议:MIT #印尼语情感分析 #电商评论分类 #高准确率BERT

下载量 58

发布时间 : 4/7/2025

模型简介

该模型专用于印尼语产品评论的情感分类任务，能够将用户评论划分为正面(POSITIF)和负面(NEGATIF)两类情感。

模型特点

高准确率

在验证集上达到94.43%的准确率和94.42%的F1值。

电商场景优化

专门针对Tokopedia电商平台的产品评论数据进行训练和优化。

轻量级微调

基于预训练的IndoBERT模型进行轻量级微调，训练周期仅3个epoch。

模型能力

印尼语文本分类

情感分析

产品评论分析

使用案例

电商分析

产品评论情感分析

自动分析电商平台上用户对产品的评论情感倾向。

准确识别94%以上的评论情感

客户反馈监控

实时监控客户反馈中的负面评论，及时发现问题产品。

🚀 IndoBERT情感分类ID - 二分类

本模型是基于indobenchmark/indobert-base-p2进行微调得到的，用于对印尼语产品评论进行二分类情感分析（积极和消极）。所使用的数据集包含了来自电商平台Tokopedia的10600条产品评论。

🚀 快速开始

本模型旨在将评论或评价分为两种情感类别：积极和消极。

✨ 主要特性

评估指标：本模型使用准确率（Accuracy）和F1分数（F1 Score）进行评估。
基础模型：基于indobenchmark/indobert-base-p2。

📦 安装指南

文档未提及安装步骤，暂不展示。

💻 使用示例

基础用法

# Load model dan tokenizer
from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline
import torch

# Load dari Hugging Face Hub
model = AutoModelForSequenceClassification.from_pretrained("siRendy/indobert-analisis-sentimen-review-produk")
tokenizer = AutoTokenizer.from_pretrained("siRendy/indobert-analisis-sentimen-review-produk")

# Fungsi prediksi
def predict_sentiment(text):
    classifier = pipeline(
        "text-classification",
        model=model,
        tokenizer=tokenizer,
        device=0 if torch.cuda.is_available() else -1
    )

    result = classifier(text)[0]
    return {
        "sentiment": str(result["label"]),
        "confidence": round(result["score"], 4)
    }

# Contoh penggunaan
text = "Produk ini sangat buruk dan tidak layak dibeli."
prediction = predict_sentiment(text)
print(prediction)

📚 详细文档

训练参数配置

以下是训练时使用的训练参数配置（TrainingArguments）：

from transformers import TrainingArguments

training_args = TrainingArguments(
    output_dir="./results",
    num_train_epochs=3,
    per_device_train_batch_size=4,
    gradient_accumulation_steps=4,
    per_device_eval_batch_size=4,
    weight_decay=0.05,
    eval_strategy="epoch",
    save_strategy="epoch",
    seed=42,
    load_best_model_at_end=True,
    metric_for_best_model="f1",
    logging_dir="./logs",
    report_to="tensorboard",
    logging_steps=100,
    warmup_ratio=0.05,
)