KeywordGen-v2开源关键词生成模型 - 免费部署助力产品评论分析

首页

Keywordgen V2

由 mrutyunjay-patil 开发

KeywordGen-v2是基于T5的模型，专门用于从文本中生成关键词，特别适用于产品评论分析。

文本生成

Transformers

支持多种语言开源协议:Apache-2.0 #产品评论关键词提取 #T5微调 #电商文本分析

下载量 83

发布时间 : 8/6/2023

模型简介

该模型通过微调T5基础模型，能够从输入文本中提取2到8个单词的关键词，尤其擅长处理产品评论，帮助用户快速获取文本的核心主题。

模型特点

产品评论优化

特别针对产品评论进行优化，能够高效提取关键点或主题。

多关键词生成

能够生成2到8个单词的关键词，覆盖文本的多个核心主题。

前缀优化

通过在输入前添加'Keyword: '前缀，显著提升生成效果。

模型能力

文本生成

关键词提取

产品评论分析

使用案例

产品评论分析

电子产品评论关键词提取

从电子产品评论中提取关键词，帮助快速了解用户反馈的核心点。

生成如'屏幕色彩鲜艳'、'电池续航出色'等关键词。

多语言评论分析

支持从英文评论中提取关键词，适用于国际化产品分析。

生成与评论内容高度相关的英文关键词。

🚀 KeywordGen-v2模型

KeywordGen-v2是一款基于T5架构微调的模型，专门用于从文本中提取关键词。输入一段文本，模型就能输出与之相关的关键词，为信息提取提供便利。

🚀 快速开始

你可以直接使用文本生成管道来调用这个模型。使用时，为了获得最佳效果，请在输入文本前加上 "Keyword: "。

以下是使用Hugging Face Transformers库在Python中调用该模型的示例：

💻 使用示例

基础用法

from transformers import T5Tokenizer, T5ForConditionalGeneration

# Initialize the tokenizer and model
tokenizer = T5Tokenizer.from_pretrained("mrutyunjay-patil/keywordGen-v2")
model = T5ForConditionalGeneration.from_pretrained("mrutyunjay-patil/keywordGen-v2")

# Define your input sequence, prefixing with "Keyword: "
input_sequence = "Keyword: I purchased the new Android smartphone last week and I've been thoroughly impressed. The display is incredibly vibrant and sharp, and the battery life is surprisingly good, easily lasting a full day with heavy usage."

# Encode the input sequence
input_ids = tokenizer.encode(input_sequence, return_tensors="pt")

# Generate output
outputs = model.generate(input_ids)
output_sequence = tokenizer.decode(outputs[0], skip_special_tokens=True)

print(output_sequence)

高级用法

from transformers import T5Tokenizer, T5ForConditionalGeneration

# Initialize the tokenizer and model
tokenizer = T5Tokenizer.from_pretrained("mrutyunjay-patil/keywordGen-v2")
model = T5ForConditionalGeneration.from_pretrained("mrutyunjay-patil/keywordGen-v2")

# Define the prefix
task_prefix = "Keyword: "

# Define your list of input sequences
inputs = [
    "Absolutely love this tablet. It has a clear, sharp screen and runs apps smoothly without any hiccups.",
    "The headphones are fantastic with great sound quality, but the build quality could be better.",
    "Bought this smartwatch last week, and I'm thrilled with its performance. Battery life is impressive.",
    "This laptop exceeded my expectations. Excellent speed, plenty of storage, and light weight. Perfect for my needs.",
    "The camera quality on this phone is exceptional. It captures detailed and vibrant photos. However, battery life is not the best."
]

# Loop through each input and generate keywords
for sample in inputs:
    input_sequence = task_prefix + sample
    input_ids = tokenizer.encode(input_sequence, return_tensors="pt")
    outputs = model.generate(input_ids)
    output_sequence = tokenizer.decode(outputs[0], skip_special_tokens=True)
    print(sample, "\n --->", output_sequence)

✨ 主要特性

针对性微调：该模型 "KeywordGen-v2" 是 "KeywordGen" 系列的第二代产品，基于T5基础模型进行微调，专注于从文本输入中生成关键词，尤其擅长处理产品评论。
输出规范：模型输出的关键词长度在2到8个单词之间，能有效提取产品评论中的关键点或主题。
输入要求：当输入文本至少有2 - 3个句子时，模型性能更佳。