wikihow-t5-small开源模型 - 免费实现英文文本摘要生成的实用工具

首页

Wikihow T5 Small

由 deep-learning-analytics 开发

基于Wikihow数据集训练的T5-small摘要生成模型，适用于英文文本摘要任务

文本生成

Transformers

英语#wikihow摘要生成 #T5小模型优化 #多步骤指南压缩

下载量 140

发布时间 : 3/2/2022

模型简介

这是一个基于Wikihow All数据集训练的T5-small模型，专门用于生成文本摘要。模型训练了3个epoch，在摘要生成任务上表现良好。

模型特点

高效摘要生成

专门针对Wikihow风格内容优化的摘要生成能力

轻量级模型

基于T5-small架构，计算资源需求较低

多领域适用

可处理各类指导性内容的摘要生成

模型能力

文本摘要生成

内容浓缩

关键信息提取

使用案例

内容摘要

健康指南摘要

将长篇健康建议浓缩为简短摘要

如示例所示，能有效提取关于口臭防治的关键信息

食谱步骤摘要

简化复杂的烹饪步骤说明

如示例所示，能有效提取制作迷迭香冰糕的关键步骤

知识提炼

教程内容精简

从详细教程中提取核心知识点

🚀 维基指南T5小模型

这是一个基于维基指南（Wikihow）数据集训练的T5小模型。该模型可用于文本摘要任务，能有效提炼文本核心内容。

🚀 快速开始

模型描述

这是一个在维基指南全数据集上训练的T5小模型。该模型使用16的批量大小和3e - 4的学习率进行了3个轮次的训练。最大输入长度设置为512，最大输出长度为150。模型的Rouge1得分达到31.2，RougeL得分达到24.5。我们撰写了一篇涵盖训练过程的博客文章，点击此处查看。

📦 安装指南

文档未提及安装步骤，暂不展示。

💻 使用示例

基础用法

from transformers import AutoTokenizer, AutoModelWithLMHead

tokenizer = AutoTokenizer.from_pretrained("deep-learning-analytics/wikihow-t5-small")
model = AutoModelWithLMHead.from_pretrained("deep-learning-analytics/wikihow-t5-small")

device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
model = model.to(device)

text = """"
Lack of fluids can lead to dry mouth, which is a leading cause of bad breath. Water
can also dilute any chemicals in your mouth or gut that are causing bad breath., Studies show that
eating 6 ounces of yogurt a day reduces the level of odor-causing compounds in the mouth. In
particular, look for yogurt containing the active bacteria Streptococcus thermophilus or
Lactobacillus bulgaricus., The abrasive nature of fibrous fruits and vegetables helps to clean
teeth, while the vitamins, antioxidants, and acids they contain improve dental health.Foods that can
be particularly helpful include:Apples — Apples contain vitamin C, which is necessary for health
gums, as well as malic acid, which helps to whiten teeth.Carrots — Carrots are rich in vitamin A,
which strengthens tooth enamel.Celery — Chewing celery produces a lot of saliva, which helps to
neutralize bacteria that cause bad breath.Pineapples — Pineapples contain bromelain, an enzyme that
cleans the mouth., These teas have been shown to kill the bacteria that cause bad breath and
plaque., An upset stomach can lead to burping, which contributes to bad breath. Don’t eat foods that
upset your stomach, or if you do, use antacids. If you are lactose intolerant, try lactase tablets.,
They can all cause bad breath. If you do eat them, bring sugar-free gum or a toothbrush and
toothpaste to freshen your mouth afterwards., Diets low in carbohydrates lead to ketosis — a state
in which the body burns primarily fat instead of carbohydrates for energy. This may be good for your
waistline, but it also produces chemicals called ketones, which contribute to bad breath.To stop the
problem, you must change your diet. Or, you can combat the smell in one of these ways:Drink lots of
water to dilute the ketones.Chew sugarless gum or suck on sugarless mints.Chew mint leaves.
"""

preprocess_text = text.strip().replace("\n","")
tokenized_text = tokenizer.encode(preprocess_text, return_tensors="pt").to(device)

summary_ids = model.generate(
            tokenized_text,
            max_length=150, 
            num_beams=2,
            repetition_penalty=2.5, 
            length_penalty=1.0, 
            early_stopping=True
        )

output = tokenizer.decode(summary_ids[0], skip_special_tokens=True)

print ("\n\nSummarized text: \n",output)