flan-t5-base-summarization-pt-br开源模型 - 优化微调实现葡萄牙语（巴西）文本摘要

首页

Flan T5 Base Summarization Pt Br

由 PamelaBorelli 开发

基于FLAN-T5的葡萄牙语（巴西）文本摘要模型，经过指令微调优化

文本生成

Transformers

其他开源协议:MIT #葡萄牙语摘要 #指令微调 #T5架构

下载量 71

发布时间 : 6/14/2024

模型简介

该模型是基于FLAN-T5架构的文本摘要模型，专门针对葡萄牙语（巴西）进行优化，适用于生成高质量的文本摘要。

模型特点

多语言支持

基于FLAN-T5架构，支持多种语言，但专门针对葡萄牙语（巴西）进行优化。

指令微调

经过两次微调，首先进行文本翻译任务，随后进行文本摘要任务，以提高模型在特定任务上的表现。

高效摘要生成

能够快速生成高质量的文本摘要，适用于各种葡萄牙语文本。

模型能力

文本摘要

葡萄牙语文本处理

使用案例

新闻摘要

新闻文章摘要

自动生成新闻文章的简短摘要，帮助用户快速了解主要内容。

学术研究

论文摘要生成

为学术论文生成简洁的摘要，便于研究人员快速浏览。

🚀 帕梅拉·博雷利/flan - t5 - base - 葡萄牙语摘要模型卡

本模型主要用于葡萄牙语的文本摘要任务，基于flan - t5 - base模型微调而来，在特定数据集上训练，以提升葡萄牙语文本摘要的能力。

🚀 快速开始

使用示例

from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained("PamelaBorelli/flan-t5-base-summarization-pt-br")
model = T5ForConditionalGeneration.from_pretrained("PamelaBorelli/flan-t5-base-summarization-pt-br")

input_text = "O corpo está mais propenso a sentir dores com exercícios de alta intensidade | Foto: Getty Images O problema está em saber identificar qual é qual. \"Em algumas situações, é difícil diferenciar uma da outra\", reconhece Juan Francisco Marco, professor do Centro de Ciência do Esporte, Treinamento e Fitness Alto Rendimento, na Espanha. \"A dor boa é aquela que associamos ao exercício físico, que não limita (o movimento) e permite continuar (a se exercitar) até o momento em que o músculo fica realmente esgotado e não trabalha mais\", explica. É importante detectar qual é o tipo de dor que você está sentindo, para evitar ter problemas mais sérios | Foto: Getty Images Para Francisco Sánchez Diego, diretor do centro de treinamento Corpore 10, \"a dor boa se sente no grupo muscular que você trabalhou, tanto durante o treinamento como nos dias seguintes\"."
input_ids = tokenizer(input_text, return_tensors="pt").input_ids

outputs = model.generate(input_ids)
print(tokenizer.decode(outputs[0]))

✨ 主要特性

基于多语言模型：原始模型 [flan - t5 - base](https://huggingface.co/google/flan - t5 - base#model - details) 是一个多语言模型，大小为2.48亿参数，采用基于T5（Text - to - Text Transfer Transformer）的编码器 - 解码器架构。
微调优化：最终模型 [PamelaBorelli/flan - t5 - base - summarization - pt - br](PamelaBorelli/flan - t5 - base - summarization - pt - br) 经过了两次微调，先进行文本翻译微调，后进行葡萄牙语文本摘要任务微调。
特定语言适用：专为葡萄牙语文本摘要任务设计，未在其他语言上进行测试。

📚 详细文档

一般信息

属性	详情
名称	[PamelaBorelli/flan - t5 - base - summarization - pt - br](PamelaBorelli/flan - t5 - base - summarization - pt - br)
类型	语言模型，Transformer编码器 - 解码器
许可证	MIT
基础模型	[google/flan - t5 - base](https://huggingface.co/google/flan - t5 - base#model - details)
相关模型	FLAN - T5的检查点
原始检查点	FLAN - T5的检查点

训练数据

训练参数

evaluation_strategy="steps"         # 评估输出的方式
eval_steps=                         # 评估输出的步数
learning_rate=                      # 学习率
per_device_train_batch_size=        # 训练的批次大小
per_device_eval_batch_size=         # 验证的批次大小
gradient_accumulation_steps=        # 累积批次的上限
weight_decay=                       # L2正则化
num_train_epochs=                   # 训练的轮数
save_strategy="steps"               # 保存输出的方式
save_steps =                        # 保存输出的步数
push_to_hub=False                   # 是否将模型保存到Hugging Face的hub
load_best_model_at_end=True         # 在训练结束时加载最佳模型（回调需要）

分词参数

start_prompt= "Sumarize: \n"                 # 摘要指令的起始
end_prompt= "\n\nSumário: "    			 # 摘要指令的结束
input_name="coluna_imput"           		 # 数据集中源文本列的名称
target_name="coluna_target"          		 # 数据集中目标文本列的名称
max_input_length = 256         				 # 分词的最大输入长度
max_target_length = 256        				 # 分词的最大目标长度
columns_to_remove= ['coluna_to_remove'] 	 # 从原始数据集中移除的列