t5-end2end-question-generation开源模型 - 免费根据上下文自动生成相关问题

首页

T5 End2end Question Generation

由 ThomasSimonini 开发

基于T5-base微调的端到端问题生成模型，可根据给定上下文自动生成相关问题。

问答系统

Transformers

开源协议:Apache-2.0 #问题生成 #教育辅助 #文本理解

下载量 386

发布时间 : 3/2/2022

模型简介

该模型是在SQuAD数据集上微调的T5-base版本，专门用于从文本上下文中生成自然语言问题。适用于教育、问答系统等场景。

模型特点

端到端问题生成

可直接从文本上下文生成完整问题，无需中间处理步骤

多问题生成

单个上下文可生成多个相关问题，提高信息提取效率

SQuAD优化

在权威问答数据集上微调，生成问题质量较高

模型能力

文本理解

问题生成

上下文分析

使用案例

教育技术

自动试题生成

从教材内容自动生成测验题目

示例显示能准确生成事实性问题（如创建者、发布时间等）

问答系统

知识库增强

为现有文档自动生成FAQ问题

模型示例显示能生成与历史事件相关的精确问题

🚀 t5端到端问题生成

该模型是 t5-base 在squad数据集上的微调版本，用于根据上下文生成问题。它能帮助用户基于给定的文本内容自动生成相关问题，提升信息获取和处理的效率。

👉 如果你想了解如何微调t5模型以实现相同功能，可以参考这个教程

示例

Context: "Python is an interpreted, high-level, general-purpose programming language. Created by Guido van Rossum and first released in 1991, Python's design philosophy emphasizes code readability with its notable use of significant whitespace."

Questions:
    Who created Python?,
    When was Python first released?
    What is Python's design philosophy?

评估集结果

该模型在评估集上取得了以下结果：

损失值：1.5691

🚀 快速开始

模型使用

from transformers import T5ForConditionalGeneration, T5TokenizerFast

hfmodel = T5ForConditionalGeneration.from_pretrained("ThomasSimonini/t5-end2end-question-generation")

text= "The abolition of feudal privileges by the National Constituent Assembly on 4 August 1789 and the Declaration \\nof the Rights of Man and of the Citizen (La Déclaration des Droits de l'Homme et du Citoyen), drafted by Lafayette \\nwith the help of Thomas Jefferson and adopted on 26 August, paved the way to a Constitutional Monarchy \\n(4 September 1791 – 21 September 1792). Despite these dramatic changes, life at the court continued, while the situation \\nin Paris was becoming critical because of bread shortages in September. On 5 October 1789, a crowd from Paris descended upon Versailles \\nand forced the royal family to move to the Tuileries Palace in Paris, where they lived under a form of house arrest under \\nthe watch of Lafayette's Garde Nationale, while the Comte de Provence and his wife were allowed to reside in the \\nPetit Luxembourg, where they remained until they went into exile on 20 June 1791."

def run_model(input_string, **generator_args):
  generator_args = {
  "max_length": 256,
  "num_beams": 4,
  "length_penalty": 1.5,
  "no_repeat_ngram_size": 3,
  "early_stopping": True,
  }
  input_string = "generate questions: " + input_string + " </s>"
  input_ids = tokenizer.encode(input_string, return_tensors="pt")
  res = hfmodel.generate(input_ids, **generator_args)
  output = tokenizer.batch_decode(res, skip_special_tokens=True)
  output = [item.split("<sep>") for item in output]
  return output
  
run_model(text)

=> [['When did the National Constituent Assembly abolish feudal privileges?',
  ' Who drafted the Declaration of the Rights of Man and of the Citizen?',
  ' When was the Constitutional Monarchy established?',
  ' What was the name of the Declaration that paved the way to a constitutional monarchy?',
  '']]

训练超参数

以下是训练过程中使用的超参数：

属性	详情
学习率	0.0001
训练批次大小	4
评估批次大小	4
随机种子	42
梯度累积步数	16
总训练批次大小	64
优化器	Adam（beta=(0.9, 0.999)，epsilon=1e-08）
学习率调度器类型	线性
训练轮数	7

训练结果

训练损失	轮数	步数	验证损失
2.5834	0.34	100	1.9107
1.9642	0.68	200	1.7227
1.8526	1.02	300	1.6627
1.7383	1.36	400	1.6354
1.7223	1.69	500	1.6154
1.6871	2.03	600	1.6096
1.6309	2.37	700	1.6048
1.6242	2.71	800	1.5923
1.6226	3.05	900	1.5855
1.5645	3.39	1000	1.5874
1.5705	3.73	1100	1.5822
1.5543	4.07	1200	1.5817
1.5284	4.41	1300	1.5841
1.5275	4.75	1400	1.5741
1.5269	5.08	1500	1.5715
1.5079	5.42	1600	1.5701
1.4876	5.76	1700	1.5754
1.498	6.1	1800	1.5699
1.4852	6.44	1900	1.5693
1.4776	6.78	2000	1.5691