longformer-base-4096-finetuned-squadv1开源问答模型

首页

Longformer Base 4096 Finetuned Squadv1

由 valhalla 开发

基于LONGFORMER-BASE-4096模型在SQuAD v1问答数据集上的微调版本，适用于处理长文档问答任务

问答系统开源协议:MIT #长文本问答 #全局注意力机制 #SQuAD微调

下载量 806

发布时间 : 3/2/2022

模型简介

该模型是Longformer在SQuAD v1数据集上微调后的版本，专门用于问答任务，能够处理最长4096个标记的序列。

模型特点

长文档处理能力

能够处理最长4096个标记的序列，适合长文档问答任务。

全局注意力机制

在问答任务中，自动为问题标记设置全局注意力，提升问答准确性。

高效训练

采用滑动窗口局部注意力机制，降低计算复杂度，提升训练效率。

模型能力

长文档问答

文本理解

答案提取

使用案例

问答系统

阅读理解

从长文档中提取问题的答案

精确匹配率85.1466，F1分数91.5415

🚀 LONGFORMER-BASE-4096在SQuAD v1上微调

这是一个在SQuAD v1数据集上针对问答任务进行微调的longformer-base-4096模型。该模型能够有效处理问答场景，为长文本问答提供了强大的支持。

数据集

squad_v1

许可证

🚀 快速开始

本模型是基于Transformer架构的问答模型，可用于处理长文本的问答任务。以下是使用该模型的示例代码：

import torch
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

tokenizer = AutoTokenizer.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")
model = AutoModelForQuestionAnswering.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")

text = "Huggingface has democratized NLP. Huge thanks to Huggingface for this."
question = "What has Huggingface done ?"
encoding = tokenizer(question, text, return_tensors="pt")
input_ids = encoding["input_ids"]

# default is local attention everywhere
# the forward method will automatically set global attention on question tokens
attention_mask = encoding["attention_mask"]

start_scores, end_scores = model(input_ids, attention_mask=attention_mask)
all_tokens = tokenizer.convert_ids_to_tokens(input_ids[0].tolist())

answer_tokens = all_tokens[torch.argmax(start_scores) :torch.argmax(end_scores)+1]
answer = tokenizer.decode(tokenizer.convert_tokens_to_ids(answer_tokens))
# output => democratized NLP

目前，LongformerForQuestionAnswering 还不支持 pipeline。待支持添加后，我会更新此文档。

✨ 主要特性

长文本处理能力：预训练模型可以处理长达4096个标记的序列，适合处理长文档问答。
自动处理全局注意力：LongformerForQuestionAnswering 模型会自动为问题标记设置全局注意力。

📦 安装指南

文档未提及安装步骤，故跳过此章节。

💻 使用示例

基础用法

import torch
from transformers import AutoTokenizer, AutoModelForQuestionAnswering

tokenizer = AutoTokenizer.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")
model = AutoModelForQuestionAnswering.from_pretrained("valhalla/longformer-base-4096-finetuned-squadv1")

text = "Huggingface has democratized NLP. Huge thanks to Huggingface for this."
question = "What has Huggingface done ?"
encoding = tokenizer(question, text, return_tensors="pt")
input_ids = encoding["input_ids"]

# default is local attention everywhere
# the forward method will automatically set global attention on question tokens
attention_mask = encoding["attention_mask"]

start_scores, end_scores = model(input_ids, attention_mask=attention_mask)
all_tokens = tokenizer.convert_ids_to_tokens(input_ids[0].tolist())

answer_tokens = all_tokens[torch.argmax(start_scores) :torch.argmax(end_scores)+1]
answer = tokenizer.decode(tokenizer.convert_tokens_to_ids(answer_tokens))
# output => democratized NLP