flan - t5-base-arxiv开源数学问答模型 - 专注解答数学题，供研究参考

首页

Flan T5 Base Arxiv Math Question Answering

由 AlgorithmicResearchGroup 开发

这是一个基于arxiv数学问答数据集训练的FLAN-T5模型，专注于数学问题解答任务，但输出结果不可靠，仅供研究使用。

大型语言模型

Transformers

英语开源协议:Apache-2.0 #数学问答生成 #学术研究专用 #arXiv论文摘要

下载量 14

发布时间 : 6/22/2023

模型简介

该模型是基于FLAN-T5架构的语言模型，专门针对arXiv数学问答数据集进行微调，用于生成数学问题的摘要或答案。

模型特点

数学领域专注

专门针对arXiv数学内容进行微调，适合处理数学相关问题

轻量级模型

基于T5-base架构，参数规模适中，适合研究用途

研究导向

明确标注为研究用途，不推荐生产环境使用

模型能力

数学问题解答

文本摘要生成

学术内容理解

使用案例

学术研究

数学概念解释

对数学术语或概念进行解释说明

生成简短的概念描述

论文内容摘要

对arXiv数学论文中的问题进行摘要

生成问题的简要回答

🚀 FLAN - T5数学问答模型

这是一个基于Transformer架构的语言模型，专门针对数学问答场景在特定数据集上进行训练。它能处理英文的数学相关问题，但目前仅用于研究，输出可靠性较低，不建议用于生产环境。

🚀 快速开始

此模型仅用于研究目的，请勿在生产环境中使用，其输出结果极不可靠。

✨ 主要特性

模型类型：语言模型
支持语言：英文
许可证：Apache 2.0
相关模型：所有FLAN - T5检查点

属性	详情
模型类型	语言模型
语言（NLP）	英文
许可证	Apache 2.0
相关模型	所有FLAN - T5检查点

💻 使用示例

基础用法

以下是在transformers库中使用该模型的示例脚本：

使用PyTorch模型

在CPU上运行模型

from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained("ArtifactAI/flan-t5-base-arxiv-math-question-answering")
model = T5ForConditionalGeneration.from_pretrained("ArtifactAI/flan-t5-base-arxiv-math-question-answering")

input_text = "What is the spectral isolation of bi-invariant metrics?"
input_ids = tokenizer(input_text, return_tensors="pt").input_ids

outputs = model.generate(input_ids)
print(tokenizer.decode(outputs[0]))

在GPU上运行模型

# pip install accelerate
from transformers import T5Tokenizer, T5ForConditionalGeneration

tokenizer = T5Tokenizer.from_pretrained("ArtifactAI/flan-t5-base-arxiv-math-question-answering")
model = T5ForConditionalGeneration.from_pretrained("ArtifactAI/flan-t5-base-arxiv-math-question-answering", device_map="auto")

input_text = "What is the spectral isolation of bi-invariant metrics?"
input_ids = tokenizer(input_text, return_tensors="pt").input_ids.to("cuda")

outputs = model.generate(input_ids)
print(tokenizer.decode(outputs[0]))

在HF管道中运行模型

FP16

# load model and tokenizer from huggingface hub with pipeline
qa = pipeline("summarization", model="ArtifactAI/flan-t5-base-arxiv-math-question-answering")

query = "What is the spectral isolation of bi-invariant metrics?"
print(f"query: {query}")
res = qa("answer: " + query)

print(f"{res[0]['summary_text']}")

🔧 技术细节

训练数据

该模型在ArtifactAI/arxiv - math - instruct - 50k数据集上进行训练，这是一个问答对数据集。问题使用t5 - base模型生成，答案使用GPT - 3.5 - turbo模型生成。

📄 许可证

本模型采用Apache 2.0许可证。

📚 引用

@misc{flan-t5-base-arxiv-math-question-answering,
    title={flan-t5-base-arxiv-math-question-answering},
    author={Matthew Kenney},
    year={2023}
}