OpenThaiGPT R1 32b开源泰语推理模型 - 免费部署，数学逻辑代码推理超出色

首页

Openthaigpt R1 32b Instruct

由 openthaigpt 开发

OpenThaiGPT R1 32b是一款320亿参数的泰语推理模型，在泰语数学、逻辑和代码推理任务中表现优异，性能超越更大规模的模型。

大型语言模型

Transformers

支持多种语言开源协议:其他 #泰语推理专家 #数学逻辑优化 #显式推理过程

下载量 403

发布时间 : 3/25/2025

模型简介

先进的320亿参数泰语推理模型，专为泰语和英语的复杂推理任务优化，包括数学、逻辑和代码推理。

模型特点

顶尖的泰语推理能力

在数学和逻辑推理任务中超越更大规模的模型，如DeepSeek R1 70b和Typhoon R1 70b

显式推理过程

能够展示逐步的思考过程，增强推理的可解释性

高效模型体积

仅32b参数规模，性能却优于70b模型，资源效率更高

泰语优化

专门针对泰语推理任务优化，包括复杂的数学和逻辑问题

模型能力

泰语文本生成

英语文本生成

数学推理

逻辑推理

代码推理

使用案例

教育

数学问题解答

解决泰语数学问题，如计算圆形面积

在MATH500-TH数据集上达到83.8的准确率

编程

代码生成与理解

生成和理解泰语和英语代码

在LiveCodeBench-TH上达到62.16的准确率

逻辑推理

复杂逻辑问题解决

处理需要多步推理的逻辑问题

在AIME24-TH上达到56.67的准确率

🚀 🇹🇭 OpenThaiGPT R1 32b

🇹🇭 OpenThaiGPT R1 32b 是一款先进的 320 亿参数泰语推理模型。尽管其规模不到 DeepSeek R1 70b 和 Typhoon R1 70b 等大模型的一半，但在性能上却更胜一筹。该模型在复杂推理任务中表现出色，能够处理泰语环境下的数学、逻辑和代码推理问题。

🚀 快速开始

在线网页界面

你可以通过此链接访问在线网页界面使用该模型。

Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "openthaigpt/openthaigpt-r1-32b-instruct"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "จงหาพื้นที่ของวงกลมที่มีรัศมี 7 หน่วย"
messages = [
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=16384,
    temperature=0.6
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]

vLLM

安装 VLLM（安装链接）。
运行服务器：

vllm serve openthaigpt/openthaigpt-r1-32b --tensor-parallel-size 2

注意：将 --tensor-parallel-size 2 更改为可用的 GPU 卡数量。

运行推理（CURL 示例）：

curl -X POST 'http://127.0.0.1:8000/v1/chat/completions' \
-H 'Content-Type: application/json' \
-d '{
  "model": "openthaigpt/openthaigpt-r1-32b-instruct",
  "messages": [
    {
      "role": "user",
      "content": "จงหาพื้นที่ของวงกลมที่มีรัศมี 7 หน่วย"
    }
  ],
  "max_tokens": 16384,
  "temperature": 0.6,
  "top_p": 0.95,
  "top_k": 40
}'

✨ 主要特性

最先进的泰语推理模型：在数学和逻辑推理任务上超越了更大规模的模型。
显式推理能力：能够展示逐步的思维过程。
显著的小模型优势：参数规模仅 320 亿，却能胜过 700 亿参数的模型。
专注泰语推理：擅长处理复杂的数学和逻辑问题。
代码推理高性能：在泰语和英语代码推理方面均表现出色。

📊 基准测试结果

SkyThought	OpenThaiGPT R1 32b	DeepSeek R1 70b	Typhoon R1 Distill 70b
AIME24 - TH	56.67	33.33	53.33
AIME24	63.36	53.33	53.33
MATH500 - TH	83.8	75.4	81
MATH500	89.4	88.88	90.2
LiveCodeBench - TH	62.16	53.15	47.75
LiveCodeBench	69.67	64.97	54.79
OpenThaiEval	76.05	74.17	77.59
AVERAGE	71.58	63.31	65.42

📦 安装指南

GPU 内存要求

参数数量	FP 16 位	8 位（量化）	4 位（量化）
32b	64 GB	32 GB	16 GB

📚 详细文档

模型技术报告

你可以通过此链接查看模型技术报告。

引用方式

如果你在工作中使用了 OpenThaiGPT，请考虑按以下方式引用：

@misc{yuenyong2025openthaigpt16r1thaicentric,
      title={OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models}, 
      author={Sumeth Yuenyong and Thodsaporn Chay-intr and Kobkrit Viriyayudhakorn},
      year={2025},
      eprint={2504.01789},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2504.01789}, 
}

聊天模板

{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{% set ns = namespace(is_first=false, is_tool=false, is_output_first=true, system_prompt='') %}{%- for message in messages %}{%- if message['role'] == 'system' %}{% set ns.system_prompt = message['content'] %}{%- endif %}{%- endfor %}{{bos_token}}{{ns.system_prompt}}{%- for message in messages %}{%- if message['role'] == 'user' %}{%- set ns.is_tool = false -%}{{'<｜User｜>' + message['content']}}{%- endif %}{%- if message['role'] == 'assistant' and message['content'] is none %}{%- set ns.is_tool = false -%}{%- for tool in message['tool_calls']%}{%- if not ns.is_first %}{{'<｜Assistant｜><｜tool▁calls▁begin｜><｜tool▁call▁begin｜>' + tool['type'] + '<｜tool▁sep｜>' + tool['function']['name'] + '\\n' + '```json' + '\\n' + tool['function']['arguments'] + '\\n' + '```' + '<｜tool▁call▁end｜>'}}{%- set ns.is_first = true -%}{%- else %}{{'\\n' + '<｜tool▁call▁begin｜>' + tool['type'] + '<｜tool▁sep｜>' + tool['function']['name'] + '\\n' + '```json' + '\\n' + tool['function']['arguments'] + '\\n' + '```' + '<｜tool▁call▁end｜>'}}{{'<｜tool▁calls▁end｜><｜end▁of▁sentence｜>'}}{%- endif %}{%- endfor %}{%- endif %}{%- if message['role'] == 'assistant' and message['content'] is not none %}{%- if ns.is_tool %}{{'<｜tool▁outputs▁end｜>' + message['content'] + '<｜end▁of▁sentence｜>'}}{%- set ns.is_tool = false -%}{%- else %}{% set content = message['content'] %}{% if '</think>' in content %}{% set content = content.split('</think>')[-1] %}{% endif %}{{'<｜Assistant｜>' + content + '<｜end▁of▁sentence｜>'}}{%- endif %}{%- endif %}{%- if message['role'] == 'tool' %}{%- set ns.is_tool = true -%}{%- if ns.is_output_first %}{{'<｜tool▁outputs▁begin｜><｜tool▁output▁begin｜>' + message['content'] + '<｜tool▁output▁end｜>'}}{%- set ns.is_output_first = false %}{%- else %}{{'\\n<｜tool▁output▁begin｜>' + message['content'] + '<｜tool▁output▁end｜>'}}{%- endif %}{%- endif %}{%- endfor -%}{% if ns.is_tool %}{{'<｜tool▁outputs▁end｜>'}}{% endif %}{% if add_generation_prompt and not ns.is_tool %}{{'<｜Assistant｜>'}}{% endif %}