Moirai-1.0-R-large开源大型时序模型 - 助力通用时间序列预测

首页

Moirai 1.0 R Large

由 Salesforce 开发

Moirai是基于掩码编码器的通用时间序列预测Transformer，在LOTSA数据集上预训练的大型时序模型

气候模型

Transformers

#时序预测基础模型 #多变量混合分布预测 #长序列补丁处理

下载量 35.58k

发布时间 : 2/11/2024

模型简介

Moirai是一个通用的时间序列预测模型，采用Transformer架构，通过掩码编码器在LOTSA数据集上进行预训练，适用于多种时间序列预测任务

模型特点

通用时间序列预测

能够处理多种时间序列预测任务，不局限于特定领域

预训练模型

在LOTSA数据集上进行预训练，具有强大的泛化能力

Transformer架构

采用先进的Transformer架构，有效捕捉时间序列中的长期依赖关系

多变量支持

支持处理多变量时间序列，包括目标变量和协变量

模型能力

时间序列预测

多变量时间序列处理

长期依赖关系建模

滚动窗口预测

使用案例

金融预测

股票价格预测

预测未来股票价格走势

销售预测

产品销售预测

预测未来产品销售趋势

能源预测

电力需求预测

预测未来电力需求变化

🚀 Moirai-1.0-R-Large

Moirai是基于掩码编码器的通用时间序列预测Transformer，是一个在LOTSA数据上预训练的大型时间序列模型。如需了解Moirai架构、训练和结果的更多详细信息，请参考论文。

图1：Moirai的整体架构。可视化展示的是一个三元时间序列，其中变量0和1是目标变量（即要进行预测的变量），变量2是动态协变量（预测范围内的值已知）。基于64的块大小，每个变量被划分为3个标记。块嵌入与序列和变量ID一起输入到Transformer中。阴影块表示要预测的预测范围，其对应的输出表示被映射到混合分布参数中。

🚀 快速开始

📦 安装指南

要使用Moirai进行推理，需从我们的GitHub仓库安装uni2ts库。

克隆仓库：

git clone https://github.com/SalesforceAIResearch/uni2ts.git
cd uni2ts

创建虚拟环境：

virtualenv venv
. venv/bin/activate

从源代码构建：

pip install -e '.[notebook]'

创建.env文件：

touch .env

💻 使用示例

基础用法

以下是一个简单的入门示例：

import torch
import matplotlib.pyplot as plt
import pandas as pd
from gluonts.dataset.pandas import PandasDataset
from gluonts.dataset.split import split

from uni2ts.eval_util.plot import plot_single
from uni2ts.model.moirai import MoiraiForecast, MoiraiModule


SIZE = "small"  # model size: choose from {'small', 'base', 'large'}
PDT = 20  # prediction length: any positive integer
CTX = 200  # context length: any positive integer
PSZ = "auto"  # patch size: choose from {"auto", 8, 16, 32, 64, 128}
BSZ = 32  # batch size: any positive integer
TEST = 100  # test set length: any positive integer

# Read data into pandas DataFrame
url = (
    "https://gist.githubusercontent.com/rsnirwan/c8c8654a98350fadd229b00167174ec4"
    "/raw/a42101c7786d4bc7695228a0f2c8cea41340e18f/ts_wide.csv"
)
df = pd.read_csv(url, index_col=0, parse_dates=True)

# Convert into GluonTS dataset
ds = PandasDataset(dict(df))

# Split into train/test set
train, test_template = split(
    ds, offset=-TEST
)  # assign last TEST time steps as test set

# Construct rolling window evaluation
test_data = test_template.generate_instances(
    prediction_length=PDT,  # number of time steps for each prediction
    windows=TEST // PDT,  # number of windows in rolling window evaluation
    distance=PDT,  # number of time steps between each window - distance=PDT for non-overlapping windows
)

# Prepare pre-trained model by downloading model weights from huggingface hub
model = MoiraiForecast(
    module=MoiraiModule.from_pretrained(f"Salesforce/moirai-1.0-R-{SIZE}"),
    prediction_length=PDT,
    context_length=CTX,
    patch_size=PSZ,
    num_samples=100,
    target_dim=1,
    feat_dynamic_real_dim=ds.num_feat_dynamic_real,
    past_feat_dynamic_real_dim=ds.num_past_feat_dynamic_real,
)

predictor = model.create_predictor(batch_size=BSZ)
forecasts = predictor.predict(test_data.input)

input_it = iter(test_data.input)
label_it = iter(test_data.label)
forecast_it = iter(forecasts)

inp = next(input_it)
label = next(label_it)
forecast = next(forecast_it)

plot_single(
    inp, 
    label, 
    forecast, 
    context_length=200,
    name="pred",
    show_label=True,
)
plt.show()

📚 详细文档

莫伊拉伊家族

模型	参数数量
Moirai-1.0-R-Small	1400万
Moirai-1.0-R-Base	9100万
Moirai-1.0-R-Large	3.11亿

引用

如果您在研究或应用中使用了Uni2TS，请使用以下BibTeX进行引用：

@article{woo2024unified,
  title={Unified Training of Universal Time Series Forecasting Transformers},
  author={Woo, Gerald and Liu, Chenghao and Kumar, Akshat and Xiong, Caiming and Savarese, Silvio and Sahoo, Doyen},
  journal={arXiv preprint arXiv:2402.02592},
  year={2024}
}

伦理考量

此版本仅用于支持学术论文的研究目的。我们的模型、数据集和代码并非专门为所有下游用途而设计或评估。我们强烈建议用户在部署此模型之前，评估并解决与准确性、安全性和公平性相关的潜在问题。我们鼓励用户考虑人工智能的常见局限性，遵守适用法律，并在选择用例时采用最佳实践，特别是在错误或滥用可能对人们的生活、权利或安全产生重大影响的高风险场景中。有关用例的更多指导，请参考我们的AUP和AI AUP。