🧠 Time-R1 Reinforced Model Weights

These are the official reinforcement learning (RL) fine-tuned model checkpoints for the paper: "Time Series Forecasting as Reasoning: A Slow-Thinking Approach with Reinforced LLMs".

📦 Model Details

Base Model: Qwen2.5-7B
Tuning Framework: Verl + LLaMA Factory
Final Stage: Trained using GRIP (Group-based Relative Importance Policy optimization)
Objective: Multi-horizon time series forecasting with structured reasoning

📦 Files Included

This model follows the standard Hugging Face transformers format and uses the efficient safetensors backend.

Time-R1/
├── config.json
├── generation_config.json
├── model.safetensors.index.json
├── model-00001-of-00004.safetensors
├── model-00002-of-00004.safetensors
├── model-00003-of-00004.safetensors
├── model-00004-of-00004.safetensors
├── tokenizer_config.json
├── tokenizer.json
└── vocab.json

✅ Fully compatible with Hugging Face transformers and AutoModelForCausalLM.

Downloads last month: 6

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ustc-zyt/Time-R1

Base model

Qwen/Qwen2.5-7B

Finetuned

(808)

this model

Quantizations

2 models

ustc-zyt
/

Time-R1

🧠 Time-R1 Reinforced Model Weights

📦 Model Details

📦 Files Included

Model tree for ustc-zyt/Time-R1

Dataset used to train ustc-zyt/Time-R1