7 11 5

weize

weizechen

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

weizechen/RL-Compositionality-Stage-1-Model

updated a dataset 16 days ago

weizechen/RL-Compositionality-Stage2-RL-Level8-TestData

updated a dataset 16 days ago

weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData

View all activity

Organizations

updated a model 16 days ago

weizechen/RL-Compositionality-Stage-1-Model

8B • Updated 16 days ago • 23

updated 4 datasets 16 days ago

updated a collection 16 days ago

RL Compositionality

Collection

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated 16 days ago

published a model 16 days ago

weizechen/RL-Compositionality-Stage-1-Model

8B • Updated 16 days ago • 23

updated a collection 16 days ago

RL Compositionality

Collection

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated 16 days ago

published 4 datasets 16 days ago

weizechen/RL-Compositionality-Stage2-RL-Level8-TestData

Viewer • Updated 16 days ago • 2.05k • 25

weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData

Viewer • Updated 16 days ago • 500k • 28

weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData

Viewer • Updated 16 days ago • 500k • 26

weizechen/RL-Compositionality-Stage1-RFT-Data

Viewer • Updated 16 days ago • 118k • 49

upvoted a paper about 1 month ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 18

commented a paper about 1 month ago

From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 18 •

authored a paper about 1 month ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 49

liked a model about 1 month ago

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19 • 3.68k • 753

upvoted a paper about 2 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11 • 78

weize

AI & ML interests

Recent Activity

Organizations

weizechen's activity