10 21 28

Yang Chen

ychenNLP

https://edchengg.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper 18 days ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

upvoted a paper 18 days ago

Agent Learning via Early Experience

upvoted a paper 24 days ago

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

View all activity

Organizations

upvoted 2 papers 18 days ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published 18 days ago • 67

Agent Learning via Early Experience

Paper • 2510.08558 • Published 18 days ago • 243

upvoted a paper 24 days ago

Tree-based Dialogue Reinforced Policy Optimization for Red-Teaming Attacks

Paper • 2510.02286 • Published 25 days ago • 28

upvoted a paper about 2 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

New activity in nvidia/AceReason-Nemotron-14B 4 months ago

Is it possible to open-source the 2k+ difficult samples from math stage3 separately, as well as the code training data?

#2 opened 4 months ago by

Suu

Add link to paper and project page

#1 opened 4 months ago by

nielsr

New activity in nvidia/AceReason-Math 4 months ago

Add task category and link to new model paper

#1 opened 4 months ago by

nielsr

upvoted a paper 4 months ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 42

updated a model 4 months ago

nvidia/AceReason-Nemotron-14B

Text Generation • 15B • Updated Jun 17 • 7.82k • • 93

authored a paper 4 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 26

updated a model 4 months ago

nvidia/AceReason-Nemotron-7B

Text Generation • 8B • Updated Jun 17 • 4.66k • • 19

commented a paper 4 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 26 •

upvoted 2 papers 4 months ago

AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy

Paper • 2506.13284 • Published Jun 16 • 26

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

Paper • 2506.06962 • Published Jun 8 • 28

liked a dataset 4 months ago

nvidia/AceReason-1.1-SFT

Viewer • Updated Jun 18 • 3.96M • 4.81k • 91

liked a model 4 months ago

nvidia/AceReason-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 11 • 7.73k • • 56

updated a dataset 4 months ago

nvidia/AceReason-1.1-SFT

Viewer • Updated Jun 18 • 3.96M • 4.81k • 91

upvoted a paper 5 months ago

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

Paper • 2506.06952 • Published Jun 8 • 9

updated a dataset 5 months ago

nvidia/AceReason-Math

Viewer • Updated Jun 18 • 49.6k • 1.02k • 35

liked a model 5 months ago

nvidia/Nemotron-H-8B-Reasoning-128K

Text Generation • 8B • Updated Jul 11 • 1.35k • 23

Yang Chen

AI & ML interests

Recent Activity

Organizations

ychenNLP's activity

Is it possible to open-source the 2k+ difficult samples from math stage3 separately, as well as the code training data?

Add link to paper and project page

Add task category and link to new model paper