31 20 2

Yulei Qin

yolay

https://yuleichin.github.io/

AI & ML interests

Medical Imaging, Computer Vision, Language Models

Recent Activity

updated a model 15 days ago

yolay/SPEAR-ReTool-Qwen2.5-32B

updated a model 15 days ago

yolay/SPEAR-ReTool-Qwen3-32B

updated a model 15 days ago

yolay/SPEAR-ALFWorld-DrBoT-GRPO-1.5B

View all activity

Organizations

upvoted a paper 20 days ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published 20 days ago • 43

upvoted a collection 20 days ago

Reinforcement learning

Collection

69 items • Updated 7 days ago • 6

upvoted a paper 20 days ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26 • 29

upvoted a paper 2 months ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published Aug 20 • 82

upvoted 5 papers 3 months ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12 • 39

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following

Paper • 2508.02150 • Published Aug 4 • 36

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 131

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 88

upvoted a paper 4 months ago

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 120

upvoted a collection 4 months ago

RAIF

Collection

Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF]. • 12 items • Updated Jul 17 • 1

upvoted 2 papers 5 months ago

WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Paper • 2506.01952 • Published Jun 2 • 10

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2 • 16

upvoted an article 9 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted a collection 9 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 172

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

upvoted a paper 12 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 86

upvoted 3 papers about 1 year ago

Rethinking Data Selection at Scale: Random Selection is Almost All You Need

Paper • 2410.09335 • Published Oct 12, 2024 • 17

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Paper • 2408.15915 • Published Aug 28, 2024 • 19

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 19

Yulei Qin

AI & ML interests

Recent Activity

Organizations

yolay's activity

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1