Cheng Qian's picture

2 22

Cheng Qian

chengq9

·

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 14 days ago

Multimodal Policy Internalization for Conversational Agents

upvoted a paper 15 days ago

Self-Improving LLM Agents at Test-Time

upvoted a paper 28 days ago

Where LLM Agents Fail and How They can Learn From Failures

View all activity

Organizations

Collections 1

Papers 17

arxiv:2509.19736

arxiv:2509.09614

arxiv:2507.22034

arxiv:2507.21046

models 3

chengq9/ToolRL-Qwen2.5-1.5B

2B • Updated Apr 22

chengq9/ToolRL-Qwen2.5-3B

3B • Updated Apr 22 • 2 • 1

chengq9/ToolRL-Llama3.2-3B

4B • Updated Apr 22

datasets 0

None public yet