RL agent - a MercedeSnape Collection

MercedeSnape 's Collections

RAG

future

kg

memory

Evolve

reasoning evaluation

agent reasoning

mas

RL agent

updated about 15 hours ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 81

Note for online RL training “提炼为经验模型”
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published about 1 month ago • 109