Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MercedeSnape 's Collections
RAG
future
kg
memory
Evolve
reasoning evaluation
agent reasoning
mm thinking
agent training
RL agent
agent env
model paradigm
mas

RL agent

updated about 15 hours ago
Upvote
-

  • Scaling Agent Learning via Experience Synthesis

    Paper • 2511.03773 • Published Nov 5 • 81

    Note for online RL training “提炼为经验模型”


  • ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

    Paper • 2511.21689 • Published about 1 month ago • 109
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs