Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published about 1 month ago • 3
Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published about 1 month ago • 3
Agentic Policy Optimization via Instruction-Policy Co-Evolution Paper • 2512.01945 • Published about 1 month ago • 3 • 2
AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning Paper • 2511.19304 • Published Nov 24, 2025 • 90
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10, 2025 • 98
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Paper • 2403.16950 • Published Mar 25, 2024 • 4
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners Paper • 2406.02537 • Published Jun 4, 2024
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments Paper • 2406.11370 • Published Jun 17, 2024
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation Paper • 2502.00330 • Published Feb 1, 2025
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Paper • 2502.02533 • Published Feb 4, 2025 • 4
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31, 2025 • 301
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning Paper • 2301.12132 • Published Jan 28, 2023 • 1
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering Paper • 2309.17249 • Published Sep 29, 2023 • 1
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning Paper • 2310.12774 • Published Oct 19, 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems Paper • 2307.14031 • Published Jul 26, 2023