king zhu's picture

king zhu

kangz

·

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

upvoted a paper 13 days ago

A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

authored a paper 19 days ago

LIME: Less Is More for MLLM Evaluation

View all activity

Organizations

upvoted a paper 13 days ago

A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

Paper • 2510.12838 • Published 19 days ago • 22

upvoted 3 papers 19 days ago

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10, 2024 • 2

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published 19 days ago • 28

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published 20 days ago • 46

upvoted a paper about 2 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

upvoted 2 papers 2 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

upvoted 4 papers 3 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11 • 109

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published Aug 6 • 158

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 113

upvoted 9 papers 4 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published Jul 8 • 43

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 92

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

Paper • 2401.13478 • Published Jan 24, 2024 • 3

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20, 2024 • 24

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17 • 35

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization

Paper • 2506.12915 • Published Jun 15 • 20

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 31