MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games Paper • 2510.15414 • Published Oct 17 • 1
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19 • 226
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper • 2511.14295 • Published Nov 18 • 71
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model Paper • 2511.13647 • Published Nov 17 • 70
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published Nov 17 • 136
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17 • 133
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14 • 164
LLM-Powered Fully Automated Chaos Engineering: Towards Enabling Anyone to Build Resilient Software Systems at Low Cost Paper • 2511.07865 • Published Nov 11 • 3
TopoPerception: A Shortcut-Free Evaluation of Global Visual Perception in Large Vision-Language Models Paper • 2511.11831 • Published Nov 14 • 1
A Brain Wave Encodes a Thousand Tokens: Modeling Inter-Cortical Neural Interactions for Effective EEG-based Emotion Recognition Paper • 2511.13954 • Published Nov 17 • 3
Error-Driven Scene Editing for 3D Grounding in Large Language Models Paper • 2511.14086 • Published Nov 18 • 5
Proactive Hearing Assistants that Isolate Egocentric Conversations Paper • 2511.11473 • Published Nov 14 • 6
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published Nov 17 • 10
Orion: A Unified Visual Agent for Multimodal Perception, Advanced Visual Reasoning and Execution Paper • 2511.14210 • Published Nov 18 • 19
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published Nov 18 • 18
Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework Paper • 2511.13189 • Published Nov 17 • 38