Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine Paper • 2510.21614 • Published 3 days ago • 12
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite Paper • 2510.21652 • Published 3 days ago • 2
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 11 days ago • 19
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation Paper • 2510.21583 • Published 3 days ago • 25
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published 3 days ago • 58
Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Paper • 2510.18245 • Published 7 days ago • 6
From Masks to Worlds: A Hitchhiker's Guide to World Models Paper • 2510.20668 • Published 4 days ago • 6
Search Self-play: Pushing the Frontier of Agent Capability without Supervision Paper • 2510.18821 • Published 6 days ago • 15
Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets Paper • 2510.19944 • Published 5 days ago • 14
DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion Paper • 2510.20766 • Published 4 days ago • 28
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives Paper • 2510.20822 • Published 4 days ago • 34
AlphaOPT: Formulating Optimization Programs with Self-Improving LLM Experience Library Paper • 2510.18428 • Published 6 days ago • 3
Unified Reinforcement and Imitation Learning for Vision-Language Models Paper • 2510.19307 • Published 6 days ago • 24
GigaBrain-0: A World Model-Powered Vision-Language-Action Model Paper • 2510.19430 • Published 5 days ago • 39
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 5 days ago • 55
UltraGen: High-Resolution Video Generation with Hierarchical Attention Paper • 2510.18775 • Published 6 days ago • 15