ReCode: Unify Plan and Action for Universal Granularity Control Paper • 2510.23564 • Published 1 day ago • 97
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 12 days ago • 38
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 6 days ago • 98
Unified Reinforcement and Imitation Learning for Vision-Language Models Paper • 2510.19307 • Published 6 days ago • 24
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 6 days ago • 56
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP Dec 17, 2024 • 7
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face 13 days ago • 14
Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published 12 days ago • 36
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published 12 days ago • 32
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper • 2510.12586 • Published 14 days ago • 107
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published 19 days ago • 117
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 15 days ago • 160