Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published Nov 20 • 108
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction Paper • 2509.07403 • Published Sep 9 • 34
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published Jun 2 • 48
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published Feb 26 • 63
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18, 2024 • 56
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Paper • 2406.13035 • Published Jun 18, 2024 • 3
A Survey on Model Compression for Large Language Models Paper • 2308.07633 • Published Aug 15, 2023 • 3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression Paper • 2403.07378 • Published Mar 12, 2024 • 4
IoT in the Era of Generative AI: Vision and Challenges Paper • 2401.01923 • Published Jan 3, 2024 • 1
Electrocardiogram Instruction Tuning for Report Generation Paper • 2403.04945 • Published Mar 7, 2024 • 2