view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv 8 days ago • 103
IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech Paper • 2506.21619 • Published Jun 23 • 3
view article Article Introducing RTEB: A New Standard for Retrieval Evaluation about 1 month ago • 118
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 33
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 70
view article Article RexBERT: Encoders for a brave new world of E-Commerce By thebajajra and 1 other • Sep 20 • 48
AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck Paper • 2509.14052 • Published Sep 17 • 1
Long-Context Language Modeling with Parallel Context Encoding Paper • 2402.16617 • Published Feb 26, 2024 • 2
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10 • 673
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15 • 48