UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper β’ 2509.02544 β’ Published Sep 2 β’ 123
Running 23 23 BrowserGym Leaderboard π Tracks perf of LLMs, VLMs and agents on web navigation tasks
view article Article Automatic Prompt Optimization with DSPy and Cross Encoders By dleemiller β’ Aug 2 β’ 1
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper β’ 2504.08942 β’ Published Apr 11 β’ 28
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9, 2024 β’ 70
Running 41 41 Gradio Hackathon Registration Winter 25 π Gradio Agents & MCP Hackathon Winter 2025 Registration Page
Running 17 17 Smart Customer Support Agent π¬ Agentic retrieval & smart routing for customer support