OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows Paper • 2510.24411 • Published 7 days ago • 63 • 2
Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning Paper • 2510.20150 • Published 12 days ago • 2 • 2
Revisiting Multimodal Positional Encoding in Vision-Language Models Paper • 2510.23095 • Published 8 days ago • 10 • 2
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification Paper • 2510.24078 • Published 7 days ago • 2
Exploring Conditions for Diffusion models in Robotic Control Paper • 2510.15510 • Published 18 days ago • 39 • 2
Performance Trade-offs of Optimizing Small Language Models for E-Commerce Paper • 2510.21970 • Published 11 days ago • 2 • 2
L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks Paper • 2510.20976 • Published 12 days ago • 2 • 2
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published 13 days ago • 36 • 2
CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning Paper • 2510.22282 • Published 10 days ago • 2 • 2
SelectMix: Enhancing Label Noise Robustness through Targeted Sample Mixing Paper • 2509.11265 • Published Sep 14 • 1 • 1
DEIM: DETR with Improved Matching for Fast Convergence Paper • 2412.04234 • Published Dec 5, 2024 • 2 • 1