RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions Paper • 2505.12887 • Published May 19 • 1
Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback Paper • 2510.18353 • Published 14 days ago • 1
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis Paper • 2510.15710 • Published 18 days ago • 6 • 3
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science Paper • 2510.16872 • Published 16 days ago • 90 • 4
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 13 days ago • 107 • 4
AgentInstruct: Toward Generative Teaching with Agentic Flows Paper • 2407.03502 • Published Jul 3, 2024 • 50 • 16
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 15 days ago • 61 • 3
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 14 days ago • 107 • 3
World-in-World: World Models in a Closed-Loop World Paper • 2510.18135 • Published 14 days ago • 86 • 3
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper • 2510.15444 • Published 18 days ago • 144 • 6
LoRA: Low-Rank Adaptation of Large Language Models Paper • 2106.09685 • Published Jun 17, 2021 • 52 • 6
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published 18 days ago • 86 • 4
MusicSwarm: Biologically Inspired Intelligence for Music Composition Paper • 2509.11973 • Published Sep 15 • 1
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator Paper • 2505.19236 • Published May 25 • 3 • 3
RoseCDL: Robust and Scalable Convolutional Dictionary Learning for Rare-event Detection Paper • 2509.07523 • Published Sep 9 • 1
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17 • 124 • 9