WhiteRabbitNeo-V3 Collection The latest and most capable cybersecurity model we've ever created β’ 1 item β’ Updated Jun 25 β’ 10
view article Article LLMGameHub: How We Won the Gradio Agents & MCP HackathonΒ 2025 By kikikita and 1 other β’ Jul 28 β’ 18
Rethinking Verification for LLM Code Generation: From Generation to Testing Paper β’ 2507.06920 β’ Published Jul 9 β’ 28
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper β’ 2506.21551 β’ Published Jun 26 β’ 28
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper β’ 2506.05010 β’ Published Jun 5 β’ 79
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper β’ 2506.09790 β’ Published Jun 11 β’ 53
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs Paper β’ 2506.05629 β’ Published Jun 5 β’ 37
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper β’ 2505.14652 β’ Published May 20 β’ 23
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper β’ 2505.24760 β’ Published May 30 β’ 73
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2 β’ 140
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper β’ 2506.01049 β’ Published Jun 1 β’ 38
ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper β’ 2506.00539 β’ Published May 31 β’ 30
Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper β’ 2505.23604 β’ Published May 29 β’ 23
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning Paper β’ 2505.16410 β’ Published May 22 β’ 57
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper β’ 2505.16933 β’ Published May 22 β’ 34