Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective Paper β’ 2509.22921 β’ Published Sep 26 β’ 11
view article Article <p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p> By hba123 β’ Aug 12 β’ 12
Experience is the Best Teacher: Grounding VLMs for Robotics through Self-Generated Memory Paper β’ 2507.16713 β’ Published Jul 22 β’ 21
view article Article <p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p> By hba123 and 2 others β’ Jul 13 β’ 11
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving Paper β’ 2507.02726 β’ Published Jul 3 β’ 14
Ark: An Open-source Python-based Framework for Robot Learning Paper β’ 2506.21628 β’ Published Jun 24 β’ 16
Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity Paper β’ 2505.21411 β’ Published May 27 β’ 17
view article Article Deepseek R1 Robotic Reasoning with Checkers By codyreading and 4 others β’ Mar 5 β’ 14
Almost Surely Safe Alignment of Large Language Models at Inference-Time Paper β’ 2502.01208 β’ Published Feb 3 β’ 11
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 and 1 other β’ Jan 7 β’ 24
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper β’ 2411.03562 β’ Published Nov 5, 2024 β’ 68
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches Paper β’ 2408.04567 β’ Published Aug 8, 2024 β’ 26
HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants Paper β’ 2405.09186 β’ Published May 15, 2024 β’ 22
Human-like Episodic Memory for Infinite Context LLMs Paper β’ 2407.09450 β’ Published Jul 12, 2024 β’ 62
Towards Robust Speech Representation Learning for Thousands of Languages Paper β’ 2407.00837 β’ Published Jun 30, 2024 β’ 11