3 35 55

NAN

nan1248

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

upvoted a collection 18 days ago

AndesVL

liked a model 18 days ago

OPPOer/AndesVL-0_6B-Instruct

View all activity

Organizations

None yet

upvoted a paper 10 days ago

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published 11 days ago • 16

upvoted a collection 18 days ago

AndesVL

Collection

AndesVL is a suite of mobile-optimized Multimodal Large Language Models (MLLMs) with 0.6B to 4B parameters. • 8 items • Updated 18 days ago • 10

upvoted a paper 18 days ago

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

Paper • 2510.11496 • Published 19 days ago • 3

upvoted an article 3 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 50

upvoted a paper 4 months ago

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 35

upvoted a collection 4 months ago

SmolLM3 pretraining datasets

Collection

datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12 • 35

upvoted a paper 4 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 73

upvoted 2 papers 5 months ago

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6 • 8

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

upvoted a paper 6 months ago

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47

upvoted 2 papers 7 months ago

GenX: Mastering Code and Test Generation with Execution Feedback

Paper • 2412.13464 • Published Dec 18, 2024 • 1

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66

upvoted a paper 8 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

upvoted 2 articles 9 months ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 378

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 425

upvoted 2 papers 9 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 123

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 420

upvoted a paper 11 months ago

Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published Nov 27, 2024 • 31

upvoted a collection 11 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Apr 30 • 81

upvoted a paper 12 months ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 127

NAN

AI & ML interests

Recent Activity

Organizations

nan1248's activity

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

SmolVLM - small yet mighty Vision Language Model

SmolLM - blazingly fast and remarkably powerful