小明

xiaoming

xiaominghero

AI & ML interests

nlp

Recent Activity

upvoted a paper 12 days ago

Step-DeepResearch Technical Report

upvoted a paper 19 days ago

Step-GUI Technical Report

upvoted an article 25 days ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published 13 days ago • 80

upvoted a paper 19 days ago

Step-GUI Technical Report

Paper • 2512.15431 • Published 19 days ago • 128

upvoted an article 25 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

558

liked a Space 2 months ago

The Smol Training Playbook

📚

2.8k

The secrets to building world-class LLMs

liked a dataset 3 months ago

allenai/CoSyn-400K

Viewer • Updated Feb 28, 2025 • 408k • 1.98k • 44

upvoted a collection 4 months ago

MobileLLM-R1

Collection

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 27

liked 3 datasets 4 months ago

liked 2 models 4 months ago

stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated Sep 5, 2025 • 1.13k • 241

ByteDance-Seed/Seed-OSS-36B-Base

Text Generation • 36B • Updated Aug 26, 2025 • 3.71k • 57

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted a paper 5 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

liked a dataset 5 months ago

nvidia/Nemotron-Pretraining-Dataset-sample

Viewer • Updated 14 days ago • 27.7k • 1.07k • 35

upvoted a collection 5 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 13 days ago • 87

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26, 2025 • 4.87k • 1.01k

liked a dataset 5 months ago

stemdataset/STEM

Viewer • Updated Apr 30, 2024 • 1.07M • 600 • 5

upvoted a paper 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145

upvoted an article 5 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

743

liked a dataset 5 months ago

nvidia/Llama-Nemotron-VLM-Dataset-v1

Viewer • Updated Oct 22, 2025 • 2.86M • 2.15k • 155

小明

AI & ML interests

Recent Activity

Organizations

xiaoming's activity

We Got Claude to Fine-Tune an Open Source LLM

The Smol Training Playbook

SmolLM3: smol, multilingual, long-context reasoner