1 2 14

Mike White

seleven11

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

opencsg/Fineweb-Edu-Chinese-V2.1

liked a dataset about 1 month ago

HuggingFaceTB/smollm-corpus

liked a dataset about 2 months ago

Leon-Leee/unofficial-pyedu

View all activity

Organizations

None yet

liked a dataset 4 days ago

opencsg/Fineweb-Edu-Chinese-V2.1

Viewer • Updated Feb 27, 2025 • 958M • 29.2k • 56

liked a dataset about 1 month ago

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 14.5k • 408

liked a dataset about 2 months ago

Leon-Leee/unofficial-pyedu

Viewer • Updated Mar 12, 2025 • 7.68M • 90 • 3

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

436

liked a Space 2 months ago

The Smol Training Playbook

📚

2.77k

The secrets to building world-class LLMs

liked 3 datasets 2 months ago

upvoted an article 5 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

liked a Space 6 months ago

Predict Memory

🧮

100

Calculate memory usage for model configurations

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

liked 2 models about 1 year ago

Qwen/Qwen2-7B-Instruct

Text Generation • 8B • Updated Aug 21, 2024 • 165k • • 680

Alibaba-NLP/gte-Qwen2-7B-instruct

liked a model over 1 year ago

Qwen/Qwen2-72B-Instruct

Text Generation • 73B • Updated Oct 8, 2024 • 20.6k • • 718

liked 2 models about 2 years ago

meta-llama/Llama-2-13b-hf

Text Generation • 13B • Updated Apr 17, 2024 • 38.8k • 620

FlagAlpha/Llama2-Chinese-13b-Chat

Question Answering • Updated Feb 23, 2024 • 1.08k • 275