ruins

ruinnight

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

History of State Space Models (SSM) in 2022

upvoted an article about 2 months ago

Introduction to State Space Models (SSM)

liked a Space about 2 months ago

HuggingFaceH4/on-policy-distillation

View all activity

Organizations

None yet

upvoted 2 articles about 2 months ago

Article

History of State Space Models (SSM) in 2022

Apr 11, 2024

•

Article

Introduction to State Space Models (SSM)

Jul 19, 2024

•

197

liked 3 Spaces about 2 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Apply on-policy distillation to any model family

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

The Smol Training Playbook

📚

2.73k

The secrets to building world-class LLMs

liked 3 models 4 months ago

liked a dataset 4 months ago

GPUMODE/KernelBook

Viewer • Updated Jun 25 • 18.2k • 771 • 45

liked a dataset 9 months ago

OpenDILabCommunity/MasterMind

Viewer • Updated Mar 20 • 696k • 526 • 5

liked 2 Spaces 10 months ago

Number Tokenization Blog

📈

105

Explore how tokenization affects arithmetic in LLMs

The Ultra-Scale Playbook

🌌

3.61k

The ultimate guide to training LLM on large GPU Clusters

ruins

AI & ML interests

Recent Activity

Organizations

ruinnight's activity

History of State Space Models (SSM) in 2022

Introduction to State Space Models (SSM)

Unlocking On-Policy Distillation for Any Model Family

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

The Smol Training Playbook

Number Tokenization Blog

The Ultra-Scale Playbook