8 40 44

Leon Tsou

xxrjun

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

nvidia/DeepSeek-R1-0528-NVFP4:What does “AA Ref” mean in NVIDIA model benchmarks?

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.79k

The secrets to building world-class LLMs

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29, 2025 • 344k • • 2.39k

liked a model 4 months ago

kernels-community/vllm-flash-attn3

Updated Oct 27, 2025 • 35

liked a model 5 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 436k • • 12.9k

liked a dataset 7 months ago

GPUMODE/KernelBook

Viewer • Updated Jun 25, 2025 • 18.2k • 588 • 45

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

liked 3 models 11 months ago

liked a model 12 months ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24, 2025 • 95.1k • • 737

liked a Space about 1 year ago

Model Memory Utility

🚀

993

Calculate vRAM needed for model training and inference

liked 2 models about 1 year ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6, 2025 • 1.14M • • 1.25k

BAAI/bge-reranker-v2-minicpm-layerwise

Text Classification • 3B • Updated Mar 19, 2024 • 1.67k • 63

liked a dataset over 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 23.3k • 1.59k

liked a model over 1 year ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 657k • • 12.1k

liked a Space over 1 year ago

Calculate Model Flops

🔥

Calculate FLOPs and parameters for transformer models

liked a model over 1 year ago

meta-llama/CodeLlama-7b-Python-hf

Text Generation • 7B • Updated Mar 14, 2024 • 593 • 25

liked 3 datasets over 1 year ago

ise-uiuc/Magicoder-OSS-Instruct-75K

Viewer • Updated Dec 4, 2023 • 75.2k • 1.63k • 157

google-research-datasets/mbpp

Viewer • Updated Jan 4, 2024 • 1.4k • 2.24M • 200

codeparrot/apps

Updated Oct 20, 2022 • 14.6k • 190