Running on CPU Upgrade Featured 2.78k The Smol Training Playbook π 2.78k The secrets to building world-class LLMs
deepseek-ai/DeepSeek-R1-0528 Text Generation β’ 685B β’ Updated May 29, 2025 β’ 307k β’ β’ 2.39k
Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ 71B β’ Updated Feb 24, 2025 β’ 95.5k β’ β’ 736
Running on CPU Upgrade Featured 993 Model Memory Utility π 993 Calculate vRAM needed for model training and inference
BAAI/bge-reranker-v2-minicpm-layerwise Text Classification β’ 3B β’ Updated Mar 19, 2024 β’ 1.67k β’ 63