Running on CPU Upgrade Featured 2.77k The Smol Training Playbook 📚 2.77k The secrets to building world-class LLMs
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 95
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
Alibaba-NLP/gte-Qwen2-7B-instruct Sentence Similarity • 8B • Updated Mar 24, 2025 • 87.6k • 476