Running 3.65k The Ultra-Scale Playbook 🌌 3.65k The ultimate guide to training LLM on large GPU Clusters
gradientai/Llama-3-70B-Instruct-Gradient-1048k Text Generation • 71B • Updated Oct 28, 2024 • 4 • 122