Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published Aug 29, 2025 • 29
TALKPLAY: Multimodal Music Recommendation with Large Language Models Paper • 2502.13713 • Published Feb 19, 2025 • 4
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation • 8B • Updated Oct 29, 2024 • 11.3k • 679
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 390