DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems Paper β’ 2601.13591 β’ Published 4 days ago β’ 2
DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems Paper β’ 2601.13591 β’ Published 4 days ago β’ 2
deepseek-ai/deepseek-coder-33b-instruct Text Generation β’ 33B β’ Updated Mar 7, 2024 β’ 25.6k β’ 560
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18, 2025 β’ 1.48M β’ β’ 4.36k
Running on CPU Upgrade 13.8k Open LLM Leaderboard π 13.8k Track, rank and evaluate open LLMs and chatbots