view article Article ABBL: NextGen LLM Benchmark & Leaderboard for evaluating Arabic models May 18 • 2
view article Article SILMA RAGQA V1.0: A Comprehensive Benchmark for Evaluating LLMs on RAG QA Use-Cases Dec 18, 2024 • 1