view article Article Phare LLM benchmark V2: Reasoning models don't guarantee better security 8 days ago • 9
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs Jul 2 • 16
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training Paper • 2504.09710 • Published Apr 13 • 19
RealHarm: A Collection of Real-World Language Model Application Failures Paper • 2504.10277 • Published Apr 14 • 10