ASTRAL: Automated Safety Testing of Large Language Models Paper • 2501.17132 • Published Jan 28, 2025 • 2
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published Jan 29, 2025 • 14