Pablo Valle's picture

1 2

Pablo Valle

pablovalle

AI & ML interests

None yet

Organizations

None yet

authored 3 papers 11 months ago

ASTRAL: Automated Safety Testing of Large Language Models

Paper • 2501.17132 • Published Jan 28, 2025 • 2

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published Jan 30, 2025 • 23

Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation

Paper • 2501.17749 • Published Jan 29, 2025 • 14