Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mikhail Terekhov's picture
1 5

Mikhail Terekhov

terekhov
Gargaz's profile picture jasoncorkill's profile picture 21world's profile picture
·
  • MikhailTerekhov

AI & ML interests

Reinforcement Learning, Multi-objective Reinforcement Learning, RLHF

Organizations

CLAIRE Lab @EPFL's profile picture

authored 3 papers 3 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 84

Control Tax: The Price of Keeping AI in Check

Paper • 2506.05296 • Published Jun 5

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10 • 5
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs