Alexander Panfilov's picture

2 11 2

Alexander Panfilov

kotekjedi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

authored a paper 3 months ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

upvoted a paper 3 months ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

View all activity

Organizations

authored 2 papers 3 months ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10, 2025 • 5

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22, 2025 • 12

authored a paper 7 months ago

Capability-Based Scaling Laws for LLM Red-Teaming

Paper • 2505.20162 • Published May 26, 2025 • 4

authored a paper over 1 year ago

Provable Compositional Generalization for Object-Centric Learning

Paper • 2310.05327 • Published Oct 9, 2023