Oleg Y. Rogov

qubitter

https://scholar.google.com/citations?user=gIx9BE0AAAAJ&hl=en

AI & ML interests

Adversarial ML

Recent Activity

upvoted a paper 10 days ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

upvoted a paper 10 days ago

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

upvoted a paper 13 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

View all activity

Organizations

upvoted 2 papers 10 days ago

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

Paper • 2510.15264 • Published 13 days ago • 1

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

Paper • 2510.11288 • Published 17 days ago • 45

upvoted a paper 13 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published 24 days ago • 108

upvoted a paper 27 days ago

The Rogue Scalpel: Activation Steering Compromises LLM Safety

Paper • 2509.22067 • Published Sep 26 • 27

upvoted 2 papers 3 months ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7 • 46

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123

upvoted a paper 4 months ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 118

upvoted an article 4 months ago

Article

The Common Pile v0.1

and 2 others •

Jun 6

• 51

upvoted 4 papers 5 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9 • 50

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 113

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 139

upvoted a paper 7 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 119

upvoted a paper 8 months ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28 • 132

upvoted an article 8 months ago

Article

AI Watermarking 101: Tools and Techniques

Feb 26, 2024

• 26

upvoted a paper 8 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 174

upvoted a paper 12 months ago

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 63

upvoted a paper about 1 year ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 209

upvoted a paper over 1 year ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 158

upvoted a paper almost 2 years ago

Kandinsky 3.0 Technical Report

Paper • 2312.03511 • Published Dec 6, 2023 • 46