RepIt: Representing Isolated Targets to Steer Language Models Paper • 2509.13281 • Published Sep 16 • 4
SteeringControl: Holistic Evaluation of Alignment Steering in LLMs Paper • 2509.13450 • Published Sep 16 • 7
Verification Collection Data and verifier from the paper "Budget-aware Test-time Scaling via Discriminative Verification". • 6 items • Updated Oct 17 • 1
Predicting Task Performance with Context-aware Scaling Laws Paper • 2510.14919 • Published Oct 16 • 3
Budget-aware Test-time Scaling via Discriminative Verification Paper • 2510.14913 • Published Oct 16 • 4