Shashwat Goel's picture

9 6 5

Shashwat Goel

shash42

·

https://www.shash42.github.io

AI & ML interests

Science of Deep Learning, Safe AI

Recent Activity

upvoted a paper about 1 month ago

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

commented on a paper about 1 month ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

upvoted a paper about 2 months ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

View all activity

Organizations

upvoted a paper about 1 month ago

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision

Paper • 2509.14234 • Published Sep 17 • 5

upvoted a paper about 2 months ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 34

upvoted a collection 4 months ago

answer-matching

Free-form datasets, human annotations, and sample-level model outputs for "Answer Matching Outperforms Multiple Choice for Language Model Evaluation" • 2 items • Updated Jul 3 • 2

upvoted a paper 5 months ago

Pitfalls in Evaluating Language Model Forecasters

Paper • 2506.00723 • Published May 31 • 3

upvoted 2 papers 9 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 33