Ruochen Zhao
ruochenzhao
AI & ML interests
NLP interpretability
Recent Activity
upvoted
a
paper
1 day ago
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits