None defined yet.
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
Evaluate LLMs on multiple-choice questions