Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models Paper • 2503.05005 • Published Mar 6, 2025 • 1