Retrospective Sparse Attention for Efficient Long-Context Generation Paper • 2508.09001 • Published Aug 12 • 2