B-score: Detecting biases in large language models using response history Paper • 2505.18545 • Published May 24, 2025 • 30