Predicting the Order of Upcoming Tokens Improves Language Modeling Paper โข 2508.19228 โข Published Aug 26 โข 22