Mercury 2, the first diffusion-based reasoning large language model, introduces a new approach to token generation by refining multiple tokens in parallel rather than sequentially. This shift enables ...