Google's open-source diffusion language model generates 256 tokens in parallel and self-corrects, hitting 4x speed on one GPU at a cost to quality.
Stability AI, the company behind Stable Diffusion, is releasing a new family of audio models, called Stability Audio 3.0. The top model can generate professional-grade music of more than six minutes ...
Both models trade word-by-word generation for parallel denoising. Only one of them does it without losing intelligence in the ...