
Since the release of DALL-E in 2021, the first AI image-generating model to popularize the tech, much progress has been made in the AI text-to-image generator space with improved quality, speed, and prompt adherence. However, even the fastest image generators typically take a couple of seconds to create an image — except this one.
Also: Apple’s AI doctor will be ready to see you next spring
HART, short for Hybrid Autoregressive Transformer, is an AI text-to-image generator developed by MIT, Nvidia, and Tsinghua University. It features unprecedented speed and generations with 3.1 to 5.9 times lower latency than state-of-the-art diffusion models. The key difference? How HART was trained.
Without getting too technical, instead of using a…