Diffusion Technology Achieves 10x Faster Text Generation

Inception Labs has introduced the Mercury diffusion language model family, a new approach to speed up text generation. Unlike traditional sequential (autoregressive) language models, Mercury uses diffusion technology, promising significant improvements in speed and efficiency. While currently focused on code generation, this technology could transform the entire field of text generation.

How Diffusion Models Work

Diffusion models gradually recover clean, meaningful information from noisy data. The process has two steps:

  • Forward Process: Noise is added step by step to real data until it becomes random noise.

  • Reverse Process: The model learns to remove the noise, eventually producing high-quality data.

Based on principles from non-equilibrium thermodynamics, diffusion models offer advantages like more stable training, easier parallel processing, and flexible design. This helps them outperform traditional GANs or autoregressive models in tasks like generation.

Inception Labs’ Mercury Models

Unlike traditional models (which generate text left-to-right), Mercury uses a “coarse-to-fine” approach. Starting with pure noise, it refines the output over multiple steps.

Its main application today is code generation. Mercury Coder provides an interactive preview of generated code, improving developers’ workflows by showing how random characters evolve into functional code. The model can generate thousands of tokens per second—up to 10 times faster than traditional methods. Mercury is also available in downloadable versions, making it easy for businesses to integrate into their systems.

Potential Impact of Diffusion Technology

  • Speed & Efficiency: Runs on standard GPUs, speeding up development cycles and application response times.

  • Lower Cost: Works with existing infrastructure, reducing the need for specialized hardware.

  • New Research Opportunities: Combining diffusion and autoregressive models could advance tasks requiring structured logic, like coding or math problem-solving. 

Share this post
Artificial Intelligence in Network Management and Maintenance
Ericsson recently presented its strategic plans for 2025 at the Mobile World Congress 2025 (MWC25). These ideas are particularly intriguing as they demonstrate how artificial intelligence is being integrated into industrial processes that impact our daily lives—yet remain unnoticed as long as they function smoothly.
GTC 2025: NVIDIA's Blackwell-Based Servers and DGX Station
The GTC (GPU Technology Conference), held annually since 2009, will be hosted by NVIDIA this year from March 17 to 21. The conference is designed to showcase the latest developments and to promote collaboration and further innovation across different industries. It is attended mainly by developers, researchers, and technology leaders. NVIDIA CEO Jensen Huang has been saying for some time that companies will become token factories in the future—meaning that every workflow will be supported by artificial intelligence. Currently, large servers play a major role in this process, but AI integration will increasingly extend to personal computers. In the future, computers and laptops will have hardware capable of running even large language models in the background. This is necessary because programmers, engineers, and almost everyone will work with AI assistance.
Figure AI Prepares for Mass Production of Humanoid Robots
The rapid growth of artificial intelligence and robotics has turned the development of humanoid robots into one of the most exciting industries of our time. These robots are designed to complement or even partially replace human work in everyday settings such as production lines, warehouses, and logistics centers. Recently, the company Figure AI unveiled its BotQ – a factory optimized for high-volume production – where production will soon begin in the tens of thousands.
App-Free Experience by Deutsche Telekom
Deutsche Telekom is following Spain’s Telefónica in introducing a digital assistant—essentially an artificial butler—to its phones. This solution is promised to debut on new handsets by the second half of the year. Older devices will also offer an AI-based service, available under the name Magenta AI.