Diffusion Technology Achieves 10x Faster Text Generation

Inception Labs has introduced the Mercury diffusion language model family, a new approach to speed up text generation. Unlike traditional sequential (autoregressive) language models, Mercury uses diffusion technology, promising significant improvements in speed and efficiency. While currently focused on code generation, this technology could transform the entire field of text generation.

How Diffusion Models Work

Diffusion models gradually recover clean, meaningful information from noisy data. The process has two steps:

  • Forward Process: Noise is added step by step to real data until it becomes random noise.

  • Reverse Process: The model learns to remove the noise, eventually producing high-quality data.

Based on principles from non-equilibrium thermodynamics, diffusion models offer advantages like more stable training, easier parallel processing, and flexible design. This helps them outperform traditional GANs or autoregressive models in tasks like generation.

Inception Labs’ Mercury Models

Unlike traditional models (which generate text left-to-right), Mercury uses a “coarse-to-fine” approach. Starting with pure noise, it refines the output over multiple steps.

Its main application today is code generation. Mercury Coder provides an interactive preview of generated code, improving developers’ workflows by showing how random characters evolve into functional code. The model can generate thousands of tokens per second—up to 10 times faster than traditional methods. Mercury is also available in downloadable versions, making it easy for businesses to integrate into their systems.

Potential Impact of Diffusion Technology

  • Speed & Efficiency: Runs on standard GPUs, speeding up development cycles and application response times.

  • Lower Cost: Works with existing infrastructure, reducing the need for specialized hardware.

  • New Research Opportunities: Combining diffusion and autoregressive models could advance tasks requiring structured logic, like coding or math problem-solving. 

Share this post
Sovereign AI, secret share sales – what is going on behind the scenes at NVIDIA?
The artificial intelligence industry has experienced unprecedented momentum in recent years, and one of the biggest winners of this wave is undoubtedly NVIDIA. Known for its graphics processors, the company is now not only a favorite among gamers and engineers, but has also become a central player in international technology strategies. Its shares are hitting historic highs on the US stock market, while more and more government cooperation and geopolitical threads are beginning to weave around it. But what does all this tell us about the future, and how well-founded is the current optimism?
Facebook's new AI feature quietly opens the door to mass analysis of personal photos
Users who want to share a post on Facebook are greeted with a new warning: a pop-up window asking for permission for “cloud-based processing.” If we approve, the system can access our entire phone photo library—including photos we've never uploaded to the social network. The goal: to generate creative ideas using artificial intelligence, such as collages, themed selections, or stylized versions.
openEuler 24.03-LTS-SP2 is the platform of choice for large enterprises in China
The future of digital infrastructure is increasingly based on operating systems that can meet the stability, innovation and compatibility requirements of different industries. openEuler, China's first community open source operating system, is not just a technology product, but the result of a long-term strategic effort to create an independent and diverse technology ecosystem. The latest major milestone in this development is openEuler 24.03 LTS SP2.
Will ASICs replace NVIDIA GPUs?
The development of artificial intelligence over the past decade has been closely linked to the name NVIDIA, which has become the dominant player in the market with its graphics processing units (GPUs). A significant portion of today's AI models are built on these GPUs, and NVIDIA's decade-old software ecosystem—especially the CUDA platform—has become an indispensable tool for research, development, and industrial applications. At the same time, in recent years, the biggest players in the technology sector – including Google, Amazon, Meta, and Microsoft – have been turning with increasing momentum toward AI chips developed in-house and optimized for specific tasks, known as ASICs.
Google Gemini CLI, a powerful offering in the field of AI accessible from the terminal
Google's recently announced Gemini CLI is an open source, command line AI tool that integrates the Gemini 2.5 Pro large language model directly into the terminal. The goal of the initiative is nothing less than to transform natural language commands into real technical workflows, in an environment that has already been synonymous with efficiency for many.
Satya Nadella's thoughts on the role, future, and responsibility of artificial intelligence
Rapid change is not uncommon in the world of technology, but rarely does it affect so many sectors at once as today's artificial intelligence (AI) revolution. In an interview with Y Combinator, Satya Nadella, CEO of Microsoft, not only assessed technological developments, but also placed the development of AI in a broader social and economic context. His approach is restrained, calm, and purposeful: AI is not a mystical entity, but a tool that must be properly applied and interpreted.

Linux distribution updates released in the last few days