Real-time music composition with Google Magenta RT

The use of artificial intelligence in music composition is not a new endeavor, but real-time operation has long faced significant obstacles. The Google Magenta team has now unveiled a development that could expand both the technical and creative possibilities of the genre. The new model, called Magenta RealTime (Magenta RT for short), generates music in real time and is accessible to anyone thanks to its open source code.

The goal of the project is to bring machine-generated and live, human-created music closer together. The development is based on an 800-million-parameter transformer-based language model that works with 48 kHz stereo sound quality. The system uses a so-called neural audio codec to break music down into small, manageable pieces of sound and rebuilds the compositions generated during the process from these. An important new feature is that Magenta RT is capable of composing music even faster than it can be played back in real time, thereby minimizing latency during interactions.

From a musical control perspective, it is particularly noteworthy that the model not only responds to text commands, but is also capable of changing styles and moods based on sound samples. This dual approach—the simultaneous use of text and sound—allows users to specify the desired genre, tempo, or instrumentation, as well as to continue or change the soundscape of previously played sections.

The model operates on 2-second sound segments, which are built on a 10-second historical context. This temporal framing not only ensures technical efficiency, but also reinforces the sense of musical continuity. Magenta RT's capabilities are further enhanced by an embedding module called MusicCoCa, which can transform both text and audio-based information into a unified musical meaning.

One of the most interesting aspects of the technology is its open licensing. Under the Apache 2.0 license, Magenta RT is freely available on GitHub and the Hugging Face platform, opening up significant opportunities not only for developers but also for artists and educators. For example, the model can be used in live performances, interactive art installations, as a music education tool, or even for rapid creation of creative prototypes.

It is worth noting, however, that Magenta RT is an experimental technology that is primarily trained on instrumental music and does not yet offer complete compositional autonomy. Machine music creation remains a complement to creative human presence, not a replacement for it. However, developments are moving in the direction of increasingly direct, rapid, and nuanced collaboration between algorithms and humans.

Compared to other models, Magenta RT stands out in particular because it not only offers pre-generated music tracks, but is also capable of responding to user commands in real time. This is a significant difference compared to Google's other model, MusicLM, or Meta MusicGen, which produce the entire piece of music at once. Magenta RT's streaming-based operation thus enables a new kind of musical experimentation and interactive performance.

Google's future plans include releasing a customizable version of the model and exploring the possibility of running it on mobile devices. These developments could represent further steps toward making artificial intelligence an active part of live music creation.

Magenta RealTime is therefore not only a technological advance, but also represents a way of thinking: that artificial intelligence can be not only a tool, but also a partner in creation. 

Share this post
Where is Artificial Intelligence Really Today?
The development of artificial intelligence has produced spectacular and often impressive results in recent years. Systems like ChatGPT can generate natural-sounding language, solve problems, and in many tasks, even surpass human performance. However, a growing number of prominent researchers and technology leaders — including John Carmack and François Chollet — caution that these achievements don’t necessarily indicate that artificial general intelligence (AGI) is just around the corner. Behind the impressive performances, new types of challenges and limitations are emerging that go far beyond raw capability.
Rhino Linux Releases New Version: 2025.3
In the world of Linux distributions, two main approaches dominate: on one side, stable systems that are updated infrequently but offer predictability and security; on the other, rolling-release distributions that provide the latest software at the cost of occasional instability. Rhino Linux aims to bridge this divide by combining the up-to-dateness of rolling releases with the stability offered by Ubuntu as its base.
SEAL: The Harbinger of Self-Taught Artificial Intelligence
For years, the dominant belief was that human instruction—through data, labels, fine-tuning, and carefully designed interventions—was the key to advancing artificial intelligence. Today, however, a new paradigm is taking shape. In a recent breakthrough, researchers at MIT introduced SEAL (Self-Adapting Language Models), a system that allows language models to teach themselves. This is not only a technological milestone—it also raises a fundamental question: what role will humans play in the training of intelligent systems in the future?
All it takes is a photo and a voice recording – Alibaba's new artificial intelligence creates a full-body avatar from them
A single voice recording and a photo are enough to create lifelike, full-body virtual characters with facial expressions and emotions – without a studio, actor, or green screen. Alibaba's latest development, an open-source artificial intelligence model called OmniAvatar, promises to do just that. Although the technology is still evolving, it is already worth paying attention to what it enables – and what new questions it raises.
ALT Linux 11.0 Education is the foundation of Russian educational institutions
ALT Linux is a Russian-based Linux distribution built on the RPM package manager, based on the Sisyphus repository. It initially grew out of Russian localization efforts, collaborating with international distributions such as Mandrake and SUSE Linux, with a particular focus on supporting the Cyrillic alphabet.
Spatial intelligence is the next hurdle for AGI to overcome
With the advent of LLM, machines have gained impressive capabilities. What's more, their pace of development has accelerated, with new models appearing every day that make machines even more efficient and give them even better capabilities. However, upon closer inspection, this technology has only just enabled machines to think in one dimension. The world we live in, however, is three-dimensional based on human perception. It is not difficult for a human to determine that something is under or behind a chair, or where a ball flying towards us will land. According to many artificial intelligence researchers, in order for AGI, or artificial general intelligence, to be born, machines must be able to think in three dimensions, and for this, spatial intelligence must be developed.

Linux distribution updates released in the last few days