Gödel machine: AI that develops itself

2025-06-13T15:10:00.000+00:00 2025 June 13. 15:10 Attila Fodor

Imagine a computer program that can independently modify its own code without human intervention to become even better and smarter! This futuristic-sounding concept is called the “Gödel machine.”

Jürgen Schmidhuber, a renowned figure in AI research, proposed the idea of self-improving AI more than two decades ago and called it the “Gödel machine.” According to the original idea, the Gödel machine rewrites its own code when it can mathematically prove that a given self-correction leads to improved performance. However, such mathematical proofs are extremely difficult, so the Gödel machine has remained a theoretical concept until now.

In May, however, a research article that could be a significant step toward the realization of the Gödel machine caused a stir on social media. The study, titled “Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents” was authored by researchers at the University of British Columbia in Canada and Sakana AI.

However, the newly presented “Darwin Gödel Machine (DGM)” elegantly circumvents the difficulties of mathematical proof. The DGM uses evolutionary algorithms and empirical (experiential) evaluation methods. This means that multiple self-correcting AI systems compete with each other in various tasks (benchmarks). Continuous competition and evaluation encourage the self-modification and continuous development of AIs.

The research team applied the DGM approach to “coding agents” that automatically generate program code. They allowed these agents to modify their own Python code, for example by adding new tools or suggesting different workflows. The modified agents were then evaluated in coding tests. Interestingly, even the worst-performing agents were archived if their behavior was unique, ensuring evolutionary diversity. This idea helps prevent agents from “getting stuck” in a local optimum and encourages the discovery of innovative solutions.

Thanks to this “evolution,” the performance of the coding agents improved significantly. They achieved a 20-50% increase on the SWE-bench benchmark for solving real-world GitHub problems and a 14.2-30.7% increase on the Polyglot benchmark for measuring multilingual coding.

Of course, there are security concerns associated with such self-improving AI research. Many fear that AI evolution will slip out of human control or that AI will “cheat” during testing. The research team responds to these concerns by enabling AI self-improvement under human supervision in a “sandbox” environment.

The second most mentioned research article in May came from NVIDIA and explores the mystery of AI logical thinking. The study, titled “ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models” examines how the latest AI models, such as OpenAI o1 and DeepSeek-R1, achieve their exceptional logical reasoning abilities.

There is a lively debate among AI researchers about the extent to which reinforcement learning influences the reasoning abilities of foundation models. The crux of the debate is whether reinforcement learning merely unlocks existing reasoning abilities in foundation models or endows them with entirely new reasoning abilities. The latest research tends to support the former view.

However, NVIDIA's research challenges this trend. Using their reinforcement learning method called “ProRL,” which enables long-term, stable learning, they demonstrated that the model was able to “discover” new reasoning strategies and find solutions to tasks that the original foundation model could not answer correctly. This suggests that reinforcement learning can indeed endow base models with new reasoning abilities.

These research breakthroughs show that the development of artificial intelligence is progressing at an astonishing rate. Self-improving AIs, such as the Darwin Gödel machine, could revolutionize software development and many other fields. At the same time, it is crucial that we address the ethical and safety issues involved in a responsible and thoughtful manner, ensuring that the development of AI serves the good of humanity.

Share this post

2025. July 28.

What is WhoFi?

Wireless internet, or WiFi, is now a ubiquitous and indispensable part of our lives. We use it to connect our devices to the internet, communicate, and exchange information. But imagine if this same technology, which invisibly weaves through our homes and cities, could also identify and track us without cameras—even through walls. This is not a distant science fiction scenario, but the reality of a newly developed technology called WhoFi, which harnesses a previously untapped property of WiFi signals. To complicate matters, the term “WhoFi” also refers to an entirely different service with community-focused goals, so it's important to clarify which meaning is being discussed.

2025. July 28.

China’s Own GPU Industry Is Slowly Awakening

“7G” is an abbreviation that sounds almost identical to the word for “miracle” in Chinese. Whether this is a lucky piece of marketing or a true technological prophecy remains to be seen. What Lisuan Technology is presenting with the 7G106—internally codenamed G100—is nothing less than the first serious attempt to step out of Nvidia and AMD’s shadow. No licensing agreements, no crutches based on Western intellectual property—this is a GPU built from scratch, manufactured using 6 nm DUV technology in a country that is only beginning to break free from the spell of Western technology exports.

2025. July 25.

Anticipation is high for the release of GPT-5 — but what should we really expect?

OpenAI’s upcoming language model, GPT-5, has become one of the most anticipated technological developments in recent months. Following the release of GPT-4o and the specialized o1 models, attention is now shifting to this next-generation model, which—according to rumors and hints from company leaders—may represent a significant leap forward in artificial intelligence capabilities. But what do we actually know so far, and what remains pure speculation?

2025. July 18.

What Does the Rise of DiffuCoder and Diffusion Language Models Mean?

A new approach is now fundamentally challenging this linear paradigm: diffusion language models (dLLMs), which generate content not sequentially but globally, through iterative refinement. But are they truly better suited to code generation than the well-established AR models? And what insights can we gain from DiffuCoder, the first major open-source experiment in this field?

2025. July 17.

Apple's New AI Models Can Understand What’s on Your Screen

When we look at our phone's display, what we see feels obvious—icons, text, and buttons we’re used to. But how does artificial intelligence interpret that same interface? This question is at the heart of joint research between Apple and Finland’s Aalto University, resulting in a model called ILuvUI. This development isn’t just a technical milestone; it’s a major step toward enabling digital systems to truly understand how we use applications—and how they can assist us even more effectively.

2025. July 17.

Artificial Intelligence in the Service of Religion and the Occult

Imagine attending a religious service. The voice of the priest or rabbi is familiar, the message resonates deeply, and the sermon seems thoughtfully tailored to the lives of those present. Then it is revealed that neither the words nor the voice came from a human being—they were generated by artificial intelligence, trained on the speaker’s previous sermons. The surprise lies not only in the capabilities of the technology, but also in the realization that spirituality—so often viewed as timeless and intrinsically human—has found a new partner in the form of an algorithm. What does this shift mean for faith, religious communities, and our understanding of what it means to believe?

Gödel machine: AI that develops itself

What is WhoFi?

China’s Own GPU Industry Is Slowly Awakening

Anticipation is high for the release of GPT-5 — but what should we really expect?

What Does the Rise of DiffuCoder and Diffusion Language Models Mean?

Apple's New AI Models Can Understand What’s on Your Screen

Artificial Intelligence in the Service of Religion and the Occult

Linux distributions

DietPi

Garuda Linux

Volumio

Calculate Linux

Slackel

BigLinux

Desktop Environments

Hyprland

KDE Plasma

Gnome

Cinnamon

Cosmic

LXQt

Popular

Realme C61 – Is It Worth the Price? A Look at Customer Experience and Expert Reviews

Switzerland’s New Language Model Shows How AI Can Truly Serve the Public Good

Xiaomi Redmi 14C – Based on Customer Reviews

According to Replit's CEO, AI Will Make Programming More Human

Realme C55 – Budget-Friendly Brilliance or Compromised Performance?