Gödel machine: AI that develops itself

Imagine a computer program that can independently modify its own code without human intervention to become even better and smarter! This futuristic-sounding concept is called the “Gödel machine.”

Jürgen Schmidhuber, a renowned figure in AI research, proposed the idea of self-improving AI more than two decades ago and called it the “Gödel machine.” According to the original idea, the Gödel machine rewrites its own code when it can mathematically prove that a given self-correction leads to improved performance. However, such mathematical proofs are extremely difficult, so the Gödel machine has remained a theoretical concept until now.

In May, however, a research article that could be a significant step toward the realization of the Gödel machine caused a stir on social media. The study, titled “Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents” was authored by researchers at the University of British Columbia in Canada and Sakana AI.

However, the newly presented “Darwin Gödel Machine (DGM)” elegantly circumvents the difficulties of mathematical proof. The DGM uses evolutionary algorithms and empirical (experiential) evaluation methods. This means that multiple self-correcting AI systems compete with each other in various tasks (benchmarks). Continuous competition and evaluation encourage the self-modification and continuous development of AIs.

The research team applied the DGM approach to “coding agents” that automatically generate program code. They allowed these agents to modify their own Python code, for example by adding new tools or suggesting different workflows. The modified agents were then evaluated in coding tests. Interestingly, even the worst-performing agents were archived if their behavior was unique, ensuring evolutionary diversity. This idea helps prevent agents from “getting stuck” in a local optimum and encourages the discovery of innovative solutions.

Thanks to this “evolution,” the performance of the coding agents improved significantly. They achieved a 20-50% increase on the SWE-bench benchmark for solving real-world GitHub problems and a 14.2-30.7% increase on the Polyglot benchmark for measuring multilingual coding.

Of course, there are security concerns associated with such self-improving AI research. Many fear that AI evolution will slip out of human control or that AI will “cheat” during testing. The research team responds to these concerns by enabling AI self-improvement under human supervision in a “sandbox” environment.

The second most mentioned research article in May came from NVIDIA and explores the mystery of AI logical thinking. The study, titled “ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models” examines how the latest AI models, such as OpenAI o1 and DeepSeek-R1, achieve their exceptional logical reasoning abilities.

There is a lively debate among AI researchers about the extent to which reinforcement learning influences the reasoning abilities of foundation models. The crux of the debate is whether reinforcement learning merely unlocks existing reasoning abilities in foundation models or endows them with entirely new reasoning abilities. The latest research tends to support the former view.

However, NVIDIA's research challenges this trend. Using their reinforcement learning method called “ProRL,” which enables long-term, stable learning, they demonstrated that the model was able to “discover” new reasoning strategies and find solutions to tasks that the original foundation model could not answer correctly. This suggests that reinforcement learning can indeed endow base models with new reasoning abilities.

These research breakthroughs show that the development of artificial intelligence is progressing at an astonishing rate. Self-improving AIs, such as the Darwin Gödel machine, could revolutionize software development and many other fields. At the same time, it is crucial that we address the ethical and safety issues involved in a responsible and thoughtful manner, ensuring that the development of AI serves the good of humanity. 

Share this post
After a Historic Turn, SK Hynix Becomes the New Market Leader in the Memory Industry
For three decades, the name Samsung was almost synonymous with leadership in the DRAM market. Now, however, the tables have turned: in the first half of 2025, South Korea’s SK Hynix surpassed its rival in the global memory industry for the first time, ending a streak of more than thirty years. This change signifies not just a shift in corporate rankings but also points to a deeper transformation across the entire semiconductor industry.
The Number of Organized Scientific Fraud Cases is Growing at an Alarming Rate
The world of science is built on curiosity, collaboration, and collective progress—at least in principle. In reality, however, it has always been marked by competition, inequality, and the potential for error. The scientific community has long feared that these pressures could divert some researchers from the fundamental mission of science: creating credible knowledge. For a long time, fraud appeared to be mainly the work of lone perpetrators. In recent years, however, a troubling trend has emerged: growing evidence suggests that fraud is no longer a series of isolated missteps but an organized, industrial-scale activity, according to a recent study.
Beyond the Hype: What Does GPT-5 Really Offer?
The development of artificial intelligence has accelerated rapidly in recent years, reaching a point where news about increasingly advanced models is emerging at an almost overwhelming pace. In this noisy environment, it’s difficult for any new development to stand out, as it must be more and more impressive to cross the threshold of user interest. OpenAI carries a double burden in this regard: not only must it continue to innovate, but it also needs to maintain its lead over fast-advancing competitors. It is into this tense landscape that OpenAI’s newly unveiled GPT-5 model family has arrived—eagerly anticipated by critics who, based on early announcements, expect nothing less than a new milestone in AI development. The big question, then, is whether it lives up to these expectations. In this article, we will examine how GPT-5 fits into the trajectory of AI model evolution, what new features it introduces, and how it impacts the current technological ecosystem.
The Most Popular Theories About the Impact of AI on the Workplace
Since the release of ChatGPT at the end of 2022, the field of AI has seen impressive developments almost every month, sparking widespread speculation about how it will change our lives. One of the central questions concerns its impact on the workplace. As fears surrounding this issue persist, I believe it's worth revisiting the topic from time to time. Although the development of AI is dramatic, over time we may gain a clearer understanding of such questions, as empirical evidence continues to accumulate and more theories emerge attempting to answer them. In this article, I’ve tried to compile the most relevant theories—without claiming to be exhaustive—as the literature on this topic is expanding by the day. The question remains: can we already see the light at the end of the tunnel, or are we still heading into an unfamiliar world we know too little about?
NVIDIA Driver Support Changes – The Clock Is Ticking for the GTX 900–10 Series
NVIDIA has announced a major shift in its driver support strategy. This decision affects millions of users, but what does it actually mean in practice? Is it really time for everyone to consider upgrading their hardware, or is the situation more nuanced? Understanding the implications is key to staying prepared for the technological changes of the coming years.
A Brutal Quarter for Apple, but What Comes After the iPhone?
Amid global economic and trade challenges, Apple has once again proven its extraordinary market power, surpassing analyst expectations in the third quarter of its 2025 fiscal year. The Cupertino giant not only posted record revenue for the period ending in June but also reached a historic milestone: the shipment of its three billionth iPhone. This achievement comes at a time when the company is grappling with the cost of punitive tariffs, intensifying competition in artificial intelligence, and a series of setbacks in the same field.