DeepSeek aims to corner OpenAI's o3 model with its more advanced R1 model

2025-06-13T12:52:00.000+00:00 2025 June 13. 12:52 Attila Fodor

DeepSeek R1-0528, the latest development from Chinese company DeepSeek, represents a significant advance in the reasoning capabilities of artificial intelligence models. The new model is based on the January DeepSeek R1 and is an improved version of it. According to the company, the performance of DeepSeek R1-0528 already rivals that of OpenAI's o3 model and approaches the capabilities of Google Gemini 2.5 Pro.

The model has significantly improved its reasoning and inference capabilities. This was achieved through the use of increased computing resources, algorithmic optimization, and an increase in token usage per question from an average of 12,000 to 23,000. As a result, the model showed a significant performance increase in various tests. For example, on the AIME 2025 test, its accuracy increased from 70% to 87.5%.

The DeepSeek R1-0528 architecture contains 685 billion parameters (up from 671 billion in the previous R1) and uses a Mixture-of-Experts (MoE) design, where only 37 billion parameters are active per token. The model's context window is 128K tokens, and it can generate a maximum of 64K tokens. It supports function calls and JSON output formats. In addition, the hallucination rate has been reduced, especially when rewriting and summarizing content. Its code generation capabilities have also been improved.

The model has achieved remarkable results on various benchmarks. In mathematical tasks, its performance reaches or exceeds that of leading models such as OpenAI o3 and Google Gemini 2.5 Pro. In programming and coding tasks on LiveCodeBench, it ranks behind the OpenAI o4-mini and o3 reasoning models. Its general reasoning abilities have also improved, as evidenced by a significant increase in its GPQA-Diamond test score (from 71.5% to 81.0%).

DeepSeek has released a smaller, distilled version of the main R1-0528 model, called DeepSeek-R1-0528-Qwen3-8B. This model is based on Qwen3-8B and contains reasoning knowledge distilled from DeepSeek-R1-0528. It delivers outstanding performance among open source models. It outperforms Qwen3-8B by +10.0% and matches the performance of Qwen3-235B-thinking. It can be run on a single GPU with at least 40 GB of VRAM.

Share this post

2025. August 17.

After a Historic Turn, SK Hynix Becomes the New Market Leader in the Memory Industry

For three decades, the name Samsung was almost synonymous with leadership in the DRAM market. Now, however, the tables have turned: in the first half of 2025, South Korea’s SK Hynix surpassed its rival in the global memory industry for the first time, ending a streak of more than thirty years. This change signifies not just a shift in corporate rankings but also points to a deeper transformation across the entire semiconductor industry.

2025. August 12.

The Number of Organized Scientific Fraud Cases is Growing at an Alarming Rate

The world of science is built on curiosity, collaboration, and collective progress—at least in principle. In reality, however, it has always been marked by competition, inequality, and the potential for error. The scientific community has long feared that these pressures could divert some researchers from the fundamental mission of science: creating credible knowledge. For a long time, fraud appeared to be mainly the work of lone perpetrators. In recent years, however, a troubling trend has emerged: growing evidence suggests that fraud is no longer a series of isolated missteps but an organized, industrial-scale activity, according to a recent study.

2025. August 08.

Beyond the Hype: What Does GPT-5 Really Offer?

The development of artificial intelligence has accelerated rapidly in recent years, reaching a point where news about increasingly advanced models is emerging at an almost overwhelming pace. In this noisy environment, it’s difficult for any new development to stand out, as it must be more and more impressive to cross the threshold of user interest. OpenAI carries a double burden in this regard: not only must it continue to innovate, but it also needs to maintain its lead over fast-advancing competitors. It is into this tense landscape that OpenAI’s newly unveiled GPT-5 model family has arrived—eagerly anticipated by critics who, based on early announcements, expect nothing less than a new milestone in AI development. The big question, then, is whether it lives up to these expectations. In this article, we will examine how GPT-5 fits into the trajectory of AI model evolution, what new features it introduces, and how it impacts the current technological ecosystem.

2025. August 07.

The Most Popular Theories About the Impact of AI on the Workplace

Since the release of ChatGPT at the end of 2022, the field of AI has seen impressive developments almost every month, sparking widespread speculation about how it will change our lives. One of the central questions concerns its impact on the workplace. As fears surrounding this issue persist, I believe it's worth revisiting the topic from time to time. Although the development of AI is dramatic, over time we may gain a clearer understanding of such questions, as empirical evidence continues to accumulate and more theories emerge attempting to answer them. In this article, I’ve tried to compile the most relevant theories—without claiming to be exhaustive—as the literature on this topic is expanding by the day. The question remains: can we already see the light at the end of the tunnel, or are we still heading into an unfamiliar world we know too little about?

2025. August 01.

A Brutal Quarter for Apple, but What Comes After the iPhone?

Amid global economic and trade challenges, Apple has once again proven its extraordinary market power, surpassing analyst expectations in the third quarter of its 2025 fiscal year. The Cupertino giant not only posted record revenue for the period ending in June but also reached a historic milestone: the shipment of its three billionth iPhone. This achievement comes at a time when the company is grappling with the cost of punitive tariffs, intensifying competition in artificial intelligence, and a series of setbacks in the same field.

2025. July 31.

The Micron 9650: The World's First Commercial PCIe 6.0 SSD

In the age of artificial intelligence and high-performance computing, data speed has become critically important. In this rapidly accelerating digital world, Micron has announced a technological breakthrough that redefines our concept of data center storage. Enter the Micron 9650, the world’s first SSD equipped with a PCIe 6.0 interface—not just another product on the market, but a herald of a new era in server-side storage, offering unprecedented speed and efficiency.

DeepSeek aims to corner OpenAI's o3 model with its more advanced R1 model

After a Historic Turn, SK Hynix Becomes the New Market Leader in the Memory Industry

The Number of Organized Scientific Fraud Cases is Growing at an Alarming Rate

Beyond the Hype: What Does GPT-5 Really Offer?

The Most Popular Theories About the Impact of AI on the Workplace

A Brutal Quarter for Apple, but What Comes After the iPhone?

The Micron 9650: The World's First Commercial PCIe 6.0 SSD

Linux distributions

Bazzite

LastOSLinux

TrueNAS

CentOS

Fedora

IPFire

Live Raizo

Qubes OS

MX Linux

OpenIndiana

FreeBSD

Desktop Environments

Cosmic

Bspwm

Cinnamon

Hyprland

KDE Plasma

Gnome

Budgie

Popular