GTC 2025: NVIDIA's Blackwell-Based Servers and DGX Station

2025-03-21T09:00:00.000+00:00 2025 March 21. 09:00 Attila Fodor

The GTC (GPU Technology Conference), held annually since 2009, will be hosted by NVIDIA this year from March 17 to 21. The conference is designed to showcase the latest developments and to promote collaboration and further innovation across different industries. It is attended mainly by developers, researchers, and technology leaders. NVIDIA CEO Jensen Huang has been saying for some time that companies will become token factories in the future—meaning that every workflow will be supported by artificial intelligence. Currently, large servers play a major role in this process, but AI integration will increasingly extend to personal computers. In the future, computers and laptops will have hardware capable of running even large language models in the background. This is necessary because programmers, engineers, and almost everyone will work with AI assistance.

The New Blackwell GPU

However, let’s not get ahead of ourselves. According to Jensen Huang, to achieve these goals, it is first necessary to dramatically increase computing power (scale-up) and then expand the system by combining these scaled-up components (scale-out). He explained that the concept was first used in the Grace Hopper architecture and Ranger server model presented three years ago. Although the physical size of the server was too large, it validated their ideas. The size issue was partly due to the use of air cooling, which was replaced by liquid cooling in the new Blackwell GPU, allowing it to fit into standard server racks. Another problem was that the NVLink-connected CPU and GPU were housed in the same module. NVLink provides a high-speed connection between the CPU and GPU. In traditional systems, a PCI Express interface is used for this, but NVLink has lower latency. By disaggregating NVLink, the CPU and GPU can be placed in separate modules, so that each component can be replaced independently in the server.

Despite these improvements, a third problem remains: the optical data transmission cables (transceivers) used to connect the GPUs. These cables are extremely expensive (six are needed for each GPU, adding $6000 to the GPU price) and they increase power consumption by an extra 180 watts per GPU. To solve this, Jensen Huang presented a solution based on silicon photonics that enables GPUs to communicate using photons. Incidentally, Google is already using this technology in its data centers, achieving a 40% reduction in power consumption.

Thanks to the reduction in size, they have reached a performance of 1 exaflop (1000 petaflops) per server rack. The memory bandwidth is an impressive 570 TB/s. For comparison, an NVIDIA RTX 4070 has a bandwidth that is a thousand times lower at 504 GB/s, although it is not designed for servers. For another realistic performance comparison, Jensen Huang states that for an AI company consuming 1 megawatt, the current setup with 1400 server racks using H100s can process 300 million tokens per second when running a large language model. With the new solution, assuming the same 1 MW consumption, 600 server racks replacing the old 1400 and H100s replaced by the new Blackwell compute units would result in 12,000 million tokens per second. The performance increase is so dramatic that it is hard to keep track, but successors have already been announced. The Blackwell Ultra will arrive at the end of this year, followed by the Rubin and Rubin Ultra GPUs next year and in 2027. The Rubin Ultra will deliver 15 exaflops per rack instead of the current 1 exaflop.

DGX Station

As mentioned earlier, Jensen Huang speaks of 30 million programmers who will soon work with some form of AI assistance. This is an important distinction between those who already claim that programmers will become obsolete and those who believe programmers will continue to be needed. However, for programmers to run large language models locally, they need adequate memory bandwidth and enough memory. The DGX Station is NVIDIA’s answer to this market need. It features 8 TB/s of memory bandwidth, 20,000 AI TFLOPS, and 784 GB of RAM—of which 288 GB is available to the GPU—so it can run relatively large models. Naturally, it uses the newly announced Blackwell chip, just like the Geforce RTX 5xxx series of graphics cards. The big question will obviously be its price. The lower-performing DGX Spark, also launched earlier this year, costs $4000 while offering only 128 GB of RAM, 273 GB/s of memory bandwidth, and 1000 AI TFLOPS, making it 20 times less powerful and much smaller. Although the DGX Spark has its advantages due to its smaller size—allowing you to connect more units to build a powerful small server for an office—the price is still quite steep.

Share this post

2025. July 11.

Is the Smart Glasses Market Really on the Verge of Taking Off?

According to several industry analysts, 2025 could mark the year when smart glasses finally step out of the realm of experimental devices and begin to gain traction among a wider consumer base. While the concept itself is not new, recent technological advances, the arrival of new players, and the growing role of artificial intelligence all suggest that a turning point may be near. But does this really mean the smart glasses market is about to take off?

2025. July 11.

Based on Motorola Edge 60 Pro Customer Reviews

The Motorola Edge 60 Pro is the manufacturer’s premium mid-range offering, aiming to compete in the upper segment with its eye-catching design, powerful hardware, and long-lasting battery. But does it live up to the promises made in official tests? Below, we explore what professional reviewers have said about the device—and how everyday users experience it in practice.

2025. July 10.

The Lenovo ThinkCentre M920q Mini PC: Still Popular as a Refurbished Option – But What Do Buyers Say?

The Lenovo ThinkCentre M920q is a popular mini PC that falls into the “TinyMiniMicro” category — a class of ultra-compact desktop computers with a volume of around 1 liter.This comparative analysis brings together findings from professional reviews and customer feedback to offer a well-rounded picture of the device.

2025. July 10.

China Wants to Buy 100,000 Nvidia Chips – But Where Will the Restricted Technology Come From?

As political and economic tensions between the United States and China continue to rise, global technological competition shows no signs of slowing. Nvidia, one of the world’s most prominent chip manufacturers, is working on new strategies to maintain its presence in China, despite increasing geopolitical constraints. The company is seeking room to maneuver not only in business, but also through diplomacy.

2025. July 09.

Phase Transition Observed in Language Model Learning

What happens inside the "mind" of artificial intelligence when it learns to understand language? How does it move from simply following the order of words to grasping their meaning? A recently published study offers a theoretical perspective on these internal processes and identifies a transformation that resembles a physical phase transition.

2025. July 08.

The Chinese-American Robotics War

The global robotics industry is at an unprecedented turning point as China and the United States engage in increasingly intense competition for dominance in the field of humanoid robotics. This is more than just a technological rivalry—it represents a fundamental struggle for leadership in the next phase of industrial automation. As Elon Musk, CEO of Tesla, puts it, “We are number one, but Chinese companies occupy second and tenth place,” accurately reflecting deep concerns about Chinese manufacturing capabilities and their strategic position in the robotics sector.

GTC 2025: NVIDIA's Blackwell-Based Servers and DGX Station

The New Blackwell GPU

DGX Station

Is the Smart Glasses Market Really on the Verge of Taking Off?

Based on Motorola Edge 60 Pro Customer Reviews

The Lenovo ThinkCentre M920q Mini PC: Still Popular as a Refurbished Option – But What Do Buyers Say?

China Wants to Buy 100,000 Nvidia Chips – But Where Will the Restricted Technology Come From?

Phase Transition Observed in Language Model Learning

The Chinese-American Robotics War

Linux distributions

RebornOS

SmartOS

MIRACLE LINUX

Securonis Linux

Ultimate Edition

CentOS

Exton Linux

Gentoo Linux

Snal Linux

Desktop Environments

KDE Plasma

Gnome

Cinnamon

Cosmic

LXQt

Popular

OpenAI unveils the O3-Pro model

Kali Linux 2025.2 released: sustainable improvements in a mature system

NVIDIA Adjusts Financial Outlook Amid Ongoing US Export Restrictions on AI Chips

Tesla's new FSD chip delivers five times more computing power

Intel's new 18A chip manufacturing process: moderate progress or a turning point in the global semiconductor industry?