TxGemma - New Open Model for Drug Development

One of the biggest challenges in drug development is finding lead compounds beyond the clinical phase, as 90% of candidates fail in the initial trial phase. In this context, TxGemma—an open model collection developed on top of Google's DeepMind Gemma and its family of modern, lightweight open models—represents a breakthrough. TxGemma aims to harness the power of large language models to improve the efficiency of therapeutic discovery, from identifying promising targets to predicting clinical trial outcomes.

TxGemma is the Successor to Tx-LLM

Launched last October, Tx-LLM was trained to perform a range of therapeutic tasks in the drug development process. The model generated significant interest, so the developers quickly fine-tuned it based on user feedback, resulting in TxGemma. The model is available in three different sizes—2B, 9B, and 27B—each with a "predict" version optimized specifically for narrow therapeutic tasks, such as predicting the toxicity of a molecule or its ability to cross the blood-brain barrier.

TxGemma is based on millions of practical examples that enable the model to excel in various tasks—classification, regression, and generation. The largest predict version, at 27B, outperforms or at least keeps pace with the previous Tx-LLM model in almost all tasks tested, and even outperforms many models optimized for specific tasks. Based on detailed performance data, TxGemma produced similar or better results in 64 out of 66 tasks and performed better than the previous model in 45 tasks.

Chat Capabilities and Further Fine-Tuning Options

The developers focused not only on raw predictive capabilities but also integrated chat features into the models. This allows the models to answer complex questions, justify their decisions, and provide feedback in multi-step conversations. For example, a researcher can ask why a particular molecule was classified as toxic, and the model can justify its answer by referring to the molecule’s structure.

The release of TxGemma offers not only an end product but also a customizable platform for developers and researchers. With the included Colab notebook, it is easy to fine-tune the model based on your own therapeutic data and tasks—such as predicting adverse events in clinical trials. Additionally, TxGemma can be integrated with the Agentic-Tx system, which includes 18 advanced tools, such as PubMed and Wikipedia search, as well as molecular, gene, and protein tools. This solution helps combine everyday research workflows with the multi-step inference capabilities provided by agent systems.

Availabilty

TxGemma is available in the Vertex AI Model Garden and on the Hugging Face platform, so anyone interested can explore how the system works, try out its inference and fine-tuning features, and experiment with the complex workflows offered by Agentic-Tx. As an open model, TxGemma also offers the possibility for further development, since researchers can tailor it to their specific therapeutic development needs using their own data.

The advent of TxGemma could open a new chapter in drug development, significantly shortening the process from the laboratory to the patient’s bedside and reducing development costs. 

Share this post
This is how LLM distorts
With the development of artificial intelligence (AI), more and more attention is being paid to so-called large language models (LLMs), which are now present not only in scientific research but also in many areas of everyday life—for example, in legal work, health data analysis, and computer program coding. However, understanding how these models work remains a serious challenge, especially when they make seemingly inexplicable mistakes or give misleading answers.
MiniMax-M1 AI model, targeting the handling of large texts
With the development of artificial intelligence systems, there is a growing demand for models that are not only capable of interpreting language, but also of carrying out complex, multi-step thought processes. Such models can be crucial not only in theoretical tasks, but also in software development or real-time decision-making, for example. However, these applications are particularly sensitive to computational costs, which are often difficult to control using traditional approaches.
 How is the relationship between OpenAI and Microsoft transforming the artificial intelligence ecosystem?
One of the most striking examples of the rapid technological and business transformations taking place in the artificial intelligence industry is the redefinition of the relationship between Microsoft and OpenAI. The two companies have worked closely together for years, but recent developments clearly show that industry logic now favors more flexible, multi-player collaboration models rather than exclusive partnerships.
Amazon and SK Group to build South Korea's largest AI center
A new era may be dawning for South Korea's artificial intelligence industry, with Amazon Web Services (AWS) announcing that it will build the country's largest AI computing center in partnership with SK Group. The investment is not only a technological milestone, but also has a spectacular impact on SK Hynix's stock market performance.
Change in Windows facial recognition: no longer works in the dark
Microsoft recently introduced an important security update to its Windows Hello facial recognition login system, which is part of the Windows 11 operating system. As a result of the change, facial recognition no longer works in the dark, and the company has confirmed that this is not a technical error, but the result of a conscious decision.
Kali Linux 2025.2 released: sustainable improvements in a mature system
The latest stable release of Kali Linux, the popular Linux distribution for ethical hacking and cybersecurity analysis, version 2025.2, was released in June 2025. This time, the developers have not only introduced maintenance updates, but also several new features that enhance both usability and functionality of the system. The updates may be of particular interest to those who use the operating system for penetration testing, network traffic analysis or other security purposes.