Generative biology helps us understand the location of proteins in cells

 A deep learning model based on artificial intelligence, ProtGPS, can predict how proteins are arranged within cells. This breakthrough not only reveals previously hidden layers of cellular organization but also offers new opportunities in drug development and biotechnology.

The spatial arrangement of proteins within cells plays a critical role in their function. Until now, applications of artificial intelligence in biology have primarily focused on predicting protein structures. The Nobel Prize-winning AI model AlphaFold was able to determine the three-dimensional shape of proteins. However, a protein’s structure alone is not always sufficient to fully understand its function within the cell.

ProtGPS bridges this gap: it can predict not only a protein’s structure but also its precise location within the cell. This capability allows scientists to better target and position proteins, which could represent a major advancement in drug discovery.

A new piece of the cellular map puzzle

Researchers have long known that proteins destined for specific cellular compartments, such as the nucleus or mitochondria, carry special tags. These tiny molecular markers serve as guides, ensuring that proteins reach the correct location. However, a significant portion of the cell functions as an open space, where proteins organize themselves into biomolecular condensates based on more subtle signals. These dynamic, fluid-like clusters regulate gene activity, help cells cope with stress, and play a role in the development of certain diseases.

ProtGPS can detect hidden amino acid sequence patterns that direct proteins to their proper cellular destinations. This capability enables the design of proteins that do not naturally exist but have specific, engineered localizations.

How is AI taught the language of proteins?

ProtGPS is a so-called protein language model, functioning similarly to AI-based language models like ChatGPT. Instead of analyzing words and sentences, it learns from the amino acid sequences of proteins, where each amino acid is represented by a specific letter combination. Therefore, rather than being a generative language model like ChatGPT, ProtGPS is a generative biology model.

The model utilizes a deep learning framework called Evolutionary Scale Modeling (ESM), originally developed by Meta to predict protein structure and function. The uniqueness of ESM lies in its approach: while AlphaFold performs detailed physical calculations, ESM relies on sequence-based learning, allowing it to operate much faster and on larger datasets. This has enabled ProtGPS to rapidly and efficiently decode the principles governing protein localization within cells.

A new tool for drug development and disease research

One of the most exciting applications of ProtGPS lies in disease research and drug development. The model can predict how specific mutations affect the compartmentalization, or localization, of proteins within the cell. This capability is particularly valuable in understanding diseases such as cancer and genetic disorders, where protein mislocalization plays a critical role.

The biotechnology company Dewpoint Therapeutics has already integrated ProtGPS into its drug discovery processes, aiming to develop new therapies for diseases in which proteins aggregate into abnormal condensates. Other researchers also see great potential in this tool, particularly in fields where targeted protein modifications could help combat disease.

A new perspective in Biology

ProtGPS is not just a new biotechnological tool—it represents a shift in scientific perspective. For decades, biology has primarily focused on molecular structures, but it is becoming increasingly clear that spatial organization within the cell is equally important. Just as the mere presence of furniture is not enough to create a well-designed home—its placement also matters—precise molecular organization is essential within cells.

The hidden patterns uncovered by ProtGPS open up new possibilities in biology and drug development. For the first time, scientists can manipulate and precisely target proteins within cells, potentially leading to the development of new drugs and therapies. In the future, artificial intelligence could provide even deeper insights into cellular processes, revolutionizing our understanding of life. 

Share this post
Artificial intelligence, space, and humanity
Elon Musk, founder and CEO of SpaceX, Tesla, Neuralink, and xAI, shared his thoughts on the possible directions of the future in a recent interview, with a particular focus on artificial intelligence, space exploration, and the evolution of humanity.
Real-time music composition with Google Magenta RT
The use of artificial intelligence in music composition is not a new endeavor, but real-time operation has long faced significant obstacles. The Google Magenta team has now unveiled a development that could expand both the technical and creative possibilities of the genre. The new model, called Magenta RealTime (Magenta RT for short), generates music in real time and is accessible to anyone thanks to its open source code.
What would the acquisition of Perplexity AI mean for Apple?
Apple has long been trying to find its place in the rapidly evolving market of generative artificial intelligence. The company waited strategically for decades before directing significant resources into artificial intelligence-based developments. Now, however, according to the latest news, the Cupertino-based company may be preparing to take a bigger step than ever before: internal discussions have begun on the possible acquisition of a startup called Perplexity AI.
The new AI chip that is revolutionizing medicine and telecommunications makes decisions in nanoseconds
As more and more devices connect to the internet and demand grows for instant, high-bandwidth applications such as cloud-based gaming, video calls, and smart homes, the efficient operation of wireless networks is becoming an increasingly serious challenge. The problem is further exacerbated by the fact that the wireless spectrum—the available frequency band—is limited. In their search for a solution, engineers are increasingly turning to artificial intelligence, but current systems are often slow and energy-intensive. A new development that brings data transmission and processing up to the speed of light could change this situation.
This is how LLM distorts
With the development of artificial intelligence (AI), more and more attention is being paid to so-called large language models (LLMs), which are now present not only in scientific research but also in many areas of everyday life—for example, in legal work, health data analysis, and computer program coding. However, understanding how these models work remains a serious challenge, especially when they make seemingly inexplicable mistakes or give misleading answers.
MiniMax-M1 AI model, targeting the handling of large texts
With the development of artificial intelligence systems, there is a growing demand for models that are not only capable of interpreting language, but also of carrying out complex, multi-step thought processes. Such models can be crucial not only in theoretical tasks, but also in software development or real-time decision-making, for example. However, these applications are particularly sensitive to computational costs, which are often difficult to control using traditional approaches.

Linux distribution updates released in the last few days