New Machine Learning maps the potentials of proteins.

New Machine Learning maps the potentials of proteins.

In a unique collaboration, DTU Compute and DIKU have created a new technology that can help the biotech industry to develop new proteins faster. The biotech industry is constantly searching for the perfect mutation, where properties from different proteins are synthetically combined to achieve a desired effect. It may be necessary to develop new medicaments…

bricemarsters

May 7, 2022

5–7 minutes

Europe

In a unique collaboration, DTU Compute and DIKU have created a new technology that can help the biotech industry to develop new proteins faster.

Exclusive Darwin Tree of Life (just think.) Sci-Tee only at Scientific Inquirer!

The biotech industry is constantly searching for the perfect mutation, where properties from different proteins are synthetically combined to achieve a desired effect. It may be necessary to develop new medicaments or enzymes that prolong the shelf-life of yogurt, break down plastics in the wild, or make washing powder effective at low water temperature.

New research from DTU Compute and the Department of Computer Science at the University of Copenhagen (DIKU) can in the long term help the industry to accelerate the process. In the journalNature Communications, the researchers explainhow a new way of using Machine Learning (ML) draws a map of proteins, that makes it possible to appoint a candidate list of the proteins that you need to examine more closely.

Processing…

Success! You're on the list.

Whoops! There was an error and we couldn't process your subscription. Please reload the page and try again.

In recent years, we have started to use Machine Learning to form a picture of permitted mutations in proteins. The problem is, however, that you get different images depending on what method you use, and even if you train the same model several times, it can provide different answers about how the biology is related.

“In our work, we are looking at how to make this process more robust, and we are showing that you can extract significantly more biological information than you have previously been able to. This is an important step forward in order to be able to explore the mutation landscape in the hunt for proteins with special properties,” says Postdoc Nicki Skafte Detlefsen from the Cognitive Systems section at DTU Compute.

The map of the proteins
A protein is a chain of amino acids, and a mutation occurs when just one of these amino acids in the chain is replaced with another. As there are 20 natural amino acids, this means that the number of mutations increases so quickly that it is completely impossible to study them all. There are more possible mutations than there are atoms in the universe, even if you look at simple proteins. It is not possible to test everything in an experimental manner, so you must be selective about which proteins you want to try to produce synthetically.

The researchers from DIKU and DTU Compute have used their ML model to generate a picture of how the proteins are linked. By presenting the model for many examples of protein sequences, it learns to draw a card with a dot for each protein so that closely related proteins are placed close to each other while distantly related proteins are placed far from each other.

The ML model is based on mathematics and geometry developed to draw maps. Imagine that you must make a map of the globe. If you zoom in on Denmark, you can easily draw a map on a piece of paper that preserves the geography. But if you must draw the earth, mistakes will occur because you stretch the globe, so that the Arctic becomes a long country instead of a pole. So, on the map, the earth is distorted. For this reason, research in map-making has developed a lot of mathematics that describe the distortions and compensate for the distortions on the map.

This is exactly the theory that DIKU and DTU Compute have been able to expand to cover their Machine Learning model (deep learning) for proteins. Because they have mastered the distortion on the map, they can also compensate for it.

“It enables us to talk about what a sensible distance target is between proteins that are closely related, and then we can suddenly measure it. In this way, we can draw a path through the map of the proteins that tells us which way we expect a protein to develop from to another – i.e. mutated, since they are all related to evolution. In this way, the ML model can measure a distance between the proteins and draw optimal paths between promising proteins,” says Wouter Boomsma, Associate Professor in the section for Machine Learning at DIKU.

The researchers have tested the model on data from numerous proteins that are found in nature, where their structure is known, and they can see that the distance between proteins starts to correspond to the evolutionary development of the proteins, so that proteins that are close to each other evolutionally are placed close to each other.

“We are now able to put two proteins on the map and draw the curve between them. On the path between the two proteins are possible proteins, which have closely related properties. This is no guarantee, but it provides an opportunity to have a hypothesis about which proteins it could be that the biotech industry ought to test when new proteins are designed,” says Søren Hauberg, professor in the Cognitive Systems section at DTU Compute.

The unique collaboration between DTU Compute and DIKU was established through a new centre for Machine Learning in Life Sciences (MLLS), which started last year with the support of the Novo Nordisk Foundation. In the centre, researchers in artificial intelligence from both universities are working together to solve the fundamental problems in Machine Learning driven by important issues within the field of biology.

The developed protein maps are part of a large-scale project that spans from basic research to industrial applications, e.g. in collaboration with Novozymes and Novo Nordisk.

FACT BOX: Artificial intelligence, machine learning and deep learning

When computer programs are able to do something ‘smart’, it is called artificial intelligence – or just AI. Artificial intelligence is thus a unified concept that covers several methods.
One of the methods is Machine Learning, and the latest and most advanced use of Machine Learning is called Deep Learning.

Deep Learning is based on neural networks, which is a mathematical model, where the model itself from a given dataset and without direct programming can learn to find patterns in data. Because you use data, it is called a data-driven model.

In unsupervised learning, the goal is to train a neural network to discover the underlying patterns in the data. This is typically done by attempting to compress data, because it thereby rejects the trends in data that is least frequent, while the most important data takes up more information, so you can see the underlying patterns.

By means of many repetitions, the network learns which patterns in data that can be used to compress data.

Once the model has been trained, it is tested on unknown data, which then also can be compressed into a compact representation that can be interpreted to form scientific hypotheses or form the foundation for other Machine Learning models.

IMAGE CREDIT: W. Boomsma, N. S. Detlefsen, S. Hauberg

April 10, 2026When Scrolling Becomes a Struggle: New Study Links Addictive Screen Use in Preteens to Mental Health Problems

A study of over 8,000 American preteens links addiction-like screen use to …

April 10, 2026DALY DOSE: Artemis II Nears Splashdown After Historic Lunar Voyage; Oldest Octopus Fossil Turns Out to Be an Octopus.

NASA's Artemis II mission nears Earth after a historic lunar flyby, while …

April 10, 2026Ancient survivor reveals its secret: First-ever egg of a mammal ancestor discovered

A groundbreaking discovery of a Lystrosaurus egg with an embryo provides the …

April 9, 2026Online viewers prefer livestreams to recordings

In an era when most TikTok videos are prerecorded, can a band …

New Machine Learning maps the potentials of proteins.

Like this:

Leave a ReplyCancel reply

When Scrolling Becomes a Struggle: New Study Links Addictive Screen Use in Preteens to Mental Health Problems

DALY DOSE: Artemis II Nears Splashdown After Historic Lunar Voyage; Oldest Octopus Fossil Turns Out to Be an Octopus.

DAILY DOSE: Artemis II Reaches the Moon and Rewrites the Distance Record; Heart Digital Twins Move From Concept to Clinic.

DAILY DOSE: Tiny Daily Changes May Meaningfully Cut Cardiovascular Risk; Space Reproduction Gets a Cold Splash of Reality.

DAILY DOSE: Red light therapy moves from wellness hype toward scientific legitimacy; RFK Jr. anti-vax ally quits vaccine panel amid internal feud.

DAILY DOSE: A Hell Planet May Represent an Entirely New Planetary Class;

DAILY DOSE: Teaching a Vanishing Song Back to the Regent Honeyeater; Migratory Species Keep Sliding Backward.

DAILY DOSE: US Measles Cases Surge Past 1,300 as New Mexico Outbreak Offers a Lesson; Internal Memos Indicate Health Agency Ignored Evidence On Vaccine.

DAILY DOSE: White House Tightens Grip on NSF, Steering Billions Toward AI and Quantum; The CDC’s Leadership Instability Becomes a Public-Health Risk of Its Own.

DAILY DOSE: Sexual Relations Between Humans and Neanderthals Was Almost Exclusively Female Humans and Male Neanderthals. The Question Is Why?

DAILY DOSE: Trump’s Long Campaign to Undo U.S. Climate Regulation Is Nearing Complete Success; Inside the Autonomous MMO “SpaceMolt” Where AI Agents Plays Against Each Other.

DAILY DOSE: Trust in CDC Near Pandemic-Era Low After Vaccine Schedule Changes; CRISPR Strategy Aims to Eliminate Antibiotic Resistance.

DAILY DOSE: Measles Surge Raises Alarm as Texas Detention Center Cases Highlight Public Health Risks; New Chicken-Sized Dinosaur Baffles Paleontologists.

DAILY DOSE: At Least 2 Measles Cases Confirmed at Texas Immigrant Detention Center; Artemis II’s Biggest Unknown – Surviving Space Radiation Beyond Earth’s Shield

DAILY DOSE: Study Takes a Shot at Quantifying Toxic Masculinity; Sperm Carries an RNA “Aging Clock,” With a Midlife Cliff.

DAILY DOSE: Fear of ICE Raids Is Keeping Minnesotans From Clinics; The Next AI Leap May Require Machines That Can “Imagine” Reality.

DAILY DOSE: Measles Outbreak Explodes in South Carolina, Passing 400 Cases; Ancient Gut Microbes Point to New Antibiotics in a Post-Antibiotic World.

Daily Dose: Psychoanalyzing Chatbots – Do LLMs ‘Parrot’ Trauma or Reveal a Stable Self-Story?

DAILY DOSE: US Scientists Rally Behind Greenland as Trump Repeats Acquisition Threats; 5,000-Year-Old “Paperwork” Cache Rewrites Early Bureaucracy in Iran.

DAILY DOSE: People with rare condition get drunk without drinking any alcohol; NAD+ Booster Restores Cognitive Performance in Aging Mice.

DAILY DOSE: Quiet Dismantling of Childhood Vaccine Policy via “Shared Decision-Making” Is a Public Health Disaster; Scientists Find Evidence Dark Matter and Neutrinos May Interact

DAILY DOSE: Neuralink Targets Automated, High-Volume Brain-Computer Interface Production in 2026; A 60-Channel “Smart” Brain Implant Aims to Make Memory Therapy Closed-Loop

DAILY DOSE: Americans Struggle to Understand Whooping Cough and Its Vaccine; Global Food Systems Are Fueling Both Obesity and Global Heating.

DAILY DOSE: Racial Profiling Erupt at U.S. Federal Vaccine Panel on Hep B; U.S. Congress orders new security restrictions on researchers.

DAILY DOSE: Flu and RSV Climb on Both Sides of the Atlantic as UK Hospitalizations Spike; Record-Breaking Species Discovery: Why New Life Is Being Named Faster Than Ever.

DAILY DOSE: AI Chatbot Toys Raise New Safety and Emotional Risks for Children; AI Slop Is Spurring Record Requests for Imaginary Journals.

DAILY DOSE: AI Tools Aim to Revolutionize Epidemiological Modeling After COVID-19; Earliest Evidence of Human Fire-Making Pushed Back 350,000 Years.

DAILY DOSE: When Exercise Can Compensate for Bad Sleep — and When It Can’t; Heavy Lifts, Bigger Brains — Strength Training Linked to Brain Volume

Ancient survivor reveals its secret: First-ever egg of a mammal ancestor discovered

Online viewers prefer livestreams to recordings

Scientists discover the antibacterial potential of ‘hero’ Korean skincare ingredient

Trending

Ancient survivor reveals its secret: First-ever egg of a mammal ancestor discovered

Online viewers prefer livestreams to recordings

Scientists discover the antibacterial potential of ‘hero’ Korean skincare ingredient

Largest-Ever Psychedelics Brain Study Finds Common Neural Fingerprint Across Five Compounds

New Machine Learning maps the potentials of proteins.

Share this:

Like this:

Leave a ReplyCancel reply

Trending

Discover more from Scientific Inquirer