The largest study of ocean DNA reveals the rich world of ocean microbes. With more than 317 million gene groups, the KMAP Global Ocean Gene Catalog 1.0 is an open-source database that can drive biotechnology innovation and uncover the role of marine microbes in mitigating global warming and safeguarding human and planetary health.
The ocean is the worldโs largest habitat, yet much of its biodiversity is still unknown. A study published in Frontiers in Science marks a significant breakthrough, reporting the largest and most comprehensive database of marine microbes to date โ matched with biological function, location, and habitat type.
โThe KMAP Global Ocean Gene Catalog 1.0 is a leap toward understanding the oceanโs full diversity, containing more than 317 million gene groups from marine organisms around the world,โ said lead author Elisa Laiolo of the King Abdullah University of Science and Technology (KAUST) in Saudi Arabia. โThe catalog focuses on marine microbes, which greatly impact human lives through their influence on the oceanโs health and the Earthโs climate.โ
โThe catalog is freely available through the KAUST Metagenomic Analysis Platform (KMAP),โ added the studyโs senior author, Prof Carlos Duarte, a world leader in biological oceanography and marine ecology also at KAUST. โScientists can access the catalog remotely to investigate how different ocean ecosystems work, track the impact of pollution and global warming, and search for biotechnology applications such as new antibiotics or new ways to break down plastics โ the possibilities are endless!โ

Researchers have been mapping marine biodiversity for hundreds of years, but faced various challenges to creating a full atlas of ocean life. One is that most marine organisms cannot be studied in a laboratory. The advent of DNA sequencing technologies overcame this by allowing organisms to be identified directly from ocean water and sediments.
โSince each species has its own set of genes, we can identify which organisms are in an ocean sample by analyzing its genetic material,โ Laiolo explained. โTwo technological advances have made this possible at scale.
โThe first is the enormous increase in speed, and decrease in cost, of DNA sequencing technologies. This has allowed researchers to sequence all the genetic material in thousands of ocean samples.โ
โThe second is the development of massive computational power and AI technologies, which make it possible to analyze these millions of sequences.โ
The team used KMAP to scan DNA sequences from 2,102 ocean samples taken at different depths and locations around the world. This advanced computing infrastructure identified 317.5 million gene groups, of which more than half could be classified according to organism type and gene function. By matching this information with the sample location and habitat type, the resulting catalog provides unprecedented information on which microbes live where and what they do.
โThis achievement reflects the critical importance of open science,โ said Duarte. โBuilding the catalog was only possible thanks to ambitious global sailing expeditions where the samples were collected and the sharing of the samplesโ DNA in the open-access European Nucleotide Archive. We are continuing these collaborative efforts by making the catalog freely available.โ
A wealth of scientific and industrial applications
The catalog has already revealed a difference in microbial activity in the water column and ocean floor, as well as a surprising number of fungi living in the โtwilightโ mesopelagic zone. These and other insights will help scientists understand how microbes living in different habitats shape ecosystems, contribute to ocean health, and influence the climate.
The catalog also serves as a baseline for tracking the effect of human impacts like pollution and global warming on marine life. And it offers a wealth of genetic material that researchers can scan for novel genes that could be used for drug development, energy, and agriculture.
Toward a global ocean genome
The KMAP Ocean Gene Catalog 1.0 is a first step towards developing an atlas of the global ocean genome, which will document every gene from every marine species worldwide โ from bacteria and fungi to plants and animals.
โOur analysis highlights the need to continue sampling the oceans, focusing on areas that are under-studied, such as the deep sea and the ocean floor. Also, since the ocean is forever changing โ both due to human activity and to natural processes โ the catalog will need continual updating,โ said Laiolo. Duarte cautions that despite its clear benefit, the future of the catalog is uncertain. A major obstacle is the status of international legislation on benefit-sharing from discoveries made in international waters.
โWhile the 2023 Treaty of the High Seas offers some solutions, it may inadvertently impede research by reducing incentives for companies and governments to invest. Such uncertainty must be resolved now we have reached the point where genetic and artificial intelligence technologies could unlock unprecedented innovation and progress in blue biotechnology,โ he concluded.
The article is part of theย Frontiers in Scienceย multimedia article hub โThe KMAP Global Ocean Gene Catalog 1.0โ. The hub features an explainer as well as anย editorial,ย viewpoint, andย policy outlookย from other eminent experts: Prof Enric Sala (National Geographic Society, USA), Prof Andreas Teske (University of North Carolina at Chapel Hill, USA), and Peggy Rodgers Kalas (International Ocean Policy Advisor to the Oceano Azul Foundation, and former Director of the High Seas Alliance).
This article was originally published in Frontiers In.
IMAGE CREDIT: Frontiers In.
Sign up for the Daily Dose Newsletter and get the morning’s best science news from around the web delivered straight to your inbox? It’s easy like Sunday morning.





Leave a Reply