Skip to main navigation Skip to search Skip to main content

Functional and evolutionary significance of unknown genes from uncultivated taxa

  • Álvaro Rodríguez del Río
  • , Joaquín Giner-Lamia
  • , Carlos P. Cantalapiedra
  • , Jorge Botas
  • , Ziqi Deng
  • , Ana Hernández-Plaza
  • , Martí Munar-Palmer
  • , Saray Santamaría-Hernando
  • , José J. Rodríguez-Herva
  • , Hans Joachim Ruscheweyh
  • , Lucas Paoli
  • , Thomas S.B. Schmidt
  • , Shinichi Sunagawa
  • , Peer Bork
  • , Emilia López-Solanilla
  • , Luis Pedro Coelho
  • , Jaime Huerta-Cepas
  • 28223
  • Technical University of Madrid
  • Instituto de Bioquímica Vegetal y Fotosíntesis (CSIC-IBVF)
  • Swiss Federal Institute of Technology Zurich
  • European Molecular Biology Laboratory
  • Max Delbrück Center for Molecular Medicine in the Helmholtz Association
  • University of Würzburg
  • Fudan University
  • Queensland University of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Many of the Earth’s microbes remain uncultured and understudied, limiting our understanding of the functional and evolutionary aspects of their genetic material, which remain largely overlooked in most metagenomic studies1. Here we analysed 149,842 environmental genomes from multiple habitats2–6 and compiled a curated catalogue of 404,085 functionally and evolutionarily significant novel (FESNov) gene families exclusive to uncultivated prokaryotic taxa. All FESNov families span multiple species, exhibit strong signals of purifying selection and qualify as new orthologous groups, thus nearly tripling the number of bacterial and archaeal gene families described to date. The FESNov catalogue is enriched in clade-specific traits, including 1,034 novel families that can distinguish entire uncultivated phyla, classes and orders, probably representing synapomorphies that facilitated their evolutionary divergence. Using genomic context analysis and structural alignments we predicted functional associations for 32.4% of FESNov families, including 4,349 high-confidence associations with important biological processes. These predictions provide a valuable hypothesis-driven framework that we used for experimental validatation of a new gene family involved in cell motility and a novel set of antimicrobial peptides. We also demonstrate that the relative abundance profiles of novel families can discriminate between environments and clinical conditions, leading to the discovery of potentially new biomarkers associated with colorectal cancer. We expect this work to enhance future metagenomics studies and expand our knowledge of the genetic repertory of uncultivated organisms.

Original languageEnglish
Pages (from-to)377-384
Number of pages8
JournalNature
Volume626
Issue number7998
DOIs
Publication statusPublished - 8 Feb 2024
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Functional and evolutionary significance of unknown genes from uncultivated taxa'. Together they form a unique fingerprint.

Cite this