Archive

  • Visit JGI.DOE.GOV
News & Publications
Home › Publications › NMPFamsDB: a database of novel protein families from microbial metagenomes and metatranscriptomes

NMPFamsDB: a database of novel protein families from microbial metagenomes and metatranscriptomes

Published in:

Nucleic Acids Research 52(D1) , d502-d512 ( 2023)

Author(s):

Baltoumas, Fotis A, Karatzas, Evangelos, Liu, Sirui, Ovchinnikov, Sergey, Sofianatos, Yorgos, Chen, I-Min, Kyrpides, Nikos C, Pavlopoulos, Georgios A

DOI:

10.1093/nar/gkad800

Abstract:

The Novel Metagenome Protein Families Database (NMPFamsDB) is a database of metagenome- and metatranscriptome-derived protein families, whose members have no hits to proteins of reference genomes or Pfam domains. Each protein family is accompanied by multiple sequence alignments, Hidden Markov Models, taxonomic information, ecosystem and geolocation metadata, sequence and structure predictions, as well as 3D structure models predicted with AlphaFold2. In its current version, NMPFamsDB hosts over 100 000 protein families, each with at least 100 members. The reported protein families significantly expand (more than double) the number of known protein sequence clusters from reference genomes and reveal new insights into their habitat distribution, origins, functions and taxonomy. We expect NMPFamsDB to be a valuable resource for microbial proteome-wide analyses and for further discovery and characterization of novel functions. NMPFamsDB is publicly available in http://www.nmpfamsdb.org/ or https://bib.fleming.gr/NMPFamsDB.

View Publication

Share this:

  • Click to share on Facebook (Opens in new window)
  • Click to share on LinkedIn (Opens in new window)
  • Click to share on Pinterest (Opens in new window)
  • Click to share on Twitter (Opens in new window)
  • Click to print (Opens in new window)
  • JGI.DOE.GOV
  • Disclaimer
  • Accessibility / Section 508
Lawrence Berkeley National Lab Biosciences Area
A project of the US Department of Energy, Office of Science

JGI is a DOE Office of Science User Facility managed by Lawrence Berkeley National Laboratory

© 1997-2025 The Regents of the University of California