Title : eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses
Publish Date : 2019-01-08 00:00:00.0
Zotero Link: zotero://select/library/items/UQGIR4AH

  • Orthologous Groups 是一组来自不同物种、拥有共同祖先并具有相似功能的基因或蛋白。
术语解释
Orthologs(正交基因)在不同物种中,由同一个祖先基因通过物种分化产生的基因,功能通常相似。
Paralogs(旁系基因)在同一个物种中,由基因复制产生的基因,功能可能多样。
Orthologous Group把多个 ortholog(以及一些可信的 paralog)组织成群,形成一个“正交群”
  • 正交群 ≠ 一个基因家族
  • 正交群更强调跨物种的同源性和保守功能
  • 蛋白家族(如 Pfam)更偏向结构域相似性

图示结构

Orthologous Group: ENOG12345
 ├── Human_Gene1
 ├── Mouse_GeneA
 ├── Yeast_GeneX
 └── Ecoli_GeneZ
→ 注释:GO:0008152 (metabolism), KO:K00001 (dehydrogenase)

eggNOG is a public database of orthology relationships, gene evolutionary histories and functional annotations. Here, we present version 5.0, featuring a major update of the underlying genome sets, which have been expanded to 4445 representative bacteria and 168 archaea derived from 25 038 genomes, as well as 477 eukaryotic organisms and 2502 viral proteomes that were selected for diversity and filtered by genome quality. In total, 4.4M orthologous groups (OGs) distributed across 379 taxonomic levels were computed together with their associated sequence alignments, phylogenies, HMM models and functional descriptors. Precomputed evolutionary analysis provides fine-grained resolution of duplication/speciation events within each OG. Our benchmarks show that, despite doubling the amount of genomes, the quality of orthology assignments and functional annotations (80% coverage) has persisted without significant changes across this update. Finally, we improved eggNOG online services for fast functional annotation and orthology prediction of custom genomics or metagenomics datasets. All precomputed data are publicly available for downloading or via API queries at http://eggnog.embl.de