Towards comprehensive annotation of Drosophila melanogaster enzymes in FlyBase

Database (Oxford). 2019 Jan 1:2019:bay144. doi: 10.1093/database/bay144.

Abstract

The catalytic activities of enzymes can be described using Gene Ontology (GO) terms and Enzyme Commission (EC) numbers. These annotations are available from numerous biological databases and are routinely accessed by researchers and bioinformaticians to direct their work. However, enzyme data may not be congruent between different resources, while the origin, quality and genomic coverage of these data within any one resource are often unclear. GO/EC annotations are assigned either manually by expert curators or inferred computationally, and there is potential for errors in both types of annotation. If such errors remain unchecked, false positive annotations may be propagated across multiple resources, significantly degrading the quality and usefulness of these data. Similarly, the absence of annotations (false negatives) from any one resource can lead to incorrect inferences or conclusions. We are systematically reviewing and enhancing the functional annotation of the enzymes of Drosophila melanogaster, focusing on improvements within the FlyBase (www.flybase.org) database. We have reviewed four major enzyme groups to date: oxidoreductases, lyases, isomerases and ligases. Herein, we describe our review workflow, the improvement in the quality and coverage of enzyme annotations within FlyBase and the wider impact of our work on other related databases.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Databases, Genetic*
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster* / enzymology
  • Drosophila melanogaster* / genetics
  • Enzymes / genetics*
  • Gene Ontology
  • Genes, Insect / genetics*
  • Genomics
  • Molecular Sequence Annotation / methods*

Substances

  • Drosophila Proteins
  • Enzymes