Homology-based annotation yields 1,042 new candidate genes in the Drosophila melanogaster genome

Show full item record

Title: Homology-based annotation yields 1,042 new candidate genes in the Drosophila melanogaster genome
Author: Gopal, Shuba; Schroeder, Mark; Pieper, Ursula; Sczyrba, Alexander; Aytekin-Kurban, Gulriz; Bekiranov, Stefan; Fajardo, Eduardo; Eswar, Narayanan; Sanchez, Roberto; Sali, Andrej; Gaasterland, Terry
Abstract: The approach to annotating a genome critically affects the number and accuracy of genes identified in the genome sequence. Genome annotation based on stringent gene identification is prone to underestimate the complement of genes encoded in a genome. In contrast, over-prediction of putative genes followed by exhaustive computational sequence, motif and structural homology search will find rarely expressed, possibly unique, new genes at the risk of including non-functional genes. We developed a two-stage approach that combines the merits of stringent genome annotation with the benefits of over-prediction. First we identify plausible genes regardless of matches with EST, cDNA or protein sequences from the organism (stage 1). In the second stage, proteins predicted from the plausible genes are compared at the protein level with EST, cDNA and protein sequences, and protein structures from other organisms (stage 2). Remote but biologically meaningful protein sequence or structure homologies provide supporting evidence for genuine genes. The method, applied to the Drosophila melanogaster genome, validated 1,042 novel candidate genes after filtering 19,410 plausible genes, of which 12,124 matched the original 13,601 annotated genes1. This annotation strategy is applicable to genomes of all organisms, including human.
Record URI: http://hdl.handle.net/1850/2370
Publishers URL: http://dx.doi.org/10.1038/85922
Date: 2001-03

Files in this item

Files Size Format View

An open access version of this file is not available. Check "Publisher URL" field for access

This item appears in the following Collection(s)

Show full item record

Search RIT DML

Advanced Search