Acceptor splice site prediction

Show full item record

Title: Acceptor splice site prediction
Author: Foster, Eric
Abstract: Gene finding is an important aspect of biological research. The state of gene finding is such that many approaches exist yet the problem itself is still largely unsolved. The various signals involved in gene location and modification offer a window of opportunity for the accurate prediction of genes. Many algorithms attempt to break down the problem of gene prediction into smaller portions focusing on various signals and properties. The individual study of these signals becomes warranted. This work focuses on splice site prediction, and more specifically, acceptor splice site prediction. Several current approaches, weight matrix models and Markov models, are utilized as well as a novel approach known as the log odds ratio. The log odds ratio is found to be able to double the positive predictive value obtained through the other methods. In agreement with a similar work performed by Lukas Habegger those log odds ratio models which incorporate 2nd order Markov models perform favorably. Also, a maximum dependency decomposition is performed which, in congruence with Lukas Habegger’s findings, highlights a position close to that of the branch point sequence as being a position of maximum dependency. These results suggest that maximum dependency decompositions may be a novel method towards examining the elusive branch point sequence in eukaryotic organisms. Lukas Habegger observed a stronger maximum dependency in Leishmania major most likely because of differences between spliceosome function in lower and upper eukaryotes.
Record URI: http://hdl.handle.net/1850/4629
Date: 2007-05

Files in this item

Files Size Format View
EFosterThesis05-2007.pdf 2.141Mb PDF View/Open

The following license files are associated with this item:

This item appears in the following Collection(s)

Show full item record

Search RIT DML


Advanced Search

Browse