Computer Prediction of the Exon-intron Structure of Mammalian pre-mRNAs
M.S. Gelfand
Nucleic Acids Research, 18(19):5865-5869 (Oct 11, 1990)
Abstract
A novel approach to the problem of prediction of protein-coding
regions is suggested. This approach combines the site prediction
methods to predict splicing sites and the global coding region
prediction methods to choose the best variant of spliced mRNA. One
of the advantages of the suggested algorithm is that the resulting
mRNA or protein sequence may then be immediately analyzed further.
The true mRNA either coincides with the predicted one or ranks high
in the list of variants. In the latter situation the predicted mRNA
usually differs from the true one in only one or two of several
exons. The combined approach allows the use of a priori information
(e.g. the putative protein length or the number of exons). It is
possible to use additional parameters not considered here, such as
the preferred lengths of exons and introns, and particularly the
preferred position of introns in the reading frame and the preferred
codon position of exon termini.