Latent Sequence Periodicity of Some Oncogenes
and DNA-binding Protein Genes
EV Korotkov, MA Korotkova1 and JS Tulko
Center of Bioengineering of the Russian Academy of
Sciences, 60-Oktybrya prospect, 7/1, Moscow,
117312
and Moscow Physical Engineering Institute,
Department of Cybernetics, Ministry of Education of
Russia, 115409, Moscow, Russia and
1To whom correspondence should be addressed at: Moscow
Physical Engineering Institute, Department of
Cybernetics, Ministry of Education of Russia,
Kashirskoe shosse, 31, 115409, Moscow, Russia
CABIO, 13(1), 37--44(January 1997).
Abstract
A method of latent periodicity search is developed.
We use mutual information to reveal the latent
periodicity of mRNA sequences. The latent
periodicity of an mRNA sequence is a periodicity
with a low level of similarity between any two
periods inside the mRNA sequence. The mutual
information between an artificial numerical sequence
and an mRNA sequence is calculated. The length of
the artificial sequence period is varied from 2 to 150.
The high level of the mutual information between
artificial and mRNA sequences allows us to find any
type of latent periodicity of mRNA sequence. The
latent periodicity of many mRNA coding regions has
been found. For example, the retinoblastoma gene of
HSRBS clone contains a region with a latent period
equal to 45 bases. The A-RAF oncogene of
HSARAF1R clone contains a region with a latent
period equal to 84 bases. Integrated sequences for the
regions with latent periodicity are determined. The
potential significance of latent periodicity is
discussed.