The "Clustered Structure" of the Purines/Pyrimidines
Distribution in DNA Distinguishes Systematically
Between Coding and Non-coding Sequences.
Y Almirantis, A Provata
Bulletin of Mathematical Biology,
59(5):975-992 (1997)
Abstract
A method allowing to measure the inhomogeneous distribution of
purines/pyrimidines in nucleotide sequences is developed. We show
that this measure relates to the coding or non-coding character of the
considered sequence. Coding sequences present a near to the random
Pu or Py distribution. This property is shared by both protein-coding
DNA and functional RNA-coding DNA. Non-coding sequences
present a highly clustered inhomogeneity. We propose the hypothesis,
corroborated with appropriate computer simulations, that this is due
to the action of various transposition events accumulated for long time
periods.