Publication View

A unified model explaining the offsets of overlapping and near-overlapping prokaryotic genes. (2007)

Abstract
Overlapping genes are a common phenomenon. Among sequenced prokaryotes, more than 29% of all annotated genes overlap at least 1 of their 2 flanking genes. We present a unified model for the creation and repair of overlaps among adjacent genes where the 3# ends either overlap or nearly overlap. Our model, derived from a comprehensive analysis of complete prokaryotic genomes in GenBank, explains the nonuniform distribution of the lengths of such overlap regions far more simply than previously proposed models. Specifically, we explain the distribution of overlap lengths based on random extensions of genes to the next occurring downstream stop codon. Our model also provides an explanation for a newly observed (here) pattern in the distribution of the separation distances of closely spaced nonoverlapping genes. We provide evidence that the newly described biased distribution of separation distances is driven by the same phenomenon that creates the uneven distribution of overlap lengths. This suggests a dynamic picture of continual overlap creation and elimination.

Publication details
Download http://hdl.handle.net/1903/7986
Publisher Molecular Biology and Evolution
Repository Digital Repository at the University of Maryland (United States)
Keywords genome analysis, gene finding, overlapping genes, prokaryotes
Type Article
Language English
Relation College of Computer, Mathematical & Physical Sciences, Computer Science, Digital Repository at the University of Maryland, University of Maryland (College Park, MD)