Abstract
BACKGROUND: Transcription slippage occurs on certain patterns of repeat mononucleotides, resulting in synthesis of a heterogeneous population of mRNAs. Individual mRNA molecules within this population differ in the number of nucleotides they contain that are not specified by the template. When transcriptional slippage occurs in a coding sequence, translation of the resulting mRNAs yields more than one protein product. Except where the products of the resulting mRNAs have distinct functions, transcription slippage occurring in a coding region is expected to be disadvantageous. This probably leads to selection against most slippage-prone sequences in coding regions. RESULTS: To find a length at which such selection is evident, we analyzed the distribution of repetitive runs of A and T of different lengths in 108 bacterial genomes. This length varies significantly among different bacteria, but in a large proportion of available genomes corresponds to nine nucleotides. Comparative sequence analysis of these genomes was used to identify occurrences of 9A and 9T transcriptional slippage-prone sequences used for gene expression. CONCLUSIONS: IS element genes are the largest group found to exploit this phenomenon. A number of genes with disrupted open reading frames (ORFs) have slippage-prone sequences at which transcriptional slippage would result in uninterrupted ORF restoration at the mRNA level. The ability of such genes to encode functional full-length protein products brings into question their annotation as pseudogenes and in these cases is pertinent to the significance of the term 'authentic frameshift' frequently assigned to such genes.
| Original language | English |
|---|---|
| Pages (from-to) | R25 |
| Journal | Genome Biology |
| Volume | 6 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - 2005 |
Keywords
- Slippage
- Gene
- Genome
- Biology
- Bacteria
- Genetics
- Expression (computer science)
- Gene expression
- Distribution (mathematics)
- Element (criminal law)
- Computational biology
- Computer science
- Political science
- Mathematics
- Engineering
- Structural engineering
- Mathematical analysis
- Law
- Programming language