Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S+IGS) can be found distributed throughout the genome and are identifiable as an ‘rDNA-like signal’, representing 0.26% of the q–arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under appreciated process; we demonstrate here its importance.
Ribosomal RNA genes contribute to the formation of pseudogenes and junk DNA in the human genome
Brent M. Robicheau, Edward Susko, Amye M. Harrigan, Marlene Snyder; Ribosomal RNA genes contribute to the formation of pseudogenes and junk DNA in the human genome. Genome Biol Evol 2017 evw307. doi: 10.1093/gbe/evw307
Download citation file:
© 2017 Oxford University Press×