Article thumbnail

A Comprehensive Map of Mobile Element Insertion Polymorphisms in Humans

By Chip Stewart, Deniz Kural, Michael P. Strömberg, Jerilyn A. Walker, Miriam K. Konkel, Adrian M. Stütz, Alexander E. Urban, Fabian Grubert, Hugo Y. K. Lam, Wan-Ping Lee, Michele Busby, Amit R. Indap, Erik Garrison, Chad Huff, Jinchuan Xing, Michael P. Snyder, Lynn B. Jorde, Mark A. Batzer, Jan O. Korbel and Gabor T. Marth

Abstract

As a consequence of the accumulation of insertion events over evolutionary time, mobile elements now comprise nearly half of the human genome. The Alu, L1, and SVA mobile element families are still duplicating, generating variation between individual genomes. Mobile element insertions (MEI) have been identified as causes for genetic diseases, including hemophilia, neurofibromatosis, and various cancers. Here we present a comprehensive map of 7,380 MEI polymorphisms from the 1000 Genomes Project whole-genome sequencing data of 185 samples in three major populations detected with two detection methods. This catalog enables us to systematically study mutation rates, population segregation, genomic distribution, and functional properties of MEI polymorphisms and to compare MEI to SNP variation from the same individuals. Population allele frequencies of MEI and SNPs are described, broadly, by the same neutral ancestral processes despite vastly different mutation mechanisms and rates, except in coding regions where MEI are virtually absent, presumably due to strong negative selection. A direct comparison of MEI and SNP diversity levels suggests a differential mobile element insertion rate among populations

Topics: Research Article
Publisher: Public Library of Science
OAI identifier: oai:pubmedcentral.nih.gov:3158055
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles

Citations

  1. (in preparation) SPANNER: a structural variation detection tool.
  2. (2008). Accurate whole human genome sequencing using reversible terminator chemistry.
  3. (2010). Alu repeat discovery and characterization within human genomes. Genome Res.
  4. (2002). Alu repeats and human genomic diversity.
  5. (2006). An initial map of insertion and deletion (INDEL) variation in the human genome.
  6. (2001). Automated finishing with autofinish.
  7. (1998). Base-calling of automated sequencer traces using phred. II. Error probabilities.
  8. (2002). BLAT–the BLAST-like alignment tool.
  9. (2009). BreakDancer: an algorithm for high-resolution mapping of genomic structural variation.
  10. (2005). Calibrating a coalescent simulation of human genome sequence variation.
  11. Chimpanzee Sequencing and Analysis Consortium (2005) Initial sequence of the chimpanzee genome and comparison with the human genome.
  12. (1987). Clustering with local equivalence relations.
  13. (2011). CNVnator: An approach to discover, genotype and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res.
  14. (2009). Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes.
  15. (2010). Complete Khoisan and Bantu genomes from southern Africa.
  16. (2011). Discovery and genotyping of genome structural polymorphism by sequencing on a population scale.
  17. (2000). Estimate of the mutation rate per nucleotide in humans.
  18. (2003). Estimating ancestral population sizes and divergence times.
  19. (2006). Estimating the retrotransposition rate of human Alu elements.
  20. (2002). Estimation of animal abundance and related parameters.
  21. (2008). Estimation of hominoid ancestral population sizes under bayesian coalescent models incorporating mutation rate variation and sequencing errors.
  22. (2007). Evolutionary history of 7SL RNA-derived SINEs in Supraprimates.
  23. (1968). Evolutionary rate at the molecular level.
  24. (2009). Finescaled human genetic structure revealed by SNP microarrays.
  25. (2001). Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees.
  26. (2006). Global variation in copy number in the human genome.
  27. (1988). Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man.
  28. (1996). High frequency retrotransposition in cultured mammalian cells.
  29. (2010). High-throughput sequencing reveals extensive variation in human-specific L1 content in individual human genomes. Genome Res.
  30. (2003). Hot L1s account for the bulk of retrotransposition in the human population.
  31. Huang W (in preparation) ART: Next-generation read simulator.
  32. (2006). Human genomic deletions mediated by recombination between Alu elements.
  33. (2007). Identification and characterization of novel polymorphic LINE-1 insertions through comparison of two human genome sequence assemblies.
  34. (2009). Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data.
  35. (2008). L1 recombination-associated deletions generate human genomic variation.
  36. (2009). L1 retrotransposition in human neural progenitor cells.
  37. (2009). LINE dancing in the human genome: transposable elements and disease.
  38. (2010). LINE-1 retrotransposition activity in human genomes.
  39. (2003). LINE-mediated retrotransposition of marked Alu sequences.
  40. (2008). Mapping and sequencing of structural variation from eight human genomes.
  41. (2008). Mapping short DNA sequencing reads and calling variants using mapping quality scores.
  42. (2011). Mapping structural variation at fine-scale by population-scale genome sequencing.
  43. Marth GT (in preparation) MOSAIK: A nextgeneration reference-guided aligner.
  44. (1992). Master genes in mammalian repetitive DNA amplification.
  45. (2010). Measurements of spontaneous rates of mutations in the recent past and the near future.
  46. (2010). Mobile element scanning (ME-Scan) by targeted high-throughput sequencing.
  47. (2009). Mobile elements create structural variation: analysis of a complete human genome.
  48. (2010). Mobile interspersed repeats are major structural variants in the human genome.
  49. (1996). Mutation analysis in the BRCA2 gene in primary breast cancers.
  50. (2010). Natural mutagenesis of human genomes by endogenous retrotransposons.
  51. (1975). On the number of segregating sites in genetical models without recombination.
  52. (2007). Pairedend mapping reveals extensive structural variation in the human genome.
  53. (2009). Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads.
  54. (2007). Principles of population genetics.
  55. (2007). Progress in understanding the biology of the human mutagen LINE-1.
  56. (2007). Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.
  57. (2000). Recent common ancestry of human Y chromosomes: evidence from DNA sequence data.
  58. (2006). Recently mobilized transposons in the human and chimpanzee genomes.
  59. (2005). Repbase Update, a database of eukaryotic repetitive elements.
  60. (2004). Retrotransposition of Alu elements: how many sources?
  61. (2009). Sensitive and accurate detection of copy number variants using read depth of coverage.
  62. (1989). Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.
  63. (1995). Statistical properties of segregating sites.
  64. (2003). SVA elements are nonautonomous retrotransposons that cause disease in humans.
  65. (2005). SVA elements: a hominid-specific retroposon family.
  66. (2004). The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations.
  67. (1968). The art of computer programming.
  68. (2007). The diploid genome sequence of an individual human.
  69. (2009). The impact of retrotransposons on human genome evolution.
  70. (2009). The regulated retrotransposon transcriptome of mammalian cells.
  71. (2010). The UCSC Genome Browser database: update 2010.
  72. (2010). Towards a comprehensive map of human sequence variation.
  73. (2010). Towards a comprehensive structural variation map of an individual human genome.
  74. (2007). Which transposable elements are active in the human genome?
  75. (2010). Whole-genome resequencing allows detection of many rare LINE-1 insertion alleles in humans. Genome Res.