10 research outputs found

    From cheek swabs to consensus sequences : an A to Z protocol for high-throughput DNA sequencing of complete human mitochondrial genomes

    Get PDF
    Background: Next-generation DNA sequencing (NGS) technologies have made huge impacts in many fields of biological research, but especially in evolutionary biology. One area where NGS has shown potential is for high-throughput sequencing of complete mtDNA genomes (of humans and other animals). Despite the increasing use of NGS technologies and a better appreciation of their importance in answering biological questions, there remain significant obstacles to the successful implementation of NGS-based projects, especially for new users. Results: Here we present an ‘A to Z’ protocol for obtaining complete human mitochondrial (mtDNA) genomes – from DNA extraction to consensus sequence. Although designed for use on humans, this protocol could also be used to sequence small, organellar genomes from other species, and also nuclear loci. This protocol includes DNA extraction, PCR amplification, fragmentation of PCR products, barcoding of fragments, sequencing using the 454 GS FLX platform, and a complete bioinformatics pipeline (primer removal, reference-based mapping, output of coverage plots and SNP calling). Conclusions: All steps in this protocol are designed to be straightforward to implement, especially for researchers who are undertaking next-generation sequencing for the first time. The molecular steps are scalable to large numbers (hundreds) of individuals and all steps post-DNA extraction can be carried out in 96-well plate format. Also, the protocol has been assembled so that individual ‘modules’ can be swapped out to suit available resources

    Geographic population structure analysis of worldwide human populations infers their biogeographical origins

    Get PDF
    The search for a method that utilizes biological information to predict humans’ place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700 km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000–130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50 km of their villages. GPS’s accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing

    Population differentiation of Southern Indian male lineages correlates with agricultural expansions predating the caste system

    Get PDF
    Christina J. Adler, Alan Cooper, Clio S.I. Der Sarkissian and Wolfgang Haak are contributors to the Genographic ConsortiumPrevious studies that pooled Indian populations from a wide variety of geographical locations, have obtained contradictory conclusions about the processes of the establishment of the Varna caste system and its genetic impact on the origins and demographic histories of Indian populations. To further investigate these questions we took advantage that both Y chromosome and caste designation are paternally inherited, and genotyped 1,680 Y chromosomes representing 12 tribal and 19 non-tribal (caste) endogamous populations from the predominantly Dravidian-speaking Tamil Nadu state in the southernmost part of India. Tribes and castes were both characterized by an overwhelming proportion of putatively Indian autochthonous Y-chromosomal haplogroups (H-M69, F-M89, R1a1-M17, L1-M27, R2-M124, and C5-M356; 81% combined) with a shared genetic heritage dating back to the late Pleistocene (10–30 Kya), suggesting that more recent Holocene migrations from western Eurasia contributed, <20% of the male lineages. We found strong evidence for genetic structure, associated primarily with the current mode of subsistence. Coalescence analysis suggested that the social stratification was established 4–6 Kya and there was little admixture during the last 3 Kya, implying a minimal genetic impact of the Varna(caste) system from the historically-documented Brahmin migrations into the area. In contrast, the overall Y-chromosomal patterns, the time depth of population diversifications and the period of differentiation were best explained by the emergence of agricultural technology in South Asia. These results highlight the utility of detailed local genetic studies within India, without prior assumptions about the importance of Varna rank status for population grouping, to obtain new insights into the relative influences of past demographic events for the population structure of the whole of modern India.GaneshPrasad ArunKumar, David F. Soria-Hernanz, Valampuri John Kavitha, Varatharajan Santhakumari Arun, Adhikarla Syama, Kumaran Samy Ashokan, Kavandanpatti Thangaraj Gandhirajan, Koothapuli Vijayakumar, Muthuswamy Narayanan, Mariakuttikan Jayalakshmi, Janet S. Ziegle, Ajay K. Royyuru, Laxmi Parida, R. Spencer Wells, Colin Renfrew, Theodore G. Schurr, Chris Tyler Smith, Daniel E. Platt, Ramasamy Pitchappan, The Genographic Consortiu

    Genetic diversity in Puerto Rico and its implications for the peopling of the Island and the West Indies

    No full text
    Puerto Rico and the surrounding islands rest on the eastern fringe of the Caribbean's Greater Antilles, located less than 100 miles northwest of the Lesser Antilles. Puerto Ricans are genetic descendants of pre-Columbian peoples, as well as peoples of European and African descent through 500 years of migration to the island. To infer these patterns of pre-Columbian and historic peopling of the Caribbean, we characterized genetic diversity in 326 individuals from the southeastern region of Puerto Rico and the island municipality of Vieques. We sequenced the mitochondrial DNA (mtDNA) control region of all of the samples and the complete mitogenomes of 12 of them to infer their putative place of origin. In addition, we genotyped 121 male samples for 25 Y-chromosome single nucleotide polymorphism and 17 STR loci. Approximately 60% of the participants had indigenous mtDNA haplotypes (mostly from haplogroups A2 and C1), while 25% had African and 15% European haplotypes. Three A2 sublineages were unique to the Greater Antilles, one of which was similar to Mesoamerican types, while C1b haplogroups showed links to South America, suggesting that people reached the island from the two distinct continental source areas. However, none of the male participants had indigenous Y-chromosomes, with 85% of them instead being European/Mediterranean and 15% sub-Saharan African in origin. West Eurasian Y-chromosome short tandem repeat haplotypes were quite diverse and showed similarities to those observed in southern Europe, North Africa and the Middle East. These results attest to the distinct, yet equally complex, pasts for the male and female ancestors of modern day Puerto Ricans.Miguel G. Vilar, Carlalynne Melendez, Akiva B. Sanders, Akshay Walia, Jill B. Gaieski, Amanda C. Owings, Theodore G. Schurr and The Genographic Consortiu

    Y-chromosome analysis reveals genetic divergence and new founding native lineages in Athapaskan- and Eskimoan-speaking populations

    No full text
    University of Adelaide consortium members: Christina J. Adler, Alan Cooper, Clio S. I. Der Sarkissian, Wolfgang Haak.For decades, the peopling of the Americas has been explored through the analysis of uniparentally inherited genetic systems in Native American populations and the comparison of these genetic data with current linguistic groupings. In northern North America, two language families predominate: Eskimo-Aleut and Na-Dene. Although the genetic evidence from nuclear and mtDNA loci suggest that speakers of these language families share a distinct biological origin, this model has not been examined using data from paternally inherited Y chromosomes. To test this hypothesis and elucidate the migration histories of Eskimoan- and Athapaskan-speaking populations, we analyzed Y-chromosomal data from Inuvialuit, Gwich’in, and Tłįchǫ populations living in the Northwest Territories of Canada. Over 100 biallelic markers and 19 chromosome short tandem repeats (STRs) were genotyped to produce a high-resolution dataset of Y chromosomes from these groups. Among these markers is an SNP discovered in the Inuvialuit that differentiates them from other Aboriginal and Native American populations. The data suggest that Canadian Eskimoan- and Athapaskan-speaking populations are genetically distinct from one another and that the formation of these groups was the result of two population expansions that occurred after the initial movement of people into the Americas. In addition, the population history of Athapaskan speakers is complex, with the Tłįchǫ being distinct from other Athapaskan groups. The high-resolution biallelic data also make clear that Y-chromosomal diversity among the first Native Americans was greater than previously recognized.Matthew C. Dulik, Amanda C. Owings, Jill B. Gaieski, Miguel G. Vilar, Alestine Andre, Crystal Lennie, Mary Adele Mackenzie, Ingrid Kritsch, Sharon Snowshoe, Ruth Wright, James Martin, Nancy Gibson, Thomas D. Andrews, Theodore G. Schurr, and The Genographic Consortiu

    Recombination networks as genetic markers in a human variation study of the Old World

    No full text
    Christina J. Adler, Alan Cooper, Clio S. I. Der Sarkissian and Wolfgang Haak are members of The Genographic ConsortiumWe have analyzed human genetic diversity in 33 Old World populations including 23 populations obtained through Genographic Project studies. A set of 1,536 SNPs in five X chromosome regions were genotyped in 1,288 individuals (mostly males). We use a novel analysis employing subARG network construction with recombining chromosomal segments. Here, a subARG is constructed independently for each of five gene-free regions across the X chromosome, and the results are aggregated across them. For PCA, MDS and ancestry inference with STRUCTURE, the subARG is processed to obtain feature vectors of samples and pairwise distances between samples. The observed population structure, estimated from the five short X chromosomal segments, supports genome-wide frequency-based analyses: African populations show higher genetic diversity, and the general trend of shared variation is seen across the globe from Africa through Middle East, Europe, Central Asia, Southeast Asia, and East Asia in broad patterns. The recombinational analysis was also compared with established methods based on SNPs and haplotypes. For haplotypes, we also employed a fixed-length approach based on information-content optimization. Our recombinational analysis suggested a southern migration route out of Africa, and it also supports a single, rapid human expansion from Africa to East Asia through South Asia.Asif Javed, Marta Melé, Marc Pybus, Pierre Zalloua, Marc Haber, David Comas, Mihai G. Netea, Oleg Balanovsky, Elena Balanovska, Li Jin, Yajun Yang, GaneshPrasad ArunKumar, Ramasamy Pitchappan, Jaume Bertranpetit, Francesc Calafell, Laxmi Parida, The Genographic Consortiu
    corecore