105 research outputs found

    General Method to Unravel Ancient Population Structures through Surnames, Final Validation on Italian Data

    Get PDF
    We analyze the geographic location of 77,451 different Italian surnames (17,579,891 individuals) obtained from the lists of telephone subscribers of the year 1993. By using a specific neural network analysis (Self-Organizing Maps, SOMs), we automatically identify the geographic origin of 49,117 different surnames. To validate the methodology, we compare the results to a study, previously conducted, on the same database, with accurate supervised methods. By comparing the results, we find an overlap of 97%, meaning that the SOMs methodology is highly reliable and well traces back the geographic origin of surnames at the time of their introduction (Late Middle Ages/Renaissance in Italy). SOMs results enables one to distinguish monophyletic surnames from polyphyletic ones, that is surnames having had a single geographic and historic origin from those that started to be in use, with an identical spelling, in different locations (respectively, 76.06% and 21.05% of the total). As we are interested in geographic origins, polyphyletic surnames are excluded from further analyses. By comparing the present location of each monophyletic surname to its inferred geographic origin in late Middle Ages/Renaissance, we measure the extent of the migrations having occurred in Italy since that time. We find that the percentage of individuals presently living in the very area where their surname started to be in use centuries ago is extremely variable (ranging from 22.77% to 77.86% according to the province), thus meaning that self-assessed regional identities seldom correspond to the autochthony they imply. For example the upper part of the Thyrennian coast (Northern Latium, Tuscany) has a strong identity but few autochthonous inhabitants (28%) having been a passageway from the North to the South of Italy

    Mitochondrial DNA Diversity in South America and the Genetic History of Andean Highlanders

    Get PDF
    We analyzed mtDNA sequence variation in 590 individuals from 18 south Amerindian populations. The spatial pattern of mtDNA diversity in these populations fits well the model proposed on the basis of Y-chromosome data. We found evidence of a differential action of genetic drift and gene flow in western and eastern populations, which has led to genetic divergence in the latter but not in the former. Although it is not possible to identify a pattern of genetic variation common to all South America, when western and eastern populations are analyzed separately, the mtDNA diversity in both regions fits the isolation-by-distance model, suggesting independent evolutionary dynamics. Maximum-likelihood estimates of divergence times between central and south Amerindian populations fall between 13,000 and 19,000 years, which is consistent with a Pleistocenic peopling of South America. Moreover, comparison of among-population variability of mtDNA and Y-chromosome DNA seems to indicate that South America is the only continent where the levels of differentiation are similar for maternal and paternal lineages

    A Mitochondrial Haplogroup is Associated with Decreased Longevity in a Historic New World Population

    Get PDF
    Interest in mitochondrial influences on extended longevity has been mounting, as demonstrated by a growing literature. Such work has demonstrated that some haplogroups are associated with increased longevity and that such associations are population-specific. Most previous work however, suffers from the methodological shortcoming that long-lived individuals are compared with “controls” who are born decades after the aged individuals were. The only true controls of the elderly are people who were born on the same time period, but who did not have extended longevity. Here we present results of a study in which we are able to test if longevity is independent of haplogroup type, controlling for time period, by using mitochondrial DNA genealogies. Since mtDNA does not recombine, we know the mtDNA haplogroup of the maternal ancestors of our living participants. Therefore, we compare the haplogroup of people with and without extended longevity, who were born during the same time period. Our sample is an admixed New World population which has haplogroups of Amerindian, European and African origin. We show that women who belong to Amerindian, European and African haplogroups do not differ in their mean longevity. Therefore, to the extent that ethnicity was tied in this population to mtDNA make up, such ethnicity did not impact longevity. In support of previous suggestions that the link between mtDNA haplogroups and longevity is specific to the population being studied, we found an association between haplogroup C and decreased longevity. Interestingly, the lifetime reproductive success and the number of grandchildren produced via a daughter of women with haplogroup C are not reduced. Our diachronic approach to the mtDNA and longevity link allowed us to determine that the same haplogroup is associated with decreased longevity during different time periods, and allowed us to compare the haplogroup of short and long-lived individuals born during the same time period. By controlling for time period, we minimize the effect of different cultural and ecological environments on differential longevity. With our diachronic approach, we investigate the mtDNA and longevity link with a biocultural perspective

    Ethnicity and Biodemographic Structure in the Arbëreshe of the Province of Cosenza, Southern Italy, in the XIX Century

    Get PDF
    Cultural and environmental factors interact in determining the genetic structure of human populations. Bio-demographic investigations of ethnic minorities are able to disentangle the influences that these two components have on the evolution of the genetic structure of a population. The ethnic minority of the Arbëreshe of the province of Cosenza (Calabria, southern Italy) is analyzed in this paper and its bio-demographic structure in the early 1800s is compared with that of neighboring Italian populations. The data derive from surnames recorded in the birth registers of the 19 Arbëreshe municipalities of the province of Cosenza and in 5 non-Arbëreshe municipalities of the same province. Isonymy and repeated pairs of surnames are used to analyze the bio-demographic structure of these populations, while analysis of isonymic relationships is used to investigate the variability between populations. Higher values of marital isonymy and subdivision into subpopulations characterize the Arbëreshe populations with respect to their non-Arbëreshe neighbors. However, the high range of variability of these parameters suggests a strong influence of geographic location on the marriage pattern of each community. At the same time, cultural differences linked to group identity had a strong impact in limiting marriage exchanges between the different ethnic groups living in the province of Cosenza in the early 1800s. In fact, the analysis of isonymic relationships demonstrates that geographic location shaped kinship patterns among the Arbëreshe communities, but it also shows that the non-Arbëreshe neighbors formed a clearly separate reproductive cluster

    Diversity in the Glucose Transporter-4 Gene (SLC2A4) in Humans Reflects the Action of Natural Selection along the Old-World Primates Evolution

    Get PDF
    BACKGROUND: Glucose is an important source of energy for living organisms. In vertebrates it is ingested with the diet and transported into the cells by conserved mechanisms and molecules, such as the trans-membrane Glucose Transporters (GLUTs). Members of this family have tissue specific expression, biochemical properties and physiologic functions that together regulate glucose levels and distribution. GLUT4 -coded by SLC2A4 (17p13) is an insulin-sensitive transporter with a critical role in glucose homeostasis and diabetes pathogenesis, preferentially expressed in the adipose tissue, heart muscle and skeletal muscle. We tested the hypothesis that natural selection acted on SLC2A4. METHODOLOGY/PRINCIPAL FINDINGS: We re-sequenced SLC2A4 and genotyped 104 SNPs along a approximately 1 Mb region flanking this gene in 102 ethnically diverse individuals. Across the studied populations (African, European, Asian and Latin-American), all the eight common SNPs are concentrated in the N-terminal region upstream of exon 7 ( approximately 3700 bp), while the C-terminal region downstream of intron 6 ( approximately 2600 bp) harbors only 6 singletons, a pattern that is not compatible with neutrality for this part of the gene. Tests of neutrality based on comparative genomics suggest that: (1) episodes of natural selection (likely a selective sweep) predating the coalescent of human lineages, within the last 25 million years, account for the observed reduced diversity downstream of intron 6 and, (2) the target of natural selection may not be in the SLC2A4 coding sequence. CONCLUSIONS: We propose that the contrast in the pattern of genetic variation between the N-terminal and C-terminal regions are signatures of the action of natural selection and thus follow-up studies should investigate the functional importance of different regions of the SLC2A4 gene

    Formal linguistics as a cue to demographic history

    Get PDF
    Beyond its theoretical success, the development of molecular genetics has brought about the possibility of extraordinary progress in the study of classification and in the inference of the evolutionary history of many species and populations. A major step forward was represented by the availability of extremely large sets of molecular data suited to quantitative and computational treatments. In this paper, we argue that even in cognitive sciences, purely theoretical progress in a discipline such as linguistics may have analogous impact. Thus, exactly on the model of molecular biology, we propose to unify two traditionally unrelated lines of linguistic investigation: 1) the formal study of syntactic variation (parameter theory) in the biolinguistic program 2) the reconstruction of relatedness among languages (phylogenetic taxonomy) The results of our linguistic analysis have thus been plotted against data from population genetics and the correlations have turned out to be largely significant: given a non-trivial set of languages/populations, the description of their variation provided by the comparison of systematic parametric analysis and molecular anthropology informatively recapitulates their history and relationships. As a result, we can claim that the reality of some parametric model of the language faculty and language acquisition/transmission (more broadly of generative grammar) receives strong and original support from its historical heuristic power. Then, on these grounds, we can begin testing Darwin's prediction that, when properly generated, the trees of human populations and of their languages should eventually turn out to be significantly parallel

    Evidence of Polygenic Adaptation to High Altitude from Tibetan and Sherpa Genomes

    Get PDF
    Although Tibetans and Sherpa present several physiological adjustments evolved to cope with selective pressures imposed by the high-altitude environment, especially hypobaric hypoxia, few selective sweeps at a limited number of hypoxia related genes were confirmed by multiple genomic studies. Nevertheless, variants at these loci were found to be associated only with downregulation of the erythropoietic cascade, which represents an indirect aspect of the considered adaptive phenotype. Accordingly, the genetic basis of Tibetan/Sherpa adaptive traits remains to be fully elucidated, in part due to limitations of selection scans implemented so far and mostly relying on the hard sweep model.In order to overcome this issue, we used whole-genome sequence data and several selection statistics as input for gene network analyses aimed at testing for the occurrence of polygenic adaptation in these high-altitude Himalayan populations. Being able to detect also subtle genomic signatures ascribable to weak positive selection at multiple genes of the same functional subnetwork, this approach allowed us to infer adaptive evolution at loci individually showing small effect sizes, but belonging to highly interconnected biological pathways overall involved in angiogenetic processes.Therefore, these findings pinpointed a series of selective events neglected so far, which likely contributed to the augmented tissue blood perfusion observed in Tibetans and Sherpa, thus uncovering the genetic determinants of a key biological mechanism that underlies their adaptation to high altitude

    Overcoming the dichotomy between open and isolated populations using genomic data from a large European dataset

    Get PDF
    Human populations are often dichotomized into "isolated" and "open" categories using cultural and/or geographical barriers to gene flow as differential criteria. Although widespread, the use of these alternative categories could obscure further heterogeneity due to inter-population differences in effective size, growth rate, and timing or amount of gene flow. We compared intra and inter-population variation measures combining novel and literature data relative to 87,818 autosomal SNPs in 14 open populations and 10 geographic and/or linguistic European isolates. Patterns of intra-population diversity were found to vary considerably more among isolates, probably due to differential levels of drift and inbreeding. The relatively large effective size estimated for some population isolates challenges the generalized view that they originate from small founding groups. Principal component scores based on measures of intra-population variation of isolated and open populations were found to be distributed along a continuum, with an area of intersection between the two groups. Patterns of inter-population diversity were even closer, as we were able to detect some differences between population groups only for a few multidimensional scaling dimensions. Therefore, different lines of evidence suggest that dichotomizing human populations into open and isolated groups fails to capture the actual relations among their genomic features

    Complex interplay between neutral and adaptive evolution shaped differential genomic background and disease susceptibility along the Italian peninsula

    Get PDF
    The Italian peninsula has long represented a natural hub for human migrations across the Mediterranean area, being involved in several prehistoric and historical population movements. Coupled with a patchy environmental landscape entailing different ecological/cultural selective pressures, this might have produced peculiar patterns of population structure and local adaptations responsible for heterogeneous genomic background of present-day Italians. To disentangle this complex scenario, genome-wide data from 780 Italian individuals were generated and set into the context of European/Mediterranean genomic diversity by comparison with genotypes from 50 populations. To maximize possibility of pinpointing functional genomic regions that have played adaptive roles during Italian natural history, our survey included also ∌250,000 exomic markers and ∌20,000 coding/regulatory variants with well-established clinical relevance. This enabled fine-grained dissection of Italian population structure through the identification of clusters of genetically homogeneous provinces and of genomic regions underlying their local adaptations. Description of such patterns disclosed crucial implications for understanding differential susceptibility to some inflammatory/autoimmune disorders, coronary artery disease and type 2 diabetes of diverse Italian subpopulations, suggesting the evolutionary causes that made some of them particularly exposed to the metabolic and immune challenges imposed by dietary and lifestyle shifts that involved western societies in the last centuries
    • 

    corecore