7 research outputs found

    A new method to reconstruct recombination events at a genomic scale

    No full text
    Recombination is one of the main forces shaping genome diversity, but the information it generates is often overlooked. A recombination event creates a junction between two parental sequences that may be transmitted to the subsequent generations. Just like mutations, these junctions carry evidence of the shared past of the sequences. We present the IRiS algorithm, which detects past recombination events from extant sequences and specifies the place of each recombination and which are the recombinants sequences. We have validated and calibrated IRiS for the human genome using coalescent simulations replicating standard human demographic history and a variable recombination rate model, and we have fine-tuned IRiS parameters to simultaneously optimize for false discovery rate, sensitivity, and accuracy in placing the recombination events in the sequence. Newer recombinations overwrite traces of past ones and our results indicate more recent recombinations are detected by IRiS with greater sensitivity. IRiS analysis of the MS32 region, previously studied using sperm typing, showed good concordance with estimated recombination rates. We also applied IRiS to haplotypes for 18 X-chromosome regions in HapMap Phase 3 populations. Recombination events detected for each individual were recoded as binary allelic states and combined into recotypes. Principal component analysis and multidimensional scaling based on recotypes reproduced the relationships between the eleven HapMap Phase III populations that can be expected from known human population history, thus further validating IRiS. We believe that our new method will contribute to the study of the distribution of recombination events across the genomes and, for the first time, it will allow the use of recombination as genetic marker to study human genetic variation.This study is part of the Genographic project (https://genographic.nationalgeographic.com/genographic/index.html), an initiative of National Geographic and IBM. Additional support comes from the Spanish Ministry of Innovation and Research grants BFU2007-63657 and SAF-2007-63171, and a scholarship to M.M. (AP2006-03268). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscrip

    Functional implications of human-specific changes in great ape microRNAs

    No full text
    microRNAs are crucial post-transcriptional regulators of gene expression involved in a wide range of biological processes. Although microRNAs are highly conserved among species, the functional implications of existing lineage-specific changes and their role in determining differences between humans and other great apes have not been specifically addressed. We analyzed the recent evolutionary history of 1,595 human microRNAs by looking at their intra- and inter-species variation in great apes using high-coverage sequenced genomes of 82 individuals including gorillas, orangutans, bonobos, chimpanzees and humans. We explored the strength of purifying selection among microRNA regions and found that the seed and mature regions are under similar and stronger constraint than the precursor region. We further constructed a comprehensive catalogue of microRNA species-specific nucleotide substitutions among great apes and, for the first time, investigated the biological relevance that human-specific changes in microRNAs may have had in great ape evolution. Expression and functional analyses of four microRNAs (miR-299-3p, miR-503-3p, miR-508-3p and miR-541-3p) revealed that lineage-specific nucleotide substitutions and changes in the length of these microRNAs alter their expression as well as the repertoires of target genes and regulatory networks. We suggest that the studied molecular changes could have modified crucial microRNA functions shaping phenotypes that, ultimately, became human-specific. Our work provides a frame to study the impact that regulatory changes may have in the recent evolution of our species.This work was funded by Spanish National Institute for Bioinformatics (www.inab.org); “Ministerio de Ciencia e Innovación de España” [grant numbers BFU2012-38236, BFU2010-18477, BFU2009-06974, CGL2009-09013 and FPI-MINECO to ITL]; “Direcció General de Recerca de la Generalitat de Catalunya” [2009SGR-1101, 2014SGR-866 and SGR2014-1311]; European Union Seventh Framework Programme [grant number PIOF-GA-2009-236836 and PIRSES-GA-2013-612583]; “Ministerio de Educación, Cultura y Deporte de España” [grant FPU-MEC to AG]; FEDER European Regional Development Fund “A way to build Europe”. Three authors of this work were employed by the funding organizations of either the “Parc Zoológic de Barcelona” or the Copenhagen Zoo. The funder “Parc Zoológic de Barcelona” provided support in the form of salaries for authors HFB and TA and the funder Copenhagen Zoo for author CH, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript

    The landscape of expression and alternative splicing variation across human traits

    No full text
    Understanding the consequences of individual transcriptome variation is fundamental to deciphering human biology and disease. We implement a statistical framework to quantify the contributions of 21 individual traits as drivers of gene expression and alternative splicing variation across 46 human tissues and 781 individuals from the Genotype-Tissue Expression project. We demonstrate that ancestry, sex, age, and BMI make additive and tissue-specific contributions to expression variability, whereas interactions are rare. Variation in splicing is dominated by ancestry and is under genetic control in most tissues, with ribosomal proteins showing a strong enrichment of tissue-shared splicing events. Our analyses reveal a systemic contribution of types 1 and 2 diabetes to tissue transcriptome variation with the strongest signal in the nerve, where histopathology image analysis identifies novel genes related to diabetic neuropathy. Our multi-tissue and multi-trait approach provides an extensive characterization of the main drivers of human transcriptome variation in health and disease.This study was funded by the HumTranscriptom project with reference PID2019-107937GA-I00. R.G.-P. was supported by a Juan de la Cierva fellowship (FJC2020-044119-I) funded by MCIN/AEI/10.13039/501100011033 and “European Union NextGenerationEU/PRTR.” J.M.R. was supported by a predoctoral fellowship from “la Caixa” Foundation (ID 100010434) with code LCF/BQ/DR22/11950022. A.R.-C. was supported by a Formación Personal Investigador (FPI) fellowship (PRE2019-090193) funded by MCIN/AEI. R.C.-G. was supported by an FPI fellowship (PRE2020-092510) funded by MCIN/AEI. M.M. was supported by a Ramon y Cajal fellowship (RYC-2017-22249). Figures 4A and S1A and the graphical abstract were created with BioRender.com. We thank the donors and their families for their generous gifts of organ donation for transplantation and tissue donations for the GTEx research project and the GTEx consortium members

    Similarity in recombination rate estimates highly correlates with genetic differentiation in humans

    No full text
    Recombination varies greatly among species, as illustrated by the poor conservation of the recombination landscape between humans and chimpanzees. Thus, shorter evolutionary time frames are needed to understand the evolution of recombination. Here, we analyze its recent evolution in humans. We calculated the recombination rates between adjacent pairs of 636,933 common single-nucleotide polymorphism loci in 28 worldwide human populations and analyzed them in relation to genetic distances between populations. We found a strong and highly significant correlation between similarity in the recombination rates corrected for effective population size and genetic differentiation between populations. This correlation is observed at the genome-wide level, but also for each chromosome and when genetic distances and recombination similarities are calculated independently from different parts of the genome. Moreover, and more relevant, this relationship is robustly maintained when considering presence/absence of recombination hotspots. Simulations show that this correlation cannot be explained by biases in the inference of recombination rates caused by haplotype sharing among similar populations. This result indicates a rapid pace of evolution of recombination, within the time span of differentiation of modern humansThis research was funded by grants BFU2007-63657, BFU2009-13409-C02-02 and SAF-2007-63171 awarded by Ministerio de Educación y Ciencia (Spain), by the Direcció General de Recerca of Generalitat de Catalunya (Grup de Recerca Consolidat 2005SGR/00608 and 2009 SGR 1101), and by the National Institute for Bioinformatics (www.inab.org), a platform of Genoma España. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscrip

    Aboriginal Australian mitochondrial genome variation – an increased understanding of population antiquity and diversity

    No full text
    Aboriginal Australians represent one of the oldest continuous cultures outside Africa, with evidence indicating that their ancestors arrived in the ancient landmass of Sahul (present-day New Guinea and Australia) ~55 thousand years ago. Genetic studies, though limited, have demonstrated both the uniqueness and antiquity of Aboriginal Australian genomes. We have further resolved known Aboriginal Australian mitochondrial haplogroups and discovered novel indigenous lineages by sequencing the mitogenomes of 127 contemporary Aboriginal Australians. In particular, the more common haplogroups observed in our dataset included M42a, M42c, S, P5 and P12, followed by rarer haplogroups M15, M16, N13, O, P3, P6 and P8. We propose some major phylogenetic rearrangements, such as in haplogroup P where we delinked P4a and P4b and redefined them as P4 (New Guinean) and P11 (Australian), respectively. Haplogroup P2b was identified as a novel clade potentially restricted to Torres Strait Islanders. Nearly all Aboriginal Australian mitochondrial haplogroups detected appear to be ancient, with no evidence of later introgression during the Holocene. Our findings greatly increase knowledge about the geographic distribution and phylogenetic structure of mitochondrial lineages that have survived in contemporary descendants of Australia’s first settlers

    Great ape genetic diversity and population history

    No full text
    Most great ape genetic variation remains uncharacterized1, 2; however, its study is critical for understanding population history3, 4, 5, 6, recombination7, selection8 and susceptibility to disease9, 10. Here we sequence to high coverage a total of 79 wild- and captive-born individuals representing all six great ape species and seven subspecies and report 88.8 million single nucleotide polymorphisms. Our analysis provides support for genetically distinct populations within each species, signals of gene flow, and the split of common chimpanzees into two distinct groups: Nigeria–Cameroon/western and central/eastern populations. We find extensive inbreeding in almost all wild populations, with eastern gorillas being the most extreme. Inferred effective population sizes have varied radically over time in different lineages and this appears to have a profound effect on the genetic diversity at, or close to, genes in almost all species. We discover and assign 1,982 loss-of-function variants throughout the human and great ape lineages, determining that the rate of gene loss has not been different in the human branch compared to other internal branches in the great ape phylogeny. This comprehensive catalogue of great ape genome diversity provides a framework for understanding evolution and a resource for more effective management of wild and captive great ape populations.We thank the following funding agencies: ERC Starting Grant (260372) to T.M.-B.;NIH grants HG002385 to E.E.E., R01_HG005226 to K.R.V., A.E.W., M.F.H., L.S. and J.D.W., GM100233 and NSF HOMINID grant 1032255 to D.R. and He.Li.;MICINN(Spain)BFU2011-28549toT.M.-B.,BFU2010-19443toJa.Be.,Spanish Government and FEDER for grants BFU2009-13409-C02-02 and BFU2012-38236 to A.N. and J.P.-M., Direcció General de Recerca, Generalitat de Catalunya (Grup de Recerca Consolidat 2009 SGR 1101) to Ja.Be., D.C., A.N. and T.M.-B.; ERC Advanced Grant (233297) and Max Planck Society to S. Paabo; Danish Council for Independent Research Natural Sciences to H.S.; Spanish Grant (CGL-2010-20170) and Zoo de Barcelona (Beca PRIC) to A.R.-H.; EUPRIM-Net to BPRC; DP1ES022577-04 NIH grant to S.A.T.; NSF Grant 0755823 to M.K.G.; P.G. is supported by the G. Harold and Leila Y. Mathers Foundation. A.N. and T.M.-B. are ICREA Research Investigators (Institut Català d’Estudis i Recerca Avançats de la Generalitat de Catalunya).J.P.-M.is supported by the Zoo de Barcelona and l’Ajuntament de Barcelona. P.H.S. is supported by an HHMI International Student Fellowship. E.E.E. is an investigator with the Howard Hughes Medical Institute

    The genome sequencing of an albino Western lowland gorilla reveals inbreeding in the wild

    No full text
    BACKGROUND: The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas./n/nRESULTS: We successfully identified the causal genetic variant for Snowflake's albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake's parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla./n/nCONCLUSIONS: In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost.This work was supported by an ERC Starting Grant (StG_20091118) to TM-
    corecore