36 research outputs found
Population genomics of sub-Saharan Drosophila melanogaster: African diversity and non-African admixture
(ABRIDGED) We report the genome sequencing of 139 wild-derived strains of D.
melanogaster, representing 22 population samples from the sub-Saharan ancestral
range of this species, along with one European population. Most genomes were
sequenced above 25X depth from haploid embryos. Results indicated a pervasive
influence of non-African admixture in many African populations, motivating the
development and application of a novel admixture detection method. Admixture
proportions varied among populations, with greater admixture in urban
locations. Admixture levels also varied across the genome, with localized peaks
and valleys suggestive of a non-neutral introgression process. Genomes from the
same location differed starkly in ancestry, suggesting that isolation
mechanisms may exist within African populations. After removing putatively
admixed genomic segments, the greatest genetic diversity was observed in
southern Africa (e.g. Zambia), while diversity in other populations was largely
consistent with a geographic expansion from this potentially ancestral region.
The European population showed different levels of diversity reduction on each
chromosome arm, and some African populations displayed chromosome arm-specific
diversity reductions. Inversions in the European sample were associated with
strong elevations in diversity across chromosome arms. Genomic scans were
conducted to identify loci that may represent targets of positive selection. A
disproportionate number of candidate selective sweep regions were located near
genes with varied roles in gene regulation. Outliers for Europe-Africa FST were
found to be enriched in genomic regions of locally elevated cosmopolitan
admixture, possibly reflecting a role for some of these loci in driving the
introgression of non-African alleles into African populations
New evidence for habitat specific selection in Wadden Sea Zostera marina populations revealed by genome scanning using SNP and microsatellite markers
Eelgrass Zostera marina is an ecosystem-engineering species of outstanding importance for coastal soft sediment habitats that lives in widely diverging habitats. Our first goal was to detect divergent selection and habitat adaptation at the molecular genetic level; hence, we compared three pairs of permanently submerged versus intertidal populations using genome scans, a genetic marker-based approach. Three different statistical approaches for outlier identification revealed divergent selection at 6 loci among 46 markers (6 SNPs, 29 EST microsatellites and 11 anonymous microsatellites). These outlier loci were repeatedly detected in parallel habitat comparisons, suggesting the influence of habitat-specific selection. A second goal was to test the consistency of the general genome scan approach by doubling the number of gene-linked microsatellites and adding single nucleotide polymorphism (SNP) loci, a novel marker type for seagrasses, compared to a previous study. Reassuringly, results with respect to selection were consistent among most marker loci. Functionally interesting marker loci were linked to genes involved in osmoregulation and water balance, suggesting different osmotic stress, and reproductive processes (seed maturation), pointing to different life history strategies. The identified outlier loci are valuable candidates for further investigation into the genetic basis of natural selection
Identifying Signatures of Natural Selection in Tibetan and Andean Populations Using Dense Genome Scan Data
High-altitude hypoxia (reduced inspired oxygen tension due to decreased barometric pressure) exerts severe physiological stress on the human body. Two high-altitude regions where humans have lived for millennia are the Andean Altiplano and the Tibetan Plateau. Populations living in these regions exhibit unique circulatory, respiratory, and hematological adaptations to life at high altitude. Although these responses have been well characterized physiologically, their underlying genetic basis remains unknown. We performed a genome scan to identify genes showing evidence of adaptation to hypoxia. We looked across each chromosome to identify genomic regions with previously unknown function with respect to altitude phenotypes. In addition, groups of genes functioning in oxygen metabolism and sensing were examined to test the hypothesis that particular pathways have been involved in genetic adaptation to altitude. Applying four population genetic statistics commonly used for detecting signatures of natural selection, we identified selection-nominated candidate genes and gene regions in these two populations (Andeans and Tibetans) separately. The Tibetan and Andean patterns of genetic adaptation are largely distinct from one another, with both populations showing evidence of positive natural selection in different genes or gene regions. Interestingly, one gene previously known to be important in cellular oxygen sensing, EGLN1 (also known as PHD2), shows evidence of positive selection in both Tibetans and Andeans. However, the pattern of variation for this gene differs between the two populations. Our results indicate that several key HIF-regulatory and targeted genes are responsible for adaptation to high altitude in Andeans and Tibetans, and several different chromosomal regions are implicated in the putative response to selection. These data suggest a genetic role in high-altitude adaption and provide a basis for future genotype/phenotype association studies necessary to confirm the role of selection-nominated candidate genes and gene regions in adaptation to altitude
A Genome Scan for Positive Selection in Thoroughbred Horses
Thoroughbred horses have been selected for exceptional racing performance resulting in system-wide structural and functional adaptations contributing to elite athletic phenotypes. Because selection has been recent and intense in a closed population that stems from a small number of founder animals Thoroughbreds represent a unique population within which to identify genomic contributions to exercise-related traits. Employing a population genetics-based hitchhiking mapping approach we performed a genome scan using 394 autosomal and X chromosome microsatellite loci and identified positively selected loci in the extreme tail-ends of the empirical distributions for (1) deviations from expected heterozygosity (Ewens-Watterson test) in Thoroughbred (n = 112) and (2) global differentiation among four geographically diverse horse populations (FST). We found positively selected genomic regions in Thoroughbred enriched for phosphoinositide-mediated signalling (3.2-fold enrichment; P<0.01), insulin receptor signalling (5.0-fold enrichment; P<0.01) and lipid transport (2.2-fold enrichment; P<0.05) genes. We found a significant overrepresentation of sarcoglycan complex (11.1-fold enrichment; P<0.05) and focal adhesion pathway (1.9-fold enrichment; P<0.01) genes highlighting the role for muscle strength and integrity in the Thoroughbred athletic phenotype. We report for the first time candidate athletic-performance genes within regions targeted by selection in Thoroughbred horses that are principally responsible for fatty acid oxidation, increased insulin sensitivity and muscle strength: ACSS1 (acyl-CoA synthetase short-chain family member 1), ACTA1 (actin, alpha 1, skeletal muscle), ACTN2 (actinin, alpha 2), ADHFE1 (alcohol dehydrogenase, iron containing, 1), MTFR1 (mitochondrial fission regulator 1), PDK4 (pyruvate dehydrogenase kinase, isozyme 4) and TNC (tenascin C). Understanding the genetic basis for exercise adaptation will be crucial for the identification of genes within the complex molecular networks underlying obesity and its consequential pathologies, such as type 2 diabetes. Therefore, we propose Thoroughbred as a novel in vivo large animal model for understanding molecular protection against metabolic disease
Finishing the euchromatic sequence of the human genome
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
EWS-FLI1 employs an E2F switch to drive target gene expression
Cell cycle progression is orchestrated by E2F factors. We previously reported that in ETS-driven cancers of the bone and prostate, activating E2F3 cooperates with ETS on target promoters. The mechanism of target co-regulation remained unknown. Using RNAi and time-resolved chromatin-immunoprecipitation in Ewing sarcoma we report replacement of E2F3/pRB by constitutively expressed repressive E2F4/p130 complexes on target genes upon EWS-FLI1 modulation. Using mathematical modeling we interrogated four alternative explanatory models for the observed EWS-FLI1/E2F3 cooperation based on longitudinal E2F target and regulating transcription factor expression analysis. Bayesian model selection revealed the formation of a synergistic complex between EWS-FLI1 and E2F3 as the by far most likely mechanism explaining the observed kinetics of E2F target induction. Consequently we propose that aberrant cell cycle activation in Ewing sarcoma is due to the de-repression of E2F targets as a consequence of transcriptional induction and physical recruitment of E2F3 by EWS-FLI1 replacing E2F4 on their target promoters