145 research outputs found

    Allele-specific gene expression is widespread across the genome and biological processes. PLoS One 4

    Get PDF
    Abstract Allelic specific gene expression (ASGE) appears to be an important factor in human phenotypic variability and as a consequence, for the development of complex traits and diseases. In order to study ASGE across the human genome, we have performed a study in which genotyping was coupled with an analysis of ASGE by screening 11,500 SNPs using the Mapping 10 K Array to identify differential allelic expression. We found that from the 5,133 SNPs that were suitable for analysis (heterozygous in our sample and expressed in peripheral blood mononuclear cells), 2,934 (57%) SNPs had differential allelic expression. Such SNPs were equally distributed along human chromosomes and biological processes. We validated the presence or absence of ASGE in 18 out 20 SNPs (90%) randomly selected by real time PCR in 48 human subjects. In addition, we observed that SNPs close to -but not included in-segmental duplications had increased levels of ASGE. Finally, we found that transcripts of unknown function or non-coding RNAs, also display ASGE: from a total of 2,308 intronic SNPs, 1510 (65%) SNPs underwent differential allelic expression. In summary, ASGE is a widespread mechanism in the human genome whose regulation seems to be far more complex than expected

    Population Genomic Analysis Reveals Differential Evolutionary Histories and Patterns of Diversity across Subgenomes and Subpopulations of Brassica napus L.

    Get PDF
    The allotetraploid species Brassica napus L. is a global crop of major economic importance, providing canola oil (seed) and vegetables for human consumption and fodder and meal for livestock feed. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this species. We used sequence-based genotyping to identify and genotype 30,881 SNPs in a diversity panel of 782 B. napus accessions, representing samples of winter and spring growth habits originating from 33 countries across Europe, Asia and America. We detected strong population structure broadly concordant with growth habit and geography, and identified three major genetic groups: spring (SP), winter Europe (WE), and winter Asia (WA). Subpopulation-specific polymorphism patterns suggest enriched genetic diversity within the WA group and a smaller effective breeding population for the SP group compared to WE. Interestingly, the two subgenomes of B. napus appear to have different geographic origins, with phylogenetic analysis placing WE and WA as basal clades for the other subpopulations in the C and A subgenomes, respectively. Finally, we identified 16 genomic regions where the patterns of diversity differed markedly from the genome-wide average, several of which are suggestive of genomic inversions. The results obtained in this study constitute a valuable resource for worldwide breeding efforts and the genetic dissection and prediction of complex B. napus traits

    Deleterious Mutation Burden and Its Association with Complex Traits in Sorghum (Sorghum bicolor)

    Get PDF
    Sorghum (Sorghum bicolor L.) is a major food cereal for millions of people worldwide. The sorghum genome, like other species, accumulates deleterious mutations, likely impacting its fitness. The lack of recombination, drift, and the coupling with favorable loci impede the removal of deleterious mutations from the genome by selection. To study how deleterious variants impact phenotypes, we identified putative deleterious mutations among ∼5.5 M segregating variants of 229 diverse biomass sorghum lines. We provide the whole-genome estimate of the deleterious burden in sorghum, showing that ∼33% of nonsynonymous substitutions are putatively deleterious. The pattern of mutation burden varies appreciably among racial groups. Across racial groups, the mutation burden correlated negatively with biomass, plant height, specific leaf area (SLA), and tissue starch content (TSC), suggesting that deleterious burden decreases trait fitness. Putatively deleterious variants explain roughly one-half of the genetic variance. However, there is only moderate improvement in total heritable variance explained for biomass (7.6%) and plant height (average of 3.1% across all stages). There is no advantage in total heritable variance for SLA and TSC. The contribution of putatively deleterious variants to phenotypic diversity therefore appears to be dependent on the genetic architecture of traits. Overall, these results suggest that incorporating putatively deleterious variants into genomic models slightly improves prediction accuracy because of extensive linkage. Knowledge of deleterious variants could be leveraged for sorghum breeding through either genome editing and/or conventional breeding that focuses on the selection of progeny with fewer deleterious alleles

    Extensive Copy-Number Variation of Young Genes across Stickleback Populations

    Get PDF
    MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

    Molecular Phylogeny Restores the Supra-Generic Subdivision of Homoscleromorph Sponges (Porifera, Homoscleromorpha)

    Get PDF
    Homoscleromorpha is the fourth major sponge lineage, recently recognized to be distinct from the Demospongiae. It contains <100 described species of exclusively marine sponges that have been traditionally subdivided into 7 genera based on morphological characters. Because some of the morphological features of the homoscleromorphs are shared with eumetazoans and are absent in other sponges, the phylogenetic position of the group has been investigated in several recent studies. However, the phylogenetic relationships within the group remain unexplored by modern methods.Here we describe the first molecular phylogeny of Homoscleromorpha based on nuclear (18S and 28S rDNA) and complete mitochondrial DNA sequence data that focuses on inter-generic relationships. Our results revealed two robust clades within this group, one containing the spiculate species (genera Plakina, Plakortis, Plakinastrella and Corticium) and the other containing aspiculate species (genera Oscarella and Pseudocorticium), thus rejecting a close relationship between Pseudocorticium and Corticium. Among the spiculate species, we found affinities between the Plakortis and Plakinastrella genera, and between the Plakina and Corticium. The validity of these clades is furthermore supported by specific morphological characters, notably the type of spicules. Furthermore, the monophyly of the Corticium genus is supported while the monophyly of Plakina is not.As the result of our study we propose to restore the pre-1995 subdivision of Homoscleromorpha into two families: Plakinidae Schulze, 1880 for spiculate species and Oscarellidae Lendenfeld, 1887 for aspiculate species that had been rejected after the description of the genus Pseudocorticium. We also note that the two families of homoscleromorphs exhibit evolutionary stable, but have drastically distinct mitochondrial genome organizations that differ in gene content and gene order

    Costs of insensitive acetylcholinesterase insecticide resistance for the malaria vector Anopheles gambiae homozygous for the G119S mutation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The G119S mutation responsible for insensitive acetylcholinesterase resistance to organophosphate and carbamate insecticides has recently been reported from natural populations of <it>Anopheles gambiae </it>in West Africa. These reports suggest there are costs of resistance associated with this mutation for <it>An. gambiae</it>, especially for homozygous individuals, and these costs could be influential in determining the frequency of carbamate resistance in these populations.</p> <p>Methods</p> <p>Life-history traits of the AcerKis and Kisumu strains of <it>An. gambiae </it>were compared following the manipulation of larval food availability in three separate experiments conducted in an insecticide-free laboratory environment. These two strains share the same genetic background, but differ in being homozygous for the presence or absence of the G119S mutation at the <it>ace-1 </it>locus, respectively.</p> <p>Results</p> <p>Pupae of the resistant strain were significantly more likely to die during pupation than those of the susceptible strain. Ages at pupation were significantly earlier for the resistant strain and their dry starved weights were significantly lighter; this difference in weight remained when the two strains were matched for ages at pupation.</p> <p>Conclusions</p> <p>The main cost of resistance found for <it>An. gambiae </it>mosquitoes homozygous for the G119S mutation was that they were significantly more likely to die during pupation than their susceptible counterparts, and they did so across a range of larval food conditions. Comparing the frequency of G119S in fourth instar larvae and adults emerging from the same populations would provide a way to test whether this cost of resistance is being expressed in natural populations of <it>An. gambiae </it>and influencing the dynamics of this resistance mutation.</p

    High mutation rates explain low population genetic divergence at copy-number-variable loci in Homo sapiens

    Get PDF
    Copy-number-variable (CNV) loci differ from single nucleotide polymorphic (SNP) sites in size, mutation rate, and mechanisms of maintenance in natural populations. It is therefore hypothesized that population genetic divergence at CNV loci will differ from that found at SNP sites. Here, we test this hypothesis by analysing 856 CNV loci from the genomes of 1184 healthy individuals from 11 HapMap populations with a wide range of ancestry. The results show that population genetic divergence at the CNV loci is generally more than three times lower than at genome-wide SNP sites. Populations generally exhibit very small genetic divergence (G(st) = 0.05 ± 0.049). The smallest divergence is among African populations (G(st) = 0.0081 ± 0.0025), with increased divergence among non-African populations (G(st) = 0.0217 ± 0.0109) and then among African and non-African populations (G(st) = 0.0324 ± 0.0064). Genetic diversity is high in African populations (~0.13), low in Asian populations (~0.11), and intermediate in the remaining 11 populations. Few significant linkage disequilibria (LDs) occur between the genome-wide CNV loci. Patterns of gametic and zygotic LDs indicate the absence of epistasis among CNV loci. Mutation rate is about twice as large as the migration rate in the non-African populations, suggesting that the high mutation rates play dominant roles in producing the low population genetic divergence at CNV loci

    Population genetic analysis of bi-allelic structural variants from low-coverage sequence data with an expectation-maximization algorithm

    Get PDF
    Background Population genetics and association studies usually rely on a set of known variable sites that are then genotyped in subsequent samples, because it is easier to genotype than to discover the variation. This is also true for structural variation detected from sequence data. However, the genotypes at known variable sites can only be inferred with uncertainty from low coverage data. Thus, statistical approaches that infer genotype likelihoods, test hypotheses, and estimate population parameters without requiring accurate genotypes are becoming popular. Unfortunately, the current implementations of these methods are intended to analyse only single nucleotide and short indel variation, and they usually assume that the two alleles in a heterozygous individual are sampled with equal probability. This is generally false for structural variants detected with paired ends or split reads. Therefore, the population genetics of structural variants cannot be studied, unless a painstaking and potentially biased genotyping is performed first. Results We present svgem, an expectation-maximization implementation to estimate allele and genotype frequencies, calculate genotype posterior probabilities, and test for Hardy-Weinberg equilibrium and for population differences, from the numbers of times the alleles are observed in each individual. Although applicable to single nucleotide variation, it aims at bi-allelic structural variation of any type, observed by either split reads or paired ends, with arbitrarily high allele sampling bias. We test svgem with simulated and real data from the 1000 Genomes Project. Conclusions svgem makes it possible to use low-coverage sequencing data to study the population distribution of structural variants without having to know their genotypes. Furthermore, this advance allows the combined analysis of structural and nucleotide variation within the same genotype-free statistical framework, thus preventing biases introduced by genotype imputation
    • …
    corecore