56 research outputs found

    On the power and the systematic biases of the detection of chromosomal inversions by paired-end genome sequencing

    Get PDF
    One of the most used techniques to study structural variation at a genome level is paired-end mapping (PEM). PEM has the advantage of being able to detect balanced events, such as inversions and translocations. However, inversions are still quite difficult to predict reliably, especially from high-throughput sequencing data. We simulated realistic PEM experiments with different combinations of read and library fragment lengths, including sequencing errors and meaningful base-qualities, to quantify and track down the origin of false positives and negatives along sequencing, mapping, and downstream analysis. We show that PEM is very appropriate to detect a wide range of inversions, even with low coverage data. However, % of inversions located between segmental duplications are expected to go undetected by the most common sequencing strategies. In general, longer DNA libraries improve the detectability of inversions far better than increments of the coverage depth or the read length. Finally, we review the performance of three algorithms to detect inversions -SVDetect, GRIAL, and VariationHunter-, identify common pitfalls, and reveal important differences in their breakpoint precisions. These results stress the importance of the sequencing strategy for the detection of structural variants, especially inversions, and offer guidelines for the design of future genome sequencing projects

    Arm-specific dynamics of chromosome evolution in malaria mosquitoes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The malaria mosquito species of subgenus <it>Cellia </it>have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in <it>Anopheles gambiae </it>and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus <it>Cellia </it>nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints.</p> <p>Results</p> <p>To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among <it>Anopheles gambiae, Anopheles funestus</it>, and <it>Anopheles stephensi</it>. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of <it>An. gambiae </it>and on the homologous arms of <it>An. funestus </it>and <it>An. stephensi</it>.</p> <p>Conclusions</p> <p>Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms.</p

    Functional impact and evolution of a novel human polymorphic inversion that disrupts a gene and creates a fusion transcript

    Get PDF
    Since the discovery of chromosomal inversions almost 100 years ago, how they are maintained in natural populations has been a highly debated issue. One of the hypotheses is that inversion breakpoints could affect genes and modify gene expression levels, although evidence of this came only from laboratory mutants. In humans, a few inversions have been shown to associate with expression differences, but in all cases the molecular causes have remained elusive. Here, we have carried out a complete characterization of a new human polymorphic inversion and determined that it is specific to East Asian populations. In addition, we demonstrate that it disrupts the ZNF257 gene and, through the translocation of the first exon and regulatory sequences, creates a previously nonexistent fusion transcript, which together are associated to expression changes in several other genes. Finally, we investigate the potential evolutionary and phenotypic consequences of the inversion, and suggest that it is probably deleterious. This is therefore the first example of a natural polymorphic inversion that has position effects and creates a new chimeric gene, contributing to answer an old question in evolutionary biology

    Variations on a theme: diversification of cuticular hydrocarbons in a clade of cactophilic Drosophila

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>We characterized variation and chemical composition of epicuticular hydrocarbons (CHCs) in the seven species of the <it>Drosophila buzzatii </it>cluster with gas chromatography/mass spectrometry. Despite the critical role of CHCs in providing resistance to desiccation and involvement in communication, such as courtship behavior, mating, and aggregation, few studies have investigated how CHC profiles evolve within and between species in a phylogenetic context. We analyzed quantitative differences in CHC profiles in populations of the <it>D. buzzatii </it>species cluster in order to assess the concordance of CHC differentiation with species divergence.</p> <p>Results</p> <p>Thirty-six CHC components were scored in single fly extracts with carbon chain lengths ranging from C<sub>29 </sub>to C<sub>39</sub>, including methyl-branched alkanes, <it>n</it>-alkenes, and alkadienes. Multivariate analysis of variance revealed that CHC amounts were significantly different among all species and canonical discriminant function (CDF) analysis resolved all species into distinct, non-overlapping groups. Significant intraspecific variation was found in different populations of <it>D. serido </it>suggesting that this taxon is comprised of at least two species. We summarized CHC variation using CDF analysis and mapped the first five CHC canonical variates (CVs) onto an independently derived <it>period </it>(<it>per</it>) gene + chromosome inversion + mtDNA COI gene for each sex. We found that the COI sequences were not phylogenetically informative due to introgression between some species, so only <it>per </it>+ inversion data were used. Positive phylogenetic signal was observed mainly for CV1 when parsimony methods and the test for serial independence (TFSI) were used. These results changed when no outgroup species were included in the analysis and phylogenetic signal was then observed for female CV3 and/or CV4 and male CV4 and CV5. Finally, removal of divergent populations of <it>D. serido </it>significantly increased the amount of phylogenetic signal as up to four out of five CVs then displayed positive phylogenetic signal.</p> <p>Conclusions</p> <p>CHCs were conserved among species while quantitative differences in CHC profiles between populations and species were statistically significant. Most CHCs were species-, population-, and sex-specific. Mapping CHCs onto an independently derived phylogeny revealed that a significant portion of CHC variation was explained by species' systematic affinities indicating phylogenetic conservatism in the evolution of these hydrocarbon arrays, presumptive waterproofing compounds and courtship signals as in many other drosophilid species.</p

    Prediction and estimation of effective population size

    Get PDF
    Effective population size (Ne) is a key parameter in population genetics. It has important applications in evolutionary biology, conservation genetics, and plant and animal breeding, because it measures the rates of genetic drift and inbreeding and affects the efficacy of systematic evolutionary forces such as mutation, selection and migration. We review the developments in predictive equations and estimation methodologies of effective size. In the prediction part, we focus on the equations for populations with different modes of reproduction, for populations under selection for unlinked or linked loci, and for the specific applications to conservation genetics. In the estimation part, we focus on methods developed for estimating the current or recent effective size from molecular marker or sequence data. We discuss some underdeveloped areas in predicting and estimating Ne for future research

    The adaptive significance of chromosomal inversion polymorphisms in Drosophila melanogaster

    Get PDF
    Chromosomal inversions, structural mutations that reverse a segment of a chromosome, cause suppression of recombination in the heterozygous state. Several studies have shown that inversion polymorphisms can form clines or fluctuate predictably in frequency over seasonal time spans. These observations prompted the hypothesis that chromosomal rearrangements might be subject to spatially and/or temporally varying selection. Here, we review what has been learned about the adaptive significance of inversion polymorphisms in the vinegar fly Drosophila melanogaster, the species in which they were first discovered by Sturtevant in 1917. A large body of work provides compelling evidence that several inversions in this system are adaptive; however, the precise selective mechanisms that maintain them polymorphic in natural populations remain poorly understood. Recent advances in population genomics, modelling and functional genetics promise to greatly improve our understanding of this long‐standing and fundamental problem in the near future
    corecore