37 research outputs found

    A Chromosome 7 Pericentric Inversion Defined at Single-Nucleotide Resolution Using Diagnostic Whole Genome Sequencing in a Patient with Hand-Foot-Genital Syndrome.

    Get PDF
    Next generation sequencing methodologies are facilitating the rapid characterisation of novel structural variants at nucleotide resolution. These approaches are particularly applicable to variants initially identified using alternative molecular methods. We report a child born with bilateral postaxial syndactyly of the feet and bilateral fifth finger clinodactyly. This was presumed to be an autosomal recessive syndrome, due to the family history of consanguinity. Karyotype analysis revealed a homozygous pericentric inversion of chromosome 7 (46,XX,inv(7)(p15q21)x2) which was confirmed to be heterozygous in both unaffected parents. Since the resolution of the karyotype was insufficient to identify any putatively causative gene, we undertook medium-coverage whole genome sequencing using paired-end reads, in order to elucidate the molecular breakpoints. In a two-step analysis, we first narrowed down the region by identifying discordant read-pairs, and then determined the precise molecular breakpoint by analysing the mapping locations of "soft-clipped" breakpoint-spanning reads. PCR and Sanger sequencing confirmed the identified breakpoints, both of which were located in intergenic regions. Significantly, the 7p15 breakpoint was located 523 kb upstream of HOXA13, the locus for hand-foot-genital syndrome. By inference from studies of HOXA locus control in the mouse, we suggest that the inversion has delocalised a HOXA13 enhancer to produce the phenotype observed in our patient. This study demonstrates how modern genetic diagnostic approach can characterise structural variants at nucleotide resolution and provide potential insights into functional regulation

    Arm-specific dynamics of chromosome evolution in malaria mosquitoes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The malaria mosquito species of subgenus <it>Cellia </it>have rich inversion polymorphisms that correlate with environmental variables. Polymorphic inversions tend to cluster on the chromosomal arms 2R and 2L but not on X, 3R and 3L in <it>Anopheles gambiae </it>and homologous arms in other species. However, it is unknown whether polymorphic inversions on homologous chromosomal arms of distantly related species from subgenus <it>Cellia </it>nonrandomly share similar sets of genes. It is also unclear if the evolutionary breakage of inversion-poor chromosomal arms is under constraints.</p> <p>Results</p> <p>To gain a better understanding of the arm-specific differences in the rates of genome rearrangements, we compared gene orders and established syntenic relationships among <it>Anopheles gambiae, Anopheles funestus</it>, and <it>Anopheles stephensi</it>. We provided evidence that polymorphic inversions on the 2R arms in these three species nonrandomly captured similar sets of genes. This nonrandom distribution of genes was not only a result of preservation of ancestral gene order but also an outcome of extensive reshuffling of gene orders that created new combinations of homologous genes within independently originated polymorphic inversions. The statistical analysis of distribution of conserved gene orders demonstrated that the autosomal arms differ in their tolerance to generating evolutionary breakpoints. The fastest evolving 2R autosomal arm was enriched with gene blocks conserved between only a pair of species. In contrast, all identified syntenic blocks were preserved on the slowly evolving 3R arm of <it>An. gambiae </it>and on the homologous arms of <it>An. funestus </it>and <it>An. stephensi</it>.</p> <p>Conclusions</p> <p>Our results suggest that natural selection favors specific gene combinations within polymorphic inversions when distant species are exposed to similar environmental pressures. This knowledge could be useful for the discovery of genes responsible for an association of inversion polymorphisms with phenotypic variations in multiple species. Our data support the chromosomal arm specificity in rates of gene order disruption during mosquito evolution. We conclude that the distribution of breakpoint regions is evolutionary conserved on slowly evolving arms and tends to be lineage-specific on rapidly evolving arms.</p

    A major genetic locus controlling natural Plasmodium falciparum infection is shared by East and West African Anopheles gambiae

    Get PDF
    Background: Genetic linkage mapping identified a region of chromosome 2L in the Anopheles gambiae genome that exerts major control over natural infection by Plasmodium falciparum. This 2L Plasmodium-resistance interval was mapped in mosquitoes from a natural population in Mali, West Africa, and controls the numbers of P. falciparum oocysts that develop on the vector midgut. An important question is whether genetic variation with respect to Plasmodium-resistance exists across Africa, and if so whether the same or multiple geographically distinct resistance mechanisms are responsible for the trait. Methods: To identify P falciparum resistance loci in pedigrees generated and infected in Kenya, East Africa, 28 microsatellite loci were typed across the mosquito genome. Genetic linkage mapping was used to detect significant linkage between genotype and numbers of midgut oocysts surviving to 7–8 days post-infection. Results: A major malaria-control locus was identified on chromosome 2L in East African mosquitoes, in the same apparent position originally identified from the West African population. Presence of this resistance locus explains 75% of parasite free mosquitoes. The Kenyan resistance locus is named EA_Pfin1 (East Africa_ Plasmodium falciparum Infection Intensity). Conclusion: Detection of a malaria-control locus at the same chromosomal location in both East and West African mosquitoes indicates that, to the level of genetic resolution of the analysis, the same mechanism of Plasmodium-resistance, or a mechanism controlled by the same genomic region, is found across Africa, and thus probably operates in A. gambiae throughout its entire range

    Evolutionary Dynamics of the Ty3/Gypsy LTR Retrotransposons in the Genome of Anopheles gambiae

    Get PDF
    Ty3/gypsy elements represent one of the most abundant and diverse LTR-retrotransposon (LTRr) groups in the Anopheles gambiae genome, but their evolutionary dynamics have not been explored in detail. Here, we conduct an in silico analysis of the distribution and abundance of the full complement of 1045 copies in the updated AgamP3 assembly. Chromosomal distribution of Ty3/gypsy elements is inversely related to arm length, with densities being greatest on the X, and greater on the short versus long arms of both autosomes. Taking into account the different heterochromatic and euchromatic compartments of the genome, our data suggest that the relative abundance of Ty3/gypsy LTRrs along each chromosome arm is determined mainly by the different proportions of heterochromatin, particularly pericentric heterochromatin, relative to total arm length. Additionally, the breakpoint regions of chromosomal inversion 2La appears to be a haven for LTRrs. These elements are underrepresented more than 7-fold in euchromatin, where 33% of the Ty3/gypsy copies are associated with genes. The euchromatin on chromosome 3R shows a faster turnover rate of Ty3/gypsy elements, characterized by a deficit of proviral sequences and the lowest average sequence divergence of any autosomal region analyzed in this study. This probably reflects a principal role of purifying selection against insertion for the preservation of longer conserved syntenyc blocks with adaptive importance located in 3R. Although some Ty3/gypsy LTRrs show evidence of recent activity, an important fraction are inactive remnants of relatively ancient insertions apparently subject to genetic drift. Consistent with these computational predictions, an analysis of the occupancy rate of putatively older insertions in natural populations suggested that the degenerate copies have been fixed across the species range in this mosquito, and also are shared with the sibling species Anopheles arabiensis

    Whole-Genome and Chromosome Evolution Associated with Host Adaptation and Speciation of the Wheat Pathogen Mycosphaerella graminicola

    Get PDF
    The fungus Mycosphaerella graminicola has been a pathogen of wheat since host domestication 10,000–12,000 years ago in the Fertile Crescent. The wheat-infecting lineage emerged from closely related Mycosphaerella pathogens infecting wild grasses. We use a comparative genomics approach to assess how the process of host specialization affected the genome structure of M. graminicola since divergence from the closest known progenitor species named M. graminicola S1. The genome of S1 was obtained by Illumina sequencing resulting in a 35 Mb draft genome sequence of 32X. Assembled contigs were aligned to the previously sequenced M. graminicola genome. The alignment covered >90% of the non-repetitive portion of the M. graminicola genome with an average divergence of 7%. The sequenced M. graminicola strain is known to harbor thirteen essential chromosomes plus eight dispensable chromosomes. We found evidence that structural rearrangements significantly affected the dispensable chromosomes while the essential chromosomes were syntenic. At the nucleotide level, the essential and dispensable chromosomes have evolved differently. The average synonymous substitution rate in dispensable chromosomes is considerably lower than in essential chromosomes, whereas the average non-synonymous substitution rate is three times higher. Differences in molecular evolution can be related to different transmission and recombination patterns, as well as to differences in effective population sizes of essential and dispensable chromosomes. In order to identify genes potentially involved in host specialization or speciation, we calculated ratios of synonymous and non-synonymous substitution rates in the >9,500 aligned protein coding genes. The genes are generally under strong purifying selection. We identified 43 candidate genes showing evidence of positive selection, one encoding a potential pathogen effector protein. We conclude that divergence of these pathogens was accompanied by structural rearrangements in the small dispensable chromosomes, while footprints of positive selection were present in only a small number of protein coding genes

    Segmental Duplication Implicated in the Genesis of Inversion 2Rj of Anopheles gambiae

    Get PDF
    The malaria vector Anopheles gambiae maintains high levels of inversion polymorphism that facilitate its exploitation of diverse ecological settings across tropical Africa. Molecular characterization of inversion breakpoints is a first step toward understanding the processes that generate and maintain inversions. Here we focused on inversion 2Rj because of its association with the assortatively mating Bamako chromosomal form of An. gambiae, whose distinctive breeding sites are rock pools beside the Niger River in Mali and Guinea. Sequence and computational analysis of 2Rj revealed the same 14.6 kb insertion between both breakpoints, which occurred near but not within predicted genes. Each insertion consists of 5.3 kb terminal inverted repeat arms separated by a 4 kb spacer. The insertions lack coding capacity, and are comprised of degraded remnants of repetitive sequences including class I and II transposable elements. Because of their large size and patchwork composition, and as no other instances of these insertions were identified in the An. gambiae genome, they do not appear to be transposable elements. The 14.6 kb modules inserted at both 2Rj breakpoint junctions represent low copy repeats (LCRs, also called segmental duplications) that are strongly implicated in the recent (∼0.4Ne generations) origin of 2Rj. The LCRs contribute to further genome instability, as demonstrated by an imprecise excision event at the proximal breakpoint of 2Rj in field isolates

    De Novo Transcriptome Sequencing in Anopheles funestus Using Illumina RNA-Seq Technology

    Get PDF
    BACKGROUND: Anopheles funestus is one of the primary vectors of human malaria, which causes a million deaths each year in sub-Saharan Africa. Few scientific resources are available to facilitate studies of this mosquito species and relatively little is known about its basic biology and evolution, making development and implementation of novel disease control efforts more difficult. The An. funestus genome has not been sequenced, so in order to facilitate genome-scale experimental biology, we have sequenced the adult female transcriptome of An. funestus from a newly founded colony in Burkina Faso, West Africa, using the Illumina GAIIx next generation sequencing platform. METHODOLOGY/PRINCIPAL FINDINGS: We assembled short Illumina reads de novo using a novel approach involving iterative de novo assemblies and "target-based" contig clustering. We then selected a conservative set of 15,527 contigs through comparisons to four Dipteran transcriptomes as well as multiple functional and conserved protein domain databases. Comparison to the Anopheles gambiae immune system identified 339 contigs as putative immune genes, thus identifying a large portion of the immune system that can form the basis for subsequent studies of this important malaria vector. We identified 5,434 1:1 orthologues between An. funestus and An. gambiae and found that among these 1:1 orthologues, the protein sequence of those with putative immune function were significantly more diverged than the transcriptome as a whole. Short read alignments to the contig set revealed almost 367,000 genetic polymorphisms segregating in the An. funestus colony and demonstrated the utility of the assembled transcriptome for use in RNA-seq based measurements of gene expression. CONCLUSIONS/SIGNIFICANCE: We developed a pipeline that makes de novo transcriptome sequencing possible in virtually any organism at a very reasonable cost ($6,300 in sequencing costs in our case). We anticipate that our approach could be used to develop genomic resources in a diversity of systems for which full genome sequence is currently unavailable. Our An. funestus contig set and analytical results provide a valuable resource for future studies in this non-model, but epidemiologically critical, vector insect

    Odorant-Binding Proteins of the Malaria Mosquito Anopheles funestus sensu stricto

    Get PDF
    is one of the major malaria vector species in sub-Saharan Africa. Olfaction is essential in guiding mosquito behaviors. Odorant-binding proteins (OBPs) are highly expressed in insect olfactory tissues and involved in the first step of odorant reception. An improved understanding of the function of malaria mosquito OBPs may contribute to identifying new attractants/repellents and assist in the development of more efficient and environmentally friendly mosquito controlling strategies. female antennae. To compare the absolute efficiency/potency of these chemicals, corrections were made for differences in volatility by determining the exact amount in a stimulus puff. Fourteen AfunOBP genes were cloned and their expression patterns were analyzed. AfunOBP1, 3, 7, 20 and 66 showed olfactory tissue specificity by reverse transcriptase PCR (RT-PCR). Quantitative real-time PCR (qRT-PCR) analysis showed that among olfactory-specific OBPs, AfunOBP1 and 3 are the most enriched OBPs in female antennae. Binding assay experiments showed that at pH 7, AfunOBP1 significantly binds to 2-undecanone, nonyl acetate, octyl acetate and 1-octen-3-ol but AfunOBP3, which shares 68% identify with AfunOBP1 at amino acid level, showed nearly no binding activity to the selected 12 EAG-active odorant compounds. olfactory system, and help developing new mosquito control strategies to reduce malaria transmission

    Comparative Genomics of the Anopheline Glutathione S-Transferase Epsilon Cluster

    Get PDF
    Enzymes of the glutathione S-transferase (GST) family play critical roles in detoxification of xenobiotics across many taxa. While GSTs are ubiquitous both in animals and plants, the GST epsilon class (GSTE) is insect-specific and has been associated with resistance to chemical insecticides. While both Aedes aegypti and Anopheles gambiae GSTE clusters consist of eight members, only four putative orthologs are identifiable between the species, suggesting independent expansions of the class in each lineage. We used a primer walking approach, sequencing almost the entire cluster from three Anopheles species (An. stephensi, An. funestus (both Cellia subgenus) and An. plumbeus (Anopheles subgenus)) and compared the sequences to putative orthologs in An. gambiae (Cellia) in an attempt to trace the evolution of the cluster within the subfamily Anophelinae. Furthermore, we measured transcript levels from the identified GSTE loci by real time reverse transcription PCR to determine if all genes were similarly transcribed at different life stages. Among the species investigated, gene order and orientation were similar with three exceptions: (i) GSTE1 was absent in An. plumbeus; (ii) GSTE2 is duplicated in An. plumbeus and (iii) an additional transcriptionally active pseudogene (ψAsGSTE2) was found in An. stephensi. Further statistical analysis and protein modelling gave evidence for positive selection on codons of the catalytic site in GSTE5 albeit its origin seems to predate the introduction of chemical insecticides. Gene expression profiles revealed differences in expression pattern among genes at different life stages. With the exception of GSTE1, ψAsGSTE2 and GSTE2b, all Anopheles species studied share orthologs and hence we assume that GSTE expansion generally predates radiation into subgenera, though the presence of GSTE1 may also suggest a recent duplication event in the Old World Cellia subgenus, instead of a secondary loss. The modifications of the catalytic site within GSTE5 may represent adaptations to new habitats

    Inferring selection in the Anopheles gambiae species complex: an example from immune-related serine protease inhibitors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mosquitoes of the <it>Anopheles gambiae </it>species complex are the primary vectors of human malaria in sub-Saharan Africa. Many host genes have been shown to affect <it>Plasmodium </it>development in the mosquito, and so are expected to engage in an evolutionary arms race with the pathogen. However, there is little conclusive evidence that any of these mosquito genes evolve rapidly, or show other signatures of adaptive evolution.</p> <p>Methods</p> <p>Three serine protease inhibitors have previously been identified as candidate immune system genes mediating mosquito-Plasmodium interaction, and serine protease inhibitors have been identified as hot-spots of adaptive evolution in other taxa. Population-genetic tests for selection, including a recent multi-gene extension of the McDonald-Kreitman test, were applied to 16 serine protease inhibitors and 16 other genes sampled from the <it>An. gambiae </it>species complex in both East and West Africa.</p> <p>Results</p> <p>Serine protease inhibitors were found to show a marginally significant trend towards higher levels of amino acid diversity than other genes, and display extensive genetic structuring associated with the 2La chromosomal inversion. However, although serpins are candidate targets for strong parasite-mediated selection, no evidence was found for rapid adaptive evolution in these genes.</p> <p>Conclusion</p> <p>It is well known that phylogenetic and population history in the <it>An. gambiae </it>complex can present special problems for the application of standard population-genetic tests for selection, and this may explain the failure of this study to detect selection acting on serine protease inhibitors. The pitfalls of uncritically applying these tests in this species complex are highlighted, and the future prospects for detecting selection acting on the <it>An. gambiae </it>genome are discussed.</p
    corecore