33 research outputs found

    Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

    Get PDF
    Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

    Model Structure of Human APOBEC3G

    Get PDF
    BACKGROUND: APOBEC3G (apolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like 3G) has antiretroviral activity associated with the hypermutation of viral DNA through cytosine deamination. APOBEC3G has two cytosine deaminase (CDA) domains; the catalytically inactive amino-terminal domain of APOBEC3G (N-CDA) carries the Vif interaction domain. There is no 3-D structure of APOBEC3G solved by X-ray or nuclear magnetic resonance. METHODOLOGY/PRINCIPAL FINDINGS: We predicted the structure of human APOBEC3G based on the crystal structure of APOBEC2. To assess the model structure, we evaluated 48 mutants of APOBEC3G N-CDA that identify novel variants altering ΔVif HIV-1 infectivity and packaging of APOBEC3G. Results indicated that the key residue D128 is exposed at the surface of the model, with a negative local electrostatic potential. Mutation D128K changes the sign of that local potential. In addition, two novel functionally relevant residues that result in defective APOBEC3G encapsidation, R122 and W127, cluster at the surface. CONCLUSIONS/SIGNIFICANCE: The structure model identifies a cluster of residues important for packaging of APOBEC3G into virions, and may serve to guide functional analysis of APOBEC3G

    A High-Resolution Map of Human Evolutionary Constraint Using 29 Mammals

    Get PDF
    The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ~4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ~60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.National Human Genome Research Institute (U.S.)National Institute of General Medical Sciences (U.S.) (Grant number GM82901)National Science Foundation (U.S.). Postdoctural Fellowship (Award 0905968)National Science Foundation (U.S.). Career (0644282)National Institutes of Health (U.S.) (R01-HG004037)Alfred P. Sloan Foundation.Austrian Science Fund. Erwin Schrodinger Fellowshi

    An Ancient Duplication of Exon 5 in the Snap25 Gene Is Required for Complex Neuronal Development/Function

    Get PDF
    Alternative splicing is an evolutionary innovation to create functionally diverse proteins from a limited number of genes. SNAP-25 plays a central role in neuroexocytosis by bridging synaptic vesicles to the plasma membrane during regulated exocytosis. The SNAP-25 polypeptide is encoded by a single copy gene, but in higher vertebrates a duplication of exon 5 has resulted in two mutually exclusive splice variants, SNAP-25a and SNAP-25b. To address a potential physiological difference between the two SNAP-25 proteins, we generated gene targeted SNAP-25b deficient mouse mutants by replacing the SNAP-25b specific exon with a second SNAP-25a equivalent. Elimination of SNAP-25b expression resulted in developmental defects, spontaneous seizures, and impaired short-term synaptic plasticity. In adult mutants, morphological changes in hippocampus and drastically altered neuropeptide expression were accompanied by severe impairment of spatial learning. We conclude that the ancient exon duplication in the Snap25 gene provides additional SNAP-25-function required for complex neuronal processes in higher eukaryotes

    Unveiling Novel RecO Distant Orthologues Involved in Homologous Recombination

    Get PDF
    The generation of a RecA filament on single-stranded DNA is a critical step in homologous recombination. Two main pathways leading to the formation of the nucleofilament have been identified in bacteria, based on the protein complexes mediating RecA loading: RecBCD (AddAB) and RecFOR. Many bacterial species seem to lack some of the components involved in these complexes. The current annotation of the Helicobacter pylori genome suggests that this highly diverse bacterial pathogen has a reduced set of recombination mediator proteins. While it is now clear that homologous recombination plays a critical role in generating H. pylori diversity by allowing genomic DNA rearrangements and integration through transformation of exogenous DNA into the chromosome, no complete mediator complex is deduced from the sequence of its genome. Here we show by bioinformatics analysis the presence of a RecO remote orthologue that allowed the identification of a new set of RecO proteins present in all bacterial species where a RecR but not RecO was previously identified. HpRecO shares less than 15% identity with previously characterized homologues. Genetic dissection of recombination pathways shows that this novel RecO and the remote RecB homologue present in H. pylori are functional in repair and in RecA-dependent intrachromosomal recombination, defining two initiation pathways with little overlap. We found, however, that neither RecOR nor RecB contributes to transformation, suggesting the presence of a third, specialized, RecA-dependent pathway responsible for the integration of transforming DNA into the chromosome of this naturally competent bacteria. These results provide insight into the mechanisms that this successful pathogen uses to generate genetic diversity and adapt to changing environments and new hosts

    Relaxation of Selective Constraints Causes Independent Selenoprotein Extinction in Insect Genomes

    Get PDF
    BACKGROUND: Selenoproteins are a diverse family of proteins notable for the presence of the 21st amino acid, selenocysteine. Until very recently, all metazoan genomes investigated encoded selenoproteins, and these proteins had therefore been believed to be essential for animal life. Challenging this assumption, recent comparative analyses of insect genomes have revealed that some insect genomes appear to have lost selenoprotein genes. METHODOLOGY/PRINCIPAL FINDINGS: In this paper we investigate in detail the fate of selenoproteins, and that of selenoprotein factors, in all available arthropod genomes. We use a variety of in silico comparative genomics approaches to look for known selenoprotein genes and factors involved in selenoprotein biosynthesis. We have found that five insect species have completely lost the ability to encode selenoproteins and that selenoprotein loss in these species, although so far confined to the Endopterygota infraclass, cannot be attributed to a single evolutionary event, but rather to multiple, independent events. Loss of selenoproteins and selenoprotein factors is usually coupled to the deletion of the entire no-longer functional genomic region, rather than to sequence degradation and consequent pseudogenisation. Such dynamics of gene extinction are consistent with the high rate of genome rearrangements observed in Drosophila. We have also found that, while many selenoprotein factors are concomitantly lost with the selenoproteins, others are present and conserved in all investigated genomes, irrespective of whether they code for selenoproteins or not, suggesting that they are involved in additional, non-selenoprotein related functions. CONCLUSIONS/SIGNIFICANCE: Selenoproteins have been independently lost in several insect species, possibly as a consequence of the relaxation in insects of the selective constraints acting across metazoans to maintain selenoproteins. The dispensability of selenoproteins in insects may be related to the fundamental differences in antioxidant defense between these animals and other metazoans.The work described here is funded by grants from the Spanish Ministery of Education and Science and from the BioSapiens European Network of Excellence to RG. CEC is reciepient of a pre-doctoral fellowship from the Spanish Ministery of Education and Science

    Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

    Get PDF
    A finished clone-based assembly of the mouse genome reveals extensive recent sequence duplication during recent evolution and rodent-specific expansion of certain gene families. Newly assembled duplications contain protein-coding genes that are mostly involved in reproductive function
    corecore