171 research outputs found

    LaSSO, a strategy for genome-wide mapping of intronic lariats and branch points using RNA-seq

    Get PDF
    Both canonical and alternative splicing of RNAs are governed by intronic sequence elements and produce transient lariat structures fastened by branch points within introns. To map precisely the location of branch points on a genomic scale, we developed LaSSO (Lariat Sequence Site Origin), a data-driven algorithm which utilizes RNA-seq data. Using fission yeast cells lacking the debranching enzyme Dbr1, LaSSO not only accurately identified canonical splicing events, but also pinpointed novel, but rare, exon-skipping events, which may reflect aberrantly spliced transcripts. Compromised intron turnover perturbed gene regulation at multiple levels, including splicing and protein translation. Notably, Dbr1 function was also critical for the expression of mitochondrial genes and for the processing of self-spliced mitochondrial introns. LaSSO showed better sensitivity and accuracy than algorithms used for computational branch-point prediction or for empirical branch-point determination. Even when applied to a human data set acquired in the presence of debranching activity, LaSSO identified both canonical and exon-skipping branch points. LaSSO thus provides an effective approach for defining high-resolution maps of branch-site sequences and intronic elements on a genomic scale. LaSSO should be useful to validate introns and uncover branch-point sequences in any eukaryote, and it could be integrated into RNA-seq pipelines

    Who knows best? A Q methodology study to explore perspectives of professional stakeholders and community participants on health in low-income communities

    Get PDF
    Abstract Background Health inequalities in the UK have proved to be stubborn, and health gaps between best and worst-off are widening. While there is growing understanding of how the main causes of poor health are perceived among different stakeholders, similar insight is lacking regarding what solutions should be prioritised. Furthermore, we do not know the relationship between perceived causes and solutions to health inequalities, whether there is agreement between professional stakeholders and people living in low-income communities or agreement within these groups. Methods Q methodology was used to identify and describe the shared perspectives (‘subjectivities’) that exist on i) why health is worse in low-income communities (‘Causes’) and ii) the ways that health could be improved in these same communities (‘Solutions’). Purposively selected individuals (n = 53) from low-income communities (n = 25) and professional stakeholder groups (n = 28) ranked ordered sets of statements – 34 ‘Causes’ and 39 ‘Solutions’ – onto quasi-normal shaped grids according to their point of view. Factor analysis was used to identify shared points of view. ‘Causes’ and ‘Solutions’ were analysed independently, before examining correlations between perspectives on causes and perspectives on solutions. Results Analysis produced three factor solutions for both the ‘Causes’ and ‘Solutions’. Broadly summarised these accounts for ‘Causes’ are: i) ‘Unfair Society’, ii) ‘Dependent, workless and lazy’, iii) ‘Intergenerational hardships’ and for ‘Solutions’: i) ‘Empower communities’, ii) ‘Paternalism’, iii) ‘Redistribution’. No professionals defined (i.e. had a significant association with one factor only) the ‘Causes’ factor ‘Dependent, workless and lazy’ and the ‘Solutions’ factor ‘Paternalism’. No community participants defined the ‘Solutions’ factor ‘Redistribution’. The direction of correlations between the two sets of factor solutions – ‘Causes’ and ‘Solutions’ – appear to be intuitive, given the accounts identified. Conclusions Despite the plurality of views there was broad agreement across accounts about issues relating to money. This is important as it points a way forward for tackling health inequalities, highlighting areas for policy and future research to focus on

    Public Managers, Media Influence, and Governance: Three Research Traditions Empirically Explored

    Get PDF
    Nowadays, media and media logic have become important and inherent elements in everyday practices of public administration and policy making. However, the logic of the media is often very different from, and conflicting with, the logic of political and administrative life. So the question of how public managers experience and deal with media attention is more relevant than ever. An analytical sketch of the literature on the relationship between public managers and media provides three main categories of literature (public relations, agenda, and mediatization tradition). These three categories are used to develop statements (so-called Q-sort statements) to capture the way public managers experience thei

    Allele Frequency–Based and Polymorphism-Versus-Divergence Indices of Balancing Selection in a New Filtered Set of Polymorphic Genes in Plasmodium falciparum

    Get PDF
    Signatures of balancing selection operating on specific gene loci in endemic pathogens can identify candidate targets of naturally acquired immunity. In malaria parasites, several leading vaccine candidates convincingly show such signatures when subjected to several tests of neutrality, but the discovery of new targets affected by selection to a similar extent has been slow. A small minority of all genes are under such selection, as indicated by a recent study of 26 Plasmodium falciparum merozoite-stage genes that were not previously prioritized as vaccine candidates, of which only one (locus PF10_0348) showed a strong signature. Therefore, to focus discovery efforts on genes that are polymorphic, we scanned all available shotgun genome sequence data from laboratory lines of P. falciparum and chose six loci with more than five single nucleotide polymorphisms per kilobase (including PF10_0348) for in-depth frequency–based analyses in a Kenyan population (allele sample sizes >50 for each locus) and comparison of Hudson–Kreitman–Aguade (HKA) ratios of population diversity (π) to interspecific divergence (K) from the chimpanzee parasite Plasmodium reichenowi. Three of these (the msp3/6-like genes PF10_0348 and PF10_0355 and the surf4.1 gene PFD1160w) showed exceptionally high positive values of Tajima's D and Fu and Li's F indices and have the highest HKA ratios, indicating that they are under balancing selection and should be prioritized for studies of their protein products as candidate targets of immunity. Combined with earlier results, there is now strong evidence that high HKA ratio (as well as the frequency-independent ratio of Watterson's θ/K) is predictive of high values of Tajima's D. Thus, the former offers value for use in genome-wide screening when numbers of genome sequences within a species are low or in combination with Tajima's D as a 2D test on large population genomic samples

    Intron Dynamics in Ribosomal Protein Genes

    Get PDF
    The role of spliceosomal introns in eukaryotic genomes remains obscure. A large scale analysis of intron presence/absence patterns in many gene families and species is a necessary step to clarify the role of these introns. In this analysis, we used a maximum likelihood method to reconstruct the evolution of 2,961 introns in a dataset of 76 ribosomal protein genes from 22 eukaryotes and validated the results by a maximum parsimony method. Our results show that the trends of intron gain and loss differed across species in a given kingdom but appeared to be consistent within subphyla. Most subphyla in the dataset diverged around 1 billion years ago, when the “Big Bang” radiation occurred. We speculate that spliceosomal introns may play a role in the explosion of many eukaryotes at the Big Bang radiation

    Repeated evolution of self-compatibility for reproductive assurance

    Get PDF
    Sexual reproduction in eukaryotes requires the fusion of two compatible gametes of opposite sexes or mating types. To meet the challenge of finding a mating partner with compatible gametes evolutionary mechanisms such as hermaphroditism and self-fertilisation have repeatedly evolved. Combining insight from comparative genomics, computer simulations and experimental evolution in fission yeast, we shed light on the conditions promoting separate mating types or self-compatibility by mating-type switching. Analogous to multiple independent transitions between switchers and non-switchers in natural populations mediated by structural genomic changes, novel switching genotypes were readily evolving under selection in experimental populations. Detailed fitness measurements accompanied by computer simulations show the benefits and costs of switching during sexual and asexual reproduction governing the occurrence of both strategies in nature. Our findings illuminate the trade-off between the benefits of reproductive assurance and its fitness costs under benign conditions governing the evolution of self-compatibility

    Intron-loss evolution of hatching enzyme genes in Teleostei

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution.</p> <p>Results</p> <p>We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes.</p> <p>Conclusions</p> <p>Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene.</p

    RSpred, a set of Hidden Markov Models to detect and classify the RIFIN and STEVOR proteins of Plasmodium falciparum

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many parasites use multicopy protein families to avoid their host's immune system through a strategy called antigenic variation. RIFIN and STEVOR proteins are variable surface antigens uniquely found in the malaria parasites <it>Plasmodium falciparum </it>and <it>P. reichenowi</it>. Although these two protein families are different, they have more similarity to each other than to any other proteins described to date. As a result, they have been grouped together in one Pfam domain. However, a recent study has described the sub-division of the RIFIN protein family into several functionally distinct groups. These sub-groups require phylogenetic analysis to sort out, which is not practical for large-scale projects, such as the sequencing of patient isolates and meta-genomic analysis.</p> <p>Results</p> <p>We have manually curated the <it>rif </it>and <it>stevor </it>gene repertoires of two <it>Plasmodium falciparum </it>genomes, isolates DD2 and HB3. We have identified 25% of mis-annotated and ~30 missing <it>rif </it>and <it>stevor </it>genes. Using these data sets, as well as sequences from the well curated reference genome (isolate 3D7) and field isolate data from Uniprot, we have developed a tool named RSpred. The tool, based on a set of hidden Markov models and an evaluation program, automatically identifies STEVOR and RIFIN sequences as well as the sub-groups: A-RIFIN, B-RIFIN, B1-RIFIN and B2-RIFIN. In addition to these groups, we distinguish a small subset of STEVOR proteins that we named STEVOR-like, as they either differ remarkably from typical STEVOR proteins or are too fragmented to reach a high enough score. When compared to Pfam and TIGRFAMs, RSpred proves to be a more robust and more sensitive method. We have applied RSpred to the proteomes of several <it>P. falciparum </it>strains, <it>P. reichenowi, P. vivax</it>, <it>P. knowlesi </it>and the rodent malaria species. All groups were found in the <it>P. falciparum </it>strains, and also in the <it>P. reichenowi </it>parasite, whereas none were predicted in the other species.</p> <p>Conclusions</p> <p>We have generated a tool for the sorting of RIFIN and STEVOR proteins, large antigenic variant protein groups, into homogeneous sub-families. Assigning functions to such protein families requires their subdivision into meaningful groups such as we have shown for the RIFIN protein family. RSpred removes the need for complicated and time consuming phylogenetic analysis methods. It will benefit both research groups sequencing whole genomes as well as others working with field isolates. RSpred is freely accessible via <url>http://www.ifm.liu.se/bioinfo/</url>.</p

    Fitness Landscape of the Fission Yeast Genome

    Get PDF
    The relationship between DNA sequence, biochemical function and molecular evolution is relatively well-described for protein-coding regions of genomes, but far less clear in non-coding regions, particularly in eukaryote genomes. In part, this is because we lack a complete description of the essential non-coding elements in a eukaryote genome. To contribute to this challenge, we used saturating transposon mutagenesis to interrogate the Schizosaccharomyces pombe genome. We generated 31 million transposon insertions, a theoretical coverage of 2.4 insertions per genomic site. We applied a five-state hidden Markov model (HMM) to distinguish insertion-depleted regions from insertion biases. Both raw insertion-density and HMM-defined fitness estimates showed significant quantitative relationships to gene knockout fitness, genetic diversity, divergence and expected functional regions based on transcription and gene annotations. Through several analyses, we conclude that transposon insertions produced fitness effects in 66-90% of the genome, including substantial portions of the non-coding regions. Based on the HMM, we estimate that 10% of the insertion depleted sites in the genome showed no signal of conservation between species and were weakly transcribed, demonstrating limitations of comparative genomics and transcriptomics to detect functional units. In this species, 3' and 5' untranslated regions were the most prominent insertion-depleted regions that were not represented in measures of constraint from comparative genomics. We conclude that the combination of transposon mutagenesis, evolutionary and biochemical data can provide new insights into the relationship between genome function and molecular evolution
    corecore