1,848 research outputs found
An Introduction to RNA Databases
We present an introduction to RNA databases. The history and technology
behind RNA databases is briefly discussed. We examine differing methods of data
collection and curation, and discuss their impact on both the scope and
accuracy of the resulting databases. Finally, we demonstrate these principals
through detailed examination of four leading RNA databases: Noncode, miRBase,
Rfam, and SILVA.Comment: 27 pages, 10 figures, 1 tables. Submitted as a chapter for "An
introduction to RNA bioinformatics" to be published by "Methods in Molecular
Biology
Developing and applying heterogeneous phylogenetic models with XRate
Modeling sequence evolution on phylogenetic trees is a useful technique in
computational biology. Especially powerful are models which take account of the
heterogeneous nature of sequence evolution according to the "grammar" of the
encoded gene features. However, beyond a modest level of model complexity,
manual coding of models becomes prohibitively labor-intensive. We demonstrate,
via a set of case studies, the new built-in model-prototyping capabilities of
XRate (macros and Scheme extensions). These features allow rapid implementation
of phylogenetic models which would have previously been far more
labor-intensive. XRate's new capabilities for lineage-specific models,
ancestral sequence reconstruction, and improved annotation output are also
discussed. XRate's flexible model-specification capabilities and computational
efficiency make it well-suited to developing and prototyping phylogenetic
grammar models. XRate is available as part of the DART software package:
http://biowiki.org/DART .Comment: 34 pages, 3 figures, glossary of XRate model terminolog
Evolutionary distances in the twilight zone -- a rational kernel approach
Phylogenetic tree reconstruction is traditionally based on multiple sequence
alignments (MSAs) and heavily depends on the validity of this information
bottleneck. With increasing sequence divergence, the quality of MSAs decays
quickly. Alignment-free methods, on the other hand, are based on abstract
string comparisons and avoid potential alignment problems. However, in general
they are not biologically motivated and ignore our knowledge about the
evolution of sequences. Thus, it is still a major open question how to define
an evolutionary distance metric between divergent sequences that makes use of
indel information and known substitution models without the need for a multiple
alignment. Here we propose a new evolutionary distance metric to close this
gap. It uses finite-state transducers to create a biologically motivated
similarity score which models substitutions and indels, and does not depend on
a multiple sequence alignment. The sequence similarity score is defined in
analogy to pairwise alignments and additionally has the positive semi-definite
property. We describe its derivation and show in simulation studies and
real-world examples that it is more accurate in reconstructing phylogenies than
competing methods. The result is a new and accurate way of determining
evolutionary distances in and beyond the twilight zone of sequence alignments
that is suitable for large datasets.Comment: to appear in PLoS ON
Adjacent Nucleotide Dependence in ncRNA and Order-1 SCFG for ncRNA Identification
Background: Non-coding RNAs (ncRNAs) are known to be involved in many critical biological processes, and identification of ncRNAs is an important task in biological research. A popular software, Infernal, is the most successful prediction tool and exhibits high sensitivity. The application of Infernal has been mainly focused on small suspected regions. We tried to apply Infernal on a chromosome level; the results have high sensitivity, yet contain many false positives. Further enhancing Infernal for chromosome level or genome wide study is desirable. Methodology: Based on the conjecture that adjacent nucleotide dependence affects the stability of the secondary structure of an ncRNA, we first conduct a systematic study on human ncRNAs and find that adjacent nucleotide dependence in human ncRNA should be useful for identifying ncRNAs. We then incorporate this dependence in the SCFG model and develop a new order-1 SCFG model for identifying ncRNAs. Conclusions: With respect to our experiments on human chromosomes, the proposed new model can eliminate more than 50 % false positives reported by Infernal while maintaining the same sensitivity. The executable and the source code of programs are freely available a
An RxLR effector from phytophthora infestans prevents re-localisation of two plant NAC transcription factors from the endoplasmic reticulum to the nucleus
The plant immune system is activated following the perception of exposed, essential and invariant microbial molecules that are recognised as non-self. A major component of plant immunity is the transcriptional induction of genes involved in a wide array of defence responses. In turn, adapted pathogens deliver effector proteins that act either inside or outside plant cells to manipulate host processes, often through their direct action on plant protein targets. To date, few effectors have been shown to directly manipulate transcriptional regulators of plant defence. Moreover, little is known generally about the modes of action of effectors from filamentous (fungal and oomycete) plant pathogens. We describe an effector, called Pi03192, from the late blight pathogen Phytophthora infestans, which interacts with a pair of host transcription factors at the endoplasmic reticulum (ER) inside plant cells. We show that these transcription factors are released from the ER to enter the nucleus, following pathogen perception, and are important in restricting disease. Pi03192 prevents the plant transcription factors from accumulating in the host nucleus, revealing a novel means of enhancing host susceptibility
Genome re-annotation: a wiki solution?
The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowledge of genomes and in part to improvements in bioinformatics software. Unfortunately, annotation is rarely if ever updated and resources to support routine reannotation are scarce. Wiki software, which would allow many scientists to edit each genome's annotation, offers one possible solution
Directed acyclic graph kernels for structural RNA analysis
<p>Abstract</p> <p>Background</p> <p>Recent discoveries of a large variety of important roles for non-coding RNAs (ncRNAs) have been reported by numerous researchers. In order to analyze ncRNAs by kernel methods including support vector machines, we propose stem kernels as an extension of string kernels for measuring the similarities between two RNA sequences from the viewpoint of secondary structures. However, applying stem kernels directly to large data sets of ncRNAs is impractical due to their computational complexity.</p> <p>Results</p> <p>We have developed a new technique based on directed acyclic graphs (DAGs) derived from base-pairing probability matrices of RNA sequences that significantly increases the computation speed of stem kernels. Furthermore, we propose profile-profile stem kernels for multiple alignments of RNA sequences which utilize base-pairing probability matrices for multiple alignments instead of those for individual sequences. Our kernels outperformed the existing methods with respect to the detection of known ncRNAs and kernel hierarchical clustering.</p> <p>Conclusion</p> <p>Stem kernels can be utilized as a reliable similarity measure of structural RNAs, and can be used in various kernel-based applications.</p
Potential conservation of circadian clock proteins in the phylum Nematoda as revealed by bioinformatic searches
Although several circadian rhythms have been described in C. elegans, its molecular clock remains elusive. In this work we employed a novel bioinformatic approach, applying probabilistic methodologies, to search for circadian clock proteins of several of the best studied circadian model organisms of different taxa (Mus musculus, Drosophila melanogaster, Neurospora crassa, Arabidopsis thaliana and Synechoccocus elongatus) in the proteomes of C. elegans and other members of the phylum Nematoda. With this approach we found that the Nematoda contain proteins most related to the core and accessory proteins of the insect and mammalian clocks, which provide new insights into the nematode clock and the evolution of the circadian system.Fil: Romanowski, Andrés. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones Bioquímicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones Bioquímicas de Buenos Aires; Argentina. Universidad Nacional de Quilmes. Departamento de Ciencia y Tecnología. Laboratorio de Cronobiología; ArgentinaFil: Garavaglia, Matías Javier. Universidad Nacional de Quilmes. Departamento de Ciencia y Tecnología. Laboratorio de Ing.genética y Biolog.molecular y Celular. Area Virus de Insectos; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Goya, María Eugenia. Universidad Nacional de Quilmes. Departamento de Ciencia y Tecnología. Laboratorio de Cronobiología; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Ghiringhelli, Pablo Daniel. Universidad Nacional de Quilmes. Departamento de Ciencia y Tecnología. Laboratorio de Ing.genética y Biolog.molecular y Celular. Area Virus de Insectos; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; ArgentinaFil: Golombek, Diego Andres. Universidad Nacional de Quilmes. Departamento de Ciencia y Tecnología. Laboratorio de Cronobiología; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentin
Managing ethnic conflict : the menu of institutional engineering
The debate on institutional engineering offers options to manage ethnic and other conflicts. This contribution systematically assesses the logic of these institutional designs and the empirical evidence on their functioning. Generally, institutions can work on ethnic conflict by either accommodating (“consociationalists”) or denying (“integrationists”) ethnicity in politics. Looking at individual and combined institutions (e.g. state structure, electoral system, forms of government), the literature review finds that most designs are theoretically ambivalent and that empirical evidence on their effectiveness is mostly inconclusive. The following questions remain open: a) Is politicized ethnicity really a conflict risk? b) What impact does the whole “menu” (not just single institutions) have? and c) How are effects conditioned by the exact nature of conflict risks
- …