406 research outputs found
Combinatorial RNA Design: Designability and Structure-Approximating Algorithm
In this work, we consider the Combinatorial RNA Design problem, a minimal
instance of the RNA design problem which aims at finding a sequence that admits
a given target as its unique base pair maximizing structure. We provide
complete characterizations for the structures that can be designed using
restricted alphabets. Under a classic four-letter alphabet, we provide a
complete characterization of designable structures without unpaired bases. When
unpaired bases are allowed, we provide partial characterizations for classes of
designable/undesignable structures, and show that the class of designable
structures is closed under the stutter operation. Membership of a given
structure to any of the classes can be tested in linear time and, for positive
instances, a solution can be found in linear time. Finally, we consider a
structure-approximating version of the problem that allows to extend bands
(helices) and, assuming that the input structure avoids two motifs, we provide
a linear-time algorithm that produces a designable structure with at most twice
more base pairs than the input structure.Comment: CPM - 26th Annual Symposium on Combinatorial Pattern Matching, Jun
2015, Ischia Island, Italy. LNCS, 201
RNA secondary structure prediction from multi-aligned sequences
It has been well accepted that the RNA secondary structures of most
functional non-coding RNAs (ncRNAs) are closely related to their functions and
are conserved during evolution. Hence, prediction of conserved secondary
structures from evolutionarily related sequences is one important task in RNA
bioinformatics; the methods are useful not only to further functional analyses
of ncRNAs but also to improve the accuracy of secondary structure predictions
and to find novel functional RNAs from the genome. In this review, I focus on
common secondary structure prediction from a given aligned RNA sequence, in
which one secondary structure whose length is equal to that of the input
alignment is predicted. I systematically review and classify existing tools and
algorithms for the problem, by utilizing the information employed in the tools
and by adopting a unified viewpoint based on maximum expected gain (MEG)
estimators. I believe that this classification will allow a deeper
understanding of each tool and provide users with useful information for
selecting tools for common secondary structure predictions.Comment: A preprint of an invited review manuscript that will be published in
a chapter of the book `Methods in Molecular Biology'. Note that this version
of the manuscript may differ from the published versio
Synthetic RNA Silencing of Actinorhodin Biosynthesis in Streptomyces coelicolor A3(2)
We demonstrate the first application of synthetic RNA gene silencers in Streptomyces coelicolor A3(2). Peptide nucleic acid and expressed antisense RNA silencers successfully inhibited actinorhodin production. Synthetic RNA silencing was target-specific and is a new tool for gene regulation and metabolic engineering studies in Streptomyces.Peer reviewe
Fine-grained parallel RNAalifold algorithm for RNA secondary structure prediction on FPGA
<p>Abstract</p> <p>Background</p> <p>In the field of RNA secondary structure prediction, the RNAalifold algorithm is one of the most popular methods using free energy minimization. However, general-purpose computers including parallel computers or multi-core computers exhibit parallel efficiency of no more than 50%. Field Programmable Gate-Array (FPGA) chips provide a new approach to accelerate RNAalifold by exploiting fine-grained custom design.</p> <p>Results</p> <p>RNAalifold shows complicated data dependences, in which the dependence distance is variable, and the dependence direction is also across two dimensions. We propose a systolic array structure including one master Processing Element (PE) and multiple slave PEs for fine grain hardware implementation on FPGA. We exploit data reuse schemes to reduce the need to load energy matrices from external memory. We also propose several methods to reduce energy table parameter size by 80%.</p> <p>Conclusion</p> <p>To our knowledge, our implementation with 16 PEs is the only FPGA accelerator implementing the complete RNAalifold algorithm. The experimental results show a factor of 12.2 speedup over the RNAalifold (<it>ViennaPackage </it>– 1.6.5) software for a group of aligned RNA sequences with 2981-residue running on a Personal Computer (PC) platform with Pentium 4 2.6 GHz CPU.</p
RNAmute: RNA secondary structure mutation analysis tool
BACKGROUND: RNAMute is an interactive Java application that calculates the secondary structure of all single point mutations, given an RNA sequence, and organizes them into categories according to their similarity with respect to the wild type predicted structure. The secondary structure predictions are performed using the Vienna RNA package. Several alternatives are used for the categorization of single point mutations: Vienna's RNAdistance based on dot-bracket representation, as well as tree edit distance and second eigenvalue of the Laplacian matrix based on Shapiro's coarse grain tree graph representation. RESULTS: Selecting a category in each one of the processed tables lists all single point mutations belonging to that category. Selecting a mutation displays a graphical drawing of the single point mutation and the wild type, and includes basic information such as associated energies, representations and distances. RNAMute can be used successfully with very little previous experience and without choosing any parameter value alongside the initial RNA sequence. The package runs under LINUX operating system. CONCLUSION: RNAMute is a user friendly tool that can be used to predict single point mutations leading to conformational rearrangements in the secondary structure of RNAs. In several cases of substantial interest, notably in virology, a point mutation may lead to a loss of important functionality such as the RNA virus replication and translation initiation because of a conformational rearrangement in the secondary structure
ViennaRNA Package 2.0
<p>Abstract</p> <p>Background</p> <p>Secondary structure forms an important intermediate level of description of nucleic acids that encapsulates the dominating part of the folding energy, is often well conserved in evolution, and is routinely used as a basis to explain experimental findings. Based on carefully measured thermodynamic parameters, exact dynamic programming algorithms can be used to compute ground states, base pairing probabilities, as well as thermodynamic properties.</p> <p>Results</p> <p>The <monospace>ViennaRNA</monospace> Package has been a widely used compilation of RNA secondary structure related computer programs for nearly two decades. Major changes in the structure of the standard energy model, the <it>Turner 2004 </it>parameters, the pervasive use of multi-core CPUs, and an increasing number of algorithmic variants prompted a major technical overhaul of both the underlying <monospace>RNAlib</monospace> and the interactive user programs. New features include an expanded repertoire of tools to assess RNA-RNA interactions and restricted ensembles of structures, additional output information such as <it>centroid </it>structures and <it>maximum expected accuracy </it>structures derived from base pairing probabilities, or <it>z</it>-<it>scores </it>for locally stable secondary structures, and support for input in <monospace>fasta</monospace> format. Updates were implemented without compromising the computational efficiency of the core algorithms and ensuring compatibility with earlier versions.</p> <p>Conclusions</p> <p>The <monospace>ViennaRNA Package 2.0</monospace>, supporting concurrent computations <monospace>via OpenMP</monospace>, can be downloaded from <url>http://www.tbi.univie.ac.at/RNA</url>.</p
RNAalifold: improved consensus structure prediction for RNA alignments
<p>Abstract</p> <p>Background</p> <p>The prediction of a consensus structure for a set of related RNAs is an important first step for subsequent analyses. RNAalifold, which computes the minimum energy structure that is simultaneously formed by a set of aligned sequences, is one of the oldest and most widely used tools for this task. In recent years, several alternative approaches have been advocated, pointing to several shortcomings of the original RNAalifold approach.</p> <p>Results</p> <p>We show that the accuracy of RNAalifold predictions can be improved substantially by introducing a different, more rational handling of alignment gaps, and by replacing the rather simplistic model of covariance scoring with more sophisticated RIBOSUM-like scoring matrices. These improvements are achieved without compromising the computational efficiency of the algorithm. We show here that the new version of RNAalifold not only outperforms the old one, but also several other tools recently developed, on different datasets.</p> <p>Conclusion</p> <p>The new version of RNAalifold not only can replace the old one for almost any application but it is also competitive with other approaches including those based on SCFGs, maximum expected accuracy, or hierarchical nearest neighbor classifiers.</p
The malignant phenotype in breast cancer is driven by eIF4A1-mediated changes in the translational landscape
Human mRNA DeXD/H-box helicases are ubiquitous molecular motors that are required for the majority of cellular processes that involve RNA metabolism. One of the most abundant is eIF4A, which is required during the initiation phase of protein synthesis to unwind regions of highly structured mRNA that would otherwise impede the scanning ribosome. Dysregulation of protein synthesis is associated with tumorigenesis, but little is known about the detailed relationships between RNA helicase function and the malignant phenotype in solid malignancies. Therefore, immunohistochemical analysis was performed on over 3000 breast tumors to investigate the relationship among expression of eIF4A1, the helicase-modulating proteins eIF4B, eIF4E and PDCD4, and clinical outcome. We found eIF4A1, eIF4B and eIF4E to be independent predictors of poor outcome in ER-negative disease, while in contrast, the eIF4A1 inhibitor PDCD4 was related to improved outcome in ER-positive breast cancer. Consistent with these data, modulation of eIF4A1, eIF4B and PCDC4 expression in cultured MCF7 cells all restricted breast cancer cell growth and cycling. The eIF4A1-dependent translatome of MCF7 cells was defined by polysome profiling, and was shown to be highly enriched for several classes of oncogenic genes, including G-protein constituents, cyclins and protein kinases, and for mRNAs with G/C-rich 5′UTRs with potential to form G-quadruplexes and with 3′UTRs containing microRNA target sites. Overall, our data show that dysregulation of mRNA unwinding contributes to the malignant phenotype in breast cancer via preferential translation of a class of genes involved in pro-oncogenic signaling at numerous levels. Furthermore, immunohistochemical tests are promising biomarkers for tumors sensitive to anti-helicase therapies
The evolutionary dynamics of microRNAs in domestic mammals
MiRNAs are crucial regulators of gene expression found across both the plant and animal kingdoms. While the number of annotated miRNAs deposited in miRBase has greatly increased in recent years, few studies provided comparative analyses across sets of related species, or investigated the role of miRNAs in the evolution of gene regulation. We generated small RNA libraries across 5 mammalian species (cow, dog, horse, pig and rabbit) from 4 different tissues (brain, heart, kidney and testis). We identified 1676 miRBase and 413 novel miRNAs by manually curating the set of computational predictions obtained from miRCat and miRDeep2. Our dataset spanning five species has enabled us to investigate the molecular mechanisms and selective pressures driving the evolution of miRNAs in mammals. We highlight the important contributions of intronic sequences (366 orthogroups), duplication events (135 orthogroups) and repetitive elements (37 orthogroups) in the emergence of new miRNA loci. We use this framework to estimate the patterns of gains and losses across the phylogeny, and observe high levels of miRNA turnover. Additionally, the identification of lineage-specific losses enables the characterisation of the selective constraints acting on the associated target sites. Compared to the miRBase subset, novel miRNAs tend to be more tissue specific. 20 percent of novel orthogroups are restricted to the brain, and their target repertoires appear to be enriched for neuron activity and differentiation processes. These findings may reflect an important role for young miRNAs in the evolution of brain expression plasticity. Many seed sequences appear to be specific to either the cow or the dog. Analyses on the associated targets highlight the presence of several genes under artificial positive selection, suggesting an involvement of these miRNAs in the domestication process. Altogether, we provide an overview on the evolutionary mechanisms responsible for miRNA turnover in 5 domestic species, and their possible contribution to the evolution of gene regulation
The linked units of 5S rDNA and U1 snDNA of razor shells (Mollusca: Bivalvia: Pharidae)
[Abstract] The linkage between 5S ribosomal DNA and other multigene families has been detected in many eukaryote lineages, but whether it provides any selective advantage remains unclear. In this work, we report the occurrence of linked units of 5S ribosomal DNA (5S rDNA) and U1 small nuclear DNA (U1 snDNA) in 10 razor shell species (Mollusca: Bivalvia: Pharidae) from four different genera. We obtained several clones containing partial or complete repeats of both multigene families in which both types of genes displayed the same orientation. We provide a comprehensive collection of razor shell 5S rDNA clones, both with linked and nonlinked organisation, and the first bivalve U1 snDNA sequences. We predicted the secondary structures and characterised the upstream and downstream conserved elements, including a region at −25 nucleotides from both 5S rDNA and U1 snDNA transcription start sites. The analysis of 5S rDNA showed that some nontranscribed spacers (NTSs) are more closely related to NTSs from other species (and genera) than to NTSs from the species they were retrieved from, suggesting birth-and-death evolution and ancestral polymorphism. Nucleotide conservation within the functional regions suggests the involvement of purifying selection, unequal crossing-overs and gene conversions. Taking into account this and other studies, we discuss the possible mechanisms by which both multigene families could have become linked in the Pharidae lineage. The reason why 5S rDNA is often found linked to other multigene families seems to be the result of stochastic processes within genomes in which its high copy number is determinan
- …
