83 research outputs found

    siDirect 2.0: updated software for designing functional siRNA with reduced seed-dependent off-target effect

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>RNA interference (RNAi), mediated by 21-nucleotide (nt)-length small interfering RNAs (siRNAs), is a powerful tool not only for studying gene function but also for therapeutic applications. RNAi, requiring perfect complementarity between the siRNA guide strand and the target mRNA, was believed to be extremely specific. However, a recent growing body of evidence has suggested that siRNA could down-regulate unintended genes whose transcripts possess complementarity to the 7-nt siRNA seed region. This off-target gene silencing may often provide incongruous results obtained from knockdown experiments, leading to misinterpretation. Thus, an efficient algorithm for designing functional siRNAs with minimal off-target effect based on the mechanistic features is considered of value.</p> <p>Results</p> <p>We present siDirect 2.0, an update of our web-based software siDirect, which provides functional and off-target minimized siRNA design for mammalian RNAi. The previous version of our software designed functional siRNAs by considering the relationship between siRNA sequence and RNAi activity, and provided them along with the enumeration of potential off-target gene candidates by using a fast and sensitive homology search algorithm. In the new version, the siRNA design algorithm is extensively updated to eliminate off-target effects by reflecting our recent finding that the capability of siRNA to induce off-target effect is highly correlated to the thermodynamic stability, or the melting temperature (Tm), of the seed-target duplex, which is formed between the nucleotides positioned at 2-8 from the 5' end of the siRNA guide strand and its target mRNA. Selection of siRNAs with lower seed-target duplex stabilities (benchmark Tm < 21.5°C) followed by the elimination of unrelated transcripts with nearly perfect match should minimize the off-target effects.</p> <p>Conclusion</p> <p>siDirect 2.0 provides functional, target-specific siRNA design with the updated algorithm which significantly reduces off-target silencing. When the candidate functional siRNAs could form seed-target duplexes with Tm values below 21.5°C, and their 19-nt regions spanning positions 2-20 of both strands have at least two mismatches to any other non-targeted transcripts, siDirect 2.0 can design at least one qualified siRNA for >94% of human mRNA sequences in RefSeq. siDirect 2.0 is available at <url>http://siDirect2.RNAi.jp/</url>.</p

    Establishing LA VIDA: A Community-Based Partnership to Prevent Intimate Violence against Latina Women

    Full text link
    LA VIDA—the Southwest Detroit Partnership to Prevent Intimate Violence Against Latina Women— evolved in response to community concern about the problem of intimate partner violence (IPV) and the lack of culturally competent preventive and support services for Latino women and men in southwest Detroit. Since 1997, diverse organizations have mobilized as a community-academic partnership to ensure the availability, accessibility, and utilization of IPV services. This article describes and analyzes the evolution of LA VIDA within a community-based participatory research framework using a case study approach that draws on multiple data sources including group and individual interviews and field notes. The challenges and lessons learned in addressing a complex multifaceted problem such as IPV in an ethnic minority community are highlighted in an examination of the process of mobilizing diverse organizations, conducting community diagnosis and needs assessment activities, establishing goals and objectives within a social ecological framework, and integrating evaluation during the development phase.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/66991/2/10.1177_109019819902600606.pd

    An efficient algorithm for systematic analysis of nucleotide strings suitable for siRNA design

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The "off-target" silencing effect hinders the development of siRNA-based therapeutic and research applications. Existing solutions for finding possible locations of siRNA seats within a large database of genes are either too slow, miss a portion of the targets, or are simply not designed to handle a very large number of queries. We propose a new approach that reduces the computational time as compared to existing techniques.</p> <p>Findings</p> <p>The proposed method employs tree-based storage in a form of a modified truncated suffix tree to sort all possible short string substrings within given set of strings (i.e. transcriptome). Using the new algorithm, we pre-computed a list of the best siRNA locations within each human gene ("siRNA seats"). siRNAs designed to reside within siRNA seats are less likely to hybridize off-target. These siRNA seats could be used as an input for the traditional "set-of-rules" type of siRNA designing software. The list of siRNA seats is available through a publicly available database located at <url>http://web.cos.gmu.edu/~gmanyam/siRNA_db/search.php</url></p> <p>Conclusions</p> <p>In attempt to perform top-down prediction of the human siRNA with minimized off-target hybridization, we developed an efficient algorithm that employs suffix tree based storage of the substrings. Applications of this approach are not limited to optimal siRNA design, but can also be useful for other tasks involving selection of the characteristic strings specific to individual genes. These strings could then be used as siRNA seats, as specific probes for gene expression studies by oligonucleotide-based microarrays, for the design of molecular beacon probes for Real-Time PCR and, generally, any type of PCR primers.</p

    An Introduction to RNA Databases

    Full text link
    We present an introduction to RNA databases. The history and technology behind RNA databases is briefly discussed. We examine differing methods of data collection and curation, and discuss their impact on both the scope and accuracy of the resulting databases. Finally, we demonstrate these principals through detailed examination of four leading RNA databases: Noncode, miRBase, Rfam, and SILVA.Comment: 27 pages, 10 figures, 1 tables. Submitted as a chapter for "An introduction to RNA bioinformatics" to be published by "Methods in Molecular Biology

    Comparing Artificial Neural Networks, General Linear Models and Support Vector Machines in Building Predictive Models for Small Interfering RNAs

    Get PDF
    Exogenous short interfering RNAs (siRNAs) induce a gene knockdown effect in cells by interacting with naturally occurring RNA processing machinery. However not all siRNAs induce this effect equally. Several heterogeneous kinds of machine learning techniques and feature sets have been applied to modeling siRNAs and their abilities to induce knockdown. There is some growing agreement to which techniques produce maximally predictive models and yet there is little consensus for methods to compare among predictive models. Also, there are few comparative studies that address what the effect of choosing learning technique, feature set or cross validation approach has on finding and discriminating among predictive models.Three learning techniques were used to develop predictive models for effective siRNA sequences including Artificial Neural Networks (ANNs), General Linear Models (GLMs) and Support Vector Machines (SVMs). Five feature mapping methods were also used to generate models of siRNA activities. The 2 factors of learning technique and feature mapping were evaluated by complete 3x5 factorial ANOVA. Overall, both learning techniques and feature mapping contributed significantly to the observed variance in predictive models, but to differing degrees for precision and accuracy as well as across different kinds and levels of model cross-validation.The methods presented here provide a robust statistical framework to compare among models developed under distinct learning techniques and feature sets for siRNAs. Further comparisons among current or future modeling approaches should apply these or other suitable statistically equivalent methods to critically evaluate the performance of proposed models. ANN and GLM techniques tend to be more sensitive to the inclusion of noisy features, but the SVM technique is more robust under large numbers of features for measures of model precision and accuracy. Features found to result in maximally predictive models are not consistent across learning techniques, suggesting care should be taken in the interpretation of feature relevance. In the models developed here, there are statistically differentiable combinations of learning techniques and feature mapping methods where the SVM technique under a specific combination of features significantly outperforms all the best combinations of features within the ANN and GLM techniques

    Reconsideration of In-Silico siRNA Design Based on Feature Selection: A Cross-Platform Data Integration Perspective

    Get PDF
    RNA interference via exogenous short interference RNAs (siRNA) is increasingly more widely employed as a tool in gene function studies, drug target discovery and disease treatment. Currently there is a strong need for rational siRNA design to achieve more reliable and specific gene silencing; and to keep up with the increasing needs for a wider range of applications. While progress has been made in the ability to design siRNAs with specific targets, we are clearly at an infancy stage towards achieving rational design of siRNAs with high efficacy. Among the many obstacles to overcome, lack of general understanding of what sequence features of siRNAs may affect their silencing efficacy and of large-scale homogeneous data needed to carry out such association analyses represents two challenges. To address these issues, we investigated a feature-selection based in-silico siRNA design from a novel cross-platform data integration perspective. An integration analysis of 4,482 siRNAs from ten meta-datasets was conducted for ranking siRNA features, according to their possible importance to the silencing efficacy of siRNAs across heterogeneous data sources. Our ranking analysis revealed for the first time the most relevant features based on cross-platform experiments, which compares favorably with the traditional in-silico siRNA feature screening based on the small samples of individual platform data. We believe that our feature ranking analysis can offer more creditable suggestions to help improving the design of siRNA with specific silencing targets. Data and scripts are available at http://csbl.bmb.uga.edu/publications/materials/qiliu/siRNA.html

    Computational Design of Artificial RNA Molecules For Gene Regulation

    Get PDF
    This volume provides an overview of RNA bioinformatics methodologies, including basic strategies to predict secondary and tertiary structures, and novel algorithms based on massive RNA sequencing. Interest in RNA bioinformatics has rapidly increased thanks to the recent high-throughput sequencing technologies allowing scientists to investigate complete transcriptomes at single nucleotide resolution. Adopting advanced computational technics, scientists are now able to conduct more in-depth studies and present them to you in this book. Written in the highly successful Methods of Molecular Biology series format, chapters include introductions to their respective topics, lists of the necessary materials and equipment, step-by-step, readily reproducible bioinformatics protocols, and key tips to avoid known pitfalls.Authoritative and practical, RNA Bioinformatics seeks to aid scientists in the further study of bioinformatics and computational biology of RNA

    Dynamics of Co-Transcriptional Pre-mRNA Folding Influences the Induction of Dystrophin Exon Skipping by Antisense Oligonucleotides

    Get PDF
    Antisense oligonucleotides (AONs) mediated exon skipping offers potential therapy for Duchenne muscular dystrophy. However, the identification of effective AON target sites remains unsatisfactory for lack of a precise method to predict their binding accessibility. This study demonstrates the importance of co-transcriptional pre-mRNA folding in determining the accessibility of AON target sites for AON induction of selective exon skipping in DMD. Because transcription and splicing occur in tandem, AONs must bind to their target sites before splicing factors. Furthermore, co-transcriptional pre-mRNA folding forms transient secondary structures, which redistributes accessible binding sites. In our analysis, to approximate transcription elongation, a “window of analysis” that included the entire targeted exon was shifted one nucleotide at a time along the pre-mRNA. Possible co-transcriptional secondary structures were predicted using the sequence in each step of transcriptional analysis. A nucleotide was considered “engaged” if it formed a complementary base pairing in all predicted secondary structures of a particular step. Correlation of frequency and localisation of engaged nucleotides in AON target sites accounted for the performance (efficacy and efficiency) of 94% of 176 previously reported AONs. Four novel insights are inferred: (1) the lowest frequencies of engaged nucleotides are associated with the most efficient AONs; (2) engaged nucleotides at 3′ or 5′ ends of the target site attenuate AON performance more than at other sites; (3) the performance of longer AONs is less attenuated by engaged nucleotides at 3′ or 5′ ends of the target site compared to shorter AONs; (4) engaged nucleotides at 3′ end of a short target site attenuates AON efficiency more than at 5′ end

    Improving model predictions for RNA interference activities that use support vector machine regression by combining and filtering features

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>RNA interference (RNAi) is a naturally occurring phenomenon that results in the suppression of a target RNA sequence utilizing a variety of possible methods and pathways. To dissect the factors that result in effective siRNA sequences a regression kernel Support Vector Machine (SVM) approach was used to quantitatively model RNA interference activities.</p> <p>Results</p> <p>Eight overall feature mapping methods were compared in their abilities to build SVM regression models that predict published siRNA activities. The primary factors in predictive SVM models are position specific nucleotide compositions. The secondary factors are position independent sequence motifs (<it>N</it>-grams) and guide strand to passenger strand sequence thermodynamics. Finally, the factors that are least contributory but are still predictive of efficacy are measures of intramolecular guide strand secondary structure and target strand secondary structure. Of these, the site of the 5' most base of the guide strand is the most informative.</p> <p>Conclusion</p> <p>The capacity of specific feature mapping methods and their ability to build predictive models of RNAi activity suggests a relative biological importance of these features. Some feature mapping methods are more informative in building predictive models and overall <it>t</it>-test filtering provides a method to remove some noisy features or make comparisons among datasets. Together, these features can yield predictive SVM regression models with increased predictive accuracy between predicted and observed activities both within datasets by cross validation, and between independently collected RNAi activity datasets. Feature filtering to remove features should be approached carefully in that it is possible to reduce feature set size without substantially reducing predictive models, but the features retained in the candidate models become increasingly distinct. Software to perform feature prediction and SVM training and testing on nucleic acid sequences can be found at the following site: <url>ftp://scitoolsftp.idtdna.com/SEQ2SVM/</url>.</p
    corecore