Search CORE

3,448 research outputs found

Novel graph based algorithms for transcriptome sequence analysis

Author: Durai Dilip
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2020
Field of study

RNA-sequencing (RNA-seq) is one of the most-widely used techniques in molecular biology. A key bioinformatics task in any RNA-seq workflow is the assembling the reads. As the size of transcriptomics data sets is constantly increasing, scalable and accurate assembly approaches have to be developed.Here, we propose several approaches to improve assembling of RNA-seq data generated by second-generation sequencing technologies. We demonstrated that the systematic removal of irrelevant reads from a high coverage dataset prior to assembly, reduces runtime and improves the quality of the assembly. Further, we propose a novel RNA-seq assembly work- flow comprised of read error correction, normalization, assembly with informed parameter selection and transcript-level expression computation. In recent years, the popularity of third-generation sequencing technologies in- creased as long reads allow for accurate isoform quantification and gene-fusion detection, which is essential for biomedical research. We present a sequence-to-graph alignment method to detect and to quantify transcripts for third-generation sequencing data. Also, we propose the first gene-fusion prediction tool which is specifically tailored towards long-read data and hence achieves accurate expression estimation even on complex data sets. Moreover, our method predicted experimentally verified fusion events along with some novel events, which can be validated in the future

Universaar

Acronym

MPG.PuRe

PuFFIN--a parameter-free method to build nucleosome maps from paired-end reads.

Author: Bunnik Evelien M
Le Roch Karine G
Lonardi Stefano
Polishko Anton
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

BackgroundWe introduce a novel method, called PuFFIN, that takes advantage of paired-end short reads to build genome-wide nucleosome maps with larger numbers of detected nucleosomes and higher accuracy than existing tools. In contrast to other approaches that require users to optimize several parameters according to their data (e.g., the maximum allowed nucleosome overlap or legal ranges for the fragment sizes) our algorithm can accurately determine a genome-wide set of non-overlapping nucleosomes without any user-defined parameter. This feature makes PuFFIN significantly easier to use and prevents users from choosing the "wrong" parameters and obtain sub-optimal nucleosome maps.ResultsPuFFIN builds genome-wide nucleosome maps using a multi-scale (or multi-resolution) approach. Our algorithm relies on a set of nucleosome "landscape" functions at different resolution levels: each function represents the likelihood of each genomic location to be occupied by a nucleosome for a particular value of the smoothing parameter. After a set of candidate nucleosomes is computed for each function, PuFFIN produces a consensus set that satisfies non-overlapping constraints and maximizes the number of nucleosomes.ConclusionsWe report comprehensive experimental results that compares PuFFIN with recently published tools (NOrMAL, TEMPLATE FILTERING, and NucPosSimulator) on several synthetic datasets as well as real data for S. cerevisiae and P. falciparum. Experimental results show that our approach produces more accurate nucleosome maps with a higher number of non-overlapping nucleosomes than other tools

CiteSeerX

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli.

Author: Kim Minseung
Rai Navneet
Tagkopoulos Ilias
Zorraquino Violeta
Publication venue: eScholarship, University of California
Publication date: 01/10/2016
Field of study

A significant obstacle in training predictive cell models is the lack of integrated data sources. We develop semi-supervised normalization pipelines and perform experimental characterization (growth, transcriptional, proteome) to create Ecomics, a consistent, quality-controlled multi-omics compendium for Escherichia coli with cohesive meta-data information. We then use this resource to train a multi-scale model that integrates four omics layers to predict genome-wide concentrations and growth dynamics. The genetic and environmental ontology reconstructed from the omics data is substantially different and complementary to the genetic and chemical ontologies. The integration of different layers confers an incremental increase in the prediction performance, as does the information about the known gene regulatory and protein-protein interactions. The predictive performance of the model ranges from 0.54 to 0.87 for the various omics layers, which far exceeds various baselines. This work provides an integrative framework of omics-driven predictive modelling that is broadly applicable to guide biological discovery

PubMed Central

eScholarship - University of California

Ranking strategies to support toxicity prediction: a case study on potential LXR binders

Author: Al Sharif
Berman
Bernotas
Bernotas
Breiman
Breiman
Breiman
Brereton
Chao
Dixon
Duan
Fradera
Friesner
Friesner
Färnegårdh
Glide
Halgren
Hoerer
Hu
Hu
Hu
Hu
Hu
Jiao
Kopecky
Landesmann
Liu
Loving
Mellor
MOSES.Descriptors
National Research Council (NRC)
Pavan
Phase
Salam
Sastry
Singhaus
Steinmetz
Svensson
Szewczyk
Travins
Triantaphyllou
Truchon
Ullrich
Walker
Wrobel
Zhao
Zhu
Zuercher
Publication venue: 'Elsevier BV'
Publication date: 19/01/2019
Field of study

The current paradigm of toxicity testing is set within a framework of Mode-of-Action (MoA)/Adverse Outcome Pathway (AOP) investigations, where novel methodologies alternative to animal testing play a crucial role, and allow to consider causal links between molecular initiating events (MIEs), further key events and an adverse outcome. In silico (computational) models are developed to support toxicity assessment within the MoA/AOP framework. This paper focuses on the evaluation of potential binding to the Liver X Receptor (LXR), as this has been identified among the MIEs leading to liver steatosis within an AOP framework addressing repeated dose and target-organ toxicity

Crossref

Bradford Scholars

Leeds Beckett Repository

Recommended from our members

High-resolution microbial community reconstruction by integrating short reads from multiple 16S rRNA regions

Author: Amir Amnon
Elgart Michael
Shamir Ohad
Shental Noam
Soen Yoav
Stern Shay
Turnbaugh Peter J.
Zeisel Amit
Zuk Or
Publication venue: 'Oxford University Press (OUP)'
Publication date: 11/03/2014
Field of study

The emergence of massively parallel sequencing technology has revolutionized microbial profiling, allowing the unprecedented comparison of microbial diversity across time and space in a wide range of host-associated and environmental ecosystems. Although the high-throughput nature of such methods enables the detection of low-frequency bacteria, these advances come at the cost of sequencing read length, limiting the phylogenetic resolution possible by current methods. Here, we present a generic approach for integrating short reads from large genomic regions, thus enabling phylogenetic resolution far exceeding current methods. The approach is based on a mapping to a statistical model that is later solved as a constrained optimization problem. We demonstrate the utility of this method by analyzing human saliva and Drosophila samples, using Illumina single-end sequencing of a 750 bp amplicon of the 16S rRNA gene. Phylogenetic resolution is significantly extended while reducing the number of falsely detected bacteria, as compared with standard single-region Roche 454 Pyrosequencing. Our approach can be seamlessly applied to simultaneous sequencing of multiple genes providing a higher resolution view of the composition and activity of complex microbial communities

Harvard University - DASH

Toward a Standardized Strategy of Clinical Metabolomics for the Advancement of Precision Medicine

Author: Kang Yun Pyo
Kim Hyung Min
Kwon Sung Won
NGHI TRAN
Nguyen Hoang Anh
Nguyen Phuoc Long
PARK SANG KI
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

Despite the tremendous success, pitfalls have been observed in every step of a clinical metabolomics workflow, which impedes the internal validity of the study. Furthermore, the demand for logistics, instrumentations, and computational resources for metabolic phenotyping studies has far exceeded our expectations. In this conceptual review, we will cover inclusive barriers of a metabolomics-based clinical study and suggest potential solutions in the hope of enhancing study robustness, usability, and transferability. The importance of quality assurance and quality control procedures is discussed, followed by a practical rule containing five phases, including two additional "pre-pre-" and "post-post-" analytical steps. Besides, we will elucidate the potential involvement of machine learning and demonstrate that the need for automated data mining algorithms to improve the quality of future research is undeniable. Consequently, we propose a comprehensive metabolomics framework, along with an appropriate checklist refined from current guidelines and our previously published assessment, in the attempt to accurately translate achievements in metabolomics into clinical and epidemiological research. Furthermore, the integration of multifaceted multi-omics approaches with metabolomics as the pillar member is in urgent need. When combining with other social or nutritional factors, we can gather complete omics profiles for a particular disease. Our discussion reflects the current obstacles and potential solutions toward the progressing trend of utilizing metabolomics in clinical research to create the next-generation healthcare system.11Ysciescopu

Multidisciplinary Digital Publishing Institute

SNU Open Repository and Archive

포항공과대학교

Manually curated transcriptomics data collection for toxicogenomic assessment of engineered nanomaterials

Author: Afantitis Antreas
Federico Antonio
Greco Dario
Lynch Iseult
Melagraki Georgia
Papadiamantis Anastasios G.
Saarimäki Laura Aliisa
Serra Angela
Tsoumanis Andreas
Publication venue
Publication date: 22/07/2020
Field of study

Toxicogenomics (TGx) approaches are increasingly applied to gain insight into the possible toxicity mechanisms of engineered nanomaterials (ENMs). Omics data can be valuable to elucidate the mechanism of action of chemicals and to develop predictive models in toxicology. While vast amounts of transcriptomics data from ENM exposures have already been accumulated, a unified, easily accessible and reusable collection of transcriptomics data for ENMs is currently lacking. In an attempt to improve the FAIRness of already existing transcriptomics data for ENMs, we curated a collection of homogenized transcriptomics data from human, mouse and rat ENM exposures in vitro and in vivo including the physicochemical characteristics of the ENMs used in each study.Peer reviewe

University of Birmingham Research Portal

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Helsingin yliopiston digitaalinen arkisto