Search CORE

291 research outputs found

Network-Based Interpretation of Diverse High-Throughput Datasets through the Omics Integrator Software Package

Author: Fraenkel Ernest
Gitter Anthony
Gosline Sara Calafell
Kedaigle Amanda Joy
Soltis Anthony Robert
Tuncbag Nurcan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/07/2015
Field of study

High-throughput, ‘omic’ methods provide sensitive measures of biological responses to perturbations. However, inherent biases in high-throughput assays make it difficult to interpret experiments in which more than one type of data is collected. In this work, we introduce Omics Integrator, a software package that takes a variety of ‘omic’ data as input and identifies putative underlying molecular pathways. The approach applies advanced network optimization algorithms to a network of thousands of molecular interactions to find high-confidence, interpretable subnetworks that best explain the data. These subnetworks connect changes observed in gene expression, protein abundance or other global assays to proteins that may not have been measured in the screens due to inherent bias or noise in measurement. This approach reveals unannotated molecular pathways that would not be detectable by searching pathway databases. Omics Integrator also provides an elegant framework to incorporate not only positive data, but also negative evidence. Incorporating negative evidence allows Omics Integrator to avoid unexpressed genes and avoid being biased toward highly-studied hub proteins, except when they are strongly implicated by the data. The software is comprised of two individual tools, Garnet and Forest, that can be run together or independently to allow a user to perform advanced integration of multiple types of high-throughput data as well as create condition-specific subnetworks of protein interactions that best connect the observed changes in various datasets. It is available at http://fraenkel.mit.edu/omicsintegrator and on GitHub at https://github.com/fraenkel-lab/OmicsIntegrator.National Institutes of Health (U.S.) (grant U54CA112967)National Institutes of Health (U.S.) (grant U01CA184898)National Institutes of Health (U.S.) (grant U54NS091046)National Institutes of Health (U.S.) (grant R01GM089903

DSpace@MIT

PCSF: An R-package for network-based interpretation of high-throughput data

Author: Akhmedov Murodzhon
Bertoni Francesco
Chong Renan Escalante
Fraenkel Ernest
Kedaigle Amanda
Kwee Ivo
Montemanni Roberto
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2017
Field of study

With the recent technological developments a vast amount of high-throughput data has been profiled to understand the mechanism of complex diseases. The current bioinformatics challenge is to interpret the data and underlying biology, where efficient algorithms for analyzing heterogeneous high-throughput data using biological networks are becoming increasingly valuable. In this paper, we propose a software package based on the Prize-collecting Steiner Forest graph optimization approach. The PCSF package performs fast and user-friendly network analysis of high-throughput data by mapping the data onto a biological networks such as protein-protein interaction, gene-gene interaction or any other correlation or coexpression based networks. Using the interaction networks as a template, it determines high-confidence subnetworks relevant to the data, which potentially leads to predictions of functional units. It also interactively visualizes the resulting subnetwork with functional enrichment analysis

DSpace@MIT

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

FigShare

Graph algorithms for predicting subcellular localization at the pathway level

Author: Gitter Anthony
Magnano Chris S.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 12/12/2022
Field of study

Protein subcellular localization is an important factor in normal cellular processes and disease. While many protein localization resources treat it as static, protein localization is dynamic and heavily influenced by biological context. Biological pathways are graphs that represent a specific biological context and can be inferred from large-scale data. We develop graph algorithms to predict the localization of all interactions in a biological pathway as an edge-labeling task. We compare a variety of models including graph neural networks, probabilistic graphical models, and discriminative classifiers for predicting localization annotations from curated pathway databases. We also perform a case study where we construct biological pathways and predict localizations of human fibroblasts undergoing viral infection. Pathway localization prediction is a promising approach for integrating publicly available localization data into the analysis of large-scale biological data.Comment: 35 pages, 14 figure

arXiv.org e-Print Archive

ANIMA: Association Network Integration for Multiscale Analysis

Author: Deffur A
Mayosi B
Mulder N
Wilkinson R
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 10/02/2018
Field of study

Contextual functional interpretation of -omics data derived from clinical samples is a classical and difficult problem in computational systems biology. The measurement of thousands of datapoints on single samples has become routine but relating ‘big data’ datasets to the complexities of human pathobiology is an area of ongoing research. Complicating this is the fact that many publically available datasets use bulk transcriptomics data from complex tissues like blood. The most prevalent analytic approaches derive molecular ‘signatures’ of disease states or apply modular analysis frameworks to the data. Here we show, using a network-based data integration method using clinical phenotype and microarray data as inputs, that we can reconstruct multiple features (or endophenotypes) of disease states at various scales of organization, from transcript abundance patterns of individual genes through co-expression patterns of groups of genes to patterns of cellular behavior in whole blood samples, both in single experiments as well as in a meta-analysis of multiple datasets

Spiral - Imperial College Digital Repository

Computational approaches for network-based integrative multi-omics analysis

Author: 't Hoen Peter A. C.
Agamah Francis E.
Bayjanov Jumamurat R.
Chimusa Emile Rugamika
Ederveen Thomas H. A.
Mazandu Gaston K.
Mulder Nicola
Niehues Anna
Njoku Kelechi F.
Skelton Michelle
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2022
Field of study

Advances in omics technologies allow for holistic studies into biological systems. These studies rely on integrative data analysis techniques to obtain a comprehensive view of the dynamics of cellular processes, and molecular mechanisms. Network-based integrative approaches have revolutionized multi-omics analysis by providing the framework to represent interactions between multiple different omics-layers in a graph, which may faithfully reflect the molecular wiring in a cell. Here we review network-based multi-omics/multi-modal integrative analytical approaches. We classify these approaches according to the type of omics data supported, the methods and/or algorithms implemented, their node and/or edge weighting components, and their ability to identify key nodes and subnetworks. We show how these approaches can be used to identify biomarkers, disease subtypes, crosstalk, causality, and molecular drivers of physiological and pathological mechanisms. We provide insight into the most appropriate methods and tools for research questions as showcased around the aetiology and treatment of COVID-19 that can be informed by multi-omics data integration. We conclude with an overview of challenges associated with multi-omics network-based analysis, such as reproducibility, heterogeneity, (biological) interpretability of the results, and we highlight some future directions for network-based integration

Northumbria Research Link

PubMed Central

Computational Methods for the Analysis of Genomic Data and Biological Processes

Author
Publication venue: 'MDPI AG'
Publication date: 01/05/2021
Field of study

In recent decades, new technologies have made remarkable progress in helping to understand biological systems. Rapid advances in genomic profiling techniques such as microarrays or high-performance sequencing have brought new opportunities and challenges in the fields of computational biology and bioinformatics. Such genetic sequencing techniques allow large amounts of data to be produced, whose analysis and cross-integration could provide a complete view of organisms. As a result, it is necessary to develop new techniques and algorithms that carry out an analysis of these data with reliability and efficiency. This Special Issue collected the latest advances in the field of computational methods for the analysis of gene expression data, and, in particular, the modeling of biological processes. Here we present eleven works selected to be published in this Special Issue due to their interest, quality, and originality

Directory of Open Access Books (DOAB)

An integrated multi-omic analysis of iPSC-derived motor neurons from C9ORF72 ALS patients

Author: Adam M.
Banuelos M.G.
Casale M.
Cheng A.
Cox V.
Coyne A.N.
Daigle J.G.
Dardov V.
Escalante-Chong R.
Eyk J.E. van
Finkbeiner S.
Fraenkel E.
Frank A.
Gomez E.
Hayes L.
Holewenski R.
Kaye J.A.
Lei S.S.
Lenail A.
Li J.
Lim R.G.
Lima L.
Lloyd T.E.
Mandefro B.
Matlock A.
Milani P.
NeuroLINCS Consortium
NYGC ALS Consortium
Ornelas L.
Panther L.
Patel-Murray N.L.
Pham J.
Ramamoorthy D.
Rothstein J.D.
Sachs K.
Sareen D.
Shelley B.
Stocksdale J.
Svendsen C.N.
Thompson L.M.
Thompson T.G.
Trost H.
Venkatraman V.
Wassie B.T.
Wilhelm M.
Wu J.
Wyman S.
Yang S.
Publication venue: 'Elsevier BV'
Publication date: 19/11/2021
Field of study

Neurodegenerative diseases are challenging for systems biology because of the lack of reliable animal models or patient samples at early disease stages. Induced pluripotent stem cells (iPSCs) could address these challenges. We investigated DNA, RNA, epigenetics, and proteins in iPSC-derived motor neurons from patients with ALS carrying hexanucleotide expansions in C9ORF72. Using integrative computational methods combining all omics datasets, we identified novel and known dysregulated pathways. We used a C9ORF72 Drosophila model to distinguish pathways contributing to disease phenotypes from compensatory ones and confirmed alterations in some pathways in postmortem spinal cord tissue of patients with ALS. A different differentiation protocol was used to derive a separate set of C9ORF72 and control motor neurons. Many individual -omics differed by protocol, but some core dysregulated pathways were consistent. This strategy of analyzing patient-specific neurons provides disease-related outcomes with small numbers of heterogeneous lines and reduces variation from single-omics to elucidate network-based signatures.Genetics of disease, diagnosis and treatmen

Leiden University Scholary Publications