Search CORE

5,223 research outputs found

Scalable high-throughput identification of genetic targets by network filtering

Author: A de la Fuente
AB Parsons
AM Deutschbauer
BKH Chia
C Zheng
CC Chuang
D di Bernardo
EJ Cosgrove
F Hormozdiari
HQ Wang
HQ Wang
HW Ma
I Guyon
JJ Faith
JM Freudenberg
KM Mani
L Perlman
M Bansal
M Hall
O Alter
Paolo Pannarale
S Falcon
S Keerthi
S Mnaimneh
S Wang
T Van den Bulcke
TR Hughes
Vitoantonio Bevilacqua
Y Yamanishi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Clinical metagenomics.

Author: Chiu Charles Y
Miller Steven A
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Clinical metagenomic next-generation sequencing (mNGS), the comprehensive analysis of microbial and host genetic material (DNA and RNA) in samples from patients, is rapidly moving from research to clinical laboratories. This emerging approach is changing how physicians diagnose and treat infectious disease, with applications spanning a wide range of areas, including antimicrobial resistance, the microbiome, human host gene expression (transcriptomics) and oncology. Here, we focus on the challenges of implementing mNGS in the clinical laboratory and address potential solutions for maximizing its impact on patient care and public health

eScholarship - University of California

Distributed gene clinical decision support system based on cloud computing

Author: Li Changlong
Wang Chao
Wang Jiali
Wang Qingfeng
Xu Bo
Zhou Xuehai
Zhuang Hang
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2018
Field of study

Background: The clinical decision support system can effectively break the limitations of doctors’ knowledge and reduce the possibility of misdiagnosis to enhance health care. The traditional genetic data storage and analysis methods based on stand-alone environment are hard to meet the computational requirements with the rapid genetic data growth for the limited scalability. Methods: In this paper, we propose a distributed gene clinical decision support system, which is named GCDSS. And a prototype is implemented based on cloud computing technology. At the same time, we present CloudBWA which is a novel distributed read mapping algorithm leveraging batch processing strategy to map reads on Apache Spark. Results: Experiments show that the distributed gene clinical decision support system GCDSS and the distributed read mapping algorithm CloudBWA have outstanding performance and excellent scalability. Compared with state-of-the-art distributed algorithms, CloudBWA achieves up to 2.63 times speedup over SparkBWA. Compared with stand-alone algorithms, CloudBWA with 16 cores achieves up to 11.59 times speedup over BWA-MEM with 1 core. Conclusions: GCDSS is a distributed gene clinical decision support system based on cloud computing techniques. In particular, we incorporated a distributed genetic data analysis pipeline framework in the proposed GCDSS system. To boost the data processing of GCDSS, we propose CloudBWA, which is a novel distributed read mapping algorithm to leverage batch processing technique in mapping stage using Apache Spark platform. Keywords: Clinical decision support system, Cloud computing, Spark, Alluxio, Genetic data analysis, Read mappin

The Research Repository @ WVU (West Virginia University)

Integrative OMICS Data-Driven Procedure Using a Derivatized Meta-Analysis Approach

Author: Cervantes-Gracia Karla
Chahwan Richard
Husi Holger
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2022
Field of study

The wealth of high-throughput data has opened up new opportunities to analyze and describe biological processes at higher resolution, ultimately leading to a significant acceleration of scientific output using high-throughput data from the different omics layers and the generation of databases to store and report raw datasets. The great variability among the techniques and the heterogeneous methodologies used to produce this data have placed meta-analysis methods as one of the approaches of choice to correlate the resultant large-scale datasets from different research groups. Through multi-study meta-analyses, it is possible to generate results with greater statistical power compared to individual analyses. Gene signatures, biomarkers and pathways that provide new insights of a phenotype of interest have been identified by the analysis of large-scale datasets in several fields of science. However, despite all the efforts, a standardized regulation to report large-scale data and to identify the molecular targets and signaling networks is still lacking. Integrative analyses have also been introduced as complementation and augmentation for meta-analysis methodologies to generate novel hypotheses. Currently, there is no universal method established and the different methods available follow different purposes. Herein we describe a new unifying, scalable and straightforward methodology to meta-analyze different omics outputs, but also to integrate the significant outcomes into novel pathways describing biological processes of interest. The significance of using proper molecular identifiers is highlighted as well as the potential to further correlate molecules from different regulatory levels. To show the methodology's potential, a set of transcriptomic datasets are meta-analyzed as an example

ZORA

NAViGaTing the Micronome – Using Multiple MicroRNA Prediction Databases to Identify Signalling Pathway-Associated MicroRNAs

Author: A Arvey
A Grimson
A Krek
A Ruepp
A Stark
AJ Enright
AM Duursma
B John
B Rhead
B Wightman
BJ Reinhart
BP Lewis
BP Lewis
D Baek
D Betel
D Blankenberg
D Grun
D Karolchik
DP Bartel
Elize A. Shirdel
Esteban Ballestar
EW Dijkstra
G Hutvagner
G Joshi-Tope
G Tang
GL Papadopoulos
HR Horvitz
I Vastrik
Igor Jurisica
IL Hofacker
J Taylor
JA Engelman
JD Han
JE Abrahante
JG Doench
JG Doench
JJ Forman
JS McCaskill
K Chen
K Seggerson
KC Miranda
KD Pruitt
KR Brown
KR Brown
KR Brown
L He
L Matthews
LP Lim
M Ceppi
M Chalfie
M Cully
M Kanehisa
M Kanehisa
M Kertesz
M Lagos-Quintana
M Maragkakis
M Maragkakis
M Rehmsmeier
M Selbach
M Zuker
MA Batzer
MS Waterman
MW Rhoades
NC Lau
P Landgraf
P Liu
P Saetrom
PH Olsen
PS Linsley
PT Hawkins
Q Huang
R Gentleman
R Lee
R Schneider
RC Friedman
RC Lee
RC Lee
RJ Webster
S Griffiths-Jones
S Griffiths-Jones
S Lall
S Nam
S Wuchty
SF Tavazoie
SY Lin
Tak W. Mak
TF Smith
TJ Hubbard
U Brandes
UA Orom
V Ambros
V Carey
VA Gennarino
Wing Xie
WJ Kent
WP Kloosterman
X Wang
Y Zeng
Y Zeng
Publication venue: Public Library of Science
Publication date: 01/02/2011
Field of study

MicroRNAs are a class of small RNAs known to regulate gene expression at the transcript level, the protein level, or both. Since microRNA binding is sequence-based but possibly structure-specific, work in this area has resulted in multiple databases storing predicted microRNA:target relationships computed using diverse algorithms. We integrate prediction databases, compare predictions to in vitro data, and use cross-database predictions to model the microRNA:transcript interactome--referred to as the micronome--to study microRNA involvement in well-known signalling pathways as well as associations with disease. We make this data freely available with a flexible user interface as our microRNA Data Integration Portal--mirDIP (http://ophid.utoronto.ca/mirDIP).mirDIP integrates prediction databases to elucidate accurate microRNA:target relationships. Using NAViGaTOR to produce interaction networks implicating microRNAs in literature-based, KEGG-based and Reactome-based pathways, we find these signalling pathway networks have significantly more microRNA involvement compared to chance (p<0.05), suggesting microRNAs co-target many genes in a given pathway. Further examination of the micronome shows two distinct classes of microRNAs; universe microRNAs, which are involved in many signalling pathways; and intra-pathway microRNAs, which target multiple genes within one signalling pathway. We find universe microRNAs to have more targets (p<0.0001), to be more studied (p<0.0002), and to have higher degree in the KEGG cancer pathway (p<0.0001), compared to intra-pathway microRNAs.Our pathway-based analysis of mirDIP data suggests microRNAs are involved in intra-pathway signalling. We identify two distinct classes of microRNAs, suggesting a hierarchical organization of microRNAs co-targeting genes both within and between pathways, and implying differential involvement of universe and intra-pathway microRNAs at the disease level

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Cancer immunogenomics: Computational neoantigen identification and vaccine design

Author: Coffman Adam
Graubert Aaron
Griffith Malachi
Griffith Obi L
Hundal Jasreet
Kiwala Susanna
Mardis Elaine R
McMichael Joshua
Miller Christopher J
Walker Jason
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

Digital Commons@Becker

WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data

Author: Cohen Jonathan C
Hobbs Helen H
Horton Jay D
Stephens Robert M
Yi Ming
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Analysis of High Throughput (HTP) Data such as microarray and proteomics data has provided a powerful methodology to study patterns of gene regulation at genome scale. A major unresolved problem in the post-genomic era is to assemble the large amounts of data generated into a meaningful biological context. We have developed a comprehensive software tool, WholePathwayScope (WPS), for deriving biological insights from analysis of HTP data. RESULT: WPS extracts gene lists with shared biological themes through color cue templates. WPS statistically evaluates global functional category enrichment of gene lists and pathway-level pattern enrichment of data. WPS incorporates well-known biological pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) and Biocarta, GO (Gene Ontology) terms as well as user-defined pathways or relevant gene clusters or groups, and explores gene-term relationships within the derived gene-term association networks (GTANs). WPS simultaneously compares multiple datasets within biological contexts either as pathways or as association networks. WPS also integrates Genetic Association Database and Partial MedGene Database for disease-association information. We have used this program to analyze and compare microarray and proteomics datasets derived from a variety of biological systems. Application examples demonstrated the capacity of WPS to significantly facilitate the analysis of HTP data for integrative discovery. CONCLUSION: This tool represents a pathway-based platform for discovery integration to maximize analysis power. The tool is freely available at

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Posterior Association Networks and Functional Modules Inferred from Rich Phenotypes of Gene Perturbations

Author: A Baryshnikova
A Battle
A Dempster
A Rzhetsky
A Subramanian
A Tong
Andrey Rzhetsky
B Szappanos
C Bakal
C Echeverri
C Echeverri
C Stark
D Schmidt
F Fuchs
Florian Markowetz
G Marsaglia
G McLachlan
H Dadgostar
H Jeffeys
H Shimodaira
I Lee
J Felsenstein
J Flint
K Mulder
K Schadler
Klaas W. Mulder
M Booker
M Boutros
M Castro
M Costanzo
M de Hoon
M Eisen
M Farha
M Gilsdorf
Mauro A. Castro
N Le Meur
P Luc
R Green
R Kelley
R Mani
R Suzuki
R Tibes
S Arora
S Benini
S Collins
S DuBois
S Wong
T Horn
T Kawamoto
T Prasad
W Pan
W Zhong
WN Venables
X Wang
Xin Wang
Y Ji
Y Qi
Publication venue: Public Library of Science
Publication date: 28/06/2012
Field of study

Combinatorial gene perturbations provide rich information for a systematic exploration of genetic interactions. Despite successful applications to bacteria and yeast, the scalability of this approach remains a major challenge for higher organisms such as humans. Here, we report a novel experimental and computational framework to efficiently address this challenge by limiting the ‘search space’ for important genetic interactions. We propose to integrate rich phenotypes of multiple single gene perturbations to robustly predict functional modules, which can subsequently be subjected to further experimental investigations such as combinatorial gene silencing. We present posterior association networks (PANs) to predict functional interactions between genes estimated using a Bayesian mixture modelling approach. The major advantage of this approach over conventional hypothesis tests is that prior knowledge can be incorporated to enhance predictive power. We demonstrate in a simulation study and on biological data, that integrating complementary information greatly improves prediction accuracy. To search for significant modules, we perform hierarchical clustering with multiscale bootstrap resampling. We demonstrate the power of the proposed methodologies in applications to Ewing's sarcoma and human adult stem cells using publicly available and custom generated data, respectively. In the former application, we identify a gene module including many confirmed and highly promising therapeutic targets. Genes in the module are also significantly overrepresented in signalling pathways that are known to be critical for proliferation of Ewing's sarcoma cells. In the latter application, we predict a functional network of chromatin factors controlling epidermal stem cell fate. Further examinations using ChIP-seq, ChIP-qPCR and RT-qPCR reveal that the basis of their genetic interactions may arise from transcriptional cross regulation. A Bioconductor package implementing PAN is freely available online at http://bioconductor.org/packages/release/bioc/html/PANR.html

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare