Search CORE

4 research outputs found

Application of Machine Learning to Mapping and Simulating Gene Regulatory Networks

Author: Liow Hien-haw
Publication venue: Washington University Open Scholarship
Publication date: 15/05/2015
Field of study

This dissertation explores, proposes, and examines methods of applying modernmachine learning and Bayesian statistics in the quantitative and qualitative modeling of gene regulatory networks using high-throughput gene expression data. A semi-parametric Bayesian model based on random forest is developed to infer quantitative aspects of gene regulation relations; a parametric model is developed to predict geneexpression levels solely from genotype information. Simulation of network behavior is shown to complement regression analysis greatly in capturing the dynamics of gene regulatory networks. Finally, as an application and extension of novel approaches in gene expression analysis, new methods of discovering topological structure of gene regulatory networks are developed and shown to provide improvement over existing methods

Washington University St. Louis: Open Scholarship

NetProphet 2.0: Mapping transcription factor networks by exploiting scalable data resources

Author: Brent Michael R
Kang Yiming
Liow Hien-Haw
Maier Ezekiel J
Publication venue: 'Oxford University Press (OUP)'
Publication date: 15/01/2018
Field of study

MOTIVATION: Cells process information, in part, through transcription factor (TF) networks, which control the rates at which individual genes produce their products. A TF network map is a graph that indicates which TFs bind and directly regulate each gene. Previous work has described network mapping algorithms that rely exclusively on gene expression data and \u27integrative\u27 algorithms that exploit a wide range of data sources including chromatin immunoprecipitation sequencing (ChIP-seq) of many TFs, genome-wide chromatin marks, and binding specificities for many TFs determined in vitro. However, such resources are available only for a few major model systems and cannot be easily replicated for new organisms or cell types. RESULTS: We present NetProphet 2.0, a \u27data light\u27 algorithm for TF network mapping, and show that it is more accurate at identifying direct targets of TFs than other, similarly data light algorithms. In particular, it improves on the accuracy of NetProphet 1.0, which used only gene expression data, by exploiting three principles. First, combining multiple approaches to network mapping from expression data can improve accuracy relative to the constituent approaches. Second, TFs with similar DNA binding domains bind similar sets of target genes. Third, even a noisy, preliminary network map can be used to infer DNA binding specificities from promoter sequences and these inferred specificities can be used to further improve the accuracy of the network map. AVAILABILITY AND IMPLEMENTATION: Source code and comprehensive documentation are freely available at https://github.com/yiming-kang/NetProphet_2.0. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

Digital Commons@Becker

Predicting protein-ligand interactions based on bow-pharmacological space and Bayesian additive regression trees

Author: Brown J. B.
Chen Luonan
Dai Hao
Fan Huai-Meng
Koh Ching Chiek
Lee Nicholas Keone
Li Li
Liow Hien-haw
Reker Daniel
Wang Haishuai
Wei Dong-Qing
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Identifying potential protein-ligand interactions is central to the field of drug discovery as it facilitates the identification of potential novel drug leads, contributes to advancement from hits to leads, predicts potential off-target explanations for side effects of approved drugs or candidates, as well as de-orphans phenotypic hits. For the rapid identification of protein-ligand interactions, we here present a novel chemogenomics algorithm for the prediction of protein-ligand interactions using a new machine learning approach and novel class of descriptor. The algorithm applies Bayesian Additive Regression Trees (BART) on a newly proposed proteochemical space, termed the bow-pharmacological space. The space spans three distinctive sub-spaces that cover the protein space, the ligand space, and the interaction space. Thereby, the model extends the scope of classical target prediction or chemogenomic modelling that relies on one or two of these subspaces. Our model demonstrated excellent prediction power, reaching accuracies of up to 94.5-98.4% when evaluated on four human target datasets constituting enzymes, nuclear receptors, ion channels, and G-protein-coupled receptors . BART provided a reliable probabilistic description of the likelihood of interaction between proteins and ligands, which can be used in the prioritization of assays to be performed in both discovery and vigilance phases of small molecule development

Kölner UniversitätsPublikationsServer

Digital Commons@Becker

Fairfield University: DigitalCommons@Fairfield

NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources

Author: Abdulrehman
Andersson
Babu
Bonke
Boorsma
Boulesteix
Brent
Cahan
Cenk Sahinalp
Chipman
Clough
Core
Cuellar-Partida
Cusanovich
D'alessio
Danko
Elemento
Ezekiel J Maier
Faith
Fuxman Bass
Georlette
Ghanbari
Gitter
Gordân
Grant
Greenfield
Gupta
Hadzic
Harbison
Haury
Haynes
Heinaniemi
Hien-Haw Liow
Hu
Hughes
Huynh-Thu
Ikmi
Jolma
Kao
Kemmeren
Lam
Liu
Marbach
Marbach
Margolin
Michael
Michael R Brent
Page
Rackham
Robasky
Sievers
Teleman
Tran
Van Nostrand
Weirauch
Yiming Kang
Zhong
Publication venue: 'Oxford University Press (OUP)'
Publication date: 15/01/2018
Field of study

Crossref

Digital Commons@Becker