166 research outputs found
Steady-State Analysis of Genetic Regulatory Networks Modelled by Probabilistic Boolean Networks
Probabilistic Boolean networks (PBNs) have recently been introduced as a promising class of models of genetic regulatory networks. The dynamic behaviour of PBNs can
be analysed in the context of Markov chains. A key goal is the determination of the
steady-state (long-run) behaviour of a PBN by analysing the corresponding Markov
chain. This allows one to compute the long-term influence of a gene on another
gene or determine the long-term joint probabilistic behaviour of a few selected genes.
Because matrix-based methods quickly become prohibitive for large sizes of networks,
we propose the use of Monte Carlo methods. However, the rate of convergence to
the stationary distribution becomes a central issue. We discuss several approaches
for determining the number of iterations necessary to achieve convergence of the
Markov chain corresponding to a PBN. Using a recently introduced method based on
the theory of two-state Markov chains, we illustrate the approach on a sub-network
designed from human glioma gene expression data and determine the joint steadystate
probabilities for several groups of genes
Genomic applications of statistical signal processing
Biological phenomena in the cells can be explained in terms of the interactions among
biological macro-molecules, e.g., DNAs, RNAs and proteins. These interactions can
be modeled by genetic regulatory networks (GRNs). This dissertation proposes to
reverse engineering the GRNs based on heterogeneous biological data sets, including
time-series and time-independent gene expressions, Chromatin ImmunoPrecipatation
(ChIP) data, gene sequence and motifs and other possible sources of knowledge. The
objective of this research is to propose novel computational methods to catch pace
with the fast evolving biological databases.
Signal processing techniques are exploited to develop computationally efficient,
accurate and robust algorithms, which deal individually or collectively with various
data sets. Methods of power spectral density estimation are discussed to identify
genes participating in various biological processes. Information theoretic methods are
applied for non-parametric inference. Bayesian methods are adopted to incorporate several sources with prior knowledge. This work aims to construct an inference system
which takes into account different sources of information such that the absence of some
components will not interfere with the rest of the system.
It has been verified that the proposed algorithms achieve better inference accuracy
and higher computational efficiency compared with other state-of-the-art schemes,
e.g. REVEAL, ARACNE, Bayesian Networks and Relevance Networks, at presence
of artificial time series and steady state microarray measurements. The proposed algorithms
are especially appealing when the the sample size is small. Besides, they are
able to integrate multiple heterogeneous data sources, e.g. ChIP and sequence data,
so that a unified GRN can be inferred. The analysis of biological literature and in
silico experiments on real data sets for fruit fly, yeast and human have corroborated
part of the inferred GRN. The research has also produced a set of potential control
targets for designing gene therapy strategies
Analysis of Gene Coexpression by B-Spline Based CoD Estimation
The gene coexpression study has emerged as a novel holistic approach for microarray data analysis. Different indices have been used in exploring coexpression relationship, but each is associated with certain pitfalls. The Pearson's correlation coefficient, for example, is not capable of uncovering nonlinear pattern and directionality of coexpression. Mutual information can detect nonlinearity but fails to show directionality. The coefficient of determination (CoD) is unique in exploring different patterns of gene coexpression, but so far only applied to discrete data and the conversion of continuous microarray data to the discrete format could lead to information loss. Here, we proposed an effective algorithm, CoexPro, for gene coexpression analysis. The new algorithm is based on B-spline approximation of coexpression between a pair of genes, followed by CoD estimation. The algorithm was justified by simulation studies and by functional semantic similarity analysis. The proposed algorithm is capable of uncovering both linear and a specific class of nonlinear relationships from continuous microarray data. It can also provide suggestions for possible directionality of coexpression to the researchers. The new algorithm presents a novel model for gene coexpression and will be a valuable tool for a variety of gene expression and network studies. The application of the algorithm was demonstrated by an analysis on ligand-receptor coexpression in cancerous and noncancerous cells. The software implementing the algorithm is available upon request to the authors
Comparison of Probabilistic Boolean Network and Dynamic Bayesian Network Approaches for Inferring Gene Regulatory Networks
Background: The regulation of gene expression is achieved through gene regulatory networks (GRNs) in which collections of genes interact with one another and other substances in a cell. In order to understand the underlying function of organisms, it is necessary to study the behavior of genes in a gene regulatory network context. Several computational approaches are available for modeling gene regulatory networks with different datasets. In order to optimize modeling of GRN, these approaches must be compared and evaluated in terms of accuracy and efficiency.
Results: In this paper, two important computational approaches for modeling gene regulatory networks, probabilistic Boolean network methods and dynamic Bayesian network methods, are compared using a biological time-series dataset from the Drosophila Interaction Database to construct a Drosophila gene network. A subset of time points and gene samples from the whole dataset is used to evaluate the performance of these two approaches.
Conclusions: The comparison indicates that both approaches had good performance in modeling the gene regulatory networks. The accuracy in terms of recall and precision can be improved if a smaller subset of genes is selected for inferring GRNs. The accuracy of both approaches is dependent upon the number of selected genes and time points of gene samples. In all tested cases, DBN identified more gene interactions and gave better recall than PBN
Uncovering Gene Regulatory Networks from Time-Series Microarray Data with Variational Bayesian Structural Expectation Maximization
We investigate in this paper reverse engineering of gene regulatory networks from time-series microarray data. We apply dynamic Bayesian networks (DBNs) for modeling cell cycle regulations. In developing a network inference algorithm, we focus on soft solutions that can provide a posteriori probability (APP) of network topology. In particular, we propose a variational Bayesian structural expectation maximization algorithm that can learn the posterior distribution of the network model parameters and topology jointly. We also show how the obtained APPs of the network topology can be used in a Bayesian data integration strategy to integrate two different microarray data sets. The proposed VBSEM algorithm has been tested on yeast cell cycle data sets. To evaluate the confidence of the inferred networks, we apply a moving block bootstrap method. The inferred network is validated by comparing it to the KEGG pathway map
Parallel mutual information estimation for inferring gene regulatory networks on GPUs
<p>Abstract</p> <p>Background</p> <p>Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity.</p> <p>Results</p> <p>We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA) programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs) from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time.</p> <p>Conclusions</p> <p>CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.</p
Untangling hotel industry’s inefficiency: An SFA approach applied to a renowned Portuguese hotel chain
The present paper explores the technical efficiency of four hotels from Teixeira Duarte Group - a renowned Portuguese hotel chain. An efficiency ranking is established from these four hotel units located in Portugal using Stochastic Frontier Analysis. This methodology allows to discriminate between measurement error and systematic inefficiencies in the estimation process enabling to investigate the main inefficiency causes. Several suggestions concerning efficiency improvement are undertaken for each hotel studied.info:eu-repo/semantics/publishedVersio
- …