182,930 research outputs found
Patterns and Complexity in Biological Systems: A Study of Sequence Structure and Ontology-based Networks
Biological information can be explored at many different levels, with the most basic information encoded in patterns within the DNA sequence. Through molecular level processes, these patterns are capable of controlling the states of genes, resulting in a complex network of interactions between genes. Key features of biological systems can be determined by evaluating properties of this gene regulatory network. More specifically, a network-based approach helps us to understand how the collective behavior of genes corresponds to patterns in genetic function.
We combine Chromatin-Immunoprecipitation microarray (ChIP-chip) data with genomic sequence data to determine how DNA sequence works to recruit various proteins. We quantify this information using a value termed "nmer-association.'' "Nmer-association'' measures how strongly individual DNA sequences are associated with a protein in a given ChIP-chip experiment. We also develop the "split-motif'' algorithm to study the underlying structural properties of DNA sequence independent of wet-lab data. The "split-motif'' algorithm finds pairs of DNA motifs which preferentially localize relative to one another. These pairs are primarily composed of known transcription factor binding sites and their co-occurrence is indicative of higher-order structure. This kind of structure has largely been missed in standard motif-finding algorithms despite emerging evidence of the importance of complex regulation.
In both simple and complex regulation, two genes that are connected in a regulatory fashion are likely to have shared functions. The Gene Ontology (GO) provides biologists with a controlled terminology with which to describe how genes are associated with function and how those functional terms are related to each other. We introduce a method for processing functional information in GO to produce a gene network. We find that the edges in this network are correlated with known regulatory interactions and that the strength of the functional relationship between two genes can be used as an indicator of how informationally important that link is in the regulatory network. We also investigate the network structure of gene-term annotations found in GO and use these associations to establish an alternate natural way to group the functional terms. These groups of terms are drastically different from the hierarchical structure established by the Gene Ontology and provide an alternative framework with which to describe and predict the functions of experimentally identified groups of genes
Stochastic dynamic modeling of short gene expression time-series data
Copyright [2008] IEEE. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of Brunel University's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to [email protected]. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.In this paper, the expectation maximization (EM) algorithm is applied for modeling the gene regulatory network from gene time-series data. The gene regulatory network is viewed as a stochastic dynamic model, which consists of the noisy gene measurement from microarray and the gene regulation first-order autoregressive (AR) stochastic dynamic process. By using the EM algorithm, both the model parameters and the actual values of the gene expression levels can be identified simultaneously. Moreover, the algorithm can deal with the sparse parameter identification and the noisy data in an efficient way. It is also shown that the EM algorithm can handle the microarray gene expression data with large number of variables but a small number of observations. The gene expression stochastic dynamic models for four real-world gene expression data sets are constructed to demonstrate the advantages of the introduced algorithm. Several indices are proposed to evaluate the models of inferred gene regulatory networks, and the relevant biological properties are discussed
A statistical method for revealing form-function relations in biological networks
Over the past decade, a number of researchers in systems biology have sought
to relate the function of biological systems to their network-level
descriptions -- lists of the most important players and the pairwise
interactions between them. Both for large networks (in which statistical
analysis is often framed in terms of the abundance of repeated small subgraphs)
and for small networks which can be analyzed in greater detail (or even
synthesized in vivo and subjected to experiment), revealing the relationship
between the topology of small subgraphs and their biological function has been
a central goal. We here seek to pose this revelation as a statistical task,
illustrated using a particular setup which has been constructed experimentally
and for which parameterized models of transcriptional regulation have been
studied extensively. The question "how does function follow form" is here
mathematized by identifying which topological attributes correlate with the
diverse possible information-processing tasks which a transcriptional
regulatory network can realize. The resulting method reveals one form-function
relationship which had earlier been predicted based on analytic results, and
reveals a second for which we can provide an analytic interpretation. Resulting
source code is distributed via http://formfunction.sourceforge.net.Comment: To appear in Proc. Natl. Acad. Sci. USA. 17 pages, 9 figures, 2
table
Patterns of subnet usage reveal distinct scales of regulation in the transcriptional regulatory network of Escherichia coli
The set of regulatory interactions between genes, mediated by transcription
factors, forms a species' transcriptional regulatory network (TRN). By
comparing this network with measured gene expression data one can identify
functional properties of the TRN and gain general insight into transcriptional
control. We define the subnet of a node as the subgraph consisting of all nodes
topologically downstream of the node, including itself. Using a large set of
microarray expression data of the bacterium Escherichia coli, we find that the
gene expression in different subnets exhibits a structured pattern in response
to environmental changes and genotypic mutation. Subnets with less changes in
their expression pattern have a higher fraction of feed-forward loop motifs and
a lower fraction of small RNA targets within them. Our study implies that the
TRN consists of several scales of regulatory organization: 1) subnets with more
varying gene expression controlled by both transcription factors and
post-transcriptional RNA regulation, and 2) subnets with less varying gene
expression having more feed-forward loops and less post-transcriptional RNA
regulation.Comment: 14 pages, 8 figures, to be published in PLoS Computational Biolog
A quantitative comparison of sRNA-based and protein-based gene regulation
Small, non-coding RNAs (sRNAs) play important roles as genetic regulators in
prokaryotes. sRNAs act post-transcriptionally via complementary pairing with
target mRNAs to regulate protein expression. We use a quantitative approach to
compare and contrast sRNAs with conventional transcription factors (TFs) to
better understand the advantages of each form of regulation. In particular, we
calculate the steady-state behavior, noise properties, frequency-dependent gain
(amplification), and dynamical response to large input signals of both forms of
regulation. While the mean steady-state behavior of sRNA-regulated proteins
exhibits a distinctive tunable threshold-linear behavior, our analysis shows
that transcriptional bursting leads to significantly higher intrinsic noise in
sRNA-based regulation than in TF-based regulation in a large range of
expression levels and limits the ability of sRNAs to perform quantitative
signaling. Nonetheless, we find that sRNAs are better than TFs at filtering
noise in input signals. Additionally, we find that sRNAs allow cells to respond
rapidly to large changes in input signals. These features suggest a niche for
sRNAs in allowing cells to transition quickly yet reliably between distinct
states. This functional niche is consistent with the widespread appearance of
sRNAs in stress-response and quasi-developmental networks in prokaryotes.Comment: 26 pages, 8 figures; accepted for publication in Molecular Systems
Biolog
- …