5,128 research outputs found

    Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

    Get PDF
    One of the most limiting aspects of biological research in the post-genomic era is the capability to integrate massive datasets on gene structure and function for producing useful biological knowledge. In this report we have applied an integrative approach to address the problem of identifying likely candidate genes within loci associated with human genetic diseases. Despite the recent progress in sequencing technologies, approaching this problem from an experimental perspective still represents a very demanding task, because the critical region may typically contain hundreds of positional candidates. We found that by concentrating only on genes sharing similar expression profiles in both human and mouse, massive microarray datasets can be used to reliably identify disease-relevant relationships among genes. Moreover, we found that integrating the coexpression criterion with systematic phenome analysis allows efficient identification of disease genes in large genomic regions. Using this approach on 850 OMIM loci characterized by unknown molecular basis, we propose high-probability candidates for 81 genetic diseases

    Positively Correlated miRNA-miRNA Regulatory Networks in Mouse Frontal Cortex During Early Stages of Alcohol Dependence

    Get PDF
    Although the study of gene regulation via the action of specific microRNAs (miRNAs) has experienced a boom in recent years, the analysis of genome-wide interaction networks among miRNAs and respective targeted mRNAs has lagged behind. MicroRNAs simultaneously target many transcripts and fine-tune the expression of genes through cooperative/combinatorial targeting. Therefore, they have a large regulatory potential that could widely impact development and progression of diseases, as well as contribute unpredicted collateral effects due to their natural, pathophysiological, or treatment-induced modulation. We support the viewpoint that whole mirnome-transcriptome interaction analysis is required to better understand the mechanisms and potential consequences of miRNA regulation and/or deregulation in relevant biological models. In this study, we tested the hypotheses that ethanol consumption induces changes in miRNA-mRNA interaction networks in the mouse frontal cortex and that some of the changes observed in the mouse are equivalent to changes in similar brain regions from human alcoholics. Results: miRNA-mRNA interaction networks responding to ethanol insult were identified by differential expression analysis and weighted gene coexpression network analysis (WGCNA). Important pathways (coexpressed modular networks detected by WGCNA) and hub genes central to the neuronal response to ethanol are highlighted, as well as key miRNAs that regulate these processes and therefore represent potential therapeutic targets for treating alcohol addiction. Importantly, we discovered a conserved signature of changing miRNAs between ethanol-treated mice and human alcoholics, which provides a valuable tool for future biomarker/diagnostic studies in humans. We report positively correlated miRNA-mRNA expression networks that suggest an adaptive, targeted miRNA response due to binge ethanol drinking. Conclusions: This study provides new evidence for the role of miRNA regulation in brain homeostasis and sheds new light on current understanding of the development of alcohol dependence. To our knowledge this is the first report that activated expression of miRNAs correlates with activated expression of mRNAs rather than with mRNA downregulation in an in vivo model. We speculate that early activation of miRNAs designed to limit the effects of alcohol-induced genes may be an essential adaptive response during disease progression.NIAAA 5R01AA012404, 5P20AA017838, 5U01AA013520, P01AA020683, 5T32AA007471-24/25Waggoner Center for Alcohol and Addiction Researc

    Genetic and Neuroanatomical Support for Functional Brain Network Dynamics in Epilepsy

    Full text link
    Focal epilepsy is a devastating neurological disorder that affects an overwhelming number of patients worldwide, many of whom prove resistant to medication. The efficacy of current innovative technologies for the treatment of these patients has been stalled by the lack of accurate and effective methods to fuse multimodal neuroimaging data to map anatomical targets driving seizure dynamics. Here we propose a parsimonious model that explains how large-scale anatomical networks and shared genetic constraints shape inter-regional communication in focal epilepsy. In extensive ECoG recordings acquired from a group of patients with medically refractory focal-onset epilepsy, we find that ictal and preictal functional brain network dynamics can be accurately predicted from features of brain anatomy and geometry, patterns of white matter connectivity, and constraints complicit in patterns of gene coexpression, all of which are conserved across healthy adult populations. Moreover, we uncover evidence that markers of non-conserved architecture, potentially driven by idiosyncratic pathology of single subjects, are most prevalent in high frequency ictal dynamics and low frequency preictal dynamics. Finally, we find that ictal dynamics are better predicted by white matter features and more poorly predicted by geometry and genetic constraints than preictal dynamics, suggesting that the functional brain network dynamics manifest in seizures rely on - and may directly propagate along - underlying white matter structure that is largely conserved across humans. Broadly, our work offers insights into the generic architectural principles of the human brain that impact seizure dynamics, and could be extended to further our understanding, models, and predictions of subject-level pathology and response to intervention

    Conserved host response to highly pathogenic avian influenza virus infection in human cell culture, mouse and macaque model systems

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Understanding host response to influenza virus infection will facilitate development of better diagnoses and therapeutic interventions. Several different experimental models have been used as a proxy for human infection, including cell cultures derived from human cells, mice, and non-human primates. Each of these systems has been studied extensively in isolation, but little effort has been directed toward systematically characterizing the conservation of host response on a global level beyond known immune signaling cascades.</p> <p>Results</p> <p>In the present study, we employed a multivariate modeling approach to characterize and compare the transcriptional regulatory networks between these three model systems after infection with a highly pathogenic avian influenza virus of the H5N1 subtype. Using this approach we identified functions and pathways that display similar behavior and/or regulation including the well-studied impact on the interferon response and the inflammasome. Our results also suggest a primary response role for airway epithelial cells in initiating hypercytokinemia, which is thought to contribute to the pathogenesis of H5N1 viruses. We further demonstrate that we can use a transcriptional regulatory model from the human cell culture data to make highly accurate predictions about the behavior of important components of the innate immune system in tissues from whole organisms.</p> <p>Conclusions</p> <p>This is the first demonstration of a global regulatory network modeling conserved host response between <it>in vitro </it>and <it>in vivo </it>models.</p

    The Impact of Multifunctional Genes on "Guilt by Association" Analysis

    Get PDF
    Many previous studies have shown that by using variants of “guilt-by-association”, gene function predictions can be made with very high statistical confidence. In these studies, it is assumed that the “associations” in the data (e.g., protein interaction partners) of a gene are necessary in establishing “guilt”. In this paper we show that multifunctionality, rather than association, is a primary driver of gene function prediction. We first show that knowledge of the degree of multifunctionality alone can produce astonishingly strong performance when used as a predictor of gene function. We then demonstrate how multifunctionality is encoded in gene interaction data (such as protein interactions and coexpression networks) and how this can feed forward into gene function prediction algorithms. We find that high-quality gene function predictions can be made using data that possesses no information on which gene interacts with which. By examining a wide range of networks from mouse, human and yeast, as well as multiple prediction methods and evaluation metrics, we provide evidence that this problem is pervasive and does not reflect the failings of any particular algorithm or data type. We propose computational controls that can be used to provide more meaningful control when estimating gene function prediction performance. We suggest that this source of bias due to multifunctionality is important to control for, with widespread implications for the interpretation of genomics studies

    Global gene expression profiling of healthy human brain and its application in studying neurological disorders

    Get PDF
    The human brain is the most complex structure known to mankind and one of the greatest challenges in modern biology is to understand how it is built and organized. The power of the brain arises from its variety of cells and structures, and ultimately where and when different genes are switched on and off throughout the brain tissue. In other words, brain function depends on the precise regulation of gene expression in its sub-anatomical structures. But, our understanding of the complexity and dynamics of the transcriptome of the human brain is still incomplete. To fill in the need, we designed a gene expression model that accurately defines the consistent blueprint of the brain transcriptome; thereby, identifying the core brain specific transcriptional processes conserved across individuals. Functionally characterizing this model would provide profound insights into the transcriptional landscape, biological pathways and the expression distribution of neurotransmitter systems. Here, in this dissertation we developed an expression model by capturing the similarly expressed gene patterns across congruently annotated brain structures in six individual brains by using data from the Allen Brain Atlas (ABA). We found that 84% of genes are expressed in at least one of the 190 brain structures. By employing hierarchical clustering we were able to show that distinct structures of a bigger brain region can cluster together while still retaining their expression identity. Further, weighted correlation network analysis identified 19 robust modules of coexpressing genes in the brain that demonstrated a wide range of functional associations. Since signatures of local phenomena can be masked by larger signatures, we performed local analysis on each distinct brain structure. Pathway and gene ontology enrichment analysis on these structures showed, striking enrichment for brain region specific processes. Besides, we also mapped the structural distribution of the gene expression profiles of genes associated with major neurotransmission systems in the human. We also postulated the utility of healthy brain tissue gene expression to predict potential genes involved in a neurological disorder, in the absence of data from diseased tissues. To this end, we developed a supervised classification model, which achieved an accuracy of 84% and an AUC (Area Under the Curve) of 0.81 from ROC plots, for predicting autism-implicated genes using the healthy expression model as the baseline. This study represents the first use of healthy brain gene expression to predict the scope of genes in autism implication and this generic methodology can be applied to predict genes involved in other neurological disorders

    Modular Algorithms for Biomolecular Network Alignment

    Get PDF
    Comparative analysis of biomolecular networks constructed using measurements from different conditions, tissues, and organisms offer a powerful approach to understanding the structure, function, dynamics, and evolution of complex biological systems. The rapidly advancing field of systems biology aims to understand the structure, function, dynamics, and evolution of complex biological systems in terms of the underlying networks of interactions among the large number of molecular participants involved including genes, proteins, and metabolites. In particular, the comparative analysis of network models representing biomolecular interactions in different species or tissues offers an important tool for identifying conserved modules, predicting functions of specific genes or proteins and studying the evolution of biological processes, among other applications. The primary focus of this dissertation is on the biomolecular network alignment problem: Given two or more network models, the problem is to optimally match the nodes and links in one network with the nodes and links of the other. The Biomolecular Network Alignment (BiNA) Toolkit developed as part of this dissertation provides a set of efficient (in terms of the running time complexity) and accurate (in terms of various evaluation criteria discussed in the literature) network alignment algorithms for biomolecular networks. BiNA is scalable, user-friendly, modular, and extensible for performing alignments on diverse types of biomolecular networks. The algorithm is applicable to (1) undirected graphs in their weighted and unweighted variations (2) undirected graphs in their labeled and unlabeled variations (3) and has been applied to align multiple networks from hundreds of nodes with a few thousand edges to networks with tens of thousands of nodes with millions of edges. The dissertation provides various applications of network comparison tools including how results from such alignments have been utilized to (1) construct phylogenetic trees based on protein-protein interaction networks, and (2) find biochemical pathways involved in ligand recognition in B cells

    In silico prediction of lncRNA function using tissue specific and evolutionary conserved expression

    Get PDF
    BACKGROUND: In recent years long non coding RNAs (lncRNAs) have been the subject of increasing interest. Thanks to many recent functional studies, the existence of a large class of lncRNAs with potential regulatory functions is now widely accepted. Although an increasing number of lncRNAs is being characterized and shown to be involved in many biological processes, the functions of the vast majority lncRNA genes is still unknown. Therefore computational methods able to take advantage of the increasing amount of publicly available data to predict lncRNA functions could be very useful. RESULTS: Since coding genes are much better annotated than lncRNAs, we attempted to project known functional information regarding proteins onto non coding genes using the guilt by association principle: if a gene shows an expression profile that correlates with those of a set of coding genes involved in a given function, that gene is probably involved in the same function. We computed gene coexpression for 30 human tissues and 9 vertebrates and mined the resulting networks with a methodology inspired by the rank product algorithm used to identify differentially expressed genes. Using different types of reference data we can predict putative new annotations for thousands of lncRNAs and proteins, ranging from cellular localization to relevance for disease and cancer. CONCLUSIONS: New function of coding genes and lncRNA can be profitably predicted using tissue specific coexpression, as well as expression of orthologous genes in different species. The data are available for download and through a user-friendly web interface at www.funcpred.com. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1535-x) contains supplementary material, which is available to authorized users. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1535-x) contains supplementary material, which is available to authorized users

    Meta-coexpression conservation analysis of microarray data: a "subset" approach provides insight into brain-derived neurotrophic factor regulation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alterations in brain-derived neurotrophic factor (<it>BDNF</it>) gene expression contribute to serious pathologies such as depression, epilepsy, cancer, Alzheimer's, Huntington and Parkinson's disease. Therefore, exploring the mechanisms of <it>BDNF </it>regulation represents a great clinical importance. Studying <it>BDNF </it>expression remains difficult due to its multiple neural activity-dependent and tissue-specific promoters. Thus, microarray data could provide insight into the regulation of this complex gene. Conventional microarray co-expression analysis is usually carried out by merging the datasets or by confirming the re-occurrence of significant correlations across datasets. However, co-expression patterns can be different under various conditions that are represented by subsets in a dataset. Therefore, assessing co-expression by measuring correlation coefficient across merged samples of a dataset or by merging datasets might not capture all correlation patterns.</p> <p>Results</p> <p>In our study, we performed meta-coexpression analysis of publicly available microarray data using <it>BDNF </it>as a "guide-gene" introducing a "subset" approach. The key steps of the analysis included: dividing datasets into subsets with biologically meaningful sample content (e.g. tissue, gender or disease state subsets); analyzing co-expression with the <it>BDNF </it>gene in each subset separately; and confirming co- expression links across subsets. Finally, we analyzed conservation in co-expression with <it>BDNF </it>between human, mouse and rat, and sought for conserved over-represented TFBSs in <it>BDNF </it>and BDNF-correlated genes. Correlated genes discovered in this study regulate nervous system development, and are associated with various types of cancer and neurological disorders. Also, several transcription factor identified here have been reported to regulate <it>BDNF </it>expression <it>in vitro </it>and <it>in vivo</it>.</p> <p>Conclusion</p> <p>The study demonstrates the potential of the "subset" approach in co-expression conservation analysis for studying the regulation of single genes and proposes novel regulators of <it>BDNF </it>gene expression.</p
    corecore