1,928 research outputs found

    Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants

    Get PDF
    Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research

    Global Network Alignment

    Get PDF
    Motivation: High-throughput methods for detecting molecular interactions have lead to a plethora of biological network data with much more yet to come, stimulating the development of techniques for biological network alignment. Analogous to sequence alignment, efficient and reliable network alignment methods will improve our understanding of biological systems. Network alignment is computationally hard. Hence, devising efficient network alignment heuristics is currently one of the foremost challenges in computational biology. 

Results: We present a superior heuristic network alignment algorithm, called Matching-based GRAph ALigner (M-GRAAL), which can process and integrate any number and type of similarity measures between network nodes (e.g., proteins), including, but not limited to, any topological network similarity measure, sequence similarity, functional similarity, and structural similarity. This is efficient in resolving ties in similarity measures and in finding a combination of similarity measures yielding the largest biologically sound alignments. When used to align protein-protein interaction (PPI) networks of various species, M-GRAAL exposes the largest known functional and contiguous regions of network similarity. Hence, we use M-GRAAL’s alignments to predict functions of un-annotated proteins in yeast, human, and bacteria _C. jejuni_ and _E. coli_. Furthermore, using M-GRAAL to compare PPI networks of different herpes viruses, we reconstruct their phylogenetic relationship and our phylogenetic tree is the same as sequenced-based one

    Phylogenetic analysis of modularity in protein interaction networks

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In systems biology, comparative analyses of molecular interactions across diverse species indicate that conservation and divergence of networks can be used to understand functional evolution from a systems perspective. A key characteristic of these networks is their modularity, which contributes significantly to their robustness, as well as adaptability. Consequently, analysis of modular network structures from a phylogenetic perspective may be useful in understanding the emergence, conservation, and diversification of functional modularity.</p> <p>Results</p> <p>In this paper, we propose a phylogenetic framework for analyzing network modules, with applications that extend well beyond network-based phylogeny reconstruction. Our approach is based on identification of modular network components from each network separately, followed by projection of these modules onto the networks of other species to compare different networks. Subsequently, we use the conservation of various modules in each network to assess the similarity between different networks. Compared to traditional methods that rely on topological comparisons, our approach has key advantages in (<it>i</it>) avoiding intractable graph comparison problems in comparative network analysis, (<it>ii</it>) accounting for noise and missing data through flexible treatment of network conservation, and (<it>iii</it>) providing insights on the evolution of biological systems through investigation of the evolutionary trajectories of network modules. We test our method, M<smcaps>OPHY</smcaps>, on synthetic data generated by simulation of network evolution, as well as existing protein-protein interaction data for seven diverse species. Comprehensive experimental results show that M<smcaps>OPHY</smcaps> is promising in reconstructing evolutionary histories of extant networks based on conservation of modularity, it is highly robust to noise, and outperforms existing methods that quantify network similarity in terms of conservation of network topology.</p> <p>Conclusion</p> <p>These results establish modularity and network proximity as useful features in comparative network analysis and motivate detailed studies of the evolutionary histories of network modules.</p

    Ancient Pbx-Hox signatures define hundreds of vertebrate developmental enhancers

    Get PDF
    Background: Gene regulation through cis-regulatory elements plays a crucial role in development and disease. A major aim of the post-genomic era is to be able to read the function of cis-regulatory elements through scrutiny of their DNA sequence. Whilst comparative genomics approaches have identified thousands of putative regulatory elements, our knowledge of their mechanism of action is poor and very little progress has been made in systematically de-coding them. Results: Here, we identify ancient functional signatures within vertebrate conserved non-coding elements (CNEs) through a combination of phylogenetic footprinting and functional assay, using genomic sequence from the sea lamprey as a reference. We uncover a striking enrichment within vertebrate CNEs for conserved binding-site motifs of the Pbx-Hox hetero-dimer. We further show that these predict reporter gene expression in a segment specific manner in the hindbrain and pharyngeal arches during zebrafish development. Conclusions: These findings evoke an evolutionary scenario in which many CNEs evolved early in the vertebrate lineage to co-ordinate Hox-dependent gene-regulatory interactions that pattern the vertebrate head. In a broader context, our evolutionary analyses reveal that CNEs are composed of tightly linked transcription-factor binding-sites (TFBSs), which can be systematically identified through phylogenetic footprinting approaches. By placing a large number of ancient vertebrate CNEs into a developmental context, our findings promise to have a significant impact on efforts toward de-coding gene-regulatory elements that underlie vertebrate development, and will facilitate building general models of regulatory element evolution

    Exploring the function and evolution of proteins using domain families

    Get PDF
    Proteins are frequently composed of multiple domains which fold independently. These are often evolutionarily distinct units which can be adapted and reused in other proteins. The classification of protein domains into evolutionary families facilitates the study of their evolution and function. In this thesis such classifications are used firstly to examine methods for identifying evolutionary relationships (homology) between protein domains. Secondly a specific approach for predicting their function is developed. Lastly they are used in studying the evolution of protein complexes. Tools for identifying evolutionary relationships between proteins are central to computational biology. They aid in classifying families of proteins, giving clues about the function of proteins and the study of molecular evolution. The first chapter of this thesis concerns the effectiveness of cutting edge methods in identifying evolutionary relationships between protein domains. The identification of evolutionary relationships between proteins can give clues as to their function. The second chapter of this thesis concerns the development of a method to identify proteins involved in the same biological process. This method is based on the concept of domain fusion whereby pairs of proteins from one organism with a concerted function are sometimes found fused into single proteins in a different organism. Using protein domain classifications it is possible to identify these relationships. Most proteins do not act in isolation but carry out their function by binding to other proteins in complexes; little is understood about the evolution of such complexes. In the third chapter of this thesis the evolution of complexes is examined in two representative model organisms using protein domain families. In this work, protein domain superfamilies allow distantly related parts of complexes to be identified in order to determine how homologous units are reused

    12-h clock regulation of genetic information flow by XBP1s

    Get PDF
    © The Author(s), 2020. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Pan, Y., Ballance, H., Meng, H., Gonzalez, N., Kim, S., Abdurehman, L., York, B., Chen, X., Schnytzer, Y., Levy, O., Dacso, C. C., McClung, C. A., O'Malley, B. W., Liu, S., & Zhu, B. 12-h clock regulation of genetic information flow by XBP1s. Plos Biology, 18(1), (2020): e3000580, doi:10.1371/journal.pbio.3000580.Our group recently characterized a cell-autonomous mammalian 12-h clock independent from the circadian clock, but its function and mechanism of regulation remain poorly understood. Here, we show that in mouse liver, transcriptional regulation significantly contributes to the establishment of 12-h rhythms of mRNA expression in a manner dependent on Spliced Form of X-box Binding Protein 1 (XBP1s). Mechanistically, the motif stringency of XBP1s promoter binding sites dictates XBP1s’s ability to drive 12-h rhythms of nascent mRNA transcription at dawn and dusk, which are enriched for basal transcription regulation, mRNA processing and export, ribosome biogenesis, translation initiation, and protein processing/sorting in the Endoplasmic Reticulum (ER)-Golgi in a temporal order consistent with the progressive molecular processing sequence described by the central dogma information flow (CEDIF). We further identified GA-binding proteins (GABPs) as putative novel transcriptional regulators driving 12-h rhythms of gene expression with more diverse phases. These 12-h rhythms of gene expression are cell autonomous and evolutionarily conserved in marine animals possessing a circatidal clock. Our results demonstrate an evolutionarily conserved, intricate network of transcriptional control of the mammalian 12-h clock that mediates diverse biological pathways. We speculate that the 12-h clock is coopted to accommodate elevated gene expression and processing in mammals at the two rush hours, with the particular genes processed at each rush hour regulated by the circadian and/or tissue-specific pathways.This study was supported by the American Diabetes Association junior faculty development award 1-18-JDF-025 to B.Z., by funding from National Institute of Health HD07879 and 1P01DK113954 to B.W.O, by funding from National Science Foundation award 1703170 to C.C.D. and B.Z., and by funding from Brockman Foundation to C.C.D and B.W.O. This work was further supported by the UPMC Genome Center with funding from UPMC’s Immunotherapy and Transplant Center. This research was supported in part by the University of Pittsburgh Center for Research Computing through the resources provided. Research reported in this publication was further supported by the National Institute of Diabetes And Digestive And Kidney Diseases of the National Institutes of Health under award number P30DK120531 to Pittsburgh Liver Research Center, in which both S.L. and B.Z. are members. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
    corecore