72 research outputs found

    Bridging topological and functional information in protein interaction networks by short loops profiling

    Get PDF
    Protein-protein interaction networks (PPINs) have been employed to identify potential novel interconnections between proteins as well as crucial cellular functions. In this study we identify fundamental principles of PPIN topologies by analysing network motifs of short loops, which are small cyclic interactions of between 3 and 6 proteins. We compared 30 PPINs with corresponding randomised null models and examined the occurrence of common biological functions in loops extracted from a cross-validated high-confidence dataset of 622 human protein complexes. We demonstrate that loops are an intrinsic feature of PPINs and that specific cell functions are predominantly performed by loops of different lengths. Topologically, we find that loops are strongly related to the accuracy of PPINs and define a core of interactions with high resilience. The identification of this core and the analysis of loop composition are promising tools to assess PPIN quality and to uncover possible biases from experimental detection methods. More than 96% of loops share at least one biological function, with enrichment of cellular functions related to mRNA metabolic processing and the cell cycle. Our analyses suggest that these motifs can be used in the design of targeted experiments for functional phenotype detection.This research was supported by the Biotechnology and Biological Sciences Research Council (BB/H018409/1 to AP, ACCC and FF, and BB/J016284/1 to NSBT) and by the Leukaemia & Lymphoma Research (to NSBT and FF). SSC is funded by a Leukaemia & Lymphoma Research Gordon Piller PhD Studentship

    Nano Random Forests to mine protein complexes and their relationships in quantitative proteomics data

    Get PDF
    Ever-increasing numbers of quantitative proteomics data sets constitute an underexploited resource for investigating protein function. Multiprotein complexes often follow consistent trends in these experiments, which could provide insights about their biology. Yet, as more experiments are considered, a complex’s signature may become conditional and less identifiable. Previously we successfully distinguished the general proteomic signature of genuine chromosomal proteins from hitchhikers using the Random Forests (RF) machine learning algorithm. Here we test whether small protein complexes can define distinguishable signatures of their own, despite the assumption that machine learning needs large training sets. We show, with simulated and real proteomics data, that RF can detect small protein complexes and relationships between them. We identify several complexes in quantitative proteomics results of wild-type and knockout mitotic chromosomes. Other proteins covary strongly with these complexes, suggesting novel functional links for later study. Integrating the RF analysis for several complexes reveals known interdependences among kinetochore subunits and a novel dependence between the inner kinetochore and condensin. Ribosomal proteins, although identified, remained independent of kinetochore subcomplexes. Together these results show that this complex-oriented RF (NanoRF) approach can integrate proteomics data to uncover subtle protein relationships. Our NanoRF pipeline is available online

    Identification of DHX9 as a cell cycle regulated nucleolar recruitment factor for CIZ1

    Get PDF
    CIP1-interacting zinc finger protein 1 (CIZ1) is a nuclear matrix associated protein that facilitates a number of nuclear functions including initiation of DNA replication, epigenetic maintenance and associates with the inactive X-chromosome. Here, to gain more insight into the protein networks that underpin this diverse functionality, molecular panning and mass spectrometry are used to identify protein interaction partners of CIZ1, and CIZ1 replication domain (CIZ1-RD). STRING analysis of CIZ1 interaction partners identified 2 functional clusters: ribosomal subunits and nucleolar proteins including the DEAD box helicases, DHX9, DDX5 and DDX17. DHX9 shares common functions with CIZ1, including interaction with XIST long-non-coding RNA, epigenetic maintenance and regulation of DNA replication. Functional characterisation of the CIZ1-DHX9 complex showed that CIZ1-DHX9 interact in vitro and dynamically colocalise within the nucleolus from early to mid S-phase. CIZ1-DHX9 nucleolar colocalisation is dependent upon RNA polymerase I activity and is abolished by depletion of DHX9. In addition, depletion of DHX9 reduced cell cycle progression from G1 to S-phase in mouse fibroblasts. The data suggest that DHX9-CIZ1 are required for efficient cell cycle progression at the G1/S transition and that nucleolar recruitment is integral to their mechanism of action

    Rationalising the role of Keratin 9 as a biomarker for Alzheimer’s disease

    Get PDF
    Keratin 9 was recently identified as an important component of a biomarker panel which demonstrated a high diagnostic accuracy (87%) for Alzheimer’s disease (AD). Understanding how a protein which is predominantly expressed in palmoplantar epidermis is implicated in AD may shed new light on the mechanisms underlying the disease. Here we use immunoassays to examine blood plasma expression patterns of Keratin 9 and its relationship to other AD-associated proteins. We correlate this with the use of an in silico analysis tool VisANT to elucidate possible pathways through which the involvement of Keratin 9 may take place. We identify possible links with Dickkopf-1, a negative regulator of the wnt pathway, and propose that the abnormal expression of Keratin 9 in AD blood and cerebrospinal fluid may be a result of blood brain barrier dysregulation and disruption of the ubiquitin proteasome system. Our findings suggest that dysregulated Keratin 9 expression is a consequence of AD pathology but, as it interacts with a broad range of proteins, it may have other, as yet uncharacterized, downstream effects which could contribute to AD onset and progression

    Co-regulation map of the human proteome enables identification of protein functions

    Get PDF
    This is the author accepted manuscript. The final version is available from Nature Research via the DOI in this recordData availability: All mass spectrometry raw files generated in-house have been deposited in the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository36 with the dataset identifier PXD008888. The co-regulation map is hosted on our website at www.proteomeHD.net, and pair-wise co-regulation scores are available through STRING (https://string-db.org). A network of the top 0.5% co-regulated protein pairs can be explored interactively on NDEx (https://doi.org/10.18119/N9N30Q).Code availability: Data analysis was performed in R 3.5.1. R scripts and input files required to reproduce the results of this manuscript are available in the following GitHub repository: https://github.com/Rappsilber-Laboratory/ProteomeHD. R scripts related specifically to the benchmarking of the treeClust algorithm using synthetic data are available in the following GitHub repository: https://github.com/Rappsilber-Laboratory/treeClust-benchmarking. The R package data.table was used for fast data processing. Figures were prepared using ggplot2, gridExtra, cowplot and viridis.Note that the title of the AAM is different from the published versionThe annotation of protein function is a longstanding challenge of cell biology that suffers from the sheer magnitude of the task. Here we present ProteomeHD, which documents the response of 10,323 human proteins to 294 biological perturbations, measured by isotope-labelling mass spectrometry. We reveal functional associations between human proteins using the treeClust machine learning algorithm, which we show to improve protein co-regulation analysis due to robust selectivity for close linear relationships. Our co-regulation map identifies a functional context for many uncharacterized proteins, including microproteins that are difficult to study with traditional methods. Co-regulation also captures relationships between proteins which do not physically interact or co-localize. For example, co-regulation of the peroxisomal membrane protein PEX11β with mitochondrial respiration factors led us to discover a novel organelle interface between peroxisomes and mitochondria in mammalian cells. The co-regulation map can be explored at www.proteomeHD.net .Biotechnology & Biological Sciences Research Council (BBSRC)European Commissio

    Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells

    Get PDF
    Naive embryonic stem cells hold great promise for research and therapeutics as they have broad and robust developmental potential. While such cells are readily derived from mouse blastocysts it has not been possible to isolate human equivalents easily, although human naive-like cells have been artificially generated (rather than extracted) by coercion of human primed embryonic stem cells by modifying culture conditions or through transgenic modification. Here we show that a sub-population within cultures of human embryonic stem cells (hESCs) and induced pluripotent stem cells (hiPSCs) manifests key properties of naive state cells. These naive-like cells can be genetically tagged, and are associated with elevated transcription of HERVH, a primate-specific endogenous retrovirus. HERVH elements provide functional binding sites for a combination of naive pluripotency transcription factors, including LBP9, recently recognized as relevant to naivety in mice. LBP9-HERVH drives hESC-specific alternative and chimaeric transcripts, including pluripotency-modulating long non-coding RNAs. Disruption of LBP9, HERVH and HERVH-derived transcripts compromises self-renewal. These observations define HERVH expression as a hallmark of naive-like hESCs, and establish novel primate-specific transcriptional circuitry regulating pluripotency
    corecore