Search CORE

22 research outputs found

Recommended from our members

A Bioconductor workflow for processing and analysing spatial proteomics data

Author: Breckels Lisa M
Gatto Laurent
Lilley Kathryn S
Mulvey Claire M
Publication venue: F1000Research
Publication date: 28/12/2016
Field of study

Spatial proteomics is the systematic study of protein sub-cellular localisation. In this workflow, we describe the analysis of a typical quantitative mass spectrometry-based spatial proteomics experiment using the MSnbase and pRoloc Bioconductor package suite. To walk the user through the computational pipeline, we use a recently published experiment predicting protein sub-cellular localisation in pluripotent embryonic mouse stem cells. We describe the software infrastructure at hand, importing and processing data, quality control, sub-cellular marker definition, visualisation and interactive exploration. We then demonstrate the application and interpretation of statistical learning methods, including novelty detection using semi-supervised learning, classification, clustering and transfer learning and conclude the pipeline with data export. The workflow is aimed at beginners who are familiar with proteomics in general and spatial proteomics in particular.LMB and CMM are supported by a Wellcome Trust Technology Development Grant (grant number 108441/Z/15/Z). KSL is a Wellcome Trust Joint Investigator (110170/Z/15/Z). LG is supported by the BBSRC Strategic Longer and Larger grant (Award BB/L002817/1)

Apollo (Cambridge)

A Bioconductor workflow for processing and analysing spatial proteomics data [version 2; referees: 2 approved]

Author: Claire M. Mulvey
Kathryn S. Lilley
Laurent Gatto
Lisa M. Breckels
Publication venue: 'F1000 Research Ltd'
Publication date: 01/07/2018
Field of study

Directory of Open Access Journals

A Bayesian mixture modelling approach for spatial proteomics.

Author: Crook Oliver M
Gatto Laurent
Kirk Paul DW
Lilley Kathryn S
Mulvey Claire M
Publication venue: PLoS Comput Biol
Publication date: 01/01/2018
Field of study

Analysis of the spatial sub-cellular distribution of proteins is of vital importance to fully understand context specific protein function. Some proteins can be found with a single location within a cell, but up to half of proteins may reside in multiple locations, can dynamically re-localise, or reside within an unknown functional compartment. These considerations lead to uncertainty in associating a protein to a single location. Currently, mass spectrometry (MS) based spatial proteomics relies on supervised machine learning algorithms to assign proteins to sub-cellular locations based on common gradient profiles. However, such methods fail to quantify uncertainty associated with sub-cellular class assignment. Here we reformulate the framework on which we perform statistical analysis. We propose a Bayesian generative classifier based on Gaussian mixture models to assign proteins probabilistically to sub-cellular niches, thus proteins have a probability distribution over sub-cellular locations, with Bayesian computation performed using the expectation-maximisation (EM) algorithm, as well as Markov-chain Monte-Carlo (MCMC). Our methodology allows proteome-wide uncertainty quantification, thus adding a further layer to the analysis of spatial proteomics. Our framework is flexible, allowing many different systems to be analysed and reveals new modelling opportunities for spatial proteomics. We find our methods perform competitively with current state-of-the art machine learning methods, whilst simultaneously providing more information. We highlight several examples where classification based on the support vector machine is unable to make any conclusions, while uncertainty quantification using our approach provides biologically intriguing results. To our knowledge this is the first Bayesian model of MS-based spatial proteomics data.LG was supported by the BBSRC Strategic Longer and Larger grant (Award BB/L002817/1) and the Wellcome Trust Senior Investigator Award 110170/Z/15/Z awarded to KSL. PDWK was supported by the MRC (project reference MC_UP_0801/1). CMM was supported by a Wellcome Trust Technology Development Grant (Grant number 108467/Z/15/Z). OMC is a Wellcome Trust Mathematical Genomics and Medicine student supported financially by the School of Clinical Medicine, University of Cambridge. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

Directory of Open Access Journals

Apollo (Cambridge)

DIAL UCLouvain

FigShare

Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics.

Author: Breckels Lisa M
Christoforou Andy
Gatto Laurent
Groen Arnoud
Holden Sean B
Kohlbacher Oliver
Lilley Kathryn S
Mulvey Claire M
Trotter Matthew WB
Wojnar David
Publication venue: PLoS Comput Biol
Publication date: 01/01/2016
Field of study

Sub-cellular localisation of proteins is an essential post-translational regulatory mechanism that can be assayed using high-throughput mass spectrometry (MS). These MS-based spatial proteomics experiments enable us to pinpoint the sub-cellular distribution of thousands of proteins in a specific system under controlled conditions. Recent advances in high-throughput MS methods have yielded a plethora of experimental spatial proteomics data for the cell biology community. Yet, there are many third-party data sources, such as immunofluorescence microscopy or protein annotations and sequences, which represent a rich and vast source of complementary information. We present a unique transfer learning classification framework that utilises a nearest-neighbour or support vector machine system, to integrate heterogeneous data sources to considerably improve on the quantity and quality of sub-cellular protein assignment. We demonstrate the utility of our algorithms through evaluation of five experimental datasets, from four different species in conjunction with four different auxiliary data sources to classify proteins to tens of sub-cellular compartments with high generalisation accuracy. We further apply the method to an experiment on pluripotent mouse embryonic stem cells to classify a set of previously unknown proteins, and validate our findings against a recent high resolution map of the mouse stem cell proteome. The methodology is distributed as part of the open-source Bioconductor pRoloc suite for spatial proteomics data analysis.LMB was supported by a BBSRC Tools and Resources Development Fund (Award BB/K00137X/1) and a Wellcome Trust Technology Development Grant (108441/Z/15/Z). LG was supported by the European Union 7th Framework Program (PRIME-XS project, grant agreement number 262067) and a BBSRC Strategic Longer and Larger Award (Award BB/L002817/1). DW and OK acknowledge funding from the European Union (PRIME-XS, GA 262067) and Deutsche Forschungsgemeinschaft (KO-2313/6-1).This is the final version of the article. It first appeared from PLOS via https://doi.org/10.1371/journal.pcbi.100492

Directory of Open Access Journals

Recommended from our members

Spatial proteomics defines the content of trafficking vesicles captured by golgin tethers.

Author: Borgeaud Alicia C
Breckels Lisa M
Cattin-Ortolá Jérôme
Chadwick Jessica
Crook Oliver M
Gillingham Alison K
Lilley Kathryn S
Munro Sean
Peak-Chew Sew Y
Shin John JH
Publication venue: Nat Commun
Publication date: 25/11/2020
Field of study

Intracellular traffic between compartments of the secretory and endocytic pathways is mediated by vesicle-based carriers. The proteomes of carriers destined for many organelles are ill-defined because the vesicular intermediates are transient, low-abundance and difficult to purify. Here, we combine vesicle relocalisation with organelle proteomics and Bayesian analysis to define the content of different endosome-derived vesicles destined for the trans-Golgi network (TGN). The golgin coiled-coil proteins golgin-97 and GCC88, shown previously to capture endosome-derived vesicles at the TGN, were individually relocalised to mitochondria and the content of the subsequently re-routed vesicles was determined by organelle proteomics. Our findings reveal 45 integral and 51 peripheral membrane proteins re-routed by golgin-97, evidence for a distinct class of vesicles shared by golgin-97 and GCC88, and various cargoes specific to individual golgins. These results illustrate a general strategy for analysing intracellular sub-proteomes by combining acute cellular re-wiring with high-resolution spatial proteomics

Apollo (Cambridge)

Spatiotemporal proteomic profiling of the pro-inflammatory response to lipopolysaccharide in the THP-1 human leukaemia cell line.

Author: Breckels Lisa M
Britovšek Nina Kočevar
Christoforou Andy
Crook Oliver M
Deery Michael J
Gatto Laurent
Geladaki Aikaterini
Hurrell Tracey
Lilley Kathryn S
Mulvey Claire M
Ribeiro Andre L R
Sanders David J
Smith Andrew M
Publication venue: Nature communications
Publication date: 01/01/2021
Field of study

Protein localisation and translocation between intracellular compartments underlie almost all physiological processes. The hyperLOPIT proteomics platform combines mass spectrometry with state-of-the-art machine learning to map the subcellular location of thousands of proteins simultaneously. We combine global proteome analysis with hyperLOPIT in a fully Bayesian framework to elucidate spatiotemporal proteomic changes during a lipopolysaccharide (LPS)-induced inflammatory response. We report a highly dynamic proteome in terms of both protein abundance and subcellular localisation, with alterations in the interferon response, endo-lysosomal system, plasma membrane reorganisation and cell migration. Proteins not previously associated with an LPS response were found to relocalise upon stimulation, the functional consequences of which are still unclear. By quantifying proteome-wide uncertainty through Bayesian modelling, a necessary role for protein relocalisation and the importance of taking a holistic overview of the LPS-driven immune response has been revealed. The data are showcased as an interactive application freely available for the scientific community

UCL Discovery

Apollo (Cambridge)

DIAL UCLouvain

A foundation for reliable spatial proteomics data analysis.

Author: Breckels Lisa M
Burger Thomas
Campbell Callum
Christoforou Andy
Ferro Myriam
Gatto Laurent
Groen Arnoud J
Lilley Kathryn S
Mulvey Claire M
Nightingale Daniel JH
Nikolovski Nino
Publication venue: Mol Cell Proteomics
Publication date: 20/05/2014
Field of study

Quantitative mass-spectrometry-based spatial proteomics involves elaborate, expensive, and time-consuming experimental procedures, and considerable effort is invested in the generation of such data. Multiple research groups have described a variety of approaches for establishing high-quality proteome-wide datasets. However, data analysis is as critical as data production for reliable and insightful biological interpretation, and no consistent and robust solutions have been offered to the community so far. Here, we introduce the requirements for rigorous spatial proteomics data analysis, as well as the statistical machine learning methodologies needed to address them, including supervised and semi-supervised machine learning, clustering, and novelty detection. We present freely available software solutions that implement innovative state-of-the-art analysis pipelines and illustrate the use of these tools through several case studies involving multiple organisms, experimental designs, mass spectrometry platforms, and quantitation techniques. We also propose sound analysis strategies for identifying dynamic changes in subcellular localization by comparing and contrasting data describing different biological conditions. We conclude by discussing future needs and developments in spatial proteomics data analysis..G., C.M.M., and M.F. were supported by the European Union 7th Framework Program (PRIME-XS Project, Grant No. 262067). L.M.B. was supported by a BBSRC Tools and Resources Development Fund (Award No. BB/K00137X/1). T.B. was supported by the Proteomics French Infrastructure (ProFI, ANR-10-INBS-08). A.C. was supported by BBSRC Grant No. BB/D526088/1. A.J.G. was supported by BBSRC Grant No. BB/E024777/ and a generous gift from King Abdullah University for Science and Technology, Saudi Arabia. D.J.N.H. was supported by a BBSRC CASE studentship (BB/I016147/1)

ZENODO

PubMed Central

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Apollo (Cambridge)

HAL-CEA

Visualization of proteomics data using R and bioconductor.

Author: Adler D.
Adler D.
Balcome S.
Bemis K. D.
Carey V.
Carlson M.
Chang W.
Chen H.
Claesen J.
Csardi G.
Futschik M.
Galili T.
Gatto L.
Gatto L.
Gatto L.
Gatto L.
Gatto L.
Gatto L.
Gentleman R.
Gentleman R.
Gregori J.
Gregori J.
Hahne F.
Hansen K. D.
Husson F.
Leisch F.
Li X.
Murrell P.
Naake T.
Nyakas A.
Panse C.
Pedersen T. L.
Ploner A.
R Core Team 2014 R: A Language and Environment for Statistical Computing
Sauteraud R.
Shannon P.
Tarca A. L.
Turewicz M.
Warnes G. R.
Wei T.
Wen B.
Wilkinson L.
Xie Y.
Xie Y.
Publication venue: Proteomics
Publication date: 01/04/2015
Field of study

Data visualization plays a key role in high-throughput biology. It is an essential tool for data exploration allowing to shed light on data structure and patterns of interest. Visualization is also of paramount importance as a form of communicating data to a broad audience. Here, we provided a short overview of the application of the R software to the visualization of proteomics data. We present a summary of R's plotting systems and how they are used to visualize and understand raw and processed MS-based proteomics data.LG was supported by the European Union 7th Framework Program (PRIME-XS project, grant agreement number 262067) and a BBSRC Strategic Longer and Larger grant (Award BB/L002817/1). LMB was supported by a BBSRC Tools and Resources Development Fund (Award BB/K00137X/1). TN was supported by a ERASMUS Placement scholarship.This is the final published version of the article. It was originally published in Proteomics (PROTEOMICS Special Issue: Proteomics Data Visualisation Volume 15, Issue 8, pages 1375–1389, April 2015. DOI: 10.1002/pmic.201400392). The final version is available at http://onlinelibrary.wiley.com/doi/10.1002/pmic.201400392/abstract

Crossref

PubMed Central

Apollo (Cambridge)

Dynamic proteomic profiling of extra-embryonic endoderm differentiation in mouse embryonic stem cells

Author: Mulvey Claire M
Schröter Christian
Gatto Laurent
Dikicioglu Duygu
Fidaner Isik Baris
Christoforou Andy
Deery Michael
Cho Lily TY
Niakan Kathy K
Martinez-Arias Alfonso
Lilley Kathryn
Publication venue: Stem Cells
Publication date: 01/01/2013
Field of study

During mammalian pre-implantation development, the cells of the blastocyst’s inner cell mass differentiate into the epiblast and primitive endoderm lineages, which give rise to the fetus and extra-embryonic tissues, respectively. Extra-embryonic endoderm differentiation can be modeled in vitro by induced expression of GATA transcription factors in mouse embryonic stem cells. Here we use this GATA-inducible system to quantitatively monitor the dynamics of global proteomic changes during the early stages of this differentiation event and also investigate the fully differentiated phenotype, as represented by embryo-derived extra-embryonic endoderm (XEN) cells. Using mass spectrometry-based quantitative proteomic profiling with multivariate data analysis tools, we reproducibly quantified 2,336 proteins across three biological replicates and have identified clusters of proteins characterized by distinct, dynamic temporal abundance profiles. We first used this approach to highlight novel marker candidates of the pluripotent state and extra-embryonic endoderm differentiation. Through functional annotation enrichment analysis, we have shown that the downregulation of chromatin-modifying enzymes, the re-organization of membrane trafficking machinery and the breakdown of cell-cell adhesion are successive steps of the extra-embryonic differentiation process. Thus, applying a range of sophisticated clustering approaches to a time-resolved proteomic dataset has allowed the elucidation of complex biological processes which characterize stem cell differentiation and could establish a general paradigm for the investigation of these processes.This work was supported by the European Union 7th Framework Program (PRIME-XS project grant number 262067 to K.S.L., L.G and C.M.M), the Biotechnology and Biological Sciences Research Council (BBSRC grant number BB/L002817/1 to K.S.L and L.G.), as well as a HFSP grant (RGP0029/2010) and a European Research Council (ERC) Advanced Investigator grant to A.M.A.. C.S was supported by an EMBO long term fellowship and a Marie Curie IEF. L.T.Y.C. and K.K.N. were supported by the Medical Research Council (MRC, UK, MC_UP_1202/9) and the March of Dimes Foundation (FY11-436). We also thank Professor Steve Oliver and Dr. A.K.Hadjantonakis for helpful discussions and advice.This is the author accepted manuscript. The final version is available from Wiley via http://dx.doi.org/10.1002/stem.206

Publikationer från KTH

Crossref

ZENODO

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Apollo (Cambridge)