Search CORE

6,652 research outputs found

Prediction of peptides observable by mass spectrometry applied at the experimental set level

Author: B Kuster
B Nanduri
B Zhang
Bindu Nanduri
E Gasteiger
E Richard
Fiona M McCarthy
FM McCarthy
H Hernandez
IH Witten
JJ Buza
MP Washburn
P Lu
P Mallick
R Aebersold
S Kawashima
SC Burgess
SC Burgess
Shane C Burgess
Susan M Bridges
TD Veenstra
William S Sanders
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Machine learning and mapping algorithms applied to proteomics problems

Author: Sanders William Shane
Publication venue: Scholars Junction
Publication date: 30/04/2011
Field of study

Proteins provide evidence that a given gene is expressed, and machine learning algorithms can be applied to various proteomics problems in order to gain information about the underlying biology. This dissertation applies machine learning algorithms to proteomics data in order to predict whether or not a given peptide is observable by mass spectrometry, whether a given peptide can serve as a cell penetrating peptide, and then utilizes the peptides observed through mass spectrometry to aid in the structural annotation of the chicken genome. Peptides observed by mass spectrometry are used to identify proteins, and being able to accurately predict which peptides will be seen can allow researchers to analyze to what extent a given protein is observable. Cell penetrating peptides can possibly be utilized to allow targeted small molecule delivery across cellular membranes and possibly serve a role as drug delivery peptides. Peptides and proteins identified through mass spectrometry can help refine computational gene models and improve structural genome annotations

Scholars Junction - Mississippi State University Institutional Repository

MRM screening/biomarker discovery with linear ion trap MS: a library of human cancer-specific peptides

Author: Lazar Iulia M
Yang Xu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The discovery of novel protein biomarkers is essential in the clinical setting to enable early disease diagnosis and increase survivability rates. To facilitate differential expression analysis and biomarker discovery, a variety of tandem mass spectrometry (MS/MS)-based protein profiling techniques have been developed. For achieving sensitive detection and accurate quantitation, targeted MS screening approaches, such as multiple reaction monitoring (MRM), have been implemented. Methods MCF-7 breast cancer protein cellular extracts were analyzed by 2D-strong cation exchange (SCX)/reversed phase liquid chromatography (RPLC) separations interfaced to linear ion trap MS detection. MS data were interpreted with the Sequest-based Bioworks software (Thermo Electron). In-house developed Perl-scripts were used to calculate the spectral counts and the representative fragment ions for each peptide. Results In this work, we report on the generation of a library of 9,677 peptides (p < 0.001), representing ~1,572 proteins from human breast cancer cells, that can be used for MRM/MS-based biomarker screening studies. For each protein, the library provides the number and sequence of detectable peptides, the charge state, the spectral count, the molecular weight, the parameters that characterize the quality of the tandem mass spectrum (p-value, DeltaM, Xcorr, DeltaCn, Sp, no. of matching <it>a</it>, <it>b</it>, <it>y </it>ions in the spectrum), the retention time, and the top 10 most intense product ions that correspond to a given peptide. Only proteins identified by at least two spectral counts are listed. The experimental distribution of protein frequencies, as a function of molecular weight, closely matched the theoretical distribution of proteins in the human proteome, as provided in the SwissProt database. The amino acid sequence coverage of the identified proteins ranged from 0.04% to 98.3%. The highest-abundance proteins in the cellular extract had a molecular weight (MW)<50,000. Conclusion Preliminary experiments have demonstrated that putative biomarkers, that are not detectable by conventional data dependent MS acquisition methods in complex un-fractionated samples, can be reliable identified with the information provided in this library. Based on the spectral count, the quality of a tandem mass spectrum and the m/z values for a parent peptide and its most abundant daughter ions, MRM conditions can be selected to enable the detection of target peptides and proteins.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Mass Spectrometry-Based Approaches Toward Absolute Quantitative Proteomics

Author: Ito Takashi
Kito Keiji
Publication venue: Bentham Science Publishers Ltd.
Publication date
Field of study

Mass spectrometry has served as a major tool for the discipline of proteomics to catalogue proteins in an unprecedented scale. With chemical and metabolic techniques for stable isotope labeling developed over the past decade, it is now routinely used as a method for relative quantification to provide valuable information on alteration of protein abundance in a proteome-wide scale. More recently, absolute or stoichiometric quantification of proteome is becoming feasible, in particular, with the development of strategies with isotope-labeled standards composed of concatenated peptides. On the other hand, remarkable progress has been also made in label-free quantification methods based on the number of identified peptides. Here we review these mass spectrometry-based approaches for absolute quantification of proteome and discuss their implications

Crossref

PubMed Central

The steady-state repertoire of human SCF Ubiquitin ligase complexes does not require ongoing Nedd8 conjugation

Author: Cronan
Geoffrey T. Smith
J. Eugene Lee
Michael J. Sweredoski
Natalie J. Kolawa
Raymond J. Deshaies
Robert L. J. Graham
Sonja Hess
Publication venue: 'American Society for Biochemistry & Molecular Biology (ASBMB)'
Publication date: 01/05/2011
Field of study

The human genome encodes 69 different F-box proteins (FBPs), each of which can potentially assemble with Skp1-Cul1-RING to serve as the substrate specificity subunit of an SCF ubiquitin ligase complex. SCF activity is switched on by conjugation of the ubiquitin- like protein Nedd8 to Cul1. Cycles of Nedd8 conjugation and deconjugation acting in conjunction with the Cul1-sequestering factor Cand1 are thought to control dynamic cycles of SCF assembly and disassembly, which would enable a dynamic equilibrium between the Cul1- RING catalytic core of SCF and the cellular repertoire of FBPs. To test this hypothesis, we determined the cellular composition of SCF complexes and evaluated the impact of Nedd8 conjugation on this steady-state. At least 42 FBPs assembled with Cul1 in HEK 293 cells, and the levels of Cul1-bound FBPs varied by over two orders of magnitude. Unexpectedly, quantitative mass spectrometry revealed that blockade of Nedd8 conjugation led to a modest increase, rather than a decrease, in the overall level of most SCF complexes. We suggest that multiple mechanisms including FBP dissociation and turnover cooperate to maintain the cellular pool of SCF ubiquitin ligases

Queen's University Belfast Research Portal

Crossref

PubMed Central

Caltech Authors

The University of Manchester - Institutional Repository

Current challenges in software solutions for mass spectrometry-based quantitative proteomics

Author: A Alexandridou
A Gruhler
A Leitner
A Michalski
A Panchaud
A Thompson
A Wolf-Yadlin
AD Polpitiya
AHP America
AI Nesvizhskii
AI Nesvizhskii
AI Nesvizhskii
AJR Heck
Albert J. R. Heck
AM Mayampurath
B Breukelen van
B Carrillo
B Ma
B Macek
B Schwanhäusser
B Zybailov
BAP Roxas
Bas van Breukelen
BO Keller
C Christin
C Ji
C Kumar
CH Becker
CK Frese
C–C Tsou
D Chelius
D Hoof Van
D MacDougall
D Tsur
D Valkenborg
DH Lundgren
DK Han
DL Swaney
DL Tabb
DL Tabb
DM Good
DN Perkins
E Deutsch
E Qeli
EL Hendrickson
G Audi
GL Finney
H Lam
H Lam
H Liu
H Steen
H Steen
I Beer
IP Shadforth
J Cox
J Cox
J Elias
J Gouw
J Grossmann
J Klimek
J Listgarten
J Meija
J Rappsilber
J Seidler
J Zhang
JC Silva
JF Kellie
JF Timms
JV Olsen
K Flikka
K Kultima
K Podwojski
KA Neilson
KC Hansen
KL Simpson
L Martens
L Ting
LF Waanders
LK Iwai
LN Mueller
M Bantscheff
M Bantscheff
M Bern
M Junqueira
M Kohl
M Mann
M Sandin
M Senko
M Unlü
M Vandenbogaert
MA Baldwin
MA Grobei
MA Kuzyk
MC Codrea
ME Belov
ME Sardiu
MH Elliott
MJ MacCoss
MM Savitski
MW Duncan
N Colaert
N Mischerikow
N Wang
NM Griffin
OA Mirgorodskaya
P Lu
P Mallick
P Mortensen
Pedro R. Cutillas
Peter R. Baker
PJ Boersema
PL Ross
PR Baker
PR Cutillas
PR Cutillas
R Aebersold
R Clarke
R Matthiesen
R Matthiesen
R Purves
R Usaite
R Zhang
RA Bradshaw
RD Smith
RE Moore
RJ Chalkley
RJ Chalkley
RJ Jacob
S Cappadona
S Cappadona
S Carr
S Dasari
S Houel
S Julka
S Ong
S-E Ong
SA Beausoleil
SA Gerber
Salvatore Cappadona
SJ Callister
SK Park
SP Gygi
SY Ow
T Shinkawa
TM Annesley
TS Collier
TT Aye
V Faca
V Lange
VG Tusher
VP Andreev
W Weiss
W Yan
W Zhu
WM Old
WX Schulze
X Yang
Y Ishihama
Y Oda
YJ Kim
Z Khan
Z Khan
Z-Q Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

This work was in part supported by the PRIME-XS project, grant agreement number 262067, funded by the European Union seventh Framework Programme; The Netherlands Proteomics Centre, embedded in The Netherlands Genomics Initiative; The Netherlands Bioinformatics Centre; and the Centre for Biomedical Genetics (to S.C., B.B. and A.J.R.H); by NIH grants NCRR RR001614 and RR019934 (to the UCSF Mass Spectrometry Facility, director: A.L. Burlingame, P.B.); and by grants from the MRC, CR-UK, BBSRC and Barts and the London Charity (to P.C.

Crossref

Springer - Publisher Connector

Queen Mary Research Online

Protein abundance profiling of the Escherichia coli cytosol

Author: Frishman Dmitrij
Hartl F Ulrich
Ishihama Yasushi
Kerner Michael J
Mann Matthias
Rappsilber Juri
Schmidt Thorsten
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Knowledge about the abundance of molecular components is an important prerequisite for building quantitative predictive models of cellular behavior. Proteins are central components of these models, since they carry out most of the fundamental processes in the cell. Thus far, protein concentrations have been difficult to measure on a large scale, but proteomic technologies have now advanced to a stage where this information becomes readily accessible. Results Here, we describe an experimental scheme to maximize the coverage of proteins identified by mass spectrometry of a complex biological sample. Using a combination of LC-MS/MS approaches with protein and peptide fractionation steps we identified 1103 proteins from the cytosolic fraction of the <it>Escherichia coli </it>strain MC4100. A measure of abundance is presented for each of the identified proteins, based on the recently developed emPAI approach which takes into account the number of sequenced peptides per protein. The values of abundance are within a broad range and accurately reflect independently measured copy numbers per cell. As expected, the most abundant proteins were those involved in protein synthesis, most notably ribosomal proteins. Proteins involved in energy metabolism as well as those with binding function were also found in high copy number while proteins annotated with the terms metabolism, transcription, transport, and cellular organization were rare. The barrel-sandwich fold was found to be the structural fold with the highest abundance. Highly abundant proteins are predicted to be less prone to aggregation based on their length, pI values, and occurrence patterns of hydrophobic stretches. We also find that abundant proteins tend to be predominantly essential. Additionally we observe a significant correlation between protein and mRNA abundance in <it>E. coli </it>cells. Conclusion Abundance measurements for more than 1000 <it>E. coli </it>proteins presented in this work represent the most complete study of protein abundance in a bacterial cell so far. We show significant associations between the abundance of a protein and its properties and functions in the cell. In this way, we provide both data and novel insights into the role of protein concentration in this model organism.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

PuSH

University of Southern Denmark Research Output

Online Research Database In Technology

Protein abundance profiling of the Escherichia coli cytosol

Author: Ishihama Yasushi
Schmidt Thorsten
Rappsilber Juri
Mann Matthias
Hartl F Ulrich
Kerner Michael J
Frishman Dmitrij
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

PuSH

University of Southern Denmark Research Output

Online Research Database In Technology

University of Hertfordshire Research Archive

Tandem mass spectrometry data quality assessment by self-convolution

Author: A Shevchenko
AA Bharath
AL McCormack
AL McCormack
Andrew Keller
Bin Ma
BJ Cargile
C Yu
CG Herbert
D Fenyo
DC Barbacci
DL Tabb
DN Perkins
F Desiere
HI Field
JE Elias
JE Syka
Jimmy K Eng
JK Eng
JV Puymbrouck
K Biemann
K Biemann
Keng Wah Choo
KR Clauser
LY Geer
M Kinter
M Mann
Marshall Bern
N Zhang
P Roepstorff
PA Pevzner
Purvine Samuel
RA Zubarev
Randy J Arnold
Richard S Johnson
RS Johnson
S Sunyaev
Salmi Jussi
VH Wysocki
Wai Mun Tham
Wu Fang-Xiang
Wu Yik-Chung
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on <it>de novo </it>sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. Results The proposed method measures the qualities of MS data sets based on the symmetric property of b- and y-ion peaks present in a MS spectrum. Self-convolution on MS data and its time-reversal copy was employed. Due to the symmetric nature of b-ions and y-ions peaks, the self-convolution result of a good spectrum would produce a highest mid point intensity peak. To reduce processing time, self-convolution was achieved using Fast Fourier Transform and its inverse transform, followed by the removal of the "DC" (Direct Current) component and the normalisation of the data set. The quality score was defined as the ratio of the intensity at the mid point to the remaining peaks of the convolution result. The method was validated using both theoretical mass spectra, with various permutations, and several real MS data sets. The results were encouraging, revealing a high percentage of positive prediction rates for spectra with good quality scores. Conclusion We have demonstrated in this work a method for determining the quality of tandem MS data set. By pre-determining the quality of tandem MS data before subjecting them to protein identification algorithms, spurious protein predictions due to poor tandem MS data are avoided, giving scientists greater confidence in the predicted results. We conclude that the algorithm performs well and could potentially be used as a pre-processing for all mass spectrometry based protein identification tools.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Machine learning applications in proteomics research: How the past can boost the future

Author: Barsnes Harald
Bittremieux Wout
De Grave Kurt
Degroeve S
Kelchtermans Pieter
Laukens Kris
Martens Lennart
Ramon Jan
Valkenborg Dirk
Publication venue: 'Wiley'
Publication date: 06/09/2017
Field of study

Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis.acceptedVersio

University of Bergen