Search CORE

109 research outputs found

A Tandem Mass Spectrometry Sequence Database Search Method for Identification of O-Fucosylated Proteins by Mass Spectrometry.

Author: Deutsch Eric W
Eng Jimmy K
Kappe Stefan H I
Mendoza Luis
Moritz Robert L
Sather D Noah
Shteynberg David
Springer Timothy A
Swearingen Kristian E
Vigdorovich Vladimir
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 21/12/2018
Field of study

Thrombospondin type 1 repeats (TSRs), small adhesive protein domains with a wide range of functions, are usually modified with O-linked fucose, which may be extended to O-fucose-β1,3-glucose. Collision-induced dissociation (CID) spectra of O-fucosylated peptides cannot be sequenced by standard tandem mass spectrometry (MS/MS) sequence database search engines because O-linked glycans are highly labile in the gas phase and are effectively absent from the CID peptide fragment spectra, resulting in a large mass error. Electron transfer dissociation (ETD) preserves O-linked glycans on peptide fragments, but only a subset of tryptic peptides with low m/ z can be reliably sequenced from ETD spectra compared to CID. Accordingly, studies to date that have used MS to identify O-fucosylated TSRs have required manual interpretation of CID mass spectra even when ETD was also employed. In order to facilitate high-throughput, automatic identification of O-fucosylated peptides from CID spectra, we re-engineered the MS/MS sequence database search engine Comet and the MS data analysis suite Trans-Proteomic Pipeline to enable automated sequencing of peptides exhibiting the neutral losses characteristic of labile O-linked glycans. We used our approach to reanalyze published proteomics data from Plasmodium parasites and identified multiple glycoforms of TSR-containing proteins

Providence St. Joseph Health Digital Commons

FigShare

System-based proteomic analysis of the interferon response in human liver cells

Author: Aebersold Ruedi
Coito Carlos
Eng Jimmy
Galitski Timothy
Goodlett David R
Katze Michael G
Keller Andrew
Kwieciszewski Bartlomiej K
Lee Hookeun
Li Xiao-jun
Reiss David
Shannon Paul
Yan Wei
Yi Eugene C
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Interferons (IFNs) play a critical role in the host antiviral defense and are an essential component of current therapies against hepatitis C virus (HCV), a major cause of liver disease worldwide. To examine liver-specific responses to IFN and begin to elucidate the mechanisms of IFN inhibition of virus replication, we performed a global quantitative proteomic analysis in a human hepatoma cell line (Huh7) in the presence and absence of IFN treatment using the isotope-coded affinity tag (ICAT) method and tandem mass spectrometry (MS/MS). RESULTS: In three subcellular fractions from the Huh7 cells treated with IFN (400 IU/ml, 16 h) or mock-treated, we identified more than 1,364 proteins at a threshold that corresponds to less than 5% false-positive error rate. Among these, 54 were induced by IFN and 24 were repressed by more than two-fold, respectively. These IFN-regulated proteins represented multiple cellular functions including antiviral defense, immune response, cell metabolism, signal transduction, cell growth and cellular organization. To analyze this proteomics dataset, we utilized several systems-biology data-mining tools, including Gene Ontology via the GoMiner program and the Cytoscape bioinformatics platform. CONCLUSIONS: Integration of the quantitative proteomics with global protein interaction data using the Cytoscape platform led to the identification of several novel and liver-specific key regulatory components of the IFN response, which may be important in regulating the interplay between HCV, interferon and the host response to virus infection

Springer - Publisher Connector

PubMed Central

Human Plasma PeptideAtlas

Author: Alexey I Nesvizhskii
Biaoyang Lin
Eric W Deutsch
Eugene C Yi
Hookeun Lee
Hui Zhang
Jimmy K Eng
Nichole L King
Reto Ossola
Ruedi Aebersold
Publication venue
Publication date: 01/01/2005
Field of study

Peptide identifications of high probability from 28 LC-MS/MS human serum and plasma experiments from eight different laboratories, carried out in the context of the HUPO Plasma Proteome Project, were combined and mapped to the EnsEMBL human genome. The 6929 distinct observed peptides were mapped to approximately 960 different proteins. The resulting compendium of peptides and their associated samples, proteins, and genes is made publicly available as a reference for future research on human plasma

CiteSeerX

Systemic Proteome Alterations Linked to Early Stage Pancreatic Cancer in Diabetic Patients

Author: Brand Randall E.
Brentnall Teresa A.
Chari Suresh T.
Chen Ru
Eng Jimmy K.
Lai Lisa A.
Pan Sheng
Peng Hong
Petersen Gloria M.
Yan Yuanqing
Publication venue: 'MDPI AG'
Publication date: 11/06/2020
Field of study

Background: Diabetes is a risk factor associated with pancreatic ductal adenocarcinoma (PDAC), and new adult-onset diabetes can be an early sign of pancreatic malignancy. Development of blood-based biomarkers to identify diabetic patients who warrant imaging tests for cancer detection may represent a realistic approach to facilitate earlier diagnosis of PDAC in a risk population. Methods: A spectral library-based proteomic platform was applied to interrogate biomarker candidates in plasma samples from clinically well-defined diabetic cohorts with and without PDAC. Random forest algorithm was used for prediction model building and receiver operating characteristic (ROC) curve analysis was applied to evaluate the prediction probability of potential biomarker panels. Results: Several biomarker panels were cross-validated in the context of detection of PDAC within a diabetic background. In combination with carbohydrate antigen 19-9 (CA19-9), the panel, which consisted of apolipoprotein A-IV (APOA4), monocyte differentiation antigen CD14 (CD14), tetranectin (CLEC3B), gelsolin (GSN), histidine-rich glycoprotein (HRG), inter-alpha-trypsin inhibitor heavy chain H3 (ITIH3), plasma kallikrein (KLKB1), leucine-rich alpha-2-glycoprotein (LRG1), pigment epithelium-derived factor (SERPINF1), plasma protease C1 inhibitor (SERPING1), and metalloproteinase inhibitor 1 (TIMP1), demonstrated an area under curve (AUC) of 0.85 and a two-fold increase in detection accuracy compared to CA19-9 alone. The study further evaluated the correlations of protein candidates and their influences on the performance of biomarker panels. Conclusions: Proteomics-based multiplex biomarker panels improved the detection accuracy for diagnosis of early stage PDAC in diabetic patients

D-Scholarship@Pitt

Tandem mass spectrometry data quality assessment by self-convolution

Author: A Shevchenko
AA Bharath
AL McCormack
AL McCormack
Andrew Keller
Bin Ma
BJ Cargile
C Yu
CG Herbert
D Fenyo
DC Barbacci
DL Tabb
DN Perkins
F Desiere
HI Field
JE Elias
JE Syka
Jimmy K Eng
JK Eng
JV Puymbrouck
K Biemann
K Biemann
Keng Wah Choo
KR Clauser
LY Geer
M Kinter
M Mann
Marshall Bern
N Zhang
P Roepstorff
PA Pevzner
Purvine Samuel
RA Zubarev
Randy J Arnold
Richard S Johnson
RS Johnson
S Sunyaev
Salmi Jussi
VH Wysocki
Wai Mun Tham
Wu Fang-Xiang
Wu Yik-Chung
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on <it>de novo </it>sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. Results The proposed method measures the qualities of MS data sets based on the symmetric property of b- and y-ion peaks present in a MS spectrum. Self-convolution on MS data and its time-reversal copy was employed. Due to the symmetric nature of b-ions and y-ions peaks, the self-convolution result of a good spectrum would produce a highest mid point intensity peak. To reduce processing time, self-convolution was achieved using Fast Fourier Transform and its inverse transform, followed by the removal of the "DC" (Direct Current) component and the normalisation of the data set. The quality score was defined as the ratio of the intensity at the mid point to the remaining peaks of the convolution result. The method was validated using both theoretical mass spectra, with various permutations, and several real MS data sets. The results were encouraging, revealing a high percentage of positive prediction rates for spectra with good quality scores. Conclusion We have demonstrated in this work a method for determining the quality of tandem MS data set. By pre-determining the quality of tandem MS data before subjecting them to protein identification algorithms, spurious protein predictions due to poor tandem MS data are avoided, giving scientists greater confidence in the predicted results. We conclude that the algorithm performs well and could potentially be used as a pre-processing for all mass spectrometry based protein identification tools.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry

Author: Aderem Alan
Aebersold Ruedi
Boyle Rose
Brunner Erich
Desiere Frank
Deutsch Eric W
Donohoe Samuel
Eng Jimmy K
Fausto Nelson
Hafen Ernst
Hood Lee
Katze Michael G
Kennedy Kathleen A
King Nichole L
Kregenow Floyd
Lee Hookeun
Lin Biaoyang
Mallick Parag
Martin Dan
Nesvizhskii Alexey I
Ranish Jeffrey A
Rawlings David J
Samelson Lawrence E
Shiio Yuzuru
Watts Julian D
Wollscheid Bernd
Wright Michael E
Yan Wei
Yang Lihong
Yi Eugene C
Zhang Hui
Publication venue: BioMed Central
Publication date: 10/12/2004
Field of study

A crucial aim upon the completion of the human genome is the verification and functional annotation of all predicted genes and their protein products. Here we describe the mapping of peptides derived from accurate interpretations of protein tandem mass spectrometry (MS) data to eukaryotic genomes and the generation of an expandable resource for integration of data from many diverse proteomics experiments. Furthermore, we demonstrate that peptide identifications obtained from high-throughput proteomics can be integrated on a large scale with the human genome. This resource could serve as an expandable repository for MS-derived proteome information

Repository for Publications and Research Data

Springer - Publisher Connector

PubMed Central

ZORA

Head-to-Head Comparison of Serum Fractionation Techniques

Author: Amanda G. Paulovich
Brian D. Piening
David G. Camp
Heidi Zhang
Hui Zhang
Jeffrey R. Whiteaker
Jimmy K. Eng
John F. Keane
Julian Watts
Kelly Cooke
Leigh Anderson
Li-Chia Feng
Lin
Martin W. McIntosh
Matthew Fitzgibbon
Regine M. Schoenherr
Richard D. Smith
Ruihua Fang
Tao Liu
Ted Holzman
Travis D. Lorentzen
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Extending Comet for Global Amino Acid Variant and Post-Translational Modification Analysis using the PSI Extended FASTA Format (PEFF).

Author: Deutsch Eric W
Eng Jimmy K
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 27/02/2020
Field of study

Protein identification by tandem mass spectrometry sequence database searching is a standard practice in many proteomics laboratories. The de facto standard for the representation of sequence databases used as input to sequence database search tools is the FASTA format. The Human Proteome Organization\u27s Proteomics Standards Initiative has developed an extension to the FASTA format termed the PSI extended FASTA format or PEFF where additional information such as structural annotations are encoded in the protein description lines. Comet has been extended to automatically analyze the post translational modifications and amino acid substitutions encoded in PEFF databases. Comet\u27s PEFF implementation and example analysis results searching a HEK293 dataset against the neXtProt PEFF database are presented. This article is protected by copyright. All rights reserved

Crossref

Providence St. Joseph Health Digital Commons

Identification of Proteins in Complexes by Solid-Phase Microextraction/Multistep Elution/Capillary Electrophoresis/Tandem Mass Spectrometry

Author: Andrew Link
Jimmy K. Eng
John R. Yates
Wei Tong
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref