6 research outputs found
μ§λλΆμκΈ° λ°μ΄ν° μμ λ²μ ν λ³ν μλ³ ν₯μμ μν μ§λ¨ μ΄μ¨μ νμ©ν 체κ³μ μΈ μ κ·Όλ² μ°κ΅¬
Post-translational modifications (PTMs) play indispensable roles in a wide array of cellular regulatory
events. More than 300 types of PTMs have been reported to occur in vivo, each with potentially different sets of
substrate proteins, dynamics, and biological consequences. Due to the enormous complexity of PTMs, systems wide study of PTMs is an active area of research in the field of proteomics. For a more comprehensive
understanding of the human PTM proteome, a taxonomy of the types of PTMs and their exact substrate
proteins/sites ought to be carried out above all else. This, in turn, requires a large-scale and confident identification
of PTMs. Mass spectrometry (MS)-based proteomics has enabled a systems-wide identification of proteins and
their amino acid residues that are affected by various PTMs. However, several important limitations and
challenges in sample preparation, MS analysis, and bioinformatics have impeded a deeper and wider
characterization of PTMs. To tackle some of the major challenges in bioinformatic analysis of PTMs including
the high false positive rate of PTMs and the heavy computational burden of database search, we developed
methods using diagnostic ions for PTMs.
First, we developed a statistical prediction model for the confident identification of citrullination. We
systematically identified diagnostic ions for citrullination, and used these diagnostic ions to build a prediction
model for assessing the validity of citrullinated PSMs identified by database searching. Application of our model
to real biological data showed significantly alleviated false positive rate. We further extended our approach to find
false negative citrullination and successfully identified additional citrullinated peptides that the database searching
failed to identify.
Second, we proposed a database search strategy for the large-scale identification of PTMs using a conventional
standard search tool. We introduced a post-acquisition spectra filtering approach to effectively reduce the size of
input MS data by retaining only the spectra that contain diagnostic ions of certain PTMs, thus rendering the use
of standard search approach for identifying hundreds of PTMs practical.
In summary, we proposed methods utilizing PTM diagnostic ions for the large-scale and confident identification
of PTMs. Constant improvement of the suggested frameworks will enable achieving a more comprehensive and
accurate identification of PTMs in the human proteome.|λ³Έ λ
Όλ¬Έμ μ§λλΆμ λ°μ΄ν° μμ λ²μ ν λ³ν μλ³μ νμ© κ°λ₯ν μ§λ¨ μ΄μ¨ κΈ°λ° μμΈ‘ λͺ¨λΈ λ° λ°μ΄ν° νν°λ§ νλ‘ν μ½μ λν΄ λ€λ£¬λ€. λ²μ ν λ³νμ μΈν¬λ΄ μ¬λ¬ μ‘°μ μμ©μ κ΄μ¬νλ κ²μΌλ‘ μλ €μ Έ μλ€. 300 μ¬ μ’
μ λ²μ ν λ³νμ΄ λ³΄κ³ λμ΄μκ³ , κ°κ°μ μλ‘ λ€λ₯Έ μμ© λ¨λ°±μ§κ³Ό λ€μ΄λλ―Ήμ€, κ·Έλ¦¬κ³ μλ¬Όνμ μΈ ν¨κ³Όλ₯Ό κ°μ§λ€. μ΄λ¬ν 볡μ‘μ± λλ¬Έμ, μ¬λμ λ²μ ν λ³ν λ¨λ°±μ²΄μ κ΄ν μ°κ΅¬λ μ¬μ ν μ΄κΈ° λ¨κ³μ μλ€. μ΄λ₯Ό μ€νμν€κΈ° μν΄μλ λ¨Όμ μΈν¬λ΄ λ²μ ν λ³νμ μ’
λ₯μ κ·Έκ²λ€μ μμ© λ¨λ°±μ§ λ° μλ―Έλ
Έμ° μμΉλ₯Ό κ΄λ²μνκ³ μ ννκ² νμ
νλ κ²μ΄ μ€μνλ€. μ§λλΆμκΈ° κΈ°λ° λ¨λ°±μ²΄ μ°κ΅¬λ μμ€ν
μ μΈ λ²μ ν λ³ν μ°κ΅¬λ₯Ό κ°λ₯νκ² λ§λ€μλ€. νμ§λ§ μν μ€λΉ κ³Όμ , μ§λλΆμ κ³Όμ , κ·Έλ¦¬κ³ μλ¬Όμ 보ν λΆμ κ³Όμ μμμ μ¬λ¬κ°μ§ λ¬Έμ μ κ³Ό νκ³μ λλ¬Έμ λ²μ ν λ³νμ λν μμ€ν
μ μΈ μ°κ΅¬λ μ¬μ ν λͺλͺ μ μλ €μ§ λ²μ ν λ³νμ κ΅νλμ΄μλ€. κ·Έ μ€μμλ μλ¬Όμ 보ν λΆμ κ³Όμ μμμ μ¬λ¬ λ¬Έμ μ λ€μ ν΄κ²°νκΈ° μν΄, μ°λ¦¬λ λ²μ ν λ³νμ μ§λ¨ μ΄μ¨μ νμ©ν λ°©λ²λ‘ μ κ°λ°νμλ€. 첫째, μ°λ¦¬λ μ§λλΆμ λ°μ΄ν° μμ λ²μ ν λ³νμ μΌμ’
μΈ μνΈλ£°λ¦°νμ μ νν μλ³μ μν΄ ν΅κ³μ μΈ μμΈ‘ λͺ¨λΈμ κ°λ°νμλ€. λ¨Όμ μνΈλ£°λ¦°νμ μ§λ¨ μ΄μ¨μ 체κ³μ μΌλ‘ μ°Ύμλ΄μκ³ , κ·Έκ²λ€μ κΈ°λ°μΌλ‘ μμΈ‘ λͺ¨λΈμ λ§λ€μ΄ λ°μ΄ν°λ² μ΄μ€ μμΉκ° μ°ΎμλΈ μνΈλ£°λ¦°ν κ²°κ³Όλ₯Ό νλ¨νλλ° μ¬μ©νμλ€. λν, μ€μ μλ¬Όν μνμμ μ λλ μ§λλΆμ λ°μ΄ν°λ€μ μ°λ¦¬κ° κ°λ°ν μμΈ‘ λͺ¨λΈμ μ μ©νμ¬ κ±°μ§ μμ±κ³Ό κ±°μ§ μμ± λ¬Έμ λ₯Ό μ±κ³΅μ μΌλ‘ μνμμΌ°λ€. λμ§Έ, μ°λ¦¬λ ν΅μμ μΌλ‘ μ¬μ©λλ μ€ν λ€λ λ°μ΄ν°λ² μ΄μ€ μμΉ ν΄μ μ΄μ©ν κ΄λ²μν λ²μ ν λ³ν μλ³μ κ°λ₯μΌνλ μμΉ λ°©λ²μ κ³ μνμλ€. μ§λλΆμ λ°μ΄ν°μμ νΉμ λ²μ ν λ³ν μ§λ¨ μ΄μ¨μ ν¬ν¨νλ λ°μ΄ν°λ§ νν°λ§νμ¬ μ΄κ²λ€μ λ°μ΄ν°λ² μ΄μ€ μμΉμ μ¬μ©νλ κ²μΌλ‘, μλ°± μ’
μ λ²μ ν λ³νμ λν μμΉλ₯Ό κ°λ₯μΌνμλ€. μ’
ν©νλ©΄, μ°λ¦¬λ λ²μ ν λ³ν μ§λ¨ μ΄μ¨μ νμ©νμ¬ μ§λλΆμ λ°μ΄ν°μμ λ²μ ν λ³νμ κ΄λ²μνκ³ μ νν μλ³μ κ°λ₯μΌνλ λ°©λ²λ€μ κ°λ°νμλ€. μ¬κΈ°μ μκ°λ λ°©λ²λ€μ μ§μμ μΈ ν₯μμ΄ νμνλ©°, μ΄λ μ¬λμ λ²μ ν λ³ν λ¨λ°±μ²΄λ₯Ό μ΄ν΄νλλ° μ μ©νκ² νμ©λ κ²μΌλ‘ μμνλ€.YAbstract i
List of Contents ii
List of Tables and Figures iii
Chapter 1. Introduction 1
1.1 Post-search PSM evaluation for the confident identification of authentic modification 4
1.2 Pre-search PTM screening for the large-scale identification of PTMs 5
Chapter 2. Systematic search for diagnostic ions for citrullination 6
2.1 Introduction 6
2.2 Results 8
2.3 Discussion 14
2.4 Methods 15
Chapter 3. Development and application of a statistical model for the confident identification of citrullination 27
3.1 Introduction 27
3.2 Results 27
3.3 Discussion 33
3.4 Methods 33
Chapter 4. Development of a search strategy for the large-scale identification of >200 types of PTMs 44
4.1 Introduction 44
4.2 Results 46
4.3 Discussion 49
4.4 Methods 49
Chapter 5. Conclusion 58
REFERENCES 60
μ μ½ λ¬Έ 63
CURRICULUM VITAE 64
ACKNOWLEDGMENT 66DoctordCollectio
A UserβFriendly Visualization Tool for MultiβOmics Data
The Clinical Proteomic Tumor Analysis Consortium (CPTAC) initiative has generated large multi-omic datasets for various cancers. Each dataset consists of common and differential data types, including genomics, epigenomics, transcriptomics, proteomics, and post-translational modifications data. They together make up a rich resource for researchers and clinicians interested in understanding cancer biology to draw from. Nevertheless, the complexity of these multi-omic datasets and a lack of an easily accessible analytical and visualization tool for exploring them continue to be a hurdle for those who are not trained in bioinformatics. In this issue, Calinawan et al. describe a user-friendly, web-based visualization platform named ProTrack for exploring the CPTAC clear cell renal cell carcinoma (ccRCC) dataset. Compared to other available visualization tools, ProTrack offers an easy yet powerful customization interface, solely dedicated to the CPTAC ccRCC dataset. Their tool enables ready inspection of potential associations between different data types within a single gene or across multiple genes without any need to code. Specific mutation types or phosphosites can also be easily looked up for any gene of interest. Calinawan et al. aim to extend their work into other CPTAC datasets, which will greatly contribute to the CPTAC as well as cancer biology community in general. Β© 2020 Wiley-VCH GmbH1
Statistical Modeling for Enhancing the Discovery Power of Citrullination from Tandem Mass Spectrometry Data
Citrullination is a post-translational modification implicated in various human diseases including rheumatoid arthritis, Alzheimer's disease, multiple sclerosis, and cancers. Due to a relatively low concentration of citrullinated proteins in the total proteome, confident identification of citrullinated proteome is challenging in mass spectrometry (MS)-based proteomic analysis. From these MS-based analyses, MS features that characterize citrullination, such as immonium ions (IMs) and neutral losses (NLs), called diagnostic ions, have been reported. However, there has been a lack of systematic approaches to comprehensively search for diagnostic ions and no statistical methods for the identification of citrullinated proteome based on these diagnostic ions. Here, we present a systematic approach to identify diagnostic IMs, internal ions (INTs), and NLs for citrullination from tandem mass (MS/MS) spectra. Diagnostic INTs mainly consisted of internal fragment ions for di- and tripeptides that contained two and three amino acids with at least one citrullinated arginine, respectively. A statistical logistic regression model was built for a confident assessment of citrullinated peptides that database searches identified (true positives) and prediction of citrullinated peptides that database searches failed to identify (false negatives) using the diagnostic IMs, INTs, and NLs. Applications of our model to complex global proteome data sets demonstrated the increased accuracy in the identification of citrullinated peptides, thereby enhancing the size and functional interpretation of citrullinated proteomes. Copyright Β© 2020 American Chemical Society.1
Novel Online Three-Dimensional Separation Expands the Detectable Functional Landscape of Cellular Phosphoproteome
Protein phosphorylation is a prevalent post-translational modification that regulates essentially every aspect of cellular processes. Currently, liquid chromatography-tandem mass spectrometry (LC-MS/MS) with an extensive offline sample fractionation and a phosphopeptide enrichment method is a best practice for deep phosphoproteome profiling, but balancing throughput and profiling depth remains a practical challenge. We present an online three-dimensional separation method for ultradeep phosphoproteome profiling that combines an online two-dimensional liquid chromatography separation and an additional gas-phase separation. This method identified over 100,000 phosphopeptides (>60,000 phosphosites) in HeLa cells during 1.5 days of data acquisition, and the largest HeLa cell phosphoproteome significantly expanded the detectable functional landscape of cellular phosphoproteome.N
2-Undecanone derived from Pseudomonas aeruginosa modulates the neutrophil activity
Pseudomonas aeruginosa (P. aeruginosa) is a well-known Gramnegative opportunistic pathogen. Neutrophils play key roles in mediating host defense against P. aeruginosa infection. In this study, we identified a metabolite derived from P. aeruginosa that regulates neutrophil activities. Using gas chromatography-mass spectrometry, a markedly increased level of 2-undecanone was identified in the peritoneal fluid of P. aeruginosa-infected mice. 2-Undecanone elicited the activation of neutrophils in a GΞ±i-phospholipase C pathway. However, 2-undecanone strongly inhibited responses to lipopolysaccharide and bactericidal activity of neutrophils against P. aeruginosa by inducing apoptosis. Our results demonstrate that 2-undecanone from P. aeruginosa limits the innate defense activity of neutrophils, suggesting that the production of inhibitory metabolites is a strategy of P. aeruginosa for escaping the host immune system. [BMB Reports 2022; 55(8): 395-400] Β© 2022 by the The Korean Society for Biochemistry and Molecular Biology1
Novel Diagnostic Biomarkers for High-Grade Serous Ovarian Cancer Uncovered by Data-Independent Acquisition Mass Spectrometry
High-grade serous ovarian cancer (HGSOC) represents the major histological type of ovarian cancer, and the lack of effective screening tools and early detection methods significantly contributes to the poor prognosis of HGSOC. Currently, there are no reliable diagnostic biomarkers for HGSOC. In this study, we performed liquid chromatography data-independent acquisition tandem mass spectrometry (MS) on depleted serum samples from 26 HGSOC cases and 24 healthy controls (HCs) to discover potential HGSOC diagnostic biomarkers. A total of 1,847 proteins were identified across all samples, among which 116 proteins showed differential expressions between HGSOC patients and HCs. Network modeling showed activations of coagulation and complement cascades, platelet activation and aggregation, neutrophil extracellular trap formation, toll-like receptor 4, insulin-like growth factor, and transforming growth factor beta signaling, as well as suppression of lipoprotein assembly and Fc gamma receptor activation in HGSOC. Based on the network model, we prioritized 28 biomarker candidates and validated 18 of them using targeted MS assays in an independent cohort. Predictive modeling showed a sensitivity of 1 and a specificity of 0.91 in the validation cohort. Finally, in vitro functional assays on four potential biomarkers (FGA, VWF, ARHGDIB, and SERPINF2) suggested that they may play an important role in cancer cell proliferation and migration in HGSOC. All raw data were deposited in PRIDE (PXD033169).N