6 research outputs found

    μ§ˆλŸ‰λΆ„μ„κΈ° 데이터 μƒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜• 식별 ν–₯상을 μœ„ν•œ 진단 μ΄μ˜¨μ„ ν™œμš©ν•œ 체계적인 접근법 연ꡬ

    No full text
    Post-translational modifications (PTMs) play indispensable roles in a wide array of cellular regulatory events. More than 300 types of PTMs have been reported to occur in vivo, each with potentially different sets of substrate proteins, dynamics, and biological consequences. Due to the enormous complexity of PTMs, systems wide study of PTMs is an active area of research in the field of proteomics. For a more comprehensive understanding of the human PTM proteome, a taxonomy of the types of PTMs and their exact substrate proteins/sites ought to be carried out above all else. This, in turn, requires a large-scale and confident identification of PTMs. Mass spectrometry (MS)-based proteomics has enabled a systems-wide identification of proteins and their amino acid residues that are affected by various PTMs. However, several important limitations and challenges in sample preparation, MS analysis, and bioinformatics have impeded a deeper and wider characterization of PTMs. To tackle some of the major challenges in bioinformatic analysis of PTMs including the high false positive rate of PTMs and the heavy computational burden of database search, we developed methods using diagnostic ions for PTMs. First, we developed a statistical prediction model for the confident identification of citrullination. We systematically identified diagnostic ions for citrullination, and used these diagnostic ions to build a prediction model for assessing the validity of citrullinated PSMs identified by database searching. Application of our model to real biological data showed significantly alleviated false positive rate. We further extended our approach to find false negative citrullination and successfully identified additional citrullinated peptides that the database searching failed to identify. Second, we proposed a database search strategy for the large-scale identification of PTMs using a conventional standard search tool. We introduced a post-acquisition spectra filtering approach to effectively reduce the size of input MS data by retaining only the spectra that contain diagnostic ions of certain PTMs, thus rendering the use of standard search approach for identifying hundreds of PTMs practical. In summary, we proposed methods utilizing PTM diagnostic ions for the large-scale and confident identification of PTMs. Constant improvement of the suggested frameworks will enable achieving a more comprehensive and accurate identification of PTMs in the human proteome.|λ³Έ 논문은 μ§ˆλŸ‰λΆ„μ„ 데이터 μƒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜• 식별에 ν™œμš© κ°€λŠ₯ν•œ 진단 이온 기반 예츑 λͺ¨λΈ 및 데이터 필터링 ν”„λ‘œν† μ½œμ— λŒ€ν•΄ 닀룬닀. λ²ˆμ—­ ν›„ λ³€ν˜•μ€ 세포내 μ—¬λŸ¬ 쑰절 μž‘μš©μ— κ΄€μ—¬ν•˜λŠ” κ²ƒμœΌλ‘œ μ•Œλ €μ Έ μžˆλ‹€. 300 μ—¬ μ’…μ˜ λ²ˆμ—­ ν›„ λ³€ν˜•μ΄ λ³΄κ³ λ˜μ–΄μžˆκ³ , 각각은 μ„œλ‘œ λ‹€λ₯Έ μž‘μš© λ‹¨λ°±μ§ˆκ³Ό λ‹€μ΄λ‚˜λ―ΉμŠ€, 그리고 생물학적인 효과λ₯Ό 가진닀. μ΄λŸ¬ν•œ λ³΅μž‘μ„± λ•Œλ¬Έμ—, μ‚¬λžŒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜• 단백체에 κ΄€ν•œ μ—°κ΅¬λŠ” μ—¬μ „νžˆ 초기 단계에 μžˆλ‹€. 이λ₯Ό μ‹€ν˜„μ‹œν‚€κΈ° μœ„ν•΄μ„œλŠ” λ¨Όμ € 세포내 λ²ˆμ—­ ν›„ λ³€ν˜•μ˜ μ’…λ₯˜μ™€ κ·Έκ²ƒλ“€μ˜ μž‘μš© λ‹¨λ°±μ§ˆ 및 μ•„λ―Έλ…Έμ‚° μœ„μΉ˜λ₯Ό κ΄‘λ²”μœ„ν•˜κ³  μ •ν™•ν•˜κ²Œ νŒŒμ•…ν•˜λŠ” 것이 μ€‘μš”ν•˜λ‹€. μ§ˆλŸ‰λΆ„μ„κΈ° 기반 단백체 μ—°κ΅¬λŠ” μ‹œμŠ€ν…œμ μΈ λ²ˆμ—­ ν›„ λ³€ν˜• 연ꡬλ₯Ό κ°€λŠ₯ν•˜κ²Œ λ§Œλ“€μ—ˆλ‹€. ν•˜μ§€λ§Œ μƒ˜ν”Œ μ€€λΉ„ κ³Όμ •, μ§ˆλŸ‰λΆ„μ„ κ³Όμ •, 그리고 생물정보학 뢄석 κ³Όμ •μ—μ„œμ˜ μ—¬λŸ¬κ°€μ§€ 문제점과 ν•œκ³„μ  λ•Œλ¬Έμ— λ²ˆμ—­ ν›„ λ³€ν˜•μ— λŒ€ν•œ μ‹œμŠ€ν…œμ μΈ μ—°κ΅¬λŠ” μ—¬μ „νžˆ λͺ‡λͺ‡ 잘 μ•Œλ €μ§„ λ²ˆμ—­ ν›„ λ³€ν˜•μ— κ΅­ν•œλ˜μ–΄μ™”λ‹€. κ·Έ μ€‘μ—μ„œλ„ 생물정보학 뢄석 κ³Όμ •μ—μ„œμ˜ μ—¬λŸ¬ λ¬Έμ œμ λ“€μ„ ν•΄κ²°ν•˜κΈ° μœ„ν•΄, μš°λ¦¬λŠ” λ²ˆμ—­ ν›„ λ³€ν˜•μ˜ 진단 μ΄μ˜¨μ„ ν™œμš©ν•œ 방법둠을 κ°œλ°œν•˜μ˜€λ‹€. 첫째, μš°λ¦¬λŠ” μ§ˆλŸ‰λΆ„μ„ 데이터 μƒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜•μ˜ 일쒅인 μ‹œνŠΈλ£°λ¦°ν™”μ˜ μ •ν™•ν•œ 식별을 μœ„ν•΄ 톡계적인 예츑 λͺ¨λΈμ„ κ°œλ°œν•˜μ˜€λ‹€. λ¨Όμ € μ‹œνŠΈλ£°λ¦°ν™”μ˜ 진단 μ΄μ˜¨μ„ μ²΄κ³„μ μœΌλ‘œ μ°Ύμ•„λ‚΄μ—ˆκ³ , 그것듀을 기반으둜 예츑 λͺ¨λΈμ„ λ§Œλ“€μ–΄ λ°μ΄ν„°λ² μ΄μŠ€ μ„œμΉ˜κ°€ μ°Ύμ•„λ‚Έ μ‹œνŠΈλ£°λ¦°ν™” κ²°κ³Όλ₯Ό νŒλ‹¨ν•˜λŠ”λ° μ‚¬μš©ν•˜μ˜€λ‹€. λ˜ν•œ, μ‹€μ œ 생물학 μƒ˜ν”Œμ—μ„œ 유래된 μ§ˆλŸ‰λΆ„μ„ 데이터듀에 μš°λ¦¬κ°€ κ°œλ°œν•œ 예츑 λͺ¨λΈμ„ μ μš©ν•˜μ—¬ 거짓 μ–‘μ„±κ³Ό 거짓 μŒμ„± 문제λ₯Ό μ„±κ³΅μ μœΌλ‘œ μ™„ν™”μ‹œμΌ°λ‹€. λ‘˜μ§Έ, μš°λ¦¬λŠ” ν†΅μƒμ μœΌλ‘œ μ‚¬μš©λ˜λŠ” μŠ€νƒ λ‹€λ“œ λ°μ΄ν„°λ² μ΄μŠ€ μ„œμΉ˜ νˆ΄μ„ μ΄μš©ν•œ κ΄‘λ²”μœ„ν•œ λ²ˆμ—­ ν›„ λ³€ν˜• 식별을 κ°€λŠ₯μΌ€ν•˜λŠ” μ„œμΉ˜ 방법을 κ³ μ•ˆν•˜μ˜€λ‹€. μ§ˆλŸ‰λΆ„μ„ λ°μ΄ν„°μ—μ„œ νŠΉμ • λ²ˆμ—­ ν›„ λ³€ν˜• 진단 μ΄μ˜¨μ„ ν¬ν•¨ν•˜λŠ” λ°μ΄ν„°λ§Œ ν•„ν„°λ§ν•˜μ—¬ 이것듀을 λ°μ΄ν„°λ² μ΄μŠ€ μ„œμΉ˜μ— μ‚¬μš©ν•˜λŠ” κ²ƒμœΌλ‘œ, 수백 μ’…μ˜ λ²ˆμ—­ ν›„ λ³€ν˜•μ— λŒ€ν•œ μ„œμΉ˜λ₯Ό κ°€λŠ₯μΌ€ν•˜μ˜€λ‹€. μ’…ν•©ν•˜λ©΄, μš°λ¦¬λŠ” λ²ˆμ—­ ν›„ λ³€ν˜• 진단 μ΄μ˜¨μ„ ν™œμš©ν•˜μ—¬ μ§ˆλŸ‰λΆ„μ„ λ°μ΄ν„°μƒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜•μ˜ κ΄‘λ²”μœ„ν•˜κ³  μ •ν™•ν•œ 식별을 κ°€λŠ₯μΌ€ν•˜λŠ” 방법듀을 κ°œλ°œν•˜μ˜€λ‹€. μ—¬κΈ°μ„œ μ†Œκ°œλœ 방법듀은 지속적인 ν–₯상이 ν•„μš”ν•˜λ©°, μ΄λŠ” μ‚¬λžŒμ˜ λ²ˆμ—­ ν›„ λ³€ν˜• 단백체λ₯Ό μ΄ν•΄ν•˜λŠ”λ° μœ μš©ν•˜κ²Œ ν™œμš©λ  κ²ƒμœΌλ‘œ μ˜ˆμƒν•œλ‹€.YAbstract i List of Contents ii List of Tables and Figures iii Chapter 1. Introduction 1 1.1 Post-search PSM evaluation for the confident identification of authentic modification 4 1.2 Pre-search PTM screening for the large-scale identification of PTMs 5 Chapter 2. Systematic search for diagnostic ions for citrullination 6 2.1 Introduction 6 2.2 Results 8 2.3 Discussion 14 2.4 Methods 15 Chapter 3. Development and application of a statistical model for the confident identification of citrullination 27 3.1 Introduction 27 3.2 Results 27 3.3 Discussion 33 3.4 Methods 33 Chapter 4. Development of a search strategy for the large-scale identification of >200 types of PTMs 44 4.1 Introduction 44 4.2 Results 46 4.3 Discussion 49 4.4 Methods 49 Chapter 5. Conclusion 58 REFERENCES 60 μš” μ•½ λ¬Έ 63 CURRICULUM VITAE 64 ACKNOWLEDGMENT 66DoctordCollectio

    A User‐Friendly Visualization Tool for Multi‐Omics Data

    No full text
    The Clinical Proteomic Tumor Analysis Consortium (CPTAC) initiative has generated large multi-omic datasets for various cancers. Each dataset consists of common and differential data types, including genomics, epigenomics, transcriptomics, proteomics, and post-translational modifications data. They together make up a rich resource for researchers and clinicians interested in understanding cancer biology to draw from. Nevertheless, the complexity of these multi-omic datasets and a lack of an easily accessible analytical and visualization tool for exploring them continue to be a hurdle for those who are not trained in bioinformatics. In this issue, Calinawan et al. describe a user-friendly, web-based visualization platform named ProTrack for exploring the CPTAC clear cell renal cell carcinoma (ccRCC) dataset. Compared to other available visualization tools, ProTrack offers an easy yet powerful customization interface, solely dedicated to the CPTAC ccRCC dataset. Their tool enables ready inspection of potential associations between different data types within a single gene or across multiple genes without any need to code. Specific mutation types or phosphosites can also be easily looked up for any gene of interest. Calinawan et al. aim to extend their work into other CPTAC datasets, which will greatly contribute to the CPTAC as well as cancer biology community in general. Β© 2020 Wiley-VCH GmbH1

    Statistical Modeling for Enhancing the Discovery Power of Citrullination from Tandem Mass Spectrometry Data

    No full text
    Citrullination is a post-translational modification implicated in various human diseases including rheumatoid arthritis, Alzheimer's disease, multiple sclerosis, and cancers. Due to a relatively low concentration of citrullinated proteins in the total proteome, confident identification of citrullinated proteome is challenging in mass spectrometry (MS)-based proteomic analysis. From these MS-based analyses, MS features that characterize citrullination, such as immonium ions (IMs) and neutral losses (NLs), called diagnostic ions, have been reported. However, there has been a lack of systematic approaches to comprehensively search for diagnostic ions and no statistical methods for the identification of citrullinated proteome based on these diagnostic ions. Here, we present a systematic approach to identify diagnostic IMs, internal ions (INTs), and NLs for citrullination from tandem mass (MS/MS) spectra. Diagnostic INTs mainly consisted of internal fragment ions for di- and tripeptides that contained two and three amino acids with at least one citrullinated arginine, respectively. A statistical logistic regression model was built for a confident assessment of citrullinated peptides that database searches identified (true positives) and prediction of citrullinated peptides that database searches failed to identify (false negatives) using the diagnostic IMs, INTs, and NLs. Applications of our model to complex global proteome data sets demonstrated the increased accuracy in the identification of citrullinated peptides, thereby enhancing the size and functional interpretation of citrullinated proteomes. Copyright Β© 2020 American Chemical Society.1

    Novel Online Three-Dimensional Separation Expands the Detectable Functional Landscape of Cellular Phosphoproteome

    No full text
    Protein phosphorylation is a prevalent post-translational modification that regulates essentially every aspect of cellular processes. Currently, liquid chromatography-tandem mass spectrometry (LC-MS/MS) with an extensive offline sample fractionation and a phosphopeptide enrichment method is a best practice for deep phosphoproteome profiling, but balancing throughput and profiling depth remains a practical challenge. We present an online three-dimensional separation method for ultradeep phosphoproteome profiling that combines an online two-dimensional liquid chromatography separation and an additional gas-phase separation. This method identified over 100,000 phosphopeptides (>60,000 phosphosites) in HeLa cells during 1.5 days of data acquisition, and the largest HeLa cell phosphoproteome significantly expanded the detectable functional landscape of cellular phosphoproteome.N

    2-Undecanone derived from Pseudomonas aeruginosa modulates the neutrophil activity

    No full text
    Pseudomonas aeruginosa (P. aeruginosa) is a well-known Gramnegative opportunistic pathogen. Neutrophils play key roles in mediating host defense against P. aeruginosa infection. In this study, we identified a metabolite derived from P. aeruginosa that regulates neutrophil activities. Using gas chromatography-mass spectrometry, a markedly increased level of 2-undecanone was identified in the peritoneal fluid of P. aeruginosa-infected mice. 2-Undecanone elicited the activation of neutrophils in a GΞ±i-phospholipase C pathway. However, 2-undecanone strongly inhibited responses to lipopolysaccharide and bactericidal activity of neutrophils against P. aeruginosa by inducing apoptosis. Our results demonstrate that 2-undecanone from P. aeruginosa limits the innate defense activity of neutrophils, suggesting that the production of inhibitory metabolites is a strategy of P. aeruginosa for escaping the host immune system. [BMB Reports 2022; 55(8): 395-400] Β© 2022 by the The Korean Society for Biochemistry and Molecular Biology1

    Novel Diagnostic Biomarkers for High-Grade Serous Ovarian Cancer Uncovered by Data-Independent Acquisition Mass Spectrometry

    No full text
    High-grade serous ovarian cancer (HGSOC) represents the major histological type of ovarian cancer, and the lack of effective screening tools and early detection methods significantly contributes to the poor prognosis of HGSOC. Currently, there are no reliable diagnostic biomarkers for HGSOC. In this study, we performed liquid chromatography data-independent acquisition tandem mass spectrometry (MS) on depleted serum samples from 26 HGSOC cases and 24 healthy controls (HCs) to discover potential HGSOC diagnostic biomarkers. A total of 1,847 proteins were identified across all samples, among which 116 proteins showed differential expressions between HGSOC patients and HCs. Network modeling showed activations of coagulation and complement cascades, platelet activation and aggregation, neutrophil extracellular trap formation, toll-like receptor 4, insulin-like growth factor, and transforming growth factor beta signaling, as well as suppression of lipoprotein assembly and Fc gamma receptor activation in HGSOC. Based on the network model, we prioritized 28 biomarker candidates and validated 18 of them using targeted MS assays in an independent cohort. Predictive modeling showed a sensitivity of 1 and a specificity of 0.91 in the validation cohort. Finally, in vitro functional assays on four potential biomarkers (FGA, VWF, ARHGDIB, and SERPINF2) suggested that they may play an important role in cancer cell proliferation and migration in HGSOC. All raw data were deposited in PRIDE (PXD033169).N
    corecore