307 research outputs found
Nonsynonymous Single-Nucleotide Variations on Some Posttranslational Modifications of Human Proteins and the Association with Diseases
Protein posttranslational modifications (PTMs) play key roles in a variety of protein activities and cellular processes. Different PTMs show distinct impacts on protein functions, and normal protein activities are consequences of all kinds of PTMs working together. With the development of high throughput technologies such as tandem mass spectrometry (MS/MS) and next generation sequencing, more and more nonsynonymous single-nucleotide variations (nsSNVs) that cause variation of amino acids have been identified, some of which result in the damage of PTMs. The damaged PTMs could be the reason of the development of some human diseases. In this study, we elucidated the proteome wide relationship of eight damaged PTMs to human inherited diseases and cancers. Some human inherited diseases or cancers may be the consequences of the interactions of damaged PTMs, rather than the result of single damaged PTM site
νλ‘ν μ€μ§λ Έλ―Ήμ€ κΈ°λ²μ μ΄μ©ν νμ λ°μ΄μ€λ§μ»€ μ°κ΅¬
νμλ
Όλ¬Έ (λ°μ¬)-- μμΈλνκ΅ λνμ : μμκ³Όλν μμνκ³Ό, 2019. 2. μ‘°μ μ΄.μ λ°μλ£ ν¨λ¬λ€μμ λ±μ₯ μ΄ν, μ§νμ μ§λ¨ λ° μΉλ£λ₯Ό μν΄μ λ°μ΄μ€λ§μ»€μ λν μμλ λμμ§κ³ μλ€. κ°μ€κΈ°λ°μ°κ΅¬λ μ ν΅μ μΌλ‘ λΉμ°νκ² μ¬μ©λμ€λ μ°κ΅¬μν체κ³μ΄μ§λ§, λ°μ΄μ€λ§μ»€ λ°κ΅΄μμ νμ°μ μΌλ‘ λ§μ£ΌμΉκ²λλ κ΄λ²μν μ€ν¬λ¦¬λ μμ
μμλ ν¨μ¨μ±μ νκ³λ₯Ό λλ¬λΈλ€. μ€λ―Ήμ€κΈ°μ μ λ±μ₯κ³Ό ν¨κ» μ§νμ°κ΅¬μ ν¨λ¬λ€μμ μ¦κ±°κΈ°λ° λκ·λͺ¨ νκ²λ°κ΅΄λ°©μμΌλ‘ λ³ννκ³ μλ€. λ¨λ°±μ§μ μ체 κΈ°λ₯μ‘°μ μ μ§μ μ μΌλ‘ κ΄μ¬νλ λ¬Όμ§μ΄κΈ° λλ¬Έμ λ°μ΄μ€λ§μ»€λ‘ νμ©ν μ μλ κ°μ₯ μ΄μμ μΈ λ¬Όμ§λ‘ μ¬κ²¨μ§λ€. μ§λλΆμκΈ°λ₯Ό μ΄μ©ν λ¨λ°±μ²΄λΆμμ λ¨λ°±μ§μ μ§μ μ μ± λ° μ λν μ μμ λΏλ§ μλλΌ λ§€μ° μμ°μ±μ΄ λμ μ§ν λ°μ΄μ€λ§μ»€ λ°κ΅΄μ μ μ©νλ€. μ΄ λ
Όλ¬Έμμλ μ§λλΆμκΈ°λ₯Ό μ΄μ©νμ¬ νμ λ°μ΄μ€λ§μ»€μ λ°κ΅΄μ μν κ³ λνλ λΆμκΈ°λ²μΈ νλ‘ν
μ€μ§λ
Έλ―Ήμ€ κΈ°λ²μ μ μ©κ³Ό, μ€ν¬λ¦¬λλ λ°μ΄μ€λ§μ»€ν보 λ¨λ°±μ§μ μ λκ²μ¦ λ° νμ κ°λ³μ§λ¨ μ‘°ν©λ§μ»€μ μμ±μ°κ΅¬μ λνμ¬ μμλ³Έλ€.
CHAPTER Iμμλ μΈκ°μΌμμ²΄κΈ°λ° λ¨λ°±μ²΄νλ‘μ νΈ (C-HPP)μ μΌνμΌλ‘ μνλ μΌμ체9λ²μ λν λ¨λ°±μ²΄μ°κ΅¬κ° ν¬ν¨λμ΄μλ€. λ―ΈνμΈ λ¨λ°±μ§κ³Ό μ μ λ¨λ°±μ²΄μμ λ°κ²¬λμ§ μμλ μκ·Έλμ²λ₯Ό λ°νλ΄κΈ° μν΄ LC-MS/MSλΆμκ³Ό RNA-seq μ°¨μΈλμΌκΈ°λΆμκΈ°λ²μ μ μ©νμ¬ μμμ’
νμνμ 5λͺ
μ μ μ-μ’
μμ‘°μ§μ λΆμνμλ€. μΌμ체μ€μ¬-μΈκ°λ¨λ°±μ²΄νλ‘μ νΈμ 2013λ
리ν¬νΈμμλ neXtProt μΈκ°λ¨λ°±μ²΄ λ°μ΄ν°λ² μ΄μ€λ₯Ό κΈ°μ€μΌλ‘ μΌμ체 9λ²μμ 170κ°μ λ―ΈνμΈ λ¨λ°±μ§μ΄ μλ κ²μΌλ‘ μλ €μ‘μΌλ©°, λ³Έ λ
Όλ¬Έμ μ°κ΅¬κ° μ§νλ 2015λ
μλ 133κ°κ° κ³μ λ―ΈνμΈμνλ‘ λ¨μμμλ€. λ³Έ λ
Όλ¬Έμ λ¨λ°±μ²΄λΆμμμλ 19κ°μ λ―ΈνμΈ λ¨λ°±μ§μ λμ ν μ μμμΌλ©°, κ·Έ μ€μμ μΌμ체9λ²μ ν΄λΉνλ λ¨λ°±μ§μ SPATA31A4 ν κ° μλ€. RNA-seqλΆμμΌλ‘λ μμ’
νμμ‘°μ§ 5κ°μμ 곡ν΅μ μΌλ‘ κ²μΆλλ©΄μ μ μμ‘°μ§μμλ κ²μΆλμ§ μλnonsynonymous SNP 5κ° (CDH17, HIST1H1T, SAPCD2, ZNF695) κ·Έλ¦¬κ³ synonymous SNP 3κ°λ₯Ό λ°κ΅΄ν μ μμλ€.
νλ‘ν
μ€μ§λ
Έλ―Ή λΆμμ μν΄μ κ° μλ£λ³ RNA-seqλ°μ΄ν°λ₯Ό κ°κ³΅νμ¬ λ§μΆ€ν λ°μ΄ν°λ² μ΄μ€λ₯Ό ꡬμΆνμλ€. μ΄λ κ² μμ±λ μλ£λ§μΆ€ν λ°μ΄ν°λ² μ΄μ€λ₯Ό λ¨λ°±μ²΄ μ§λλΆμλ°μ΄ν° κ²μμ νμ©νμ¬ 5κ° μ μ μ(LTF, HDLBP, TF, HBD, HLA-DRB5)μ ν΄λΉνλ 7κ°μ λμ°λ³μ΄λ₯Ό κ²μΆνμλ€. λ κ°μ λμ°λ³μ΄λ μ μμ‘°μ§μμλ κ²μΆλμ§ μκ³ μμ‘°μ§μμλ§ κ²μΆλμλ€. λν, μ΄ κ²°κ³Όμμλ μ μ-μμ‘°μ§ λͺ¨λμμ μμ μ μ (EEF1A1P5) ν©ν°λλ₯Ό κ²μΆν μ μμλ€.
CHAPTER IIμμλ λ€μ€λ°μκ²μ§λ² (MRM) μ μ΄μ©ν λ¨λ°±μ§ λ°μ΄μ€λ§μ»€ κ²μ¦κ³Ό μ‘°ν©λ§μ»€ ꡬμ±μ λν μ°κ΅¬λ₯Ό μμ νμλ€. νμκ³Ό λ€λ₯Έ νμ§νμ κ°λ³μ΄ μ΄λ ΅κΈ° λλ¬Έμ νμμ μ€μ§λ¨ μνμ΄ ν° μ§λ³μ΄λ€. λ°λΌμ νμ²κΈ°λ°μ νμκ°λ³μ§λ¨ λ°μ΄μ€λ§μ»€κ°λ°μ νμμ±μ λ리 μΈμ λκ³ μλ€. μ΄ λ¨μμμλ νμνμμ λμ‘°κ΅°νμ§ν νμ 198λͺ
μ νμ²μλ£λ₯Ό νμ©νμ¬ μΌκ³±κ°μ νμλ°μ΄μ€λ§μ»€ ν보λ¨λ°±μ§μ λλ
Έμ μ μ‘체ν¬λ‘λ§ν κ·ΈλνΌ-λ€μ€λ°μκ²μ§λ²μΌλ‘ μ λνμλ€.
ν보λ¨λ°±μ§μ κ°λ³λ‘ λΆμνμμ λμλ SERPINA4λ§μ΄ ν΅κ³μ μΌλ‘ μ μμ±μκ² νμ€λλκ° κ°μνλ κ²μΌλ‘ λνλ¬λ€. λ€μ€λ°μκ²μ§λ² μ 체λ°μ΄ν°λ₯Ό μμμ 보μ ν¨κ» λ‘μ§μ€ν±νκ·λͺ¨λΈμ μ μ©νμ¬ νλμ μ‘°ν©λ§μ»€λ‘ λ§λ€ μ μμλ€. μ΄ κ³Όμ μμ κ°λ³λ§μ»€λ‘λ ν΅κ³μ μΈ μ μμ±μ΄ λλλ¬μ§μ§ μμ§λ§ κ°μν¨κ³Όλ₯Ό λ§λ€μ΄λΌ μ μλ λ³μλ₯Ό κ³ λ €νμ¬ λͺ¨λΈλ§μ μ§ννμλ€. μ΅μ’
μ μΌλ‘ SERPINA4, PON1, λμ΄λ₯Ό μ‘°ν©νμμ λ κ°μ₯ μ΅μ μ μ‘°ν©λ§μ»€κ° μμ±λμλ€. μ΄ μ‘°ν©λ§μ»€λ AUC 0.915 μ κ°λ³μ§λ¨ μ±λ₯μ 보μ¬μ£ΌμμΌλ©°, λͺ¨λΈμ λ§λλλ° μ¬μ©λμλ μλ£μλ λ³κ°μ κ²μ¦κ΅°μμλ μ±λ₯μ μ μ§λμλ€. μ΄μ κ°μ΄ ν΅κ³λͺ¨λΈμ μ΄μ©νμ¬ μμ±ν μ‘°ν©λ§μ»€λ κ°λ³ λΆμλ§μ»€λ₯Ό μ΄μ©νμ κ²½μ°λ³΄λ€ κ°μ λ νμ κ°λ³μ§λ¨λ₯λ ₯μ 보μ¬μ€ μ μμμ μ μνλ€.Biomarkers have been in high demand for disease diagnosis and therapeutics. Traditional hypothesis-based research has been challenging due to massive screening studies. Together with the emergence of omics technologies, currently, the paradigm for disease research has been moving toward evidence-based large-scale discovery studies. Proteins, as key effector molecules, can serve as ideal biomarkers for various diseases because they catalyze every biological function. Proteomics, which is represented by mass spectrometry (MS) technologies, stands as a solution for disease diagnosis and drug target discovery.
CHAPTER I includes a portion of a report from of the human proteome project (HPP) related to chromosome 9 (Chr 9). To identify missing proteins (MPs) and their potential features in regard to proteogenomic view, both LC-MS/MS analysis and next-generation RNA sequencing (RNA-seq)-based tools were used for the clinical samples including adjacent non-tumor tissues. When the Chr 9 working group of the Chromosome-Centric Human Proteome Project (C-HPP) began this project, there were 170 remaining MPs encoded by Chr 9 (neXtProt 2013.09.26 rel.)currently, 133 MPs remain unidentified at present (neXtProt 2015.04.28 rel.). Proteome analysis in this study identified 19 missing proteins across all chromosomes and one MP (SPATA31A4) from Chr 9. RNA-seq analysis enable detection of RNA expression of 4 nonsynonymous (NS) SNPs (in CDH17, HIST1H1T, SAPCD2, and ZNF695) and 3 synonymous SNPs (in CDH17, CST1, and HNF1A) in all 5 tumor tissues but not in any of the adjacent normal tissues. By constructing a cancer patient sample-specific protein database based on individual RNA-seq data, and by searching the proteomics data from the same sample, 7 missense mutations in 5 genes (LTF, HDLBP, TF, HBD, and HLA-DRB5) were identified. Two of these mutations were found in tumor tissues but not in the paired normal tissues. Additionally, this study discovered peptides that were derived from the expression of a pseudogene (EEF1A1P5) in both tumor and normal tissues. In summary, this proteogenomic study of human primary lung tumor tissues enabled detection of additional missing proteins and revealed novel missense mutations and synonymous SNP signatures, some of which are predicted to be specific to lung cancer.
CHAPTER II describes a study of the combination marker model using multiple reaction monitoring (MRM) quantitative data. Misdiagnosis of lung cancer remains a serious problem due to the difficulty of distinguishing lung cancer from other respiratory lung diseases. As a result, the development of serum-based differential diagnostic biomarkers is in high demand. In this study, 198 serum samples from non-cancer lung disease and lung cancer patients were analyzed using nLC-QqQ-MS to examine the diagnostic efficacy of seven lung cancer biomarker candidates. When the candidates were assessed individually, only SERPINEA4 showed statistically significant changes in the sera of cancer patient compared to those of control samples. The MRM results and clinical information were analyzed using logistic regression analysis to a select model for the best meta-marker, or combination of biomarkers for the differential diagnosis. Additionally, under consideration of statistical interaction, variables having a low significance as a single factor but statistically influencing the meta-marker model were selected. Using this probabilistic classification, the best meta-marker was determined to comprise two proteins SERPINA4 and PON1, with an age factor. This meta-marker showed an enhanced differential diagnostic capability (AUC=0.915) to distinguish the lung cancer from lung disease patient groups. These results suggest that a statistical model can determine optimal meta-markers, which may have better specificity and sensitivity than a single biomarker and may thus improve the differential diagnosis of lung cancer and lung disease patients.ABSTRACT_i
CONTENTS_v
LIST OF FIGURES_vii
LIST OF TABLES_x
ABBREVIATIONS_xii
BACKGROUND_1
1. LUNG CANCER_1
2. BIOMARKER_7
3. MASS SPECTROMETRY BASED PROTEOMICS_12
4. PROTEOGENOMICS_24
5. TARGETED PROTEOMICS_33
CHAPTER I
Proteogenomic Study: Variant Proteome and Transcriptome in Human Lung Adenocarcinoma Tissues_41
1. INTRODUCTION_42
2. MATERIALS AND METHODS_45
3. RESULTS AND DISCUSSION_53
4. CONCLUSION_81
CHAPTER II
Multi-Panel Biomarker Development for the Efficient Discrimination of Lung Cancer for Other Lung Diseases_84
1. INTRODUCTION_85
2. MATERIALS AND METHODS_88
3. RESULTS_93
4. DISCUSSION_120
5. CONCLUSION_127
GENERAL CONCLUSION_129
REFERENCES_131
ABSTRACT IN KOREAN_154Docto
Viral Modulation of Host Translation and Implications for Vaccine Development
Translation of mRNAs into protein is an essential mechanism of regulating gene expressionβand a step exploited by viruses for their own propagation. In this article, we review mechanisms that govern translation and provide an overview of the translation machinery, discuss some of the components involved in this process, and discuss how viruses modulate host translational controls and implications in vaccine design
Human germline and pan-cancer variomes and their distinct functional profiles
Identification of non-synonymous single nucleotide variations (nsSNVs) has exponentially increased due to advances in Next-Generation Sequencing technologies. The functional impacts of these variations have been difficult to ascertain because the corresponding knowledge about sequence functional sites is quite fragmented. It is clear that mapping of variations to sequence functional features can help us better understand the pathophysiological role of variations. In this study, we investigated the effect of nsSNVs on more than 17 common types of post-translational modification (PTM) sites, active sites and binding sites. Out of 1 705 285 distinct nsSNVs on 259 216 functional sites we identified 38 549 variations that significantly affect 10 major functional sites. Furthermore, we found distinct patterns of site disruptions due to germline and somatic nsSNVs. Pan-cancer analysis across 12 different cancer types led to the identification of 51 genes with 106 nsSNV affected functional sites found in 3 or more cancer types. 13 of the 51 genes overlap with previously identified Significantly Mutated Genes (Nature. 2013 Oct 17;502(7471)). 62 mutations in these 13 genes affecting functional sites such as DNA, ATP binding and various PTM sites occur across several cancers and can be prioritized for additional validation and investigations
UGT pharmacogenomics in drug metabolism and diseases
Glucuronidation, mediated by UDP-glucuronosyltransferase enzymes (UGTs), is a major phase
II biotransformation pathway and, complementary to phase I metabolism and membrane
transport, one of the most important cellular defense mechanism responsible for the inactivation
of therapeutic drugs, other xenobiotics and numerous endogenous molecules. Individual
variability in UGT enzymatic pathways is significant and may have profound pharmacological
and toxicological implications. Several genetic and genomic processes are underlying this
variability and are discussed in the context of drug metabolism and diseases such as cancer
Identification and functional characterization of genetic variants in the human indoleamine 2, 3-dioxygenase (INDO) gene
Indiana University-Purdue University Indianapolis (IUPUI)Indoleamine 2,3-dioxygenase (IDO) is a rate limiting enzyme in tryptophan
catabolism that has been implicated in the pathogenesis of a number of diseases.
Large interindividual variability in IDO activity in the absence of stimuli and as the
result of therapy induced changes has been reported. This variability has the potential
to contribute to susceptibility to disease and to interindividual variability in
therapeutic response.
To identify genetic variations that might contribute to interindividual
variability in IDO activity, we resequenced the exons, intron/exon borders and 1.3 kb
of the 5β-flanking region of the INDO gene in 48 African-American (AA) and 48
Caucasian-American (CA) subjects from the Coriell DNA Repository. A total of 24
INDO variants were identified. Seventeen of these were in exons, introns, or
exon/intron boundries, while seven were within 1.3 kb upstream of the translation
start site. Seventeen are novel and 7 were previously identified.
When transiently expressed in COS-7 or HEK293 cells the amino acid
sequence change in Arg77His resulted in significant decrease in activity, and it
reduced the Vmax of IDO. The Arg77His variant and the 9 bp deletion resulted in
nearly complete loss of enzyme activity and a lack of detectable protein expression.
The function of the Arg77His variant IDO was restored in a dose dependent
manner by the heme analog hemin; but there was no associated increase in IDO
protein. Cellular heme concentration was higher in cells transfected with the wild
type and Ala4Thr variant constructs, but not in cells transfected with the Arg77His
variant. The heme synthesis inhibitor, succinylacetone (SA), blocked IDO activity in
cells transfected with Arg77His.
We identified 22 putative transcription binding sites within the 1.3 kb
upstream of the translation start site. Two of the SNPs were located in GATA3 and
FOXC1 sites. A specific 3-SNP haplotype reduced promoter activity when transiently
transfected into 2 different cell lines.
We conclude that there are naturally occurring genetic variants in the INDO
gene which affect both expression and activity. These results make clear that
interindividual variability in IDO activity at baseline or in response to therapy may be
in part due to inherited genetic variability
Graves' disease: introducing new genetic and epigenetic contributors
Autoimmune thyroid disease (AITD) accounts for 90% of all thyroid diseases and affects 2-5% of the population with remarkable familial clustering. Among AITDs, Graves' disease (GD) is a complex disease affecting thyroid function. Over the l ast two decades, casecontrol studies using cutting-edge gene sequencing techniques have detected various susceptible loci that may predispose individuals to GD. It has been presumed that all likely associated genes, variants, and polymorphisms might be responsible for 75-80% of the heritability of GD. As a result, there are implications concerning the potential contribution of environmental and epigenetic factors in the pathogenesis of GD, including its initiation, progression, and development. Numerous review studies have summarized the contribution of genetic factors in GD until now, but there are still some key questions and notions that have not been discussed concerning the interplay of genetic, epigenetic, and immunological factors. With this in mind, this review discusses some newly-identified loci and their potential roles in the pathogenicity of GD. This may lead to the identification of new, promising therapeutic targets. Here, we emphasized principles, listed all the reported disease-associated genes and polymorphisms, and also summarized the current understanding of the epigenetic basis of GD
- β¦