156 research outputs found

    Network enrichment analysis: extension of gene-set enrichment analysis to gene networks

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene-set enrichment analyses (GEA or GSEA) are commonly used for biological characterization of an experimental gene-set. This is done by finding known functional categories, such as pathways or Gene Ontology terms, that are over-represented in the experimental set; the assessment is based on an overlap statistic. Rich biological information in terms of gene interaction network is now widely available, but this topological information is not used by GEA, so there is a need for methods that exploit this type of information in high-throughput data analysis.</p> <p>Results</p> <p>We developed a method of network enrichment analysis (NEA) that extends the overlap statistic in GEA to network links between genes in the experimental set and those in the functional categories. For the crucial step in statistical inference, we developed a fast network randomization algorithm in order to obtain the distribution of any network statistic under the null hypothesis of no association between an experimental gene-set and a functional category. We illustrate the NEA method using gene and protein expression data from a lung cancer study.</p> <p>Conclusions</p> <p>The results indicate that the NEA method is more powerful than the traditional GEA, primarily because the relationships between gene sets were more strongly captured by network connectivity rather than by simple overlaps.</p

    A novel method for sample preparation of fresh lung cancer tissue for proteomics analysis by tumor cell enrichment and removal of blood contaminants

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In-depth proteomics analyses of tumors are frequently biased by the presence of blood components and stromal contamination, which leads to large experimental variation and decreases the proteome coverage. We have established a reproducible method to prepare freshly collected lung tumors for proteomics analysis, aiming at tumor cell enrichment and reduction of plasma protein contamination. We obtained enriched tumor-cell suspensions (ETS) from six lung cancer cases (two adenocarcinomas, two squamous-cell carcinomas, two large-cell carcinomas) and from two normal lung samples. The cell content of resulting ETS was evaluated with immunocytological stainings and compared with the histologic pattern of the original specimens. By means of a quantitative mass spectrometry-based method we evaluated the reproducibility of the sample preparation protocol and we assessed the proteome coverage by comparing lysates from ETS samples with the direct lysate of corresponding fresh-frozen samples.</p> <p>Results</p> <p>Cytological analyses on cytospin specimens showed that the percentage of tumoral cells in the ETS samples ranged from 20% to 70%. In the normal lung samples the percentage of epithelial cells was less then 10%. The reproducibility of the sample preparation protocol was very good, with coefficient of variation at the peptide level and at the protein level of 13% and 7%, respectively. Proteomics analysis led to the identification of a significantly higher number of proteins in the ETS samples than in the FF samples (244 vs 109, respectively). Albumin and hemoglobin were among the top 5 most abundant proteins identified in the FF samples, showing a high contamination with blood and plasma proteins, whereas ubiquitin and the mitochondrial ATP synthase 5A1 where among the top 5 most abundant proteins in the ETS samples.</p> <p>Conclusion</p> <p>The method is feasible and reproducible. We could obtain a fair enrichment of cells but the major benefit of the method was an effective removal of contaminants from red blood cells and plasma proteins resulting in larger proteome coverage compared to the direct lysis of frozen samples. This sample preparation method may be successfully implemented for the discovery of lung cancer biomarkers on tissue samples using mass spectrometry-based proteomics.</p

    Glioblastoma stem cells express non-canonical proteins and exclusive mesenchymal-like or non-mesenchymal-like protein signatures

    Get PDF
    Glioblastoma (GBM) cancer stem cells (GSCs) contribute to GBM's origin, recurrence, and resistance to treatment. However, the understanding of how mRNA expression patterns of GBM subtypes are reflected at global proteome level in GSCs is limited. To characterize protein expression in GSCs, we performed in-depth proteogenomic analysis of patient-derived GSCs by RNA-sequencing and mass-spectrometry. We quantified &gt; 10 000 proteins in two independent GSC panels and propose a GSC-associated proteomic signature characterizing two distinct phenotypic conditions; one defined by proteins upregulated in proneural and classical GSCs (GPC-like), and another by proteins upregulated in mesenchymal GSCs (GM-like). The GM-like protein set in GBM tissue was associated with necrosis, recurrence, and worse overall survival. Through proteogenomics, we discovered 252 non-canonical peptides in the GSCs, i.e., protein sequences that are variant or derive from genome regions previously considered non-protein-coding, including variants of the heterogeneous ribonucleoproteins implicated in RNA splicing. In summary, GSCs express two protein sets that have an inverse association with clinical outcomes in GBM. The discovery of non-canonical protein sequences questions existing gene models and pinpoints new protein targets for research in GBM

    A systematic analysis of the effects of increasing degrees of serum immunodepletion in terms of depth of coverage and other key aspects in top-down and bottom-up proteomic analyses

    Get PDF
    Immunodepletion of clinical fluids to overcome the dominance by a few very abundant proteins has been explored but studies are few, commonly examining only limited aspects with one analytical platform. We have systematically compared immunodepletion of 6, 14, or 20 proteins using serum from renal transplant patients, analysing reproducibility, depth of coverage, efficiency, and specificity using 2-D DIGE (‘top-down’) and LC-MS/MS (‘bottom-up’). A progressive increase in protein number (≥2 unique peptides) was found from 159 in unfractionated serum to 301 following 20 protein depletion using a relatively high-throughput 1-D-LC-MS/MS approach, including known biomarkers and moderate–lower abundance proteins such as NGAL and cytokine/growth factor receptors. On the contrary, readout by 2-D DIGE demonstrated good reproducibility of immunodepletion, but additional proteins seen tended to be isoforms of existing proteins. Depletion of 14 or 20 proteins followed by LC-MS/MS showed excellent reproducibility of proteins detected and a significant overlap between columns. Using label-free analysis, greater run-to-run variability was seen with the Prot20 column compared with the MARS14 column (median %CVs of 30.9 versus 18.2%, respectively) and a corresponding wider precision profile for the Prot20. These results illustrate the potential of immunodepletion followed by 1-D nano-LC-LTQ Orbitrap Velos analysis in a moderate through-put biomarker discovery process

    The viral protein corona directs viral pathogenesis and amyloid aggregation

    Get PDF
    Artificial nanoparticles accumulate a protein corona layer in biological fluids, which significantly influences their bioactivity. As nanosized obligate intracellular parasites, viruses share many biophysical properties with artificial nanoparticles in extracellular environments and here we show that respiratory syncytial virus (RSV) and herpes simplex virus type 1 (HSV-1) accumulate a rich and distinctive protein corona in different biological fluids. Moreover, we show that corona pre-coating differentially affects viral infectivity and immune cell activation. In addition, we demonstrate that viruses bind amyloidogenic peptides in their corona and catalyze amyloid formation via surface-assisted heterogeneous nucleation. Importantly, we show that HSV-1 catalyzes the aggregation of the amyloid beta-peptide (A beta(42)), a major constituent of amyloid plaques in Alzheimer's disease, in vitro and in animal models. Our results highlight the viral protein corona as an acquired structural layer that is critical for viral-host interactions and illustrate a mechanistic convergence between viral and amyloid pathologies.Peer reviewe

    Quantitative iTRAQ-Based Proteomic Identification of Candidate Biomarkers for Diabetic Nephropathy in Plasma of Type 1 Diabetic Patients

    Get PDF
    # The Author(s) 2010. This article is published with open access at Springerlink.com Introduction As part of a clinical proteomics programme focused on diabetes and its complications, it was our goal to investigate the proteome of plasma in order to find improved candidate biomarkers to predict diabetic nephropathy. Methods Proteins derived from plasma from a crosssectiona

    Proteomics: Clinical and research applications in respiratory diseases

    Full text link
    The proteome is the study of the protein content of a definable component of an organism in biology. However, the tissue‐specific expression of proteins and the varied post‐translational modifications, splice variants and protein–protein complexes that may form, make the study of protein a challenging yet vital tool in answering many of the unanswered questions in medicine and biology to date. Indeed, the spatial, temporal and functional composition of proteins in the human body has proven difficult to elucidate for many years. Given the effect of microRNA and epigenetic regulation on silencing and enhancing gene transcription, the study of protein arguably provides more accurate information on homeostasis and perturbation in health and disease. There have been significant advances in the field of proteomics in recent years, with new technologies and platforms available to the research community. In this review, we briefly discuss some of these new technologies and developments in the context of respiratory disease. We also discuss the types of data science approaches to analyses and interpretation of the large volumes of data generated in proteomic studies. We discuss the application of these technologies with regard to respiratory disease and highlight the potential for proteomics in generating major advances in the understanding of respiratory pathophysiology into the future.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/146450/1/resp13383_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/146450/2/resp13383.pd

    High Abundance Proteins Depletion vs Low Abundance Proteins Enrichment: Comparison of Methods to Reduce the Plasma Proteome Complexity

    Get PDF
    BACKGROUND: To date, the complexity of the plasma proteome exceeds the analytical capacity of conventional approaches to isolate lower abundance proteins that may prove to be informative biomarkers. Only complex multistep separation strategies have been able to detect a substantial number of low abundance proteins (<100 ng/ml). The first step of these protocols is generally the depletion of high abundance proteins by the use of immunoaffinity columns or, alternatively, the enrichment of by the use of solid phase hexapeptides ligand libraries. METHODOLOGY/PRINCIPAL FINDINGS: Here we present a direct comparison of these two approaches. Following either approach, the plasma sample was further fractionated by SCX chromatography and analyzed by RP-LC-MS/MS with a Q-TOF mass spectrometer. The depletion of the 20 most abundant plasma proteins allowed the identification of about 25% more proteins than those detectable following low abundance proteins enrichment. The two datasets are partially overlapping and the identified proteins belong to the same order of magnitude in terms of plasma concentration. CONCLUSIONS/SIGNIFICANCE: Our results show that the two approaches give complementary results. However, the enrichment of low abundance proteins has the great advantage of obtaining much larger amount of material that can be used for further fractionations and analyses and emerges also as a cheaper and technically simpler approach. Collectively, these data indicate that the enrichment approach seems more suitable as the first stage of a complex multi-step fractionation protocol
    corecore