18 research outputs found
Comparison of normalisation methods for surface-enhanced laser desorption and ionisation (SELDI) time-of-flight (TOF) mass spectrometry data
<p>Abstract</p> <p>Background</p> <p>Mass spectrometry for biological data analysis is an active field of research, providing an efficient way of high-throughput proteome screening. A popular variant of mass spectrometry is SELDI, which is often used to measure sample populations with the goal of developing (clinical) classifiers. Unfortunately, not only is the data resulting from such measurements quite noisy, variance between replicate measurements of the same sample can be high as well. Normalisation of spectra can greatly reduce the effect of this technical variance and further improve the quality and interpretability of the data. However, it is unclear which normalisation method yields the most informative result.</p> <p>Results</p> <p>In this paper, we describe the first systematic comparison of a wide range of normalisation methods, using two objectives that should be met by a good method. These objectives are minimisation of inter-spectra variance and maximisation of signal with respect to class separation. The former is assessed using an estimation of the coefficient of variation, the latter using the classification performance of three types of classifiers on real-world datasets representing two-class diagnostic problems. To obtain a maximally robust evaluation of a normalisation method, both objectives are evaluated over multiple datasets and multiple configurations of baseline correction and peak detection methods. Results are assessed for statistical significance and visualised to reveal the performance of each normalisation method, in particular with respect to using no normalisation. The normalisation methods described have been implemented in the freely available MASDA R-package.</p> <p>Conclusion</p> <p>In the general case, normalisation of mass spectra is beneficial to the quality of data. The majority of methods we compared performed significantly better than the case in which no normalisation was used. We have shown that normalisation methods that scale spectra by a factor based on the dispersion (e.g., standard deviation) of the data clearly outperform those where a factor based on the central location (e.g., mean) is used. Additional improvements in performance are obtained when these factors are estimated locally, using a sliding window within spectra, instead of globally, over full spectra. The underperforming category of methods using a globally estimated factor based on the central location of the data includes the method used by the majority of SELDI users.</p
Recommended from our members
Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset
Diabetes like many diseases and biological processes is not mono-causal. On the one hand multifactorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics
Expression of the phosphorylated MEK5 protein is associated with TNM staging of colorectal cancer
<p>Abstract</p> <p>Background</p> <p>Activation of MEK5 in many cancers is associated with carcinogenesis through aberrant cell proliferation. In this study, we determined the level of phosphorylated MEK5 (pMEK5) expression in human colorectal cancer (CRC) tissues and correlated it with clinicopathologic data.</p> <p>Methods</p> <p>pMEK5 expression was examined by immunohistochemistry in a tissue microarray (TMA) containing 335 clinicopathologic characterized CRC cases and 80 cases of nontumor colorectal tissues. pMEK5 expression of 19 cases of primary CRC lesions and paired with normal mucosa was examined by Western blotting. The relationship between pMEK5 expression in CRC and clinicopathologic parameters, and the association of pMEK5 expression with CRC survival were analyzed respectively.</p> <p>Results</p> <p>pMEK5 expression was significantly higher in CRC tissues (185 out of 335, 55.2%) than in normal tissues (6 out of 80, 7.5%; <it>P </it>< 0.001). Western blotting demonstrated that pMEK5 expression was upregulated in 12 of 19 CRC tissues (62.1%) compared to the corresponding adjacent nontumor colorectal tissues. Overexpression of pMEK5 in CRC tissues was significantly correlated to the depth of invasion (<it>P </it>= 0.001), lymph node metastasis (<it>P </it>< 0.001), distant metastasis (<it>P </it>< 0.001) and high preoperative CEA level (<it>P </it>< 0.001). Consistently, the pMEK5 level in CRC tissues was increased following stage progression of the disease (<it>P </it>< 0.001). Analysis of the survival curves showed a significantly worse 5-year disease-free (<it>P </it>= 0.002) and 5-year overall survival rate (<it>P </it>< 0.001) for patients whose tumors overexpressed pMEK5. However, in multivariate analysis, pMEK5 was not an independent prognostic factor for CRC (DFS: <it>P </it>= 0.139; OS: <it>P </it>= 0.071).</p> <p>Conclusions</p> <p>pMEK5 expression is correlated with the staging of CRC and its expression might be helpful to the TNM staging system of CRC.</p
Genomic and oncoproteomic advances in detection and treatment of colorectal cancer
<p>Abstract</p> <p>Aims</p> <p>We will examine the latest advances in genomic and proteomic laboratory technology. Through an extensive literature review we aim to critically appraise those studies which have utilized these latest technologies and ascertain their potential to identify clinically useful biomarkers.</p> <p>Methods</p> <p>An extensive review of the literature was carried out in both online medical journals and through the Royal College of Surgeons in Ireland library.</p> <p>Results</p> <p>Laboratory technology has advanced in the fields of genomics and oncoproteomics. Gene expression profiling with DNA microarray technology has allowed us to begin genetic profiling of colorectal cancer tissue. The response to chemotherapy can differ amongst individual tumors. For the first time researchers have begun to isolate and identify the genes responsible. New laboratory techniques allow us to isolate proteins preferentially expressed in colorectal cancer tissue. This could potentially lead to identification of a clinically useful protein biomarker in colorectal cancer screening and treatment.</p> <p>Conclusion</p> <p>If a set of discriminating genes could be used for characterization and prediction of chemotherapeutic response, an individualized tailored therapeutic regime could become the standard of care for those undergoing systemic treatment for colorectal cancer. New laboratory techniques of protein identification may eventually allow identification of a clinically useful biomarker that could be used for screening and treatment. At present however, both expression of different gene signatures and isolation of various protein peaks has been limited by study size. Independent multi-centre correlation of results with larger sample sizes is needed to allow translation into clinical practice.</p
Mass spectrometry and multivariate analysis to classify cervical intraepithelial neoplasia from blood plasma: an untargeted lipidomic study
Cervical cancer is still an important issue of public health since it is the fourth most frequent type of cancer in women worldwide. Much effort has been dedicated to combating this cancer, in particular by the early detection of cervical pre-cancerous lesions. For this purpose, this paper reports the use of mass spectrometry coupled with multivariate analysis as an untargeted lipidomic approach to classifying 76 blood plasma samples into negative for intraepithelial lesion or malignancy (NILM, nâ=â42) and squamous intraepithelial lesion (SIL, nâ=â34). The crude lipid extract was directly analyzed with mass spectrometry for untargeted lipidomics, followed by multivariate analysis based on the principal component analysis (PCA) and genetic algorithm (GA) with support vector machines (SVM), linear (LDA) and quadratic (QDA) discriminant analysis. PCA-SVM models outperformed LDA and QDA results, achieving sensitivity and specificity values of 80.0% and 83.3%, respectively. Five types of lipids contributing to the distinction between NILM and SIL classes were identified, including prostaglandins, phospholipids, and sphingolipids for the former condition and Tetranor-PGFM and hydroperoxide lipid for the latter. These findings highlight the potentiality of using mass spectrometry associated with chemometrics to discriminate between healthy women and those suffering from cervical pre-cancerous lesions