32 research outputs found
Effects of Pooling Samples on the Performance of Classification Algorithms: A Comparative Study
A pooling design can be used as a powerful strategy to compensate for limited amounts of samples or high biological variation. In this paper, we perform a comparative study to model and quantify the effects of virtual pooling on the performance of the widely applied classifiers, support vector machines (SVMs), random forest (RF), k-nearest neighbors (k-NN), penalized logistic regression (PLR), and prediction analysis for microarrays (PAMs). We evaluate a variety of experimental designs using mock omics datasets with varying levels of pool sizes and considering effects from feature selection. Our results show that feature selection significantly improves classifier performance for non-pooled and pooled data. All investigated classifiers yield lower misclassification rates with smaller pool sizes. RF mainly outperforms other investigated algorithms, while accuracy levels are comparable among all the remaining ones. Guidelines are derived to identify an optimal pooling scheme for obtaining adequate predictive power and, hence, to motivate a study design that meets best experimental objectives and budgetary conditions, including time constraints
Lipocalin 2 expression is associated with aggressive features of endometrial cancer
Background: Increased expression of lipocalin 2 (LCN2) has been observed in several cancers. The aim of the present study was to investigate LCN2 in endometrial cancer in relation to clinico-pathologic phenotype, angiogenesis, markers of epithelial-mesenchymal transition (EMT), and patient survival. Methods: Immunohistochemical staining was performed using a human LCN2 antibody on a population-based series of endometrial cancer patients collected in Hordaland County (Norway) during 1981-1990 (n = 256). Patients were followed from the time of primary surgery until death or last follow-up in 2007. The median follow-up time for survivors was 17 years. Gene expression data from a prospectively collected endometrial cancer series (n = 76) and a publicly available endometrial cancer series (n = 111) was used for gene correlation studies. Results: Expression of LCN2 protein, found in 49% of the cases, was associated with non-endometrioid histologic type (p = 0.001), nuclear grade 3 (p = 0.001), >50% solid tumor growth (p = 0.001), ER and PR negativity (p = 0.028 and 0.006), and positive EZH2 expression (p < 0.001). LCN2 expression was significantly associated with expression of VEGF-A (p = 0.021), although not with other angiogenesis markers examined (vascular proliferation index, glomeruloid microvascular proliferation, VEGF-C, VEGF-D or bFGF2 expression). Further, LCN2 was not associated with several EMT-related markers (E-cadherin, N-cadherin, P-cadherin, β-catenin), nor with vascular invasion (tumor cells invading lymphatic or blood vessels). Notably, LCN2 was significantly associated with distant tumor recurrences, as well as with the S100A family of metastasis related genes. Patients with tumors showing no LCN2 expression had the best outcome with 81% 5-year survival, compared to 73% for intermediate and 38% for the small subgroup with strong LCN2 staining (p = 0.007). In multivariate analysis, LCN2 expression was an independent prognostic factor in addition to histologic grade and FIGO stage. Conclusion: Increased LCN2 expression is associated with aggressive features and poor prognosis in endometrial cancer
Genetic aberration analysis in thai colorectal adenoma and early-stage adenocarcinoma patients by whole-exome sequencing
Colorectal adenomas are precursor lesions of colorectal adenocarcinoma. The transition from adenoma to carcinoma in patients with colorectal cancer (CRC) has been associated with an accumulation of genetic aberrations. However, criteria that can screen adenoma progression to adenocarcinoma are still lacking. This present study is the first attempt to identify genetic aberrations, such as the somatic mutations, copy number variations (CNVs), and high-frequency mutated genes, found in Thai patients. In this study, we identified the genomic abnormality of two sample groups. In the first group, five cases matched normal-colorectal adenoma-colorectal adenocarcinoma. In the second group, six cases matched normal-colorectal adenomas. For both groups, whole-exome sequencing was performed. We compared the genetic aberration of the two sample groups. In both normal tissues compared with colorectal adenoma and colorectal adenocarcinoma analyses, somatic mutations were observed in the tumor suppressor gene APC (Adenomatous polyposis coli) in eight out of ten patients. In the group of normal tissue comparison with colorectal adenoma tissue, somatic mutations were also detected in Catenin Beta 1 (CTNNB1), Family With Sequence Similarity 123B (FAM123B), F-Box And WD Repeat Domain Containing 7 (FBXW7), Sex-Determining Region Y-Box 9 (SOX9), Low-Density Lipoprotein Receptor-Related Protein 5 (LRP5), Frizzled Class Receptor 10 (FZD10), and AT-Rich Interaction Domain 1A (ARID1A) genes, which are involved in the Wingless-related integration site (Wnt) signaling pathway. In the normal tissue comparison with colorectal adenocarcinoma tissue, Kirsten retrovirus-associated DNA sequences (KRAS), Tumor Protein 53 (TP53), and Ataxia-Telangiectasia Mutated (ATM) genes are found in the receptor tyrosine kinase-RAS (RTK–RAS) signaling pathway and p53 signaling pathway, respectively. These results suggest that APC and TP53 may act as a potential screening marker for colorectal adenoma and early-stage CRC. This preliminary study may help identify patients with adenoma and early-stage CRC and may aid in establishing prevention and surveillance strategies to reduce the incidence of CRC
The cientificWorldJOURNAL Research Article Effects of Pooling Samples on the Performance of Classification Algorithms: A Comparative Study
A pooling design can be used as a powerful strategy to compensate for limited amounts of samples or high biological variation. In this paper, we perform a comparative study to model and quantify the effects of virtual pooling on the performance of the widely applied classifiers, support vector machines (SVMs), random forest (RF), k-nearest neighbors (k-NN), penalized logistic regression (PLR), and prediction analysis for microarrays (PAMs). We evaluate a variety of experimental designs using mock omics datasets with varying levels of pool sizes and considering effects from feature selection. Our results show that feature selection significantly improves classifier performance for non-pooled and pooled data. All investigated classifiers yield lower misclassification rates with smaller pool sizes. RF mainly outperforms other investigated algorithms, while accuracy levels are comparable among all the remaining ones. Guidelines are derived to identify an optimal pooling scheme for obtaining adequate predictive power and, hence, to motivate a study design that meets best experimental objectives and budgetary conditions, including time constraints
Switch in FOXA1 status associates with endometrial cancer progression
Background: The transcription factor Forkhead box A1 (FOXA1) is suggested to be important in hormone dependent cancers, although with little data for endometrial cancer. We investigated expression levels of FOXA1 in primary and metastatic endometrial cancer in relation to clinical phenotype, and transcriptional alterations related to FOXA1 status. Methods: Protein expression of FOXA1 was explored by immunohistochemistry in 529 primary and 199 metastatic endometrial carcinoma lesions. mRNA levels from corresponding 158 fresh frozen primary and 42 metastatic lesions were analyzed using Agilent Microarrays (44k) in parallel. Results: Low FOXA1 protein expression in primary tumors significantly correlated with low FOXA1 mRNA, high age, non-endometrioid histology, high grade, loss of ERα and PR and poor survival (all p-values <0.05). Through a Connectivity Map search, HDAC inhibitors were suggested as potential treatment for patients with low FOXA1 expression. An increase in FOXA1 expression was observed from primary to metastatic lesions and it correlated with CDKN2A expression in metastases. Conclusion: Low FOXA1 is associated with poor survival and suggests a potential for HDAC inhibitors in endometrial carcinoma. A switch in FOXA1 expression from primary to metastatic lesions is observed and gene expression indicates a link between FOXA1 and CDKN2A in metastatic lesions
High-Throughput Mutation Profiling of Primary and Metastatic Endometrial Cancers Identifies KRAS, FGFR2 and PIK3CA to Be Frequently Mutated
Background: Despite being the most common pelvic gynecologic malignancy in industrialized countries, no targeted therapies are available for patients with metastatic endometrial carcinoma. In order to improve treatment, underlying molecular characteristics of primary and metastatic disease must be explored. Methodology/Principal Findings: We utilized the mass spectrometric-based mutation detection technology OncoMap to define the types and frequency of point somatic mutations in endometrial cancer. 67 primary tumors, 15 metastases corresponding to 7 of the included primary tumors and 11 endometrial cancer cell lines were screened for point mutations in 28 known oncogenes. We found that 27 (40.3%) of 67 primary tumors harbored one or more mutations with no increase in metastatic lesions. FGFR2, KRAS and PIK3CA were consistently the most frequently mutated genes in primary tumors, metastatic lesions and cell lines. Conclusions/Significance: Our results emphasize the potential for targeting FGFR2, KRAS and PIK3CA mutations in endometrial cancer for development of novel therapeutic strategies