42 research outputs found

    A Novel Statistical Method to Diagnose, Quantify and Correct Batch Effects in Genomic Studies.

    Get PDF
    Genome projects now generate large-scale data often produced at various time points by different laboratories using multiple platforms. This increases the potential for batch effects. Currently there are several batch evaluation methods like principal component analysis (PCA; mostly based on visual inspection), and sometimes they fail to reveal all of the underlying batch effects. These methods can also lead to the risk of unintentionally correcting biologically interesting factors attributed to batch effects. Here we propose a novel statistical method, finding batch effect (findBATCH), to evaluate batch effect based on probabilistic principal component and covariates analysis (PPCCA). The same framework also provides a new approach to batch correction, correcting batch effect (correctBATCH), which we have shown to be a better approach to traditional PCA-based correction. We demonstrate the utility of these methods using two different examples (breast and colorectal cancers) by merging gene expression data from different studies after diagnosing and correcting for batch effects and retaining the biological effects. These methods, along with conventional visual inspection-based PCA, are available as a part of an R package exploring batch effect (exploBATCH; https://github.com/syspremed/exploBATCH )

    Predicting Patterns of Customer Usage, with Niftecash

    Get PDF
    Report is the result of the working during 93rd European Study Group with Industry in Limerick

    Analytical Validation of Multiplex Biomarker Assay to Stratify Colorectal Cancer into Molecular Subtypes.

    Get PDF
    Previously, we classified colorectal cancers (CRCs) into five CRCAssigner (CRCA) subtypes with different prognoses and potential treatment responses, later consolidated into four consensus molecular subtypes (CMS). Here we demonstrate the analytical development and validation of a custom NanoString nCounter platform-based biomarker assay (NanoCRCA) to stratify CRCs into subtypes. To reduce costs, we switched from the standard nCounter protocol to a custom modified protocol. The assay included a reduced 38-gene panel that was selected using an in-house machine-learning pipeline. We applied NanoCRCA to 413 samples from 355 CRC patients. From the fresh frozen samples (n = 237), a subset had matched microarray/RNAseq profiles (n = 47) or formalin-fixed paraffin-embedded (FFPE) samples (n = 58). We also analyzed a further 118 FFPE samples. We compared the assay results with the CMS classifier, different platforms (microarrays/RNAseq) and gene-set classifiers (38 and the original 786 genes). The standard and modified protocols showed high correlation (> 0.88) for gene expression. Technical replicates were highly correlated (> 0.96). NanoCRCA classified fresh frozen and FFPE samples into all five CRCA subtypes with consistent classification of selected matched fresh frozen/FFPE samples. We demonstrate high and significant subtype concordance across protocols (100%), gene sets (95%), platforms (87%) and with CMS subtypes (75%) when evaluated across multiple datasets. Overall, our NanoCRCA assay with further validation may facilitate prospective validation of CRC subtypes in clinical trials and beyond

    Predicting Patterns of Customer Usage, with Niftecash

    Get PDF
    Report is the result of the working during 93rd European Study Group with Industry in Limerick

    A seven-Gene Signature assay improves prognostic risk stratification of perioperative chemotherapy treated gastroesophageal cancer patients from the MAGIC trial

    Get PDF
    BACKGROUND: Following neoadjuvant chemotherapy for operable gastroesophageal cancer, lymph node metastasis is the only validated prognostic variable; however, within lymph node groups there is still heterogeneity with risk of relapse. We hypothesized that gene profiles from neoadjuvant chemotherapy treated resection specimens from gastroesophageal cancer patients can be used to define prognostic risk groups to identify patients at risk for relapse. PATIENTS AND METHODS: The Medical Research Council Adjuvant Gastric Infusional Chemotherapy (MAGIC) trial (n = 202 with high quality RNA) samples treated with perioperative chemotherapy were profiled for a custom gastric cancer gene panel using the NanoString platform. Genes associated with overall survival (OS) were identified using penalized and standard Cox regression, followed by generation of risk scores and development of a NanoString biomarker assay to stratify patients into risk groups associated with OS. An independent dataset served as a validation cohort. RESULTS: Regression and clustering analysis of MAGIC patients defined a seven-Gene Signature and two risk groups with different OS [hazard ratio (HR) 5.1; P < 0.0001]. The median OS of high- and low-risk groups were 10.2 [95% confidence interval (CI) of 6.5 and 13.2 months] and 80.9 months (CI: 43.0 months and not assessable), respectively. Risk groups were independently prognostic of lymph node metastasis by multivariate analysis (HR 3.6 in node positive group, P = 0.02; HR 3.6 in high-risk group, P = 0.0002), and not prognostic in surgery only patients (n = 118; log rank P = 0.2). A validation cohort independently confirmed these findings. CONCLUSIONS: These results suggest that gene-based risk groups can independently predict prognosis in gastroesophageal cancer patients treated with neoadjuvant chemotherapy. This signature and associated assay may help risk stratify these patients for post-surgery chemotherapy in future perioperative chemotherapy-based clinical trials
    corecore