9 research outputs found

    Clinical Value of Prognosis Gene Expression Signatures in Colorectal Cancer: A Systematic Review

    Get PDF
    <div><h3>Introduction</h3><p>The traditional staging system is inadequate to identify those patients with stage II colorectal cancer (CRC) at high risk of recurrence or with stage III CRC at low risk. A number of gene expression signatures to predict CRC prognosis have been proposed, but none is routinely used in the clinic. The aim of this work was to assess the prediction ability and potential clinical usefulness of these signatures in a series of independent datasets.</p> <h3>Methods</h3><p>A literature review identified 31 gene expression signatures that used gene expression data to predict prognosis in CRC tissue. The search was based on the PubMed database and was restricted to papers published from January 2004 to December 2011. Eleven CRC gene expression datasets with outcome information were identified and downloaded from public repositories. Random Forest classifier was used to build predictors from the gene lists. Matthews correlation coefficient was chosen as a measure of classification accuracy and its associated p-value was used to assess association with prognosis. For clinical usefulness evaluation, positive and negative post-tests probabilities were computed in stage II and III samples.</p> <h3>Results</h3><p>Five gene signatures showed significant association with prognosis and provided reasonable prediction accuracy in their own training datasets. Nevertheless, all signatures showed low reproducibility in independent data. Stratified analyses by stage or microsatellite instability status showed significant association but limited discrimination ability, especially in stage II tumors. From a clinical perspective, the most predictive signatures showed a minor but significant improvement over the classical staging system.</p> <h3>Conclusions</h3><p>The published signatures show low prediction accuracy but moderate clinical usefulness. Although gene expression data may inform prognosis, better strategies for signature validation are needed to encourage their widespread use in the clinic.</p> </div

    Heatmap showing Matthews Correlation Coefficient (MCC) values for each signature in each dataset as result of analyses with Random Forest.

    No full text
    <p>Rows correspond to signatures and columns to datasets. Last column shows a pooled MCC across datasets using sample size as weights. Black lines delimit the first five signatures for which training datasets were available (cells highlighted in black). Cells representing signatures and datasets used to validate them are highlighted in blue. Color scale represents the MCC values: the darker the color, the higher MCC (see the legend). Negative values were collapsed to zero.</p

    Description of signatures used in this work.

    No full text
    <p><b>Signature</b>: signature name; <b>Training dataset</b>: public training data set if used in this work; <b>Validation dataset</b>: public test data set if used in this work<b>; Signature size</b>: reported signature size in the original paper (genes or features):; <b>Training sample size (good + poor)</b>: sample size of training data set, separating good and poor prognosis when reported; <b>Training outcome</b>: outcome used to derive the signature; <b>Training platform</b>: platform used for the training data set; <b>Signature validation</b>: type of validation for signature if performed; <b>Independent validation outcome</b>: outcome used for independent validation if performed; V<b>alidation results:</b> for each validation performed, accuracy classification measures or association assessing if provided; <b>Reference</b>: PMID and reference for publishing paper. <b>*</b> Frequencies of subgroups were not available. <b>Abbreviations</b>: <b>TMA</b>: tissue microarray; <b>cv</b>: cross-validation; <b>ns</b>: not specified.</p

    Datasets description.

    No full text
    <p><b>Dataset:</b> GEO or Array Express dataset identifier; <b>Trained signatures:</b> signatures which used that dataset as training sample, if any; <b>Validation signatures:</b> signatures which used that dataset as independent validation sample; <b>Outcome:</b> type of relapse used for that dataset; <b>Minimum follow up:</b> minimum follow up required for that dataset, when this info was available; <b>Number of samples:</b> number of samples contained in that dataset, showing good and bad prognosis’ separately between brackets; <b>Clinical info</b>: samples ranges of stage and microsatellite status when this information was available; <b>Platform:</b> datasets’ hybridization platform. <b>*</b> NA: the authors do not provide clinical information about MSI and/or stage. No info: Although authors provide clinical information in the paper, samples are not labelled with this information in GEO or ArrayExpress. <b>a.</b> Stage II and III samples from data sets GSE17536 and GSE17537 were jointly used to derive signature VL10, but the later did not include enough events at these stage subgroups. <b>b.</b> Signature JS09 was built with Duke’s A and D and validated with Duke’s B and C samples.</p

    Additional file 1 of Colorectal cancer incidences in Lynch syndrome: a comparison of results from the prospective lynch syndrome database and the international mismatch repair consortium

    No full text
    Additional file 1
    corecore