15 research outputs found

    Human participants research checklist.

    No full text
    BackgroundAnalysis of omics data that contain multidimensional biological and clinical information can be complex and make it difficult to deduce significance of specific biomarker factors.MethodsWe explored the utility of propensity score matching (PSM), a statistical technique for minimizing confounding factors and simplifying the examination of specific factors. We tested two datasets generated from cohorts of colorectal cancer (CRC) patients, one comprised of immunohistochemical analysis of 12 protein markers in 544 CRC tissues and another consisting of RNA-seq profiles of 163 CRC cases. We examined the efficiency of PSM by comparing pre- and post-PSM analytical results.ResultsUnlike conventional analysis which typically compares randomized cohorts of cancer and normal tissues, PSM enabled direct comparison between patient characteristics uncovering new prognostic biomarkers. By creating optimally matched groups to minimize confounding effects, our study demonstrates that PSM enables robust extraction of significant biomarkers while requiring fewer cancer cases and smaller overall patient cohorts.ConclusionPSM may emerge as an efficient and cost-effective strategy for multiomic data analysis and clinical trial design for biomarker discovery.</div

    Fig 5 -

    No full text
    (A–B) RNA–seq volcano plot comparing good prognosis group vs. poor prognosis group. Green dots (N = 93) represent genes that are significant in both pre–and post–PSM comparison between the good–and poor–prognosis groups. Blue dots (N = 217) represent genes that are significant only in the pre–PSM comparison between the good–and poor–prognosis groups. Red dots (N = 29) represent genes that are significant only in the post–PSM comparison between the good–and poor–prognosis groups. Grey dots (N = 12,121) represent genes that did not show significant differences. (C) The Venn diagram of significant genes before and after PSM. The blue circle represents before PSM, and the yellow represents after PSM.</p

    Distribution of propensity scores of the IHC CRC dataset.

    No full text
    Distribution of propensity scores of the IHC CRC dataset.</p

    Comparison of protein marker expression between groups before and after PSM.

    No full text
    Comparison of protein marker expression between groups before and after PSM.</p

    Genes commonly selected before and after PSM.

    No full text
    BackgroundAnalysis of omics data that contain multidimensional biological and clinical information can be complex and make it difficult to deduce significance of specific biomarker factors.MethodsWe explored the utility of propensity score matching (PSM), a statistical technique for minimizing confounding factors and simplifying the examination of specific factors. We tested two datasets generated from cohorts of colorectal cancer (CRC) patients, one comprised of immunohistochemical analysis of 12 protein markers in 544 CRC tissues and another consisting of RNA-seq profiles of 163 CRC cases. We examined the efficiency of PSM by comparing pre- and post-PSM analytical results.ResultsUnlike conventional analysis which typically compares randomized cohorts of cancer and normal tissues, PSM enabled direct comparison between patient characteristics uncovering new prognostic biomarkers. By creating optimally matched groups to minimize confounding effects, our study demonstrates that PSM enables robust extraction of significant biomarkers while requiring fewer cancer cases and smaller overall patient cohorts.ConclusionPSM may emerge as an efficient and cost-effective strategy for multiomic data analysis and clinical trial design for biomarker discovery.</div

    Distribution of propensity scores of the IHC CRC dataset with a 2-to-1 matching arrangement.

    No full text
    Distribution of propensity scores of the IHC CRC dataset with a 2-to-1 matching arrangement.</p

    Flow chart of propensity score matching in this study.

    No full text
    It is crucial that the number of cases in group A is not larger than in group B. In this study, group A meant the good prognosis group, and group B meant the poor prognosis group in both datasets.</p

    Comparison of IHC scoring between good and poor prognosis groups after 2-to-1 PSM.

    No full text
    Comparison of IHC scoring between good and poor prognosis groups after 2-to-1 PSM.</p

    Random forest rankings of prognostic factors in the CRC proteomic marker dataset.

    No full text
    (A) Ranking of variable importance (VIMP). The blue bars represent positive values of VIMP, indicating that the corresponding factor is positively associated with prognostic prediction. While the red bars represent negative values of VIMP, indicating that the factor is negatively associated with prognostic prediction. (B) Ranking of minimal depth. The small minimal depth indicates that the factor plays an important role in prognostic prediction. The vertical dashed line indicates the minimal depth threshold where smaller minimal depth values indicate higher importance and larger indicate lower importance as calculated by the “gg_minimal_depth” function of the “ggRandomForests” R package (version 4.7–1.1). (C) The combination of variable importance (VIMP) and minimal depth. The blue dots represent positive values of VIMP, while red dots represent negative values of VIMP. The threshold represented by the vertical red dashed line indicates VIMP = 0. The threshold represented by the horizontal red dashed line is equal to (B).</p

    Clinicopathological features of the proteomic CRC cohort before and after PSM.

    No full text
    Clinicopathological features of the proteomic CRC cohort before and after PSM.</p
    corecore