A comparative study of variable selection procedures applied in high dimensional medical problems

Abstract

In health studies, many potential factors are usually introduced to determine an outcome variable. In our study, different statistical methods are applied to analyze trauma annual data, collected by 30 General Hospitals in Greece. The first dataset consists of 1681 observations and 76 factors and the second of 6334 observations and 131 factors, that include demographic, transport and intrahospital data. The statistical methods employed in this work were the nonconcave penalized likelihood methods, SCAD, LASSO, and Hard, the generalized linear logistic regression, and the best subset variable selection, used to detect possible risk factors of death. A variety of different statistical models are considered, with respect to the combinations of factors and the number of observations. A comparative survey reveals differences between results and execution times of each method, and the analysis produces models that identify the significant prognostic factors affecting death from trauma

    Similar works

    Full text

    thumbnail-image

    Available Versions