2,373 research outputs found

    Survival ensembles by the sum of pairwise differences with application to lung cancer microarray studies

    Full text link
    Lung cancer is among the most common cancers in the United States, in terms of incidence and mortality. In 2009, it is estimated that more than 150,000 deaths will result from lung cancer alone. Genetic information is an extremely valuable data source in characterizing the personal nature of cancer. Over the past several years, investigators have conducted numerous association studies where intensive genetic data is collected on relatively few patients compared to the numbers of gene predictors, with one scientific goal being to identify genetic features associated with cancer recurrence or survival. In this note, we propose high-dimensional survival analysis through a new application of boosting, a powerful tool in machine learning. Our approach is based on an accelerated lifetime model and minimizing the sum of pairwise differences in residuals. We apply our method to a recent microarray study of lung adenocarcinoma and find that our ensemble is composed of 19 genes, while a proportional hazards (PH) ensemble is composed of nine genes, a proper subset of the 19-gene panel. In one of our simulation scenarios, we demonstrate that PH boosting in a misspecified model tends to underfit and ignore moderately-sized covariate effects, on average. Diagnostic analyses suggest that the PH assumption is not satisfied in the microarray data and may explain, in part, the discrepancy in the sets of active coefficients. Our simulation studies and comparative data analyses demonstrate how statistical learning by PH models alone is insufficient.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS426 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    The Red Stockings

    Get PDF

    Risk prediction for prostate cancer recurrence through regularized estimation with simultaneous adjustment for nonlinear clinical effects

    Full text link
    In biomedical studies it is of substantial interest to develop risk prediction scores using high-dimensional data such as gene expression data for clinical endpoints that are subject to censoring. In the presence of well-established clinical risk factors, investigators often prefer a procedure that also adjusts for these clinical variables. While accelerated failure time (AFT) models are a useful tool for the analysis of censored outcome data, it assumes that covariate effects on the logarithm of time-to-event are linear, which is often unrealistic in practice. We propose to build risk prediction scores through regularized rank estimation in partly linear AFT models, where high-dimensional data such as gene expression data are modeled linearly and important clinical variables are modeled nonlinearly using penalized regression splines. We show through simulation studies that our model has better operating characteristics compared to several existing models. In particular, we show that there is a nonnegligible effect on prediction as well as feature selection when nonlinear clinical effects are misspecified as linear. This work is motivated by a recent prostate cancer study, where investigators collected gene expression data along with established prognostic clinical variables and the primary endpoint is time to prostate cancer recurrence.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS458 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Nonparametric estimation of a covariate-adjusted counterfactual treatment regimen response curve

    Full text link
    Flexible estimation of the mean outcome under a treatment regimen (i.e., value function) is the key step toward personalized medicine. We define our target parameter as a conditional value function given a set of baseline covariates which we refer to as a stratum based value function. We focus on semiparametric class of decision rules and propose a sieve based nonparametric covariate adjusted regimen-response curve estimator within that class. Our work contributes in several ways. First, we propose an inverse probability weighted nonparametrically efficient estimator of the smoothed regimen-response curve function. We show that asymptotic linearity is achieved when the nuisance functions are undersmoothed sufficiently. Asymptotic and finite sample criteria for undersmoothing are proposed. Second, using Gaussian process theory, we propose simultaneous confidence intervals for the smoothed regimen-response curve function. Third, we provide consistency and convergence rate for the optimizer of the regimen-response curve estimator; this enables us to estimate an optimal semiparametric rule. The latter is important as the optimizer corresponds with the optimal dynamic treatment regimen. Some finite-sample properties are explored with simulations

    Acoustic Probing of the Jamming Transition in an Unconsolidated Granular Medium

    Get PDF
    Experiments with acoustic waves guided along the mechanically free surface of an unconsolidated granular packed structure provide information on the elasticity of granular media at very low pressures that are naturally controlled by the gravitational acceleration and the depth beneath the surface. Comparison of the determined dispersion relations for guided surface acoustic modes with a theoretical model reveals the dependencies of the elastic moduli of the granular medium on pressure. The experiments confirm recent theoretical predictions that relaxation of the disordered granular packing through non-affine motion leads to a peculiar scaling of shear rigidity with pressure near the jamming transition corresponding to zero pressure. Unexpectedly, and in disagreement with the most of the available theories, the bulk modulus depends on pressure in a very similar way to the shear modulus

    Genitofemoral and Perineal Neuralgia After Transobturator Midurethral Sling

    Get PDF
    Midurethral slings successfully treat stress urinary incontinence through a minimally invasive vaginal approach. Postoperative pain related to sling placement can occur and poses both diagnostic and treatment dilemmas

    Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models

    Get PDF
    We propose a general strategy for variable selection in semiparametric regression models by penalizing appropriate estimating functions. Important applications include semiparametric linear regression with censored responses and semiparametric regression with missing predictors. Unlike the existing penalized maximum likelihood estimators, the proposed penalized estimating functions may not pertain to the derivatives of any objective functions and may be discrete in the regression coefficients. We establish a general asymptotic theory for penalized estimating functions and present suitable numerical algorithms to implement the proposed estimators. In addition, we develop a resampling technique to estimate the variances of the estimated regression coefficients when the asymptotic variances cannot be evaluated directly. Simulation studies demonstrate that the proposed methods perform well in variable selection and variance estimation. We illustrate our methods using data from the Paul Coverdell Stroke Registry

    The Solvent–Solid Interface of Acid Catalysts Studied by High Resolution MAS NMR

    Get PDF
    High-resolution magic angle spinning (HRMAS) NMR spectroscopy was used to study the effect of mixed solvent systems on the acidity at the solid−liquid interface of solid acid catalysts. A method was developed that can exploit benefits of both solution and solid-state NMR (SSNMR) by wetting porous solids with small volumes of liquids (μL/mg) to create an interfacial liquid that exhibits unique motional dynamics intermediate to an isotropic liquid and a rigid solid. Results from these experiments provide information about the influence of the solvent mixtures on the acidic properties at a solid−liquid interface. Importantly, use of MAS led to spectra with full resolution between water in an acidic environment and that of bulk water. Using mixed solvent systems, the chemical shift of water was used to compare the relative acidity as a function of the hydration level of the DMSO-d6 solvent. Nonlinear increasing acidity was observed as the DMSO-d6 became more anhydrous. 1H HR-MAS NMR experiments on a variety of supported sulfonic acid functionalized materials, suggest that the acid strength and number of acid sites correlates to the degree of broadening of the peaks in the 1H NMR spectra. When the amount of liquid added to the solid is increased (corresponding to a thicker liquid layer), fully resolved water phases were observed. This suggests that the acidic proton was localized predominantly within a 2 nm distance from the solid. EXSY 1H−1H 2D experiments of the thin layers were used to determine the rate of proton exchange for different catalytic materials. These results demonstrated the utility of using (SSNMR) on solid−liquid mixtures to selectively probe catalyst surfaces under realistic reaction conditions for condensed phase systems
    corecore