Search CORE

102 research outputs found

Score Test Variable Screening

Author: Agarwal
Avet-Loiseau
Bühlmann
Cai
Cheng
Daubechies
Decaux
Donoho
Duchi
Fan
Fan
Fan
Fan
Gorst-Rasmussen
He
He
Hoeffding
Honore
Jin
Li
Li
Meinshausen
Peng
Portnoy
Shaughnessy
Shi
Shor
Song
Tibshirani
Tsiatis
Uno
Wang
Wei
Zhan
Zhao
Zhu
Publication venue: Collection of Biostatistics Research Archive
Publication date: 01/01/2013
Field of study

Variable screening has emerged as a crucial first step in the analysis of high-throughput data, but existing procedures can be computationally cumbersome, difficult to justify theoretically, or inapplicable to certain types of analyses. Motivated by a high-dimensional censored quantile regression problem in multiple myeloma genomics, this paper makes three contributions. First, we establish a score test-based screening framework, which is widely applicable, extremely computationally efficient, and relatively simple to justify. Secondly, we propose a resampling-based procedure for selecting the number of variables to retain after screening according to the principle of reproducibility. Finally, we propose a new iterative score test screening method which is closely related to sparse regression. In simulations we apply our methods to four dierent regression models and show that they can outperform existing procedures. We also apply score test screening to an analysis of gene expression data from multiple myeloma patients using a censored quantile regression model to identify high-risk genes

Crossref

PubMed Central

Collection Of Biostatistics Research Archive

Deep Blue Documents at the University of Michigan

Independent screening for single-index hazard rate models with ultra-high dimensional features

Author: Aalen
Aalen
Bair
Benner
Bradic
Bradic
Breiman
Brillinger
Cheng
Fan
Fan
Fan
Fan
Fan
Fan
Fan
Friedman
Gorst-Rasmussen
Hall
Hardin
Hattori
He
Hoeffding
Leng
Leng
Li
Lin
Ma
Martinussen
Martinussen
Massart
McKeague
Meinshausen
Metzeler
Pollard
Robert
Struthers
Tibshirani
Tibshirani
Wang
Zhu
Zhu
Zou
Zou
Zou
Publication venue
Publication date: 01/05/2011
Field of study

In data sets with many more features than observations, independent screening based on all univariate regression models leads to a computationally convenient variable selection method. Recent efforts have shown that in the case of generalized linear models, independent screening may suffice to capture all relevant features with high probability, even in ultra-high dimension. It is unclear whether this formal sure screening property is attainable when the response is a right-censored survival time. We propose a computationally very efficient independent screening method for survival data which can be viewed as the natural survival equivalent of correlation screening. We state conditions under which the method admits the sure screening property within a general class of single-index hazard rate models with ultra-high dimensional features. An iterative variant is also described which combines screening with penalized regression in order to handle more complex feature covariance structures. The methods are evaluated through simulation studies and through application to a real gene expression dataset.Comment: 32 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

VBN

Combining Pathway Identification and Breast Cancer Survival Prediction via Screening-Network Methods.

Author: Annalisa Occhipinti
Antonella Iuliano
Antonella Iuliano
Claudia Angelini
Italia De Feis
Pietro Liò
Publication venue: Front Genet
Publication date: 01/01/2018
Field of study

Breast cancer is one of the most common invasive tumors causing high mortality among women. It is characterized by high heterogeneity regarding its biological and clinical characteristics. Several high-throughput assays have been used to collect genome-wide information for many patients in large collaborative studies. This knowledge has improved our understanding of its biology and led to new methods of diagnosing and treating the disease. In particular, system biology has become a valid approach to obtain better insights into breast cancer biological mechanisms. A crucial component of current research lies in identifying novel biomarkers that can be predictive for breast cancer patient prognosis on the basis of the molecular signature of the tumor sample. However, the high dimension and low sample size of data greatly increase the difficulty of cancer survival analysis demanding for the development of ad-hoc statistical methods. In this work, we propose novel screening-network methods that predict patient survival outcome by screening key survival-related genes and we assess the capability of the proposed approaches using METABRIC dataset. In particular, we first identify a subset of genes by using variable screening techniques on gene expression data. Then, we perform Cox regression analysis by incorporating network information associated with the selected subset of genes. The novelty of this work consists in the improved prediction of survival responses due to the different types of screenings (i.e., a biomedical-driven, data-driven and a combination of the two) before building the network-penalized model. Indeed, the combination of the two screening approaches allows us to use the available biological knowledge on breast cancer and complement it with additional information emerging from the data used for the analysis. Moreover, we also illustrate how to extend the proposed approaches to integrate an additional omic layer, such as copy number aberrations, and we show that such strategies can further improve our prediction capabilities. In conclusion, our approaches allow to discriminate patients in high-and low-risk groups using few potential biomarkers and therefore, can help clinicians to provide more precise prognoses and to facilitate the subsequent clinical management of patients at risk of disease

Directory of Open Access Journals

Frontiers - Publisher Connector

Teeside University's Research Repository

Apollo (Cambridge)

FigShare