252 research outputs found
Sparse inverse covariance estimation with the lasso
We consider the problem of estimating sparse graphs by a lasso penalty
applied to the inverse covariance matrix. Using a coordinate descent procedure
for the lasso, we develop a simple algorithm that is remarkably fast: in the
worst cases, it solves a 1000 node problem (~500,000 parameters) in about a
minute, and is 50 to 2000 times faster than competing methods. It also provides
a conceptual link between the exact problem and the approximation suggested by
Meinhausen and Buhlmann (2006). We illustrate the method on some cell-signaling
data from proteomics.Comment: submitte
Pathwise coordinate optimization
We consider ``one-at-a-time'' coordinate-wise descent algorithms for a class
of convex optimization problems. An algorithm of this kind has been proposed
for the -penalized regression (lasso) in the literature, but it seems to
have been largely ignored. Indeed, it seems that coordinate-wise algorithms are
not often used in convex optimization. We show that this algorithm is very
competitive with the well-known LARS (or homotopy) procedure in large lasso
problems, and that it can be applied to related methods such as the garotte and
elastic net. It turns out that coordinate-wise descent does not work in the
``fused lasso,'' however, so we derive a generalized algorithm that yields the
solution in much less time that a standard convex optimizer. Finally, we
generalize the procedure to the two-dimensional fused lasso, and demonstrate
its performance on some image smoothing problems.Comment: Published in at http://dx.doi.org/10.1214/07-AOAS131 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Regularization Paths for Generalized Linear Models via Coordinate Descent
We develop fast algorithms for estimation of generalized linear models with convex penalties. The models include linear regression, two-class logistic regression, and multi- nomial regression problems while the penalties include âÂÂ_1 (the lasso), âÂÂ_2 (ridge regression) and mixtures of the two (the elastic net). The algorithms use cyclical coordinate descent, computed along a regularization path. The methods can handle large problems and can also deal efficiently with sparse features. In comparative timings we find that the new algorithms are considerably faster than competing methods.
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent
We introduce a pathwise algorithm for the Cox proportional hazards model, regularized by convex combinations of l_1 and l_2 penalties (elastic net). Our algorithm fits via cyclical coordinate descent, and employs warm starts to find a solution along a regularization path. We demonstrate the efficacy of our algorithm on real and simulated data sets, and find considerable speedup between our algorithm and competing methods.
Strong rules for discarding predictors in lasso-type problems
We consider rules for discarding predictors in lasso regression and related
problems, for computational efficiency. El Ghaoui et al (2010) propose "SAFE"
rules that guarantee that a coefficient will be zero in the solution, based on
the inner products of each predictor with the outcome. In this paper we propose
strong rules that are not foolproof but rarely fail in practice. These can be
complemented with simple checks of the Karush- Kuhn-Tucker (KKT) conditions to
provide safe rules that offer substantial speed and space savings in a variety
of statistical convex optimization problems.Comment:
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent
We introduce a pathwise algorithm for the Cox proportional hazards model, regularized by convex combinations of l1 and l2 penalties (elastic net). Our algorithm fits via cyclical coordinate descent, and employs warm starts to find a solution along a regularization path. We demonstrate the efficacy of our algorithm on real and simulated data sets, and find considerable speedup between our algorithm and competing methods
Recommended from our members
The role of prefrontal cortex in cognitive control and executive function.
Funder: U.S. Department of Health & Human Services | NIH | National Institute on Drug Abuse (NIDA)Funder: National Research Foundation (Singapore) CREATE Grant to the University of Cambridge and Nanyang Technological University for the Centre of Lifelong Learning and Individualised Cognition (CLIC).Concepts of cognitive control (CC) and executive function (EF) are defined in terms of their relationships with goal-directed behavior versus habits and controlled versus automatic processing, and related to the functions of the prefrontal cortex (PFC) and related regions and networks. A psychometric approach shows unity and diversity in CC constructs, with 3 components in the most commonly studied constructs: general or common CC and components specific to mental set shifting and working memory updating. These constructs are considered against the cellular and systems neurobiology of PFC and what is known of its functional neuroanatomical or network organization based on lesioning, neurochemical, and neuroimaging approaches across species. CC is also considered in the context of motivation, as "cool" and "hot" forms. Its Common CC component is shown to be distinct from general intelligence (g) and closely related to response inhibition. Impairments in CC are considered as possible causes of psychiatric symptoms and consequences of disorders. The relationships of CC with the general factor of psychopathology (p) and dimensional constructs such as impulsivity in large scale developmental and adult populations are considered, as well as implications for genetic studies and RDoC approaches to psychiatric classification
Urinary MicroRNA Profiling in the Nephropathy of Type 1 Diabetes
Background: Patients with Type 1 Diabetes (T1D) are particularly vulnerable to development of Diabetic nephropathy (DN) leading to End Stage Renal Disease. Hence a better understanding of the factors affecting kidney disease progression in T1D is urgently needed. In recent years microRNAs have emerged as important post-transcriptional regulators of gene expression in many different health conditions. We hypothesized that urinary microRNA profile of patients will differ in the different stages of diabetic renal disease. Methods and Findings: We studied urine microRNA profiles with qPCR in 40 T1D with >20 year follow up 10 who never developed renal disease (N) matched against 10 patients who went on to develop overt nephropathy (DN), 10 patients with intermittent microalbuminuria (IMA) matched against 10 patients with persistent (PMA) microalbuminuria. A Bayesian procedure was used to normalize and convert raw signals to expression ratios. We applied formal statistical techniques to translate fold changes to profiles of microRNA targets which were then used to make inferences about biological pathways in the Gene Ontology and REACTOME structured vocabularies. A total of 27 microRNAs were found to be present at significantly different levels in different stages of untreated nephropathy. These microRNAs mapped to overlapping pathways pertaining to growth factor signaling and renal fibrosis known to be targeted in diabetic kidney disease. Conclusions: Urinary microRNA profiles differ across the different stages of diabetic nephropathy. Previous work using experimental, clinical chemistry or biopsy samples has demonstrated differential expression of many of these microRNAs in a variety of chronic renal conditions and diabetes. Combining expression ratios of microRNAs with formal inferences about their predicted mRNA targets and associated biological pathways may yield useful markers for early diagnosis and risk stratification of DN in T1D by inferring the alteration of renal molecular processes. © 2013 Argyropoulos et al
The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific
The world's oceans contain a complex mixture of micro-organisms that are for the most part, uncharacterized both genetically and biochemically. We report here a metagenomic study of the marine planktonic microbiota in which surface (mostly marine) water samples were analyzed as part of the Sorcerer II Global Ocean Sampling expedition. These samples, collected across a several-thousand km transect from the North Atlantic through the Panama Canal and ending in the South Pacific yielded an extensive dataset consisting of 7.7 million sequencing reads (6.3 billion bp). Though a few major microbial clades dominate the planktonic marine niche, the dataset contains great diversity with 85% of the assembled sequence and 57% of the unassembled data being unique at a 98% sequence identity cutoff. Using the metadata associated with each sample and sequencing library, we developed new comparative genomic and assembly methods. One comparative genomic method, termed “fragment recruitment,” addressed questions of genome structure, evolution, and taxonomic or phylogenetic diversity, as well as the biochemical diversity of genes and gene families. A second method, termed “extreme assembly,” made possible the assembly and reconstruction of large segments of abundant but clearly nonclonal organisms. Within all abundant populations analyzed, we found extensive intra-ribotype diversity in several forms: (1) extensive sequence variation within orthologous regions throughout a given genome; despite coverage of individual ribotypes approaching 500-fold, most individual sequencing reads are unique; (2) numerous changes in gene content some with direct adaptive implications; and (3) hypervariable genomic islands that are too variable to assemble. The intra-ribotype diversity is organized into genetically isolated populations that have overlapping but independent distributions, implying distinct environmental preference. We present novel methods for measuring the genomic similarity between metagenomic samples and show how they may be grouped into several community types. Specific functional adaptations can be identified both within individual ribotypes and across the entire community, including proteorhodopsin spectral tuning and the presence or absence of the phosphate-binding gene PstS
Preventing AVF thrombosis: the rationale and design of the Omega-3 fatty acids (Fish Oils) and Aspirin in Vascular access OUtcomes in REnal Disease (FAVOURED) study
Background: Haemodialysis (HD) is critically dependent on the availability of adequate access to the systemic circulation, ideally via a native arteriovenous fistula (AVF). The Primary failure rate of an AVF ranges between 20-54%, due to thrombosis or failure of maturation. There remains limited evidence for the use of anti-platelet agents and uncertainty as to choice of agent(s) for the prevention of AVF thrombosis. We present the study protocol for a randomised, double-blind, placebo-controlled, clinical trial examining whether the use of the anti-platelet agents, aspirin and omega-3 fatty acids, either alone or in combination, will effectively reduce the risk of early thrombosis in de novo AVF
- …