232 research outputs found

    Don't Fall for Tuning Parameters: Tuning-Free Variable Selection in High Dimensions With the TREX

    Full text link
    Lasso is a seminal contribution to high-dimensional statistics, but it hinges on a tuning parameter that is difficult to calibrate in practice. A partial remedy for this problem is Square-Root Lasso, because it inherently calibrates to the noise variance. However, Square-Root Lasso still requires the calibration of a tuning parameter to all other aspects of the model. In this study, we introduce TREX, an alternative to Lasso with an inherent calibration to all aspects of the model. This adaptation to the entire model renders TREX an estimator that does not require any calibration of tuning parameters. We show that TREX can outperform cross-validated Lasso in terms of variable selection and computational efficiency. We also introduce a bootstrapped version of TREX that can further improve variable selection. We illustrate the promising performance of TREX both on synthetic data and on a recent high-dimensional biological data set that considers riboflavin production in B. subtilis

    Non-convex Global Minimization and False Discovery Rate Control for the TREX

    Full text link
    The TREX is a recently introduced method for performing sparse high-dimensional regression. Despite its statistical promise as an alternative to the lasso, square-root lasso, and scaled lasso, the TREX is computationally challenging in that it requires solving a non-convex optimization problem. This paper shows a remarkable result: despite the non-convexity of the TREX problem, there exists a polynomial-time algorithm that is guaranteed to find the global minimum. This result adds the TREX to a very short list of non-convex optimization problems that can be globally optimized (principal components analysis being a famous example). After deriving and developing this new approach, we demonstrate that (i) the ability of the preexisting TREX heuristic to reach the global minimum is strongly dependent on the difficulty of the underlying statistical problem, (ii) the new polynomial-time algorithm for TREX permits a novel variable ranking and selection scheme, (iii) this scheme can be incorporated into a rule that controls the false discovery rate (FDR) of included features in the model. To achieve this last aim, we provide an extension of the results of Barber & Candes (2015) to establish that the knockoff filter framework can be applied to the TREX. This investigation thus provides both a rare case study of a heuristic for non-convex optimization and a novel way of exploiting non-convexity for statistical inference

    Gradient-robust hybrid DG discretizations for the compressible Stokes equations

    Full text link
    This paper studies two hybrid discontinuous Galerkin (HDG) discretizations for the velocity-density formulation of the compressible Stokes equations with respect to several desired structural properties, namely provable convergence, the preservation of non-negativity and mass constraints for the density, and gradient-robustness. The later property dramatically enhances the accuracy in well-balanced situations, such as the hydrostatic balance where the pressure gradient balances the gravity force. One of the studied schemes employs an H(div)-conforming velocity ansatz space which ensures all mentioned properties, while a fully discontinuous method is shown to satisfy all properties but the gradient-robustness. Also higher-order schemes for both variants are presented and compared in three numerical benchmark problems. The final example shows the importance also for non-hydrostatic well-balanced states for the compressible Navier-Stokes equations

    Guaranteed upper bounds for the velocity error of pressure-robust Stokes discretisations

    Get PDF
    This paper improves guaranteed error control for the Stokes problem with a focus on pressure-robustness, i.e. for discretisations that compute a discrete velocity that is independent of the exact pressure. A Prager-Synge type result relates the errors of divergence-free primal and H(div)-conforming dual mixed methods (for the velocity gradient) with an equilibration constraint that needs special care when discretised. To relax the constraints on the primal and dual method, a more general result is derived that enables the use of a recently developed mass conserving mixed stress discretisation to design equilibrated fluxes that yield pressure-independent guaranteed upper bounds for any pressure-robust (but not necessarily divergence-free) primal discretisation. Moreover, a provably efficient local design of the equilibrated fluxes is presented that reduces the numerical costs of the error estimator. All theoretical findings are verified by numerical examples which also show that the efficiency indices of our novel guaranteed upper bounds for the velocity error are close to 1

    Long-term gait measurements in daily life: Results from the Berlin Aging Study II (BASE-II)

    Get PDF
    BACKGROUND: Walking ability is an important prerequisite for activity, social participation and independent living. While in most healthy adults, this ability can be assumed as given, limitations in walking ability occur with increasing age. Furthermore, slow walking speed is linked to several chronic conditions and overall morbidity. Measurements of gait parameters can be used as a proxy to detect functional decline and onset of chronic conditions. Up to now, gait characteristics used for this purpose are measured in standardized laboratory settings. There is some evidence, however, that long-term measurements of gait parameters in the living environment have some advantages over short-term laboratory measurements. METHODS: We evaluated cross-sectional data from an accelerometric sensor worn in a subgroup of 554 participants of the Berlin Aging Study II (BASE-II). Data from the two BASE-II age groups (age between 22-36 years and 60-79 years) were used for the current analysis of accelerometric data for a minimum of two days and a maximum of ten days were available. Real world walking speed, number of steps, maximum coherent distance and total distance were derived as average data per day. Linear regression analyses were performed on the different gait parameters in order to identify significant determinants. Additionally, Mann-Whitney-U-tests were performed to detect sex-specific differences. RESULTS: Age showed to be significantly associated with real world walking speed and with the total distance covered per day, while BMI contributed negatively to the number of walking steps, maximum coherent distance and total distance walked. Additionally, sex was associated with walking steps. However, R2-values for all models were low. Overall, women had significantly more walking steps and a larger coherent distance per day when compared to men. When separated by age group, this difference was significant only in the older participants. Additionally, walking speed was significantly higher in women compared to men in the subgroup of older people. CONCLUSIONS: Age- and sex-specific differences have to be considered when objective gait parameters are measured, e.g. in the context of clinical risk assessment. For this purpose normative data, differentiating for age and sex would have to be established to allow reliable classification of long-term measurements of gait

    Divergence-free reconstruction operators for pressure-robust Stokes discretizations with continuous pressure finite elements

    Get PDF
    Classical inf-sup stable mixed finite elements for the incompressible (Navier--)Stokes equations are not pressure-robust, i.e., their velocity errors depend on the continuous pressure. How-ever, a modification only in the right hand side of a Stokes discretization is able to reestablish pressure-robustness, as shown recently for several inf-sup stable Stokes elements with discontinuous discrete pressures. In this contribution, this idea is extended to low and high order Taylor--Hood and mini elements, which have continuous discrete pressures. For the modification of the right hand side a velocity reconstruction operator is constructed that maps discretely divergence-free test functions to exactly divergence-free ones. The reconstruction is based on local H(div)-conforming flux equilibration on vertex patches, and fulfills certain orthogonality properties to provide consistency and optimal a-priori error estimates. Numerical examples for the incompressible Stokes and Navier--Stokes equations confirm that the new pressure-robust Taylor--Hood and mini elements converge with optimal order and outperform signi--cantly the classical versions of those elements when the continuous pressure is comparably large

    Automatic identification of chemical moieties

    Get PDF
    In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representations, enabling a variety of applications beyond property prediction, which otherwise rely on expert knowledge. The required representation can either be provided by a pretrained MPNN, or be learned from scratch using only structural information. Beyond the data-driven design of molecular fingerprints, the versatility of our approach is demonstrated by enabling the selection of representative entries in chemical databases, the automatic construction of coarse-grained force fields, as well as the identification of reaction coordinates
    corecore