1,841 research outputs found

    A Fast Algorithm for Robust Regression with Penalised Trimmed Squares

    Full text link
    The presence of groups containing high leverage outliers makes linear regression a difficult problem due to the masking effect. The available high breakdown estimators based on Least Trimmed Squares often do not succeed in detecting masked high leverage outliers in finite samples. An alternative to the LTS estimator, called Penalised Trimmed Squares (PTS) estimator, was introduced by the authors in \cite{ZiouAv:05,ZiAvPi:07} and it appears to be less sensitive to the masking problem. This estimator is defined by a Quadratic Mixed Integer Programming (QMIP) problem, where in the objective function a penalty cost for each observation is included which serves as an upper bound on the residual error for any feasible regression line. Since the PTS does not require presetting the number of outliers to delete from the data set, it has better efficiency with respect to other estimators. However, due to the high computational complexity of the resulting QMIP problem, exact solutions for moderately large regression problems is infeasible. In this paper we further establish the theoretical properties of the PTS estimator, such as high breakdown and efficiency, and propose an approximate algorithm called Fast-PTS to compute the PTS estimator for large data sets efficiently. Extensive computational experiments on sets of benchmark instances with varying degrees of outlier contamination, indicate that the proposed algorithm performs well in identifying groups of high leverage outliers in reasonable computational time.Comment: 27 page

    The evolution of the actin binding NET superfamily.

    Get PDF
    This is the final version of the article. Available from Frontiers Media via the DOI in this record.The Arabidopsis Networked (NET) superfamily are plant-specific actin binding proteins which specifically label different membrane compartments and identify specialized sites of interaction between actin and membranes unique to plants. There are 13 members of the superfamily in Arabidopsis, which group into four distinct clades or families. NET homologs are absent from the genomes of metazoa and fungi; furthermore, in plantae, NET sequences are also absent from the genome of mosses and more ancient extant plant clades. A single family of the NET proteins is found encoded in the club moss genome, an extant species of the earliest vascular plants. Gymnosperms have examples from families 4 and 3, with a hybrid form of NET1 and 2 which shows characteristics of both NET1 and NET2. In addition to NET3 and 4 families, the NET1 and pollen-expressed NET2 families are found only as independent sequences in Angiosperms. This is consistent with the divergence of reproductive actin. The four families are conserved across Monocots and Eudicots, with the numbers of members of each clade expanding at this point, due, in part, to regions of genome duplication. Since the emergence of the NET superfamily at the dawn of vascular plants, they have continued to develop and diversify in a manner which has mirrored the divergence and increasing complexity of land-plant species

    Visualizing 1D Regression

    Get PDF
    Regression is the study of the conditional distribution of the response y given the predictors x. In a 1D regression, y is independent of x given a single linear combination βTx of the predictors. Special cases of 1D regression include multiple linear regression, binary regression and generalized linear models. If a good estimate ˆb of some non-zero multiple cβ of β can be constructed, then the 1D regression can be visualized with a scatterplot of ˆbTx versus y. A resistant method for estimating cβ is presented along with applications

    Can programme theory be used as a 'translational tool’ to optimise health service delivery in a national early years’ initiative in Scotland: a case study

    Get PDF
    Background Theory-based evaluation (TBE) approaches are heralded as supporting formative evaluation by facilitating increased use of evaluative findings to guide programme improvement. It is essential that learning from programme implementation is better used to improve delivery and to inform other initiatives, if interventions are to be as effective as they have the potential to be. Nonetheless, few studies describe formative feedback methods, or report direct instrumental use of findings resulting from TBE. This paper uses the case of Scotland’s, National Health Service, early years’, oral health improvement initiative (Childsmile) to describe the use of TBE as a framework for providing feedback on delivery to programme staff and to assess its impact on programmatic action.<p></p> Methods In-depth, semi-structured interviews and focus groups with key stakeholders explored perceived deviations between the Childsmile programme 'as delivered’ and its Programme Theory (PT). The data was thematically analysed using constant comparative methods. Findings were shared with key programme stakeholders and discussions around likely impact and necessary actions were facilitated by the authors. Documentary review and ongoing observations of programme meetings were undertaken to assess the extent to which learning was acted upon.<p></p> Results On the whole, the activities documented in Childsmile’s PT were implemented as intended. This paper purposefully focuses on those activities where variation in delivery was evident. Differences resulted from the stage of roll-out reached and the flexibility given to individual NHS boards to tailor local implementation. Some adaptations were thought to have diverged from the central features of Childsmile’s PT, to the extent that there was a risk to achieving outcomes. The methods employed prompted national service improvement action, and proposals for local action by individual NHS boards to address this.<p></p> Conclusions The TBE approach provided a platform, to direct attention to areas of risk within a national health initiative, and to agree which intervention components were 'core’ to its hypothesised success. The study demonstrates that PT can be used as a 'translational tool’ to facilitate instrumental use of evaluative findings to optimise implementation within a complex health improvement programme.<p></p&gt

    The North Wyke Farm Platform: effect of temperate grassland farming systems on soil moisture contents, runoff and associated water quality dynamics

    Get PDF
    This is the final version of the article. Available from Wiley via the DOI in this record.The North Wyke Farm Platform was established as a United Kingdom national capability for collaborative research, training and knowledge exchange in agro-environmental sciences. Its remit is to research agricultural productivity and ecosystem responses to different management practices for beef and sheep production in lowland grasslands. A system based on permanent pasture was implemented on three 21-ha farmlets to obtain baseline data on hydrology, nutrient cycling and productivity for 2 years. Since then two farmlets have been modified by either (i) planned reseeding with grasses that have been bred for enhanced sugar content or deep-rooting traits or (ii) sowing grass and legume mixtures to reduce nitrogen fertilizer inputs. The quantities of nutrients that enter, cycle within and leave the farmlets were evaluated with data recorded from sensor technologies coupled with more traditional field study methods. We demonstrate the potential of the farm platform approach with a case study in which we investigate the effects of the weather, field topography and farm management activity on surface runoff and associated pollutant or nutrient loss from soil. We have the opportunity to do a full nutrient cycling analysis, taking account of nutrient transformations in soil, and flows to water and losses to air. The NWFP monitoring system is unique in both scale and scope for a managed land-based capability that brings together several technologies that allow the effect of temperate grassland farming systems on soil moisture levels, runoff and associated water quality dynamics to be studied in detail. HIGHLIGHTS: Can meat production systems be developed that are productive yet minimize losses to the environment?The data are from an intensively instrumented capability, which is globally unique and topical.We use sensing technologies and surveys to show the effect of pasture renewal on nutrient losses.Platforms provide evidence of the effect of meteorology, topography and farm activity on nutrient loss.The North Wyke Farm Platform is a UK National Capability supported by the Biotechnology and Biological Sciences Research Council (BBSRC BB/J004308/1)

    The merger that led to the formation of the Milky Way's inner stellar halo and thick disk

    Get PDF
    The assembly process of our Galaxy can be retrieved using the motions and chemistry of individual stars. Chemo-dynamical studies of the nearby halo have long hinted at the presence of multiple components such as streams, clumps, duality and correlations between the stars' chemical abundances and orbital parameters. More recently, the analysis of two large stellar surveys have revealed the presence of a well-populated chemical elemental abundance sequence, of two distinct sequences in the colour-magnitude diagram, and of a prominent slightly retrograde kinematic structure all in the nearby halo, which may trace an important accretion event experienced by the Galaxy. Here report an analysis of the kinematics, chemistry, age and spatial distribution of stars in a relatively large volume around the Sun that are mainly linked to two major Galactic components, the thick disk and the stellar halo. We demonstrate that the inner halo is dominated by debris from an object which at infall was slightly more massive than the Small Magellanic Cloud, and which we refer to as Gaia-Enceladus. The stars originating in Gaia-Enceladus cover nearly the full sky, their motions reveal the presence of streams and slightly retrograde and elongated trajectories. Hundreds of RR Lyrae stars and thirteen globular clusters following a consistent age-metallicity relation can be associated to Gaia-Enceladus on the basis of their orbits. With an estimated 4:1 mass-ratio, the merger with Gaia-Enceladus must have led to the dynamical heating of the precursor of the Galactic thick disk and therefore contributed to the formation of this component approximately 10 Gyr ago. These findings are in line with simulations of galaxy formation, which predict that the inner stellar halo should be dominated by debris from just a few massive progenitors.Comment: 19 pages, 8 figures. Published in Nature in the issue of Nov. 1st, 2018. This is the authors' version before final edit

    Common ADRB2 Haplotypes Derived from 26 Polymorphic Sites Direct β2-Adrenergic Receptor Expression and Regulation Phenotypes

    Get PDF
    The beta2-adrenergic receptor (beta2AR) is expressed on numerous cell-types including airway smooth muscle cells and cardiomyocytes. Drugs (agonists or antagonists) acting at these receptors for treatment of asthma, chronic obstructive pulmonary disease, and heart failure show substantial interindividual variability in response. The ADRB2 gene is polymorphic in noncoding and coding regions, but virtually all ADRB2 association studies have utilized the two common nonsynonymous coding SNPs, often reaching discrepant conclusions.We constructed the 8 common ADRB2 haplotypes derived from 26 polymorphisms in the promoter, 5'UTR, coding, and 3'UTR of the intronless ADRB2 gene. These were cloned into an expression construct lacking a vector-based promoter, so that beta2AR expression was driven by its promoter, and steady state expression could be modified by polymorphisms throughout ADRB2 within a haplotype. "Whole-gene" transfections were performed with COS-7 cells and revealed 4 haplotypes with increased cell surface beta2AR protein expression compared to the others. Agonist-promoted downregulation of beta2AR protein expression was also haplotype-dependent, and was found to be increased for 2 haplotypes. A phylogenetic tree of the haplotypes was derived and annotated by cellular phenotypes, revealing a pattern potentially driven by expression.Thus for obstructive lung disease, the initial bronchodilator response from intermittent administration of beta-agonist may be influenced by certain beta2AR haplotypes (expression phenotypes), while other haplotypes may influence tachyphylaxis during the response to chronic therapy (downregulation phenotypes). An ideal clinical outcome of high expression and less downregulation was found for two haplotypes. Haplotypes may also affect heart failure antagonist therapy, where beta2AR increase inotropy and are anti-apoptotic. The haplotype-specific expression and regulation phenotypes found in this transfection-based system suggest that the density of genetic information in the form of these haplotypes, or haplotype-clusters with similar phenotypes can potentially provide greater discrimination of phenotype in human disease and pharmacogenomic association studies

    An exploratory randomised controlled trial of a premises-level intervention to reduce alcohol-related harm including violence in the United Kingdom

    Get PDF
    <b>Background</b><p></p> To assess the feasibility of a randomised controlled trial of a licensed premises intervention to reduce severe intoxication and disorder; to establish effect sizes and identify appropriate approaches to the development and maintenance of a rigorous research design and intervention implementation.<p></p> <b>Methods</b><p></p> An exploratory two-armed parallel randomised controlled trial with a nested process evaluation. An audit of risk factors and a tailored action plan for high risk premises, with three month follow up audit and feedback. Thirty-two premises that had experienced at least one assault in the year prior to the intervention were recruited, match paired and randomly allocated to control or intervention group. Police violence data and data from a street survey of study premises’ customers, including measures of breath alcohol concentration and surveyor rated customer intoxication, were used to assess effect sizes for a future definitive trial. A nested process evaluation explored implementation barriers and the fidelity of the intervention with key stakeholders and senior staff in intervention premises using semi-structured interviews.<p></p> <b>Results</b><p></p> The process evaluation indicated implementation barriers and low fidelity, with a reluctance to implement the intervention and to submit to a formal risk audit. Power calculations suggest the intervention effect on violence and subjective intoxication would be raised to significance with a study size of 517 premises.<p></p> <b>Conclusions</b><p></p> It is methodologically feasible to conduct randomised controlled trials where licensed premises are the unit of allocation. However, lack of enthusiasm in senior premises staff indicates the need for intervention enforcement, rather than voluntary agreements, and on-going strategies to promote sustainability
    • …
    corecore