606 research outputs found

    Multiple Imputation Ensembles (MIE) for dealing with missing data

    Get PDF
    Missing data is a significant issue in many real-world datasets, yet there are no robust methods for dealing with it appropriately. In this paper, we propose a robust approach to dealing with missing data in classification problems: Multiple Imputation Ensembles (MIE). Our method integrates two approaches: multiple imputation and ensemble methods and compares two types of ensembles: bagging and stacking. We also propose a robust experimental set-up using 20 benchmark datasets from the UCI machine learning repository. For each dataset, we introduce increasing amounts of data Missing Completely at Random. Firstly, we use a number of single/multiple imputation methods to recover the missing values and then ensemble a number of different classifiers built on the imputed data. We assess the quality of the imputation by using dissimilarity measures. We also evaluate the MIE performance by comparing classification accuracy on the complete and imputed data. Furthermore, we use the accuracy of simple imputation as a benchmark for comparison. We find that our proposed approach combining multiple imputation with ensemble techniques outperform others, particularly as missing data increases

    Extinction times in the subcritical stochastic SIS logistic epidemic

    Get PDF
    Many real epidemics of an infectious disease are not straightforwardly super- or sub-critical, and the understanding of epidemic models that exhibit such complexity has been identified as a priority for theoretical work. We provide insights into the near-critical regime by considering the stochastic SIS logistic epidemic, a well-known birth-and-death chain used to model the spread of an epidemic within a population of a given size NN. We study the behaviour of the process as the population size NN tends to infinity. Our results cover the entire subcritical regime, including the "barely subcritical" regime, where the recovery rate exceeds the infection rate by an amount that tends to 0 as NN \to \infty but more slowly than N1/2N^{-1/2}. We derive precise asymptotics for the distribution of the extinction time and the total number of cases throughout the subcritical regime, give a detailed description of the course of the epidemic, and compare to numerical results for a range of parameter values. We hypothesise that features of the course of the epidemic will be seen in a wide class of other epidemic models, and we use real data to provide some tentative and preliminary support for this theory.Comment: Revised; 34 pages; 6 figure

    Potential climatic transitions with profound impact on Europe

    Get PDF
    We discuss potential transitions of six climatic subsystems with large-scale impact on Europe, sometimes denoted as tipping elements. These are the ice sheets on Greenland and West Antarctica, the Atlantic thermohaline circulation, Arctic sea ice, Alpine glaciers and northern hemisphere stratospheric ozone. Each system is represented by co-authors actively publishing in the corresponding field. For each subsystem we summarize the mechanism of a potential transition in a warmer climate along with its impact on Europe and assess the likelihood for such a transition based on published scientific literature. As a summary, the ‘tipping’ potential for each system is provided as a function of global mean temperature increase which required some subjective interpretation of scientific facts by the authors and should be considered as a snapshot of our current understanding. <br/

    The Pioneer Anomaly

    Get PDF
    Radio-metric Doppler tracking data received from the Pioneer 10 and 11 spacecraft from heliocentric distances of 20-70 AU has consistently indicated the presence of a small, anomalous, blue-shifted frequency drift uniformly changing with a rate of ~6 x 10^{-9} Hz/s. Ultimately, the drift was interpreted as a constant sunward deceleration of each particular spacecraft at the level of a_P = (8.74 +/- 1.33) x 10^{-10} m/s^2. This apparent violation of the Newton's gravitational inverse-square law has become known as the Pioneer anomaly; the nature of this anomaly remains unexplained. In this review, we summarize the current knowledge of the physical properties of the anomaly and the conditions that led to its detection and characterization. We review various mechanisms proposed to explain the anomaly and discuss the current state of efforts to determine its nature. A comprehensive new investigation of the anomalous behavior of the two Pioneers has begun recently. The new efforts rely on the much-extended set of radio-metric Doppler data for both spacecraft in conjunction with the newly available complete record of their telemetry files and a large archive of original project documentation. As the new study is yet to report its findings, this review provides the necessary background for the new results to appear in the near future. In particular, we provide a significant amount of information on the design, operations and behavior of the two Pioneers during their entire missions, including descriptions of various data formats and techniques used for their navigation and radio-science data analysis. As most of this information was recovered relatively recently, it was not used in the previous studies of the Pioneer anomaly, but it is critical for the new investigation.Comment: 165 pages, 40 figures, 16 tables; accepted for publication in Living Reviews in Relativit

    Epilepsy, hippocampal sclerosis and febrile seizures linked by common genetic variation around SCN1A

    Get PDF
    Epilepsy comprises several syndromes, amongst the most common being mesial temporal lobe epilepsy with hippocampal sclerosis. Seizures in mesial temporal lobe epilepsy with hippocampal sclerosis are typically drug-resistant, and mesial temporal lobe epilepsy with hippocampal sclerosis is frequently associated with important co-morbidities, mandating the search for better understanding and treatment. The cause of mesial temporal lobe epilepsy with hippocampal sclerosis is unknown, but there is an association with childhood febrile seizures. Several rarer epilepsies featuring febrile seizures are caused by mutations in SCN1A, which encodes a brain-expressed sodium channel subunit targeted by many anti-epileptic drugs. We undertook a genome-wide association study in 1018 people with mesial temporal lobe epilepsy with hippocampal sclerosis and 7552 control subjects, with validation in an independent sample set comprising 959 people with mesial temporal lobe epilepsy with hippocampal sclerosis and 3591 control subjects. To dissect out variants related to a history of febrile seizures, we tested cases with mesial temporal lobe epilepsy with hippocampal sclerosis with (overall n = 757) and without (overall n = 803) a history of febrile seizures. Meta-analysis revealed a genome-wide significant association for mesial temporal lobe epilepsy with hippocampal sclerosis with febrile seizures at the sodium channel gene cluster on chromosome 2q24.3 [rs7587026, within an intron of the SCN1A gene, P = 3.36 × 10(-9), odds ratio (A) = 1.42, 95% confidence interval: 1.26-1.59]. In a cohort of 172 individuals with febrile seizures, who did not develop epilepsy during prospective follow-up to age 13 years, and 6456 controls, no association was found for rs7587026 and febrile seizures. These findings suggest SCN1A involvement in a common epilepsy syndrome, give new direction to biological understanding of mesial temporal lobe epilepsy with hippocampal sclerosis with febrile seizures, and open avenues for investigation of prognostic factors and possible prevention of epilepsy in some children with febrile seizures

    Endophytes vs tree pathogens and pests: can they be used as biological control agents to improve tree health?

    Get PDF
    Like all other plants, trees are vulnerable to attack by a multitude of pests and pathogens. Current control measures for many of these diseases are limited and relatively ineffective. Several methods, including the use of conventional synthetic agro-chemicals, are employed to reduce the impact of pests and diseases. However, because of mounting concerns about adverse effects on the environment and a variety of economic reasons, this limited management of tree diseases by chemical methods is losing ground. The use of biological control, as a more environmentally friendly alternative, is becoming increasingly popular in plant protection. This can include the deployment of soil inoculants and foliar sprays, but the increased knowledge of microbial ecology in the phytosphere, in particular phylloplane microbes and endophytes, has stimulated new thinking for biocontrol approaches. Endophytes are microbes that live within plant tissues. As such, they hold potential as biocontrol agents against plant diseases because they are able to colonize the same ecological niche favoured by many invading pathogens. However, the development and exploitation of endophytes as biocontrol agents will have to overcome numerous challenges. The optimization and improvement of strategies employed in endophyte research can contribute towards discovering effective and competent biocontrol agents. The impact of environment and plant genotype on selecting potentially beneficial and exploitable endophytes for biocontrol is poorly understood. How endophytes synergise or antagonise one another is also an important factor. This review focusses on recent research addressing the biocontrol of plant diseases and pests using endophytic fungi and bacteria, alongside the challenges and limitations encountered and how these can be overcome. We frame this review in the context of tree pests and diseases, since trees are arguably the most difficult plant species to study, work on and manage, yet they represent one of the most important organisms on Earth

    Taxonomic and functional turnover are decoupled in European peat bogs

    Get PDF
    In peatland ecosystems, plant communities mediate a globally significant carbon store. The effects of global environmental change on plant assemblages are expected to be a factor in determining how ecosystem functions such as carbon uptake will respond. Using vegetation data from 56 Sphagnum-dominated peat bogs across Europe, we show that in these ecosystems plant species aggregate into two major clusters that are each defined by shared response to environmental conditions. Across environmental gradients, we find significant taxonomic turnover in both clusters. However, functional identity and functional redundancy of the community as a whole remain unchanged. This strongly suggests that in peat bogs, species turnover across environmental gradients is restricted to functionally similar species. Our results demonstrate that plant taxonomic and functional turnover are decoupled, which may allow these peat bogs to maintain ecosystem functioning when subject to future environmental change
    corecore