744 research outputs found

    Multiple Imputation Ensembles (MIE) for dealing with missing data

    Get PDF
    Missing data is a significant issue in many real-world datasets, yet there are no robust methods for dealing with it appropriately. In this paper, we propose a robust approach to dealing with missing data in classification problems: Multiple Imputation Ensembles (MIE). Our method integrates two approaches: multiple imputation and ensemble methods and compares two types of ensembles: bagging and stacking. We also propose a robust experimental set-up using 20 benchmark datasets from the UCI machine learning repository. For each dataset, we introduce increasing amounts of data Missing Completely at Random. Firstly, we use a number of single/multiple imputation methods to recover the missing values and then ensemble a number of different classifiers built on the imputed data. We assess the quality of the imputation by using dissimilarity measures. We also evaluate the MIE performance by comparing classification accuracy on the complete and imputed data. Furthermore, we use the accuracy of simple imputation as a benchmark for comparison. We find that our proposed approach combining multiple imputation with ensemble techniques outperform others, particularly as missing data increases

    Combining ‘‘real effort’’ with induced effort costs: the ball-catching task

    Get PDF
    We introduce the “ball-catching task”, a novel computerized task, which combines a tangible action (“catching balls”) with induced material cost of effort. The central feature of the ball-catching task is that it allows researchers to manipulate the cost of effort function as well as the production function, which permits quantitative predictions on effort provision. In an experiment with piece-rate incentives we find that the comparative static and the point predictions on effort provision are remarkably accurate. We also present experimental findings from three classic experiments, namely, team production, gift exchange and tournament, using the task. All of the results are closely in line with the stylized facts from experiments using purely induced values. We conclude that the ball-catching task combines the advantages of real effort tasks with the use of induced values, which is useful for theory-testing purposes as well as for applications

    Measurement of the cross-section and charge asymmetry of WW bosons produced in proton-proton collisions at s=8\sqrt{s}=8 TeV with the ATLAS detector

    Get PDF
    This paper presents measurements of the W+μ+νW^+ \rightarrow \mu^+\nu and WμνW^- \rightarrow \mu^-\nu cross-sections and the associated charge asymmetry as a function of the absolute pseudorapidity of the decay muon. The data were collected in proton--proton collisions at a centre-of-mass energy of 8 TeV with the ATLAS experiment at the LHC and correspond to a total integrated luminosity of 20.2~\mbox{fb^{-1}}. The precision of the cross-section measurements varies between 0.8% to 1.5% as a function of the pseudorapidity, excluding the 1.9% uncertainty on the integrated luminosity. The charge asymmetry is measured with an uncertainty between 0.002 and 0.003. The results are compared with predictions based on next-to-next-to-leading-order calculations with various parton distribution functions and have the sensitivity to discriminate between them.Comment: 38 pages in total, author list starting page 22, 5 figures, 4 tables, submitted to EPJC. All figures including auxiliary figures are available at https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/STDM-2017-13

    De novo Assembly of a 40 Mb Eukaryotic Genome from Short Sequence Reads: Sordaria macrospora, a Model Organism for Fungal Morphogenesis

    Get PDF
    Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30–90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in ∼4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for comparative studies to address basic questions of fungal biology

    Measurement of the W±Z boson pair-production cross section in pp collisions at √s=13TeV with the ATLAS detector

    Get PDF
    published_or_final_versio

    Measurement of the View the tt production cross-section using eμ events with b-tagged jets in pp collisions at √s = 13 TeV with the ATLAS detector

    Get PDF
    This paper describes a measurement of the inclusive top quark pair production cross-section (σtt¯) with a data sample of 3.2 fb−1 of proton–proton collisions at a centre-of-mass energy of √s = 13 TeV, collected in 2015 by the ATLAS detector at the LHC. This measurement uses events with an opposite-charge electron–muon pair in the final state. Jets containing b-quarks are tagged using an algorithm based on track impact parameters and reconstructed secondary vertices. The numbers of events with exactly one and exactly two b-tagged jets are counted and used to determine simultaneously σtt¯ and the efficiency to reconstruct and b-tag a jet from a top quark decay, thereby minimising the associated systematic uncertainties. The cross-section is measured to be: σtt¯ = 818 ± 8 (stat) ± 27 (syst) ± 19 (lumi) ± 12 (beam) pb, where the four uncertainties arise from data statistics, experimental and theoretical systematic effects, the integrated luminosity and the LHC beam energy, giving a total relative uncertainty of 4.4%. The result is consistent with theoretical QCD calculations at next-to-next-to-leading order. A fiducial measurement corresponding to the experimental acceptance of the leptons is also presented

    Search for High-Mass Resonances Decaying to τν in pp Collisions at √s=13 TeV with the ATLAS Detector

    Get PDF
    A search for high-mass resonances decaying to τν using proton-proton collisions at √s=13 TeV produced by the Large Hadron Collider is presented. Only τ-lepton decays with hadrons in the final state are considered. The data were recorded with the ATLAS detector and correspond to an integrated luminosity of 36.1 fb−1. No statistically significant excess above the standard model expectation is observed; model-independent upper limits are set on the visible τν production cross section. Heavy W′ bosons with masses less than 3.7 TeV in the sequential standard model and masses less than 2.2–3.8 TeV depending on the coupling in the nonuniversal G(221) model are excluded at the 95% credibility level

    Search for the direct production of charginos and neutralinos in final states with tau leptons in √s=13 TeV collisions with the ATLAS detector

    Get PDF
    A search for the direct production of charginos and neutralinos in final states with at least two hadronically decaying tau leptons is presented. The analysis uses a dataset of pp collisions corresponding to an integrated luminosity of 36.1 fb−1, recorded with the ATLAS detector at the Large Hadron Collider at a centre-of-mass energy of 13TeV.Nosignificant deviation from the expected Standard Model background is observed. Limits are derived in scenarios of ˜χ+1 ˜χ−1 pair production and of ˜χ±1 ˜χ02 and ˜χ+1 ˜χ−1 production in simplified models where the neutralinos and charginos decay solely via intermediate left-handed staus and tau sneutrinos, and the mass of the ˜ τL state is set to be halfway between the masses of the ˜χ±1 and the ˜χ01. Chargino masses up to 630 GeV are excluded at 95% confidence level in the scenario of direct production of ˜χ+1 ˜χ−1 for a massless ˜χ01. Common ˜χ±1 and ˜χ02 masses up to 760 GeV are excluded in the case of production of ˜χ±1 ˜χ02 and ˜χ+1 ˜χ−1 assuming a massless ˜χ01. Exclusion limits for additional benchmark scenarios with large and small mass-splitting between the ˜χ±1 and the ˜χ01 are also studied by varying the ˜ τL mass between the masses of the ˜χ±1 and the ˜χ01

    Combined measurement of differential and total cross sections in the H → γγ and the H → ZZ* → 4ℓ decay channels at s=13 TeV with the ATLAS detector

    Get PDF
    A combined measurement of differential and inclusive total cross sections of Higgs boson production is performed using 36.1 fb−1 of 13 TeV proton–proton collision data produced by the LHC and recorded by the ATLAS detector in 2015 and 2016. Cross sections are obtained from measured H→γγ and H→ZZ*(→4ℓ event yields, which are combined taking into account detector efficiencies, resolution, acceptances and branching fractions. The total Higgs boson production cross section is measured to be 57.0−5.9 +6.0 (stat.) −3.3 +4.0 (syst.) pb, in agreement with the Standard Model prediction. Differential cross-section measurements are presented for the Higgs boson transverse momentum distribution, Higgs boson rapidity, number of jets produced together with the Higgs boson, and the transverse momentum of the leading jet. The results from the two decay channels are found to be compatible, and their combination agrees with the Standard Model predictions

    Measurement of jet fragmentation in Pb+Pb and pp collisions at √s NN =5.02 TeV with the ATLAS detector

    Get PDF
    This paper presents a measurement of jet fragmentation functions in 0.49 nb −1 of Pb+Pb collisions and 25 pb −1 of pp collisions at √ sNN =5.02 TeV collected in 2015 with the ATLAS detector at the LHC. These measurements provide insight into the jet quenching process in the quark-gluon plasma created in the aftermath of ultra-relativistic collisions between two nuclei. The modifications to the jet fragmentation functions are quantified by dividing the measurements in Pb+Pb collisions by baseline measurements in pp collisions. This ratio is studied as a function of the transverse momentum of the jet, the jet rapidity, and the centrality of the collision. In both collision systems, the jet fragmentation functions are measured for jets with transverse momentum between 126 GeV and 398 GeV and with an absolute value of jet rapidity less than 2.1. An enhancement of particles carrying a small fraction of the jet momentum is observed, which increases with centrality and with increasing jet transverse momentum. Yields of particles carrying a very large fraction of the jet momentum are also observed to be enhanced. Between these two enhancements of the fragmentation functions a suppression of particles carrying an intermediate fraction of the jet momentum is observed in Pb+Pb collisions. A small dependence of the modifications on jet rapidity is observed
    corecore