Search CORE

4,214 research outputs found

Classification without labels: Learning from mixed samples in high energy physics

Author: Metodiev Eric M.
Nachman Benjamin
Thaler Jesse
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2017
Field of study

Modern machine learning techniques can be used to construct powerful models for difficult collider physics problems. In many applications, however, these models are trained on imperfect simulations due to a lack of truth-level information in the data, which risks the model learning artifacts of the simulation. In this paper, we introduce the paradigm of classification without labels (CWoLa) in which a classifier is trained to distinguish statistical mixtures of classes, which are common in collider physics. Crucially, neither individual labels nor class proportions are required, yet we prove that the optimal classifier in the CWoLa paradigm is also the optimal classifier in the traditional fully-supervised case where all label information is available. After demonstrating the power of this method in an analytical toy example, we consider a realistic benchmark for collider physics: distinguishing quark- versus gluon-initiated jets using mixed quark/gluon training samples. More generally, CWoLa can be applied to any classification problem where labels or class proportions are unknown or simulations are unreliable, but statistical mixtures of the classes are available.Comment: 18 pages, 5 figures; v2: intro extended and references added; v3: additional discussion to match JHEP versio

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Directory of Open Access Journals

eScholarship - University of California

On similarity solutions of a boundary layer problem with an upstream moving wall

Author: Hussaini M. Y.
Lakin W. D.
Nachman A.
Publication venue
Publication date
Field of study

The problem of a boundary layer on a flat plate which has a constant velocity opposite in direction to that of the uniform mainstream is examined. It was previously shown that the solution of this boundary value problem is crucially dependent on the parameter which is the ratio of the velocity of the plate to the velocity of the free stream. In particular, it was proved that a solution exists only if this parameter does not exceed a certain critical value, and numerical evidence was adduced to show that this solution is nonunique. Using Crocco formulation the present work proves this nonuniqueness. Also considered are the analyticity of solutions and the derivation of upper bounds on the critical value of wall velocity parameter

NASA Technical Reports Server

Recommended from our members

Exploring the Relationship Between School Organizational Health, Advice Seeking Networks, and Student Behavior

Author: Nachman Abbey M
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/05/2021
Field of study

The purpose of this study was to explore the relationship between organizational health and advice seeking behavior of school staff around students exhibiting social, emotional, or behavioral concerns. School staff are front line responders to mental/behavioral health issues and it would benefit schools to better understand the organizational factors that influence advice seeking behavior and the affect that school climate amongst teachers has on student behavior. This study investigated the climates and communication patterns of two urban elementary schools. Social network analysis was used to visualize and analyze both schools’ respective networks. School staff completed the Organizational Health Inventory as well as provided data regarding which staff members they have sought out and received helpful advice in regards to students social, emotional, and behavioral functioning. Findings suggest that high levels of organizational health were associated with frequent advice seeking behavior. Individuals were more likely to reach out to staff who had longer tenure, held administrative positions, and those working closely together (e.g., grade level). Implications include creating system norms and overcoming other barriers to enhance advice seeking behavior among staff

ScholarWorks@UMass Amherst

Learning to Classify from Impure Samples with High-Dimensional Data

Author: Komiske Patrick T.
Metodiev Eric M.
Nachman Benjamin
Schwartz Matthew D.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/07/2018
Field of study

A persistent challenge in practical classification tasks is that labeled training sets are not always available. In particle physics, this challenge is surmounted by the use of simulations. These simulations accurately reproduce most features of data, but cannot be trusted to capture all of the complex correlations exploitable by modern machine learning methods. Recent work in weakly supervised learning has shown that simple, low-dimensional classifiers can be trained using only the impure mixtures present in data. Here, we demonstrate that complex, high-dimensional classifiers can also be trained on impure mixtures using weak supervision techniques, with performance comparable to what could be achieved with pure samples. Using weak supervision will therefore allow us to avoid relying exclusively on simulations for high-dimensional classification. This work opens the door to a new regime whereby complex models are trained directly on data, providing direct access to probe the underlying physics.Comment: 6 pages, 2 tables, 2 figures. v2: updated to match PRD versio

arXiv.org e-Print Archive

DSpace@MIT

eScholarship - University of California

Pileup Mitigation with Machine Learning (PUMML)

Author: Komiske Patrick T.
Metodiev Eric M.
Nachman Benjamin
Schwartz Matthew D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2017
Field of study

Pileup involves the contamination of the energy distribution arising from the primary collision of interest (leading vertex) by radiation from soft collisions (pileup). We develop a new technique for removing this contamination using machine learning and convolutional neural networks. The network takes as input the energy distribution of charged leading vertex particles, charged pileup particles, and all neutral particles and outputs the energy distribution of particles coming from leading vertex alone. The PUMML algorithm performs remarkably well at eliminating pileup distortion on a wide range of simple and complex jet observables. We test the robustness of the algorithm in a number of ways and discuss how the network can be trained directly on data.Comment: 20 pages, 8 figures, 2 tables. Updated to JHEP versio

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Directory of Open Access Journals

eScholarship - University of California

OmniFold: A Method to Simultaneously Unfold All Observables

Author: Andreassen Anders
Komiske Patrick T.
Metodiev Eric M.
Nachman Benjamin
Thaler Jesse
Publication venue: 'American Physical Society (APS)'
Publication date: 16/04/2020
Field of study

Collider data must be corrected for detector effects ("unfolded") to be compared with many theoretical calculations and measurements from other experiments. Unfolding is traditionally done for individual, binned observables without including all information relevant for characterizing the detector response. We introduce OmniFold, an unfolding method that iteratively reweights a simulated dataset, using machine learning to capitalize on all available information. Our approach is unbinned, works for arbitrarily high-dimensional data, and naturally incorporates information from the full phase space. We illustrate this technique on a realistic jet substructure example from the Large Hadron Collider and compare it to standard binned unfolding methods. This new paradigm enables the simultaneous measurement of all observables, including those not yet invented at the time of the analysis.Comment: 8 pages, 3 figures, 1 table, 1 poem; v2: updated to approximate PRL versio

arXiv.org e-Print Archive

DSpace@MIT

eScholarship - University of California

A rescaled method for RBF approximation

Author: GE Fasshauer
GE Fasshauer
L Bos
M Buhmann
Nachman Aronszajn
Simone Deparis
Stefano Marchi De
Z Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/11/2016
Field of study

In the recent paper [8], a new method to compute stable kernel-based interpolants has been presented. This \textit{rescaled interpolation} method combines the standard kernel interpolation with a properly defined rescaling operation, which smooths the oscillations of the interpolant. Although promising, this procedure lacks a systematic theoretical investigation. Through our analysis, this novel method can be understood as standard kernel interpolation by means of a properly rescaled kernel. This point of view allow us to consider its error and stability properties

arXiv.org e-Print Archive

Crossref

A rescaled method for RBF approximation

Author: GE Fasshauer
GE Fasshauer
L Bos
M Buhmann
Nachman Aronszajn
Simone Deparis
Stefano Marchi De
Z Wu
Publication venue
Publication date: 01/01/2016
Field of study

A new method to compute stable kernel-based interpolants has been presented by the second and third authors. This rescaled interpolation method combines the standard kernel interpolation with a properly defined rescaling operation, which smooths the oscillations of the interpolant. Although promising, this procedure lacks a systematic theoretical investigation. Through our analysis, this novel method can be understood as standard kernel interpolation by means of a properly rescaled kernel. This point of view allow us to consider its error and stability properties. First, we prove that the method is an instance of the Shepard\u2019s method, when certain weight functions are used. In particular, the method can reproduce constant functions. Second, it is possible to define a modified set of cardinal functions strictly related to the ones of the not-rescaled kernel. Through these functions, we define a Lebesgue function for the rescaled interpolation process, and study its maximum - the Lebesgue constant - in different settings. Also, a preliminary theoretical result on the estimation of the interpolation error is presented. As an application, we couple our method with a partition of unity algorithm. This setting seems to be the most promising, and we illustrate its behavior with some experiments

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Archivio istituzionale della ricerca - Università di Padova

Climate change, adaptation, and phenotypic plasticity: the problem and the evidence

Author: Bell M. A.
Endler J. A.
Lynch M.
Nachman M. W.
West‐Eberhard M. J.
Publication venue
Publication date: 01/01/2014
Field of study

Peer reviewe

Crossref

PubMed Central

Helsingin yliopiston digitaalinen arkisto

A window into a public programme for prevention of mother-to-child transmission of HIV: evidence from a prospective clinical trial

Author: Coetzee J
Cotton M
Kim S
Nachman S
Rabie H
Publication venue: Southern African Journal of HIV Medicine
Publication date: 03/02/2010
Field of study

Objectives. To evaluate efficacy of the antenatal, intrapartum and postnatal antiretroviral components of a public service prevention of mother-to-child (PMTCT) programme in infants.Design. Analysis of prospectively collected screening data of demographic and MTCT-related interventions and HIV infection status of infants identified through HIV-specific DNA polymerase chain reaction.Setting. Tygerberg Children’s Hospital, Western Cape, South Africa.Subjects. HIV-infected women and their infants identified through participation in a public service PMTCT programme were referred for possible participation in a prospective study of isoniazid prophylaxis.Interventions. Key components of the programme include voluntary counselling and testing, administration of zidovudine to the mother from between 28 and 34 weeks’ gestation and to the newborn infant for the firstweek, single-dose nevirapine to the mother in labour and to the newborn shortly after birth, and free formula for 6 months.Main outcome measures. Number and percentage of HIV-infected infants and extent of exposure to antenatal, intrapartum and postnatal antiretrovirals.Results. Of 656 infants with a median age of 12.6 weeks, screened between 1 April 2005 through May 2006, 39 were HIV-infected, giving a transmission rate of 5.9% (95% confidence interval (CI) 4.4 - 8.0%). Antenatal prophylaxis was significantly associated with reduced transmission (odds ratio (OR) 0.43 (95% CI 0.21 - 0.94)) as opposed to intrapartum and postpartum components (p=0.85 and p=0.84, respectively). In multivariable analysis the antenatal component remained significant (OR=0.40 (95% CI 0.19 - 0.90)).Conclusions. The antenatal phase is the most important antiretroviral component of the PMTCT programme, allowing most opportunity for intervention

AJOL - African Journals Online