4,214 research outputs found
Classification without labels: Learning from mixed samples in high energy physics
Modern machine learning techniques can be used to construct powerful models
for difficult collider physics problems. In many applications, however, these
models are trained on imperfect simulations due to a lack of truth-level
information in the data, which risks the model learning artifacts of the
simulation. In this paper, we introduce the paradigm of classification without
labels (CWoLa) in which a classifier is trained to distinguish statistical
mixtures of classes, which are common in collider physics. Crucially, neither
individual labels nor class proportions are required, yet we prove that the
optimal classifier in the CWoLa paradigm is also the optimal classifier in the
traditional fully-supervised case where all label information is available.
After demonstrating the power of this method in an analytical toy example, we
consider a realistic benchmark for collider physics: distinguishing quark-
versus gluon-initiated jets using mixed quark/gluon training samples. More
generally, CWoLa can be applied to any classification problem where labels or
class proportions are unknown or simulations are unreliable, but statistical
mixtures of the classes are available.Comment: 18 pages, 5 figures; v2: intro extended and references added; v3:
additional discussion to match JHEP versio
On similarity solutions of a boundary layer problem with an upstream moving wall
The problem of a boundary layer on a flat plate which has a constant velocity opposite in direction to that of the uniform mainstream is examined. It was previously shown that the solution of this boundary value problem is crucially dependent on the parameter which is the ratio of the velocity of the plate to the velocity of the free stream. In particular, it was proved that a solution exists only if this parameter does not exceed a certain critical value, and numerical evidence was adduced to show that this solution is nonunique. Using Crocco formulation the present work proves this nonuniqueness. Also considered are the analyticity of solutions and the derivation of upper bounds on the critical value of wall velocity parameter
Recommended from our members
Exploring the Relationship Between School Organizational Health, Advice Seeking Networks, and Student Behavior
The purpose of this study was to explore the relationship between organizational health and advice seeking behavior of school staff around students exhibiting social, emotional, or behavioral concerns. School staff are front line responders to mental/behavioral health issues and it would benefit schools to better understand the organizational factors that influence advice seeking behavior and the affect that school climate amongst teachers has on student behavior. This study investigated the climates and communication patterns of two urban elementary schools. Social network analysis was used to visualize and analyze both schools’ respective networks. School staff completed the Organizational Health Inventory as well as provided data regarding which staff members they have sought out and received helpful advice in regards to students social, emotional, and behavioral functioning. Findings suggest that high levels of organizational health were associated with frequent advice seeking behavior. Individuals were more likely to reach out to staff who had longer tenure, held administrative positions, and those working closely together (e.g., grade level). Implications include creating system norms and overcoming other barriers to enhance advice seeking behavior among staff
Learning to Classify from Impure Samples with High-Dimensional Data
A persistent challenge in practical classification tasks is that labeled
training sets are not always available. In particle physics, this challenge is
surmounted by the use of simulations. These simulations accurately reproduce
most features of data, but cannot be trusted to capture all of the complex
correlations exploitable by modern machine learning methods. Recent work in
weakly supervised learning has shown that simple, low-dimensional classifiers
can be trained using only the impure mixtures present in data. Here, we
demonstrate that complex, high-dimensional classifiers can also be trained on
impure mixtures using weak supervision techniques, with performance comparable
to what could be achieved with pure samples. Using weak supervision will
therefore allow us to avoid relying exclusively on simulations for
high-dimensional classification. This work opens the door to a new regime
whereby complex models are trained directly on data, providing direct access to
probe the underlying physics.Comment: 6 pages, 2 tables, 2 figures. v2: updated to match PRD versio
Pileup Mitigation with Machine Learning (PUMML)
Pileup involves the contamination of the energy distribution arising from the
primary collision of interest (leading vertex) by radiation from soft
collisions (pileup). We develop a new technique for removing this contamination
using machine learning and convolutional neural networks. The network takes as
input the energy distribution of charged leading vertex particles, charged
pileup particles, and all neutral particles and outputs the energy distribution
of particles coming from leading vertex alone. The PUMML algorithm performs
remarkably well at eliminating pileup distortion on a wide range of simple and
complex jet observables. We test the robustness of the algorithm in a number of
ways and discuss how the network can be trained directly on data.Comment: 20 pages, 8 figures, 2 tables. Updated to JHEP versio
OmniFold: A Method to Simultaneously Unfold All Observables
Collider data must be corrected for detector effects ("unfolded") to be
compared with many theoretical calculations and measurements from other
experiments. Unfolding is traditionally done for individual, binned observables
without including all information relevant for characterizing the detector
response. We introduce OmniFold, an unfolding method that iteratively reweights
a simulated dataset, using machine learning to capitalize on all available
information. Our approach is unbinned, works for arbitrarily high-dimensional
data, and naturally incorporates information from the full phase space. We
illustrate this technique on a realistic jet substructure example from the
Large Hadron Collider and compare it to standard binned unfolding methods. This
new paradigm enables the simultaneous measurement of all observables, including
those not yet invented at the time of the analysis.Comment: 8 pages, 3 figures, 1 table, 1 poem; v2: updated to approximate PRL
versio
A rescaled method for RBF approximation
In the recent paper [8], a new method to compute stable kernel-based
interpolants has been presented. This \textit{rescaled interpolation} method
combines the standard kernel interpolation with a properly defined rescaling
operation, which smooths the oscillations of the interpolant. Although
promising, this procedure lacks a systematic theoretical investigation. Through
our analysis, this novel method can be understood as standard kernel
interpolation by means of a properly rescaled kernel. This point of view allow
us to consider its error and stability properties
A rescaled method for RBF approximation
A new method to compute stable kernel-based interpolants
has been presented by the second and third authors. This rescaled interpolation method combines the
standard kernel interpolation with a properly defined rescaling operation, which
smooths the oscillations of the interpolant. Although promising, this procedure
lacks a systematic theoretical investigation.
Through our analysis, this novel method can be understood as standard
kernel interpolation by means of a properly rescaled kernel. This point of view
allow us to consider its error and stability properties.
First, we prove that the method is an instance of the Shepard\u2019s method,
when certain weight functions are used. In particular, the method can reproduce
constant functions.
Second, it is possible to define a modified set of cardinal functions strictly
related to the ones of the not-rescaled kernel. Through these functions, we
define a Lebesgue function for the rescaled interpolation process, and study its
maximum - the Lebesgue constant - in different settings.
Also, a preliminary theoretical result on the estimation of the interpolation
error is presented.
As an application, we couple our method with a partition of unity algorithm.
This setting seems to be the most promising, and we illustrate its behavior with
some experiments
A window into a public programme for prevention of mother-to-child transmission of HIV: evidence from a prospective clinical trial
Objectives. To evaluate efficacy of the antenatal, intrapartum and postnatal antiretroviral components of a public service prevention of mother-to-child (PMTCT) programme in infants.Design. Analysis of prospectively collected screening data of demographic and MTCT-related interventions and HIV infection status of infants identified through HIV-specific DNA polymerase chain reaction.Setting. Tygerberg Children’s Hospital, Western Cape, South Africa.Subjects. HIV-infected women and their infants identified through participation in a public service PMTCT programme were referred for possible participation in a prospective study of isoniazid prophylaxis.Interventions. Key components of the programme include voluntary counselling and testing, administration of zidovudine to the mother from between 28 and 34 weeks’ gestation and to the newborn infant for the firstweek, single-dose nevirapine to the mother in labour and to the newborn shortly after birth, and free formula for 6 months.Main outcome measures. Number and percentage of HIV-infected infants and extent of exposure to antenatal, intrapartum and postnatal antiretrovirals.Results. Of 656 infants with a median age of 12.6 weeks, screened between 1 April 2005 through May 2006, 39 were HIV-infected, giving a transmission rate of 5.9% (95% confidence interval (CI) 4.4 - 8.0%). Antenatal prophylaxis was significantly associated with reduced transmission (odds ratio (OR) 0.43 (95% CI 0.21 - 0.94)) as opposed to intrapartum and postpartum components (p=0.85 and p=0.84, respectively). In multivariable analysis the antenatal component remained significant (OR=0.40 (95% CI 0.19 - 0.90)).Conclusions. The antenatal phase is the most important antiretroviral component of the PMTCT programme, allowing most opportunity for intervention
- …