39 research outputs found
Signal mixture estimation for degenerate heavy Higgses using a deep neural network
If a new signal is established in future LHC data, a next question will be to
determine the signal composition, in particular whether the signal is due to
multiple near-degenerate states. We investigate the performance of a deep
learning approach to signal mixture estimation for the challenging scenario of
a ditau signal coming from a pair of degenerate Higgs bosons of opposite CP
charge. This constitutes a parameter estimation problem for a mixture model
with highly overlapping features. We use an unbinned maximum likelihood fit to
a neural network output, and compare the results to mixture estimation via a
fit to a single kinematic variable. For our benchmark scenarios we find a ~20%
improvement in the estimate uncertainty.Comment: v2, 12 pages, 7 figures, published in EPJ
Concept backpropagation: An Explainable AI approach for visualising learned concepts in neural network models
Neural network models are widely used in a variety of domains, often as
black-box solutions, since they are not directly interpretable for humans. The
field of explainable artificial intelligence aims at developing explanation
methods to address this challenge, and several approaches have been developed
over the recent years, including methods for investigating what type of
knowledge these models internalise during the training process. Among these,
the method of concept detection, investigates which \emph{concepts} neural
network models learn to represent in order to complete their tasks. In this
work, we present an extension to the method of concept detection, named
\emph{concept backpropagation}, which provides a way of analysing how the
information representing a given concept is internalised in a given neural
network model. In this approach, the model input is perturbed in a manner
guided by a trained concept probe for the described model, such that the
concept of interest is maximised. This allows for the visualisation of the
detected concept directly in the input space of the model, which in turn makes
it possible to see what information the model depends on for representing the
described concept. We present results for this method applied to a various set
of input modalities, and discuss how our proposed method can be used to
visualise what information trained concept probes use, and the degree as to
which the representation of the probed concept is entangled within the neural
network model itself
Vacuum free energy, quark condensate shifts and magnetization in three-flavor chiral perturbation theory to in a uniform magnetic field
We study three-flavor QCD in a uniform magnetic field using chiral
perturbation theory (PT). We construct the vacuum free energy density,
quark condensate shifts induced by the magnetic field and the renormalized
magnetization to in the chiral expansion. We find that the
calculation of the free energy is greatly simplified by cancellations among
two-loop diagrams involving charged mesons. In comparing our results with
recent -flavor lattice QCD data, we find that the light quark condensate
shift at is in better agreement than the shift at
. We also find that the renormalized magnetization, due to
its smallness, possesses large uncertainties at due to the
uncertainties in the low-energy constants.Comment: 23 pages, 3 sets of figures, matches published versio
Trilinear-Augmented Gaugino Mediation
We consider a gaugino-mediated supersymmetry breaking scenario where in
addition to the gauginos the Higgs fields couple directly to the field that
breaks supersymmetry. This yields non-vanishing trilinear scalar couplings in
general, which can lead to large mixing in the stop sector providing a
sufficiently large Higgs mass. Using the most recent release of FeynHiggs, we
show the implications on the parameter space. Assuming a gravitino LSP, we find
allowed points with a neutralino, sneutrino or stau NLSP. We test these points
against the results of Run 1 of the LHC, considering in particular searches for
heavy stable charged particles.Comment: 13 pages + references, 4 figures, v4: corrected plot labels in figs.
1-
AutoGCN -- Towards Generic Human Activity Recognition with Neural Architecture Search
This paper introduces AutoGCN, a generic Neural Architecture Search (NAS)
algorithm for Human Activity Recognition (HAR) using Graph Convolution Networks
(GCNs). HAR has gained attention due to advances in deep learning, increased
data availability, and enhanced computational capabilities. At the same time,
GCNs have shown promising results in modeling relationships between body key
points in a skeletal graph. While domain experts often craft dataset-specific
GCN-based methods, their applicability beyond this specific context is severely
limited. AutoGCN seeks to address this limitation by simultaneously searching
for the ideal hyperparameters and architecture combination within a versatile
search space using a reinforcement controller while balancing optimal
exploration and exploitation behavior with a knowledge reservoir during the
search process. We conduct extensive experiments on two large-scale datasets
focused on skeleton-based action recognition to assess the proposed algorithm's
performance. Our experimental results underscore the effectiveness of AutoGCN
in constructing optimal GCN architectures for HAR, outperforming conventional
NAS and GCN methods, as well as random search. These findings highlight the
significance of a diverse search space and an expressive input representation
to enhance the network performance and generalizability
From Movements to Metrics: Evaluating Explainable AI Methods in Skeleton-Based Human Activity Recognition
The advancement of deep learning in human activity recognition (HAR) using 3D
skeleton data is critical for applications in healthcare, security, sports, and
human-computer interaction. This paper tackles a well-known gap in the field,
which is the lack of testing in the applicability and reliability of XAI
evaluation metrics in the skeleton-based HAR domain. We have tested established
XAI metrics namely faithfulness and stability on Class Activation Mapping (CAM)
and Gradient-weighted Class Activation Mapping (Grad-CAM) to address this
problem. The study also introduces a perturbation method that respects human
biomechanical constraints to ensure realistic variations in human movement. Our
findings indicate that \textit{faithfulness} may not be a reliable metric in
certain contexts, such as with the EfficientGCN model. Conversely, stability
emerges as a more dependable metric when there is slight input data
perturbations. CAM and Grad-CAM are also found to produce almost identical
explanations, leading to very similar XAI metric performance. This calls for
the need for more diversified metrics and new XAI methods applied in
skeleton-based HAR
Trilinear-augmented gaugino mediation
We consider a gaugino-mediated supersymmetry breaking scenario where in addition to the gauginos the Higgs fields couple directly to the field that breaks supersymmetry. This yields non-vanishing trilinear scalar couplings in general, which can lead to large mixing in the stop sector providing a sufficiently large Higgs mass. Using the most recent release of FeynHiggs, we show the implications on the parameter space. Assuming a gravitino LSP, we find allowed points with a neutralino, sneutrino or stau NLSP. We test these points against the results of Run 1 of the LHC, considering in particular searches for heavy stable charged particles.publishedVersio