327 research outputs found
A Likelihood Approach to Incorporating Self-Report Data in HIV Recency Classification
Estimating new HIV infections is significant yet challenging due to the
difficulty in distinguishing between recent and long-term infections. We
demonstrate that HIV recency status (recent v.s. long-term) could be determined
from the combination of self-report testing history and biomarkers, which are
increasingly available in bio-behavioral surveys. HIV recency status is
partially observed, given the self-report testing history. For example, people
who tested positive for HIV over one year ago should have a long-term
infection. Based on the nationally representative samples collected by the
Population-based HIV Impact Assessment (PHIA) Project, we propose a
likelihood-based probabilistic model for HIV recency classification. The model
incorporates both labeled and unlabeled data and integrates the mechanism of
how HIV recency status depends on biomarkers and the mechanism of how HIV
recency status, together with the self-report time of the most recent HIV test,
impacts the test results, via a set of logistic regression models. We compare
our method to logistic regression and the binary classification tree (current
practice) on Malawi, Zimbabwe, and Zambia PHIA data, as well as on simulated
data. Our model obtains more efficient and less biased parameter estimates and
is relatively robust to potential reporting error and model misspecification
Towards Visually Explaining Variational Autoencoders
Recent advances in Convolutional Neural Network (CNN) model interpretability
have led to impressive progress in visualizing and understanding model
predictions. In particular, gradient-based visual attention methods have driven
much recent effort in using visual attention maps as a means for visual
explanations. A key problem, however, is these methods are designed for
classification and categorization tasks, and their extension to explaining
generative models, e.g. variational autoencoders (VAE) is not trivial. In this
work, we take a step towards bridging this crucial gap, proposing the first
technique to visually explain VAEs by means of gradient-based attention. We
present methods to generate visual attention from the learned latent space, and
also demonstrate such attention explanations serve more than just explaining
VAE predictions. We show how these attention maps can be used to localize
anomalies in images, demonstrating state-of-the-art performance on the MVTec-AD
dataset. We also show how they can be infused into model training, helping
bootstrap the VAE into learning improved latent space disentanglement,
demonstrated on the Dsprites dataset
2D excitation information by MPS method on infinite helixes
Understanding the excitation spectrum in two-dimensional quantum many-body
systems has long been a challenging task. We present an approach by introducing
an excitation ansatz based on an infinite matrix product state (MPS) on a helix
structure. With the canonical form of MPS states, we can accurately extract key
properties such as energy, degeneracy, spectrum weight, and scaling behavior of
low-energy excited states simultaneously. To validate the effectiveness of this
method, we begin by applying it to the critical point of the transverse-field
Ising model. The extracted scaling exponent of the energy gap closely aligns
with the conformal bootstrap results. Subsequently, we apply this approach to
the - Heisenberg model on a square lattice. We discover that the
degeneracy of lowest-energy excitations serves as a reliable metric for
distinguishing different phases. The phase boundary identified by our method is
consistent with some of the previous findings. The present method provides a
promising avenue for studying the excitation spectrum of two-dimensional
quantum many-body systems
Scale jump-aware pose graph relaxation for monocular SLAM with re-initializations
Pose graph relaxation has become an indispensable addition to SLAM enabling
efficient global registration of sensor reference frames under the objective of
satisfying pair-wise relative transformation constraints. The latter may be
given by incremental motion estimation or global place recognition. While the
latter case enables loop closures and drift compensation, care has to be taken
in the monocular case in which local estimates of structure and displacements
can differ from reality not just in terms of noise, but also in terms of a
scale factor. Owing to the accumulation of scale propagation errors, this scale
factor is drifting over time, hence scale-drift aware pose graph relaxation has
been introduced. We extend this idea to cases in which the relative scale
between subsequent sensor frames is unknown, a situation that can easily occur
if monocular SLAM enters re-initialization and no reliable overlap between
successive local maps can be identified. The approach is realized by a hybrid
pose graph formulation that combines the regular similarity consistency terms
with novel, scale-blind constraints. We apply the technique to the practically
relevant case of small indoor service robots capable of effectuating purely
rotational displacements, a condition that can easily cause tracking failures.
We demonstrate that globally consistent trajectories can be recovered even if
multiple re-initializations occur along the loop, and present an in-depth study
of success and failure cases.Comment: 8 pages, 23 figures, International Conference on Intelligent Robots
and Systems 202
Zero-shot Preference Learning for Offline RL via Optimal Transport
Preference-based Reinforcement Learning (PbRL) has demonstrated remarkable
efficacy in aligning rewards with human intentions. However, a significant
challenge lies in the need of substantial human labels, which is costly and
time-consuming. Additionally, the expensive preference data obtained from prior
tasks is not typically reusable for subsequent task learning, leading to
extensive labeling for each new task. In this paper, we propose a novel
zero-shot preference-based RL algorithm that leverages labeled preference data
from source tasks to infer labels for target tasks, eliminating the requirement
for human queries. Our approach utilizes Gromov-Wasserstein distance to align
trajectory distributions between source and target tasks. The solved optimal
transport matrix serves as a correspondence between trajectories of two tasks,
making it possible to identify corresponding trajectory pairs between tasks and
transfer the preference labels. However, learning directly from inferred labels
that contains a fraction of noisy labels will result in an inaccurate reward
function, subsequently affecting policy performance. To this end, we introduce
Robust Preference Transformer, which models the rewards as Gaussian
distributions and incorporates reward uncertainty in addition to reward mean.
The empirical results on robotic manipulation tasks of Meta-World and Robomimic
show that our method has strong capabilities of transferring preferences
between tasks and learns reward functions from noisy labels robustly.
Furthermore, we reveal that our method attains near-oracle performance with a
small proportion of scripted labels
Model and Algorithm for Linkage Disequilibrium Analysis in a Non-Equilibrium Population
The multilocus analysis of polymorphisms has emerged as a vital ingredient of population genetics and evolutionary biology. A fundamental assumption used for existing multilocus analysis approaches is Hardy–Weinberg equilibrium at which maternally- and paternally-derived gametes unite randomly during fertilization. Given the fact that natural populations are rarely panmictic, these approaches will have a significant limitation for practical use. We present a robust model for multilocus linkage disequilibrium analysis which does not rely on the assumption of random mating. This new disequilibrium model capitalizes on Weir’s definition of zygotic disequilibria and is based on an open-pollinated design in which multiple maternal individuals and their half-sib families are sampled from a natural population. This design captures two levels of associations: one is at the upper level that describes the pattern of cosegregation between different loci in the parental population and the other is at the lower level that specifies the extent of co-transmission of non-alleles at different loci from parents to their offspring. An MCMC method was implemented to estimate genetic parameters that define these associations. Simulation studies were used to validate the statistical behavior of the new model
- …