317 research outputs found
Stochastic Primal-Dual Coordinate Method for Regularized Empirical Risk Minimization
We consider a generic convex optimization problem associated with regularized
empirical risk minimization of linear predictors. The problem structure allows
us to reformulate it as a convex-concave saddle point problem. We propose a
stochastic primal-dual coordinate (SPDC) method, which alternates between
maximizing over a randomly chosen dual variable and minimizing over the primal
variable. An extrapolation step on the primal variable is performed to obtain
accelerated convergence rate. We also develop a mini-batch version of the SPDC
method which facilitates parallel computing, and an extension with weighted
sampling probabilities on the dual variables, which has a better complexity
than uniform sampling on unnormalized data. Both theoretically and empirically,
we show that the SPDC method has comparable or better performance than several
state-of-the-art optimization methods
Effect of interaction with neutrons in matter on flavor conversion of super-light sterile neutrino with active neutrino
A super-light sterile neutrino was proposed to explain the absence of the
expected upturn of the survival probability of low energy solar boron
neutrinos. This is because this super-light sterile neutrino can oscillate
efficiently with electron neutrino through a MSW resonance happened in Sun. One
may naturally expect that a similar resonance should happen for neutrinos
propagating in Earth matter. We study the flavor conversion of this super-light
sterile neutrino with active neutrinos in Earth matter. We find that the
scenario of the super-light sterile neutrino can easily pass through possible
constraints from experiments which can test the Earth matter effect in
oscillation of neutrinos. Interestinlgy, we find that this is because the
naively expected resonant conversion disappears or is significantly suppressed
due to the presence of a potential which arises from neutral current
interaction of neutrino with neutrons in matter. In contrast, the neutron
number density in the Sun is negligible and the effect of is effectively
switched off. This enables the MSW resonance in Sun needed in oscillation of
the super-light sterile neutrino with solar electron neutrinos. It's
interesting to note that it is the different situation in the Sun and in the
Earth that makes effectively turned off and turned on respectively. This
observation makes the scenario of the super-light sterile neutrino quite
interesting.Comment: 22 pages, 10 figure
Near-Optimal Adversarial Policy Switching for Decentralized Asynchronous Multi-Agent Systems
A key challenge in multi-robot and multi-agent systems is generating
solutions that are robust to other self-interested or even adversarial parties
who actively try to prevent the agents from achieving their goals. The
practicality of existing works addressing this challenge is limited to only
small-scale synchronous decision-making scenarios or a single agent planning
its best response against a single adversary with fixed, procedurally
characterized strategies. In contrast this paper considers a more realistic
class of problems where a team of asynchronous agents with limited observation
and communication capabilities need to compete against multiple strategic
adversaries with changing strategies. This problem necessitates agents that can
coordinate to detect changes in adversary strategies and plan the best response
accordingly. Our approach first optimizes a set of stratagems that represent
these best responses. These optimized stratagems are then integrated into a
unified policy that can detect and respond when the adversaries change their
strategies. The near-optimality of the proposed framework is established
theoretically as well as demonstrated empirically in simulation and hardware
Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation
Pre-training and fine-tuning is a paradigm for alleviating the data scarcity
problem in end-to-end speech translation (E2E ST). The commonplace "modality
gap" between speech and text data often leads to inconsistent inputs between
pre-training and fine-tuning. However, we observe that this gap occurs in the
early stages of fine-tuning, but does not have a major impact on the final
performance. On the other hand, we find that there has another gap, which we
call the "capacity gap": high resource tasks (such as ASR and MT) always
require a large model to fit, when the model is reused for a low resource task
(E2E ST), it will get a sub-optimal performance due to the over-fitting. In a
case study, we find that the regularization plays a more important role than
the well-designed modality adaption method, which achieves 29.0 for en-de and
40.3 for en-fr on the MuST-C dataset. Code and models are available at
https://github.com/hannlp/TAB.Comment: ACL 2023 Main Conferenc
On-Robot Bayesian Reinforcement Learning for POMDPs
Robot learning is often difficult due to the expense of gathering data. The
need for large amounts of data can, and should, be tackled with effective
algorithms and leveraging expert information on robot dynamics. Bayesian
reinforcement learning (BRL), thanks to its sample efficiency and ability to
exploit prior knowledge, is uniquely positioned as such a solution method.
Unfortunately, the application of BRL has been limited due to the difficulties
of representing expert knowledge as well as solving the subsequent inference
problem. This paper advances BRL for robotics by proposing a specialized
framework for physical systems. In particular, we capture this knowledge in a
factored representation, then demonstrate the posterior factorizes in a similar
shape, and ultimately formalize the model in a Bayesian framework. We then
introduce a sample-based online solution method, based on Monte-Carlo tree
search and particle filtering, specialized to solve the resulting model. This
approach can, for example, utilize typical low-level robot simulators and
handle uncertainty over unknown dynamics of the environment. We empirically
demonstrate its efficiency by performing on-robot learning in two human-robot
interaction tasks with uncertainty about human behavior, achieving near-optimal
performance after only a handful of real-world episodes. A video of learned
policies is at https://youtu.be/H9xp60ngOes.Comment: Accepted at IROS-2023 (Detroit, USA
Web News Timeline Generation with Extended Task Prompting
The creation of news timeline is essential for a comprehensive and contextual
understanding of events as they unfold over time. This approach aids in
discerning patterns and trends that might be obscured when news is viewed in
isolation. By organizing news in a chronological sequence, it becomes easier to
track the development of stories, understand the interrelation of events, and
grasp the broader implications of news items. This is particularly helpful in
sectors like finance and insurance, where timely understanding of the event
development-ranging from extreme weather to political upheavals and health
crises-is indispensable for effective risk management. While traditional
natural language processing (NLP) techniques have had some success, they often
fail to capture the news with nuanced relevance that are readily apparent to
domain experts, hindering broader industry integration. The advance of Large
Language Models (LLMs) offers a renewed opportunity to tackle this challenge.
However, direct prompting LLMs for this task is often ineffective. Our study
investigates the application of an extended task prompting technique to assess
past news relevance. We demonstrate that enhancing conventional prompts with
additional tasks boosts their effectiveness on various news dataset, rendering
news timeline generation practical for professional use. This work has been
deployed as a publicly accessible browser extension which is adopted within our
network.Comment: 4 page
- …