254 research outputs found
Group Learning and Opinion Diffusion in a Broadcast Network
We analyze the following group learning problem in the context of opinion
diffusion: Consider a network with users, each facing options. In a
discrete time setting, at each time step, each user chooses out of the
options, and receive randomly generated rewards, whose statistics depend on the
options chosen as well as the user itself, and are unknown to the users. Each
user aims to maximize their expected total rewards over a certain time horizon
through an online learning process, i.e., a sequence of exploration (sampling
the return of each option) and exploitation (selecting empirically good
options) steps.
Within this context we consider two group learning scenarios, (1) users with
uniform preferences and (2) users with diverse preferences, and examine how a
user should construct its learning process to best extract information from
other's decisions and experiences so as to maximize its own reward. Performance
is measured in {\em weak regret}, the difference between the user's total
reward and the reward from a user-specific best single-action policy (i.e.,
always selecting the set of options generating the highest mean rewards for
this user). Within each scenario we also consider two cases: (i) when users
exchange full information, meaning they share the actual rewards they obtained
from their choices, and (ii) when users exchange limited information, e.g.,
only their choices but not rewards obtained from these choices
To Stay Or To Switch: Multiuser Dynamic Channel Access
In this paper we study opportunistic spectrum access (OSA) policies in a
multiuser multichannel random access cognitive radio network, where users
perform channel probing and switching in order to obtain better channel
condition or higher instantaneous transmission quality. However, unlikely many
prior works in this area, including those on channel probing and switching
policies for a single user to exploit spectral diversity, and on probing and
access policies for multiple users over a single channel to exploit temporal
and multiuser diversity, in this study we consider the collective switching of
multiple users over multiple channels. In addition, we consider finite
arrivals, i.e., users are not assumed to always have data to send and demand
for channel follow a certain arrival process. Under such a scenario, the users'
ability to opportunistically exploit temporal diversity (the temporal variation
in channel quality over a single channel) and spectral diversity (quality
variation across multiple channels at a given time) is greatly affected by the
level of congestion in the system. We investigate the optimal decision process
in this case, and evaluate the extent to which congestion affects potential
gains from opportunistic dynamic channel switching
Optimal Relay Selection with Non-negligible Probing Time
In this paper an optimal relay selection algorithm with non-negligible
probing time is proposed and analyzed for cooperative wireless networks. Relay
selection has been introduced to solve the degraded bandwidth efficiency
problem in cooperative communication. Yet complete information of relay
channels often remain unavailable for complex networks which renders the
optimal selection strategies impossible for transmission source without probing
the relay channels. Particularly when the number of relay candidate is large,
even though probing all relay channels guarantees the finding of the best
relays at any time instant, the degradation of bandwidth efficiency due to
non-negligible probing times, which was often neglected in past literature, is
also significant. In this work, a stopping rule based relay selection strategy
is determined for the source node to decide when to stop the probing process
and choose one of the probed relays to cooperate with under wireless channels'
stochastic uncertainties. This relay selection strategy is further shown to
have a simple threshold structure. At the meantime, full diversity order and
high bandwidth efficiency can be achieved simultaneously. Both analytical and
simulation results are provided to verify the claims.Comment: 8 pages. ICC 201
MeshAdv: Adversarial Meshes for Visual Recognition
Highly expressive models such as deep neural networks (DNNs) have been widely
applied to various applications. However, recent studies show that DNNs are
vulnerable to adversarial examples, which are carefully crafted inputs aiming
to mislead the predictions. Currently, the majority of these studies have
focused on perturbation added to image pixels, while such manipulation is not
physically realistic. Some works have tried to overcome this limitation by
attaching printable 2D patches or painting patterns onto surfaces, but can be
potentially defended because 3D shape features are intact. In this paper, we
propose meshAdv to generate "adversarial 3D meshes" from objects that have rich
shape features but minimal textural variation. To manipulate the shape or
texture of the objects, we make use of a differentiable renderer to compute
accurate shading on the shape and propagate the gradient. Extensive experiments
show that the generated 3D meshes are effective in attacking both classifiers
and object detectors. We evaluate the attack under different viewpoints. In
addition, we design a pipeline to perform black-box attack on a photorealistic
renderer with unknown rendering parameters.Comment: Published in IEEE CVPR201
Long-Term Fairness with Unknown Dynamics
While machine learning can myopically reinforce social inequalities, it may
also be used to dynamically seek equitable outcomes. In this paper, we
formalize long-term fairness in the context of online reinforcement learning.
This formulation can accommodate dynamical control objectives, such as driving
equity inherent in the state of a population, that cannot be incorporated into
static formulations of fairness. We demonstrate that this framing allows an
algorithm to adapt to unknown dynamics by sacrificing short-term incentives to
drive a classifier-population system towards more desirable equilibria. For the
proposed setting, we develop an algorithm that adapts recent work in online
learning. We prove that this algorithm achieves simultaneous probabilistic
bounds on cumulative loss and cumulative violations of fairness (as statistical
regularities between demographic groups). We compare our proposed algorithm to
the repeated retraining of myopic classifiers, as a baseline, and to a deep
reinforcement learning algorithm that lacks safety guarantees. Our experiments
model human populations according to evolutionary game theory and integrate
real-world datasets
- …