7,302 research outputs found
Accelerating Message Passing for MAP with Benders Decomposition
We introduce a novel mechanism to tighten the local polytope relaxation for
MAP inference in Markov random fields with low state space variables. We
consider a surjection of the variables to a set of hyper-variables and apply
the local polytope relaxation over these hyper-variables. The state space of
each individual hyper-variable is constructed to be enumerable while the vector
product of pairs is not easily enumerable making message passing inference
intractable.
To circumvent the difficulty of enumerating the vector product of state
spaces of hyper-variables we introduce a novel Benders decomposition approach.
This produces an upper envelope describing the message constructed from affine
functions of the individual variables that compose the hyper-variable receiving
the message. The envelope is tight at the minimizers which are shared by the
true message. Benders rows are constructed to be Pareto optimal and are
generated using an efficient procedure targeted for binary problems
Secrecy Beamforming for SWIPT MISO Heterogeneous Cellular Networks
In this paper, we consider the secure transmission design for a
multiple-input single-output Femtocell overlaid with a Macrocell in co-channel
deployment. The Femtocell base station sends confidential messages to
information receiving Femtocell users (FUs) and energy signals to energy
receiving (ER) FUs while limiting the interference to Macrocell users (MUs).
The ER FUs have the potential to wiretap the confidential messages. By taking
fairness into account, we propose a sum logarithmic secrecy rate maximization
beamforming design problem under the interference constraints for MUs and
energy harvesting (EH) constraints for ER FUs. The formulated design problem is
nontrivial to solve due to the nonconvexity which lies in the objective and the
constraints. To tackle the design problem, a semidefinite relaxation and
successive convex approximation based algorithm is proposed. Simulation results
demonstrate the effectiveness of the proposed beamforming design.Comment: 5 pages, 2 figures, Conference paper, Presented in BSC 201
Distributed Join-the-Idle-Queue for Low Latency Cloud Services
Low latency is highly desirable for cloud services. To achieve low response
time, stringent timing requirements are needed for task scheduling in a
large-scale server farm spanning thousands of servers. In this paper, we
conduct an in-depth analysis for distributed Join-the-Idle-Queue (JIQ), a
promising new approximation of an idealized task-scheduling algorithm. In
particular, we derive semi-closed form expressions for the delay performance of
distributed JIQ, and we propose a new variant of distributed JIQ that offers
clear advantages over alternative algorithms for large systems.Comment: 10 pages, 10 figures. This work is inspired by Michael Mitzenmacher's
recent work on distributed Join-Idle-Queue (arXiv:1606.01833) as well as
Alexander Stolyar's recent work on centralized Join-Idle-Queue
(arXiv:1512.07873 and arXiv:1407.6343
Exploiting skeletal structure in computer vision annotation with Benders decomposition
Many annotation problems in computer vision can be phrased as integer linear
programs (ILPs). The use of standard industrial solvers does not to exploit the
underlying structure of such problems eg, the skeleton in pose estimation. The
leveraging of the underlying structure in conjunction with industrial solvers
promises increases in both speed and accuracy. Such structure can be exploited
using Bender's decomposition, a technique from operations research, that solves
complex ILPs or mixed integer linear programs by decomposing them into
sub-problems that communicate via a master problem. The intuition is that
conditioned on a small subset of the variables the solution to the remaining
variables can be computed easily by taking advantage of properties of the ILP
constraint matrix such as block structure. In this paper we apply Benders
decomposition to a typical problem in computer vision where we have many
sub-ILPs (eg, partitioning of detections, body-parts) coupled to a master ILP
(eg, constructing skeletons). Dividing inference problems into a master problem
and sub-problems motivates the development of a plethora of novel models, and
inference approaches for the field of computer vision
Visually-Aware Fashion Recommendation and Design with Generative Image Models
Building effective recommender systems for domains like fashion is
challenging due to the high level of subjectivity and the semantic complexity
of the features involved (i.e., fashion styles). Recent work has shown that
approaches to `visual' recommendation (e.g.~clothing, art, etc.) can be made
more accurate by incorporating visual signals directly into the recommendation
objective, using `off-the-shelf' feature representations derived from deep
networks. Here, we seek to extend this contribution by showing that
recommendation performance can be significantly improved by learning `fashion
aware' image representations directly, i.e., by training the image
representation (from the pixel level) and the recommender system jointly; this
contribution is related to recent work using Siamese CNNs, though we are able
to show improvements over state-of-the-art recommendation techniques such as
BPR and variants that make use of pre-trained visual features. Furthermore, we
show that our model can be used \emph{generatively}, i.e., given a user and a
product category, we can generate new images (i.e., clothing items) that are
most consistent with their personal taste. This represents a first step towards
building systems that go beyond recommending existing items from a product
corpus, but which can be used to suggest styles and aid the design of new
products.Comment: 10 pages, 6 figures. Accepted by ICDM'17 as a long pape
Efficient Multi-Person Pose Estimation with Provable Guarantees
Multi-person pose estimation (MPPE) in natural images is key to the
meaningful use of visual data in many fields including movement science,
security, and rehabilitation. In this paper we tackle MPPE with a bottom-up
approach, starting with candidate detections of body parts from a convolutional
neural network (CNN) and grouping them into people. We formulate the grouping
of body part detections into people as a minimum-weight set packing (MWSP)
problem where the set of potential people is the power set of body part
detections. We model the quality of a hypothesis of a person which is a set in
the MWSP by an augmented tree-structured Markov random field where variables
correspond to body-parts and their state-spaces correspond to the power set of
the detections for that part.
We describe a novel algorithm that combines efficiency with provable bounds
on this MWSP problem. We employ an implicit column generation strategy where
the pricing problem is formulated as a dynamic program. To efficiently solve
this dynamic program we exploit the problem structure utilizing a nested
Bender's decomposition (NBD) exact inference strategy which we speed up by
recycling Bender's rows between calls to the pricing problem.
We test our approach on the MPII-Multiperson dataset, showing that our
approach obtains comparable results with the state-of-the-art algorithm for
joint node labeling and grouping problems, and that NBD achieves considerable
speed-ups relative to a naive dynamic programming approach. Typical algorithms
that solve joint node labeling and grouping problems use heuristics and thus
can not obtain proofs of optimality. Our approach, in contrast, proves that for
over 99 percent of problem instances we find the globally optimal solution and
otherwise provide upper/lower bounds
Sherlock: Sparse Hierarchical Embeddings for Visually-aware One-class Collaborative Filtering
Building successful recommender systems requires uncovering the underlying
dimensions that describe the properties of items as well as users' preferences
toward them. In domains like clothing recommendation, explaining users'
preferences requires modeling the visual appearance of the items in question.
This makes recommendation especially challenging, due to both the complexity
and subtlety of people's 'visual preferences,' as well as the scale and
dimensionality of the data and features involved. Ultimately, a successful
model should be capable of capturing considerable variance across different
categories and styles, while still modeling the commonalities explained by
`global' structures in order to combat the sparsity (e.g. cold-start),
variability, and scale of real-world datasets. Here, we address these
challenges by building such structures to model the visual dimensions across
different product categories. With a novel hierarchical embedding architecture,
our method accounts for both high-level (colorfulness, darkness, etc.) and
subtle (e.g. casualness) visual characteristics simultaneously.Comment: 7 pages, 3 figure
Vista: A Visually, Socially, and Temporally-aware Model for Artistic Recommendation
Understanding users' interactions with highly subjective content---like
artistic images---is challenging due to the complex semantics that guide our
preferences. On the one hand one has to overcome `standard' recommender systems
challenges, such as dealing with large, sparse, and long-tailed datasets. On
the other, several new challenges present themselves, such as the need to model
content in terms of its visual appearance, or even social dynamics, such as a
preference toward a particular artist that is independent of the art they
create.
In this paper we build large-scale recommender systems to model the dynamics
of a vibrant digital art community, Behance, consisting of tens of millions of
interactions (clicks and `appreciates') of users toward digital art.
Methodologically, our main contributions are to model (a) rich content,
especially in terms of its visual appearance; (b) temporal dynamics, in terms
of how users prefer `visually consistent' content within and across sessions;
and (c) social dynamics, in terms of how users exhibit preferences both towards
certain art styles, as well as the artists themselves.Comment: 8 pages, 3 figure
Tracking Objects with Higher Order Interactions using Delayed Column Generation
We study the problem of multi-target tracking and data association in video.
We formulate this in terms of selecting a subset of high-quality tracks subject
to the constraint that no pair of selected tracks is associated with a common
detection (of an object). This objective is equivalent to the classic NP-hard
problem of finding a maximum-weight set packing (MWSP) where tracks correspond
to sets and is made further difficult since the number of candidate tracks
grows exponentially in the number of detections. We present a relaxation of
this combinatorial problem that uses a column generation formulation where the
pricing problem is solved via dynamic programming to efficiently explore the
space of tracks. We employ row generation to tighten the bound in such a way as
to preserve efficient inference in the pricing problem. We show the practical
utility of this algorithm for tracking problems in natural and biological video
datasets
A New Asymptotic Analysis Technique for Diversity Receptions Over Correlated Lognormal Fading Channels
Prior asymptotic performance analyses are based on the series expansion of
the moment-generating function (MGF) or the probability density function (PDF)
of channel coefficients. However, these techniques fail for lognormal fading
channels because the Taylor series of the PDF of a lognormal random variable is
zero at the origin and the MGF does not have an explicit form. Although
lognormal fading model has been widely applied in wireless communications and
free-space optical communications, few analytical tools are available to
provide elegant performance expressions for correlated lognormal channels. In
this work, we propose a novel framework to analyze the asymptotic outage
probabilities of selection combining (SC), equal-gain combining (EGC) and
maximum-ratio combining (MRC) over equally correlated lognormal fading
channels. Based on these closed-form results, we reveal the followings: i) the
outage probability of EGC or MRC becomes an infinitely small quantity compared
to that of SC at large signal-to-noise ratio (SNR); ii) channel correlation can
result in an infinite performance loss at large SNR. More importantly, the
analyses reveal insights into the long-standing problem of performance analyses
over correlated lognormal channels at high SNR, and circumvent the
time-consuming Monte Carlo simulation and numerical integration
- …