Search CORE

653 research outputs found

Computational Difficulty of Global Variations in the Density Matrix Renormalization Group

Author: I. M. Bomze
J. Eisert
M. R. Garey
U. Feige
Publication venue: 'American Physical Society (APS)'
Publication date: 09/01/2007
Field of study

The density matrix renormalization group (DMRG) approach is arguably the most successful method to numerically find ground states of quantum spin chains. It amounts to iteratively locally optimizing matrix-product states, aiming at better and better approximating the true ground state. To date, both a proof of convergence to the globally best approximation and an assessment of its complexity are lacking. Here we establish a result on the computational complexity of an approximation with matrix-product states: The surprising result is that when one globally optimizes over several sites of local Hamiltonians, avoiding local optima, one encounters in the worst case a computationally difficult NP-hard problem (hard even in approximation). The proof exploits a novel way of relating it to binary quadratic programming. We discuss intriguing ramifications on the difficulty of describing quantum many-body systems.Comment: 5 pages, 1 figure, RevTeX, final versio

arXiv.org e-Print Archive

Crossref

Improving Gaussian mixture latent variable model convergence with Optimal Transport

Author: Barber D
Feige I
Gaujac B
Publication venue: MLResearch Press
Publication date: 01/01/2021
Field of study

Generative models with both discrete and continuous latent variables are highly motivated by the structure of many real-world data sets. They present, however, subtleties in training often manifesting in the discrete latent variable not being leveraged. In this paper, we show why such models struggle to train using traditional log-likelihood maximization, and that they are amenable to training using the Optimal Transport framework of Wasserstein Autoencoders. We find our discrete latent variable to be fully leveraged by the model when trained, without any modifications to the objective function or significant fine tuning. Our model generates comparable samples to other approaches while using relatively simple neural networks, since the discrete latent variable carries much of the descriptive burden. Furthermore, the discrete latent provides significant control over generation

UCL Discovery

Learning Disentangled Representations with the Wasserstein Autoencoder

Author: Barber D
Feige I
Gaujac B
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 07/10/2020
Field of study

Disentangled representation learning has undoubtedly benefited from objective function surgery. However, a delicate balancing act of tuning is still required in order to trade off reconstruction fidelity versus disentanglement. Building on previous successes of penalizing the total correlation in the latent variables, we propose TCWAE (Total Correlation Wasserstein Autoencoder). Working in the WAE paradigm naturally enables the separation of the total-correlation term, thus providing disentanglement control over the learned representation, while offering more flexibility in the choice of reconstruction cost. We propose two variants using different KL estimators and analyse in turn the impact of having different ground cost functions and latent regularization terms. Extensive quantitative comparisons on data sets with known generative factors shows that our methods present competitive results relative to state-of-the-art techniques. We further study the trade off between disentanglement and reconstruction on more-difficult data sets with unknown generative factors, where the flexibility of the WAE paradigm leads to improved reconstructions

arXiv.org e-Print Archive

UCL Discovery

A Characterization of Visibility Graphs for Pseudo-Polygons

Author: B. Aronov
G. Aloupis
G. Srinivasaraghavan
H. Everett
I. Streinu
J. O’Rourke
S.-H. Choi
S.K. Ghosh
S.K. Ghosh
U. Feige
Publication venue
Publication date: 23/04/2015
Field of study

In this paper, we give a characterization of the visibility graphs of pseudo-polygons. We first identify some key combinatorial properties of pseudo-polygons, and we then give a set of five necessary conditions based off our identified properties. We then prove that these necessary conditions are also sufficient via a reduction to a characterization of vertex-edge visibility graphs given by O'Rourke and Streinu

arXiv.org e-Print Archive

Crossref

Combinatorial Assortment Optimization

Author: A Alptekinolu
C Ulu
F Caro
I Méndez-Díaz
JJM Bront
JM Davis
K Talluri
P Rusmevichientong
RP Leme
U Feige
Publication venue
Publication date: 07/11/2017
Field of study

Assortment optimization refers to the problem of designing a slate of products to offer potential customers, such as stocking the shelves in a convenience store. The price of each product is fixed in advance, and a probabilistic choice function describes which product a customer will choose from any given subset. We introduce the combinatorial assortment problem, where each customer may select a bundle of products. We consider a model of consumer choice where the relative value of different bundles is described by a valuation function, while individual customers may differ in their absolute willingness to pay, and study the complexity of the resulting optimization problem. We show that any sub-polynomial approximation to the problem requires exponentially many demand queries when the valuation function is XOS, and that no FPTAS exists even for succinctly-representable submodular valuations. On the positive side, we show how to obtain constant approximations under a "well-priced" condition, where each product's price is sufficiently high. We also provide an exact algorithm for

k

-additive valuations, and show how to extend our results to a learning setting where the seller must infer the customers' preferences from their purchasing behavior

arXiv.org e-Print Archive

Crossref

Budget-restricted utility games with ordered strategic decisions

Author: G.L. Nemhauser
H.-K. Ahn
I. Milchtaich
K. Jain
M. Gairing
M. Mavronicolas
R.. Rosenthal
U. Feige
X.-Y. Li
X.-Y. Li
Publication venue
Publication date: 01/01/2014
Field of study

We introduce the concept of budget games. Players choose a set of tasks and each task has a certain demand on every resource in the game. Each resource has a budget. If the budget is not enough to satisfy the sum of all demands, it has to be shared between the tasks. We study strategic budget games, where the budget is shared proportionally. We also consider a variant in which the order of the strategic decisions influences the distribution of the budgets. The complexity of the optimal solution as well as existence, complexity and quality of equilibria are analyzed. Finally, we show that the time an ordered budget game needs to convergence towards an equilibrium may be exponential

arXiv.org e-Print Archive

CiteSeerX

Crossref

Near-optimal asymmetric binary matrix partitions

Author: A Ghosh
B Lehmann
GA Akerlof
I Caragiannis
I Caragiannis
J Cremer
J Cremer
J Levin
N Alon
P Milgrom
PR Milgrom
PR Milgrom
S Athanassopoulos
S Athanassopoulos
S Khot
U Feige
V Crawford
Publication venue
Publication date: 08/04/2015
Field of study

We study the asymmetric binary matrix partition problem that was recently introduced by Alon et al. (WINE 2013) to model the impact of asymmetric information on the revenue of the seller in take-it-or-leave-it sales. Instances of the problem consist of an

n \times m

binary matrix

A

and a probability distribution over its columns. A partition scheme

B=(B_1,...,B_n)

consists of a partition

B_i

for each row

i

A

. The partition

B_i

acts as a smoothing operator on row

i

that distributes the expected value of each partition subset proportionally to all its entries. Given a scheme

B

that induces a smooth matrix

A^B

, the partition value is the expected maximum column entry of

A^B

. The objective is to find a partition scheme such that the resulting partition value is maximized. We present a

9/10

-approximation algorithm for the case where the probability distribution is uniform and a

(1-1/e)

-approximation algorithm for non-uniform distributions, significantly improving results of Alon et al. Although our first algorithm is combinatorial (and very simple), the analysis is based on linear programming and duality arguments. In our second result we exploit a nice relation of the problem to submodular welfare maximization.Comment: 17 page

arXiv.org e-Print Archive

Crossref

Strong inapproximability of the shortest reset word

Author: B Steinberg
D Eppstein
DS Ananichev
I Rystsov
J Kari
J Olschewski
M Bellare
M Gerbush
M Grech
MV Berlinkov
MV Berlinkov
MV Volkov
S Arora
S Arora
S Arora
U Feige
Publication venue
Publication date: 09/06/2015
Field of study

The \v{C}ern\'y conjecture states that every

n

-state synchronizing automaton has a reset word of length at most

(n-1)^2

. We study the hardness of finding short reset words. It is known that the exact version of the problem, i.e., finding the shortest reset word, is NP-hard and coNP-hard, and complete for the DP class, and that approximating the length of the shortest reset word within a factor of

O(\log n)

is NP-hard [Gerbush and Heeringa, CIAA'10], even for the binary alphabet [Berlinkov, DLT'13]. We significantly improve on these results by showing that, for every

\epsilon>0

, it is NP-hard to approximate the length of the shortest reset word within a factor of

n^{1-\epsilon}

. This is essentially tight since a simple

O(n)

-approximation algorithm exists.Comment: extended abstract to appear in MFCS 201

arXiv.org e-Print Archive

Crossref

Near-Optimal Asymmetric Binary Matrix Partitions

Author: Alexandros A. Voudouris
B Lehmann
Fidaa Abed
GA Akerlof
GH Hardy
I Caragiannis
I Caragiannis
Ioannis Caragiannis
J Levin
K Jain
P Milgrom
PR Milgrom
S Athanassopoulos
S Khot
U Feige
U Feige
V Crawford
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

We study the asymmetric binary matrix partition problem that was recently introduced by Alon et al. (Proceedings of the 9th Conference on Web and Internet Economics (WINE), pp 1–14, 2013). Instances of the problem consist of an n× m binary matrix A and a probability distribution over its columns. A partition schemeB= (B1, … , Bn) consists of a partition Bifor each row i of A. The partition Biacts as a smoothing operator on row i that distributes the expected value of each partition subset proportionally to all its entries. Given a scheme B that induces a smooth matrix AB, the partition value is the expected maximum column entry of AB. The objective is to find a partition scheme such that the resulting partition value is maximized. We present a 9/10-approximation algorithm for the case where the probability distribution is uniform and a (1 - 1 / e) -approximation algorithm for non-uniform distributions, significantly improving results of Alon et al. Although our first algorithm is combinatorial (and very simple), the analysis is based on linear programming and duality arguments. In our second result we exploit a nice relation of the problem to submodular welfare maximization

University of Essex Research Repository

Crossref

Oxford University Research Archive

Improving latent variable descriptiveness by modelling rather than ad-hoc factors

Author: Barber D
Feige I
Fierimonte R
Mansbridge A
Publication venue
Publication date: 01/09/2019
Field of study

Powerful generative models, particularly in natural language modelling, are commonly trained by maximizing a variational lower bound on the data log likelihood. These models often suffer from poor use of their latent variable, with ad-hoc annealing factors used to encourage retention of information in the latent variable. We discuss an alternative and general approach to latent variable modelling, based on an objective that encourages a perfect reconstruction by tying a stochastic autoencoder with a variational autoencoder (VAE). This ensures by design that the latent variable captures information about the observations, whilst retaining the ability to generate well. Interestingly, although our model is fundamentally different to a VAE, the lower bound attained is identical to the standard VAE bound but with the addition of a simple pre-factor; thus, providing a formal interpretation of the commonly used, ad-hoc pre-factors in training VAEs

UCL Discovery