Search CORE

16 research outputs found

Anderson's orthogonality catastrophe

Author: Küttler Heinrich
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 19/09/2014
Field of study

The topic of this thesis is a mathematical treatment of Anderson's orthogonality catastrophe. Named after P.W. Anderson, who studied the phenomenon in the late 1960s, the catastrophe is an intrinsic effect in Fermi gases. In his first work on the topic in [Phys. Rev. Lett. 18:1049--1051], Anderson studied a system of

N

noninteracting fermions in three space dimensions and found the ground state to be asymptotically orthogonal to the ground state of the same system perturbed by a finite-range scattering potential. More precisely, let

\Phi_L^N

be the

N

-body ground state of the fermionic system in a

d

-dimensional box of length

L

,and let

\Psi_L^N

be the ground state of the corresponding system in the presence of the additional finite-range potential. Then the catastrophe brings about the asymptotic vanishing

\S_L^N := \ \sim L^{-\gamma/2}

of the overlap

\S_L^N

of the

N

-body ground states

\Phi_L^N

and

\Psi_L^N

. The asymptotics is in the thermodynamic limit

L\to\infty

and

N\to\infty

with fixed density

N/L^d\to\varrho > 0

. In [Commun. Math. Phys. 329:979--998], the overlap

\S_L^N

has been bounded from above with an asymptotic bound of the form \abs{\S_L^N}^2 \lesssim L^{-\tilde{\gamma}}. The decay exponent

\tilde{\gamma}

there corresponds to the one of Anderson in [Phys. Rev. Lett. 18:1049--1051]. Another publication by Anderson from the same year, [Phys. Rev. 164:352--359], contains the exact asymptotics with a bigger coefficient

\gamma

. This thesis features a step towards the exact asymptotics. We prove a bound with a coefficient

\gamma

that corresponds in a certain sense to the one in [Phys. Rev. 164:352--359], and improves upon the one in [Commun. Math. Phys. 329:979--998]. We use the methods from [Commun. Math. Phys. 329:979--998], but treat every term in a series expansion of

\ln S_L^N

, instead of only the first one. Treating the higher order terms introduces additional arguments since the trace expressions occurring are no longer necessarily nonnegative, which complicates some of the estimates. The main contents of this thesis will also be published in a forthcoming article co-authored with Martin Gebert, Peter Müller, and Peter Otte

The exponent in the orthogonality catastrophe for Fermi gases

Author: Gebert Martin
Küttler Heinrich
Müller Peter
Otte Peter
Publication venue: 'European Mathematical Society Publishing House'
Publication date: 01/01/2016
Field of study

We quantify the asymptotic vanishing of the ground-state overlap of two non-interacting Fermi gases in

d

-dimensional Euclidean space in the thermodynamic limit. Given two one-particle Schr\"odinger operators in finite-volume which differ by a compactly supported bounded potential, we prove a power-law upper bound on the ground-state overlap of the corresponding non-interacting

N

-particle systems. We interpret the decay exponent

\gamma

in terms of scattering theory and find

\gamma = \pi^{-2}{\lVert\arcsin{\lvert T_E/2\rvert}\rVert}_{\mathrm{HS}}^2

, where

T_E

is the transition matrix at the Fermi energy

E

. This exponent reduces to the one predicted by Anderson [Phys. Rev. 164, 352-359 (1967)] for the exact asymptotics in the special case of a repulsive point-like perturbation.Comment: Version as to appear in J. Spectr. Theory, References update

arXiv.org e-Print Archive

Open Access LMU

Anderson's orthogonality catastrophe

Author: Küttler Heinrich
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 19/09/2014
Field of study

N

\Phi_L^N

be the

N

-body ground state of the fermionic system in a

d

-dimensional box of length

L

,and let

\Psi_L^N

be the ground state of the corresponding system in the presence of the additional finite-range potential. Then the catastrophe brings about the asymptotic vanishing

\S_L^N := \ \sim L^{-\gamma/2}

of the overlap

\S_L^N

of the

N

-body ground states

\Phi_L^N

and

\Psi_L^N

. The asymptotics is in the thermodynamic limit

L\to\infty

and

N\to\infty

with fixed density

N/L^d\to\varrho > 0

. In [Commun. Math. Phys. 329:979--998], the overlap

\S_L^N

has been bounded from above with an asymptotic bound of the form \abs{\S_L^N}^2 \lesssim L^{-\tilde{\gamma}}. The decay exponent

\tilde{\gamma}

\gamma

. This thesis features a step towards the exact asymptotics. We prove a bound with a coefficient

\gamma

\ln S_L^N

Digitale Hochschulschriften der LMU

Learning with AMIGo: Adversarially Motivated Intrinsic Goals

Author: Campero Andres
Grefenstette Edward
Küttler Heinrich
Raileanu Roberta
Rocktäschel Tim
Tenenbaum Joshua B.
Publication venue
Publication date: 01/01/2021
Field of study

A key challenge for reinforcement learning (RL) consists of learning in environments with sparse extrinsic rewards. In contrast to current RL methods, humans are able to learn new skills with little or no reward by using various forms of intrinsic motivation. We propose AMIGo, a novel agent incorporating -- as form of meta-learning -- a goal-generating teacher that proposes Adversarially Motivated Intrinsic Goals to train a goal-conditioned "student" policy in the absence of (or alongside) environment reward. Specifically, through a simple but effective "constructively adversarial" objective, the teacher learns to propose increasingly challenging -- yet achievable -- goals that allow the student to learn general skills for acting in a new environment, independent of the task to be solved. We show that our method generates a natural curriculum of self-proposed goals which ultimately allows the agent to solve challenging procedurally-generated tasks where other forms of intrinsic motivation and state-of-the-art RL methods fail.Comment: 18 pages, 6 figures, published at The Ninth International Conference on Learning Representations (2021

arXiv.org e-Print Archive

UCL Discovery

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Author: Küttler Heinrich
Lewis Patrick
Liu Linqing
Minervini Pasquale
Piktus Aleksandra
Riedel Sebastian
Stenetorp Pontus
Wu Yuxiang
Publication venue
Publication date: 13/02/2021
Field of study

Open-domain Question Answering models which directly leverage question-answer (QA) pairs, such as closed-book QA (CBQA) models and QA-pair retrievers, show promise in terms of speed and memory compared to conventional models which retrieve and read from text corpora. QA-pair retrievers also offer interpretable answers, a high degree of control, and are trivial to update at test time with new knowledge. However, these models lack the accuracy of retrieve-and-read systems, as substantially less knowledge is covered by the available QA-pairs relative to text corpora like Wikipedia. To facilitate improved QA-pair models, we introduce Probably Asked Questions (PAQ), a very large resource of 65M automatically-generated QA-pairs. We introduce a new QA-pair retriever, RePAQ, to complement PAQ. We find that PAQ preempts and caches test questions, enabling RePAQ to match the accuracy of recent retrieve-and-read models, whilst being significantly faster. Using PAQ, we train CBQA models which outperform comparable baselines by 5%, but trail RePAQ by over 15%, indicating the effectiveness of explicit retrieval. RePAQ can be configured for size (under 500MB) or speed (over 1K questions per second) whilst retaining high accuracy. Lastly, we demonstrate RePAQ's strength at selective QA, abstaining from answering when it is likely to be incorrect. This enables RePAQ to ``back-off" to a more expensive state-of-the-art model, leading to a combined system which is both more accurate and 2x faster than the state-of-the-art model alone

arXiv.org e-Print Archive

UCL Discovery

Grounding Aleatoric Uncertainty in Unsupervised Environment Design

Author: Dennis Michael
Foerster Jakob
Grefenstette Edward
Jiang Minqi
Küttler Heinrich
Lupu Andrei
Parker-Holder Jack
Rocktäschel Tim
Publication venue
Publication date: 11/07/2022
Field of study

Adaptive curricula in reinforcement learning (RL) have proven effective for producing policies robust to discrepancies between the train and test environment. Recently, the Unsupervised Environment Design (UED) framework generalized RL curricula to generating sequences of entire environments, leading to new methods with robust minimax regret properties. Problematically, in partially-observable or stochastic settings, optimal policies may depend on the ground-truth distribution over aleatoric parameters of the environment in the intended deployment setting, while curriculum learning necessarily shifts the training distribution. We formalize this phenomenon as curriculum-induced covariate shift (CICS), and describe how its occurrence in aleatoric parameters can lead to suboptimal policies. Directly sampling these parameters from the ground-truth distribution avoids the issue, but thwarts curriculum learning. We propose SAMPLR, a minimax regret UED method that optimizes the ground-truth utility function, even when the underlying training data is biased due to CICS. We prove, and validate on challenging domains, that our approach preserves optimality under the ground-truth distribution, while promoting robustness across the full range of environment settings

arXiv.org e-Print Archive

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Author: Goyal Naman
Karpukhin Vladimir
Kiela Douwe
Küttler Heinrich
Lewis Mike
Lewis Patrick
Perez Ethan
Petroni Fabio
Piktus Aleksandra
Riedel Sebastian
Rocktäschel Tim
Yih Wen-tau
Publication venue
Publication date: 01/01/2020
Field of study

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. We compare two RAG formulations, one which conditions on the same retrieved passages across the whole generated sequence, the other can use different passages per token. We fine-tune and evaluate our models on a wide range of knowledge-intensive NLP tasks and set the state-of-the-art on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve-and-extract architectures. For language generation tasks, we find that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.Comment: Accepted at NeurIPS 202

arXiv.org e-Print Archive

UCL Discovery