Search CORE

33 research outputs found

Paq: 65 million probably-asked questions and what you can do with them

Author: Küttler H
Lewis P
Liu L
Minervini P
Piktus A
Riedel S
Stenetorp P
Wu Y
Publication venue
Publication date: 07/10/2021
Field of study

Open-domain Question Answering models that directly leverage question-answer (QA) pairs, such as closed-book QA (CBQA) models and QA-pair retrievers, show promise in terms of speed and memory compared with conventional models which retrieve and read from text corpora. QA-pair retrievers also offer interpretable answers, a high degree of control, and are trivial to update at test time with new knowledge. However, these models fall short of the accuracy of retrieve-and-read systems, as substantially less knowledge is covered by the available QA-pairs relative to text corpora like Wikipedia. To facilitate improved QA-pair models, we introduce Probably Asked Questions (PAQ), a very large resource of 65M automatically generated QA-pairs. We introduce a new QA-pair retriever, RePAQ, to complement PAQ. We find that PAQ preempts and caches test questions, enabling RePAQ to match the accuracy of recent retrieve-and-read models, whilst being significantly faster. Using PAQ, we train CBQA models which outperform comparable baselines by 5%, but trail RePAQ by over 15%, indicating the effectiveness of explicit retrieval. RePAQ can be configured for size (under 500MB) or speed (over 1K questions per second) while retaining high accuracy. Lastly, we demonstrate RePAQ’s strength at selective QA, abstaining from answering when it is likely to be incorrect. This enables RePAQ to ‘‘back-off’’ to a more expensive state-of-the-art model, leading to a combined system which is both more accurate and 2x faster than the state-of-the-art model alone

UCL Discovery

MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions

Author: Küttler H
Miller AH
Nardelli N
Pineau J
Rabbat M
Riedel S
Rocktäschel T
Sivakumar V
Publication venue: ML for Systems
Publication date
Field of study

Effective network congestion control strategies are key to keeping the Internet (or any large computer network) operational. Network congestion control has been dominated by hand-crafted heuristics for decades. Recently, ReinforcementLearning (RL) has emerged as an alternative to automatically optimize such control strategies. Research so far has primarily considered RL interfaces which block the sender while an agent considers its next action. This is largely an artifact of building on top of frameworks designed for RL in games (e.g. OpenAI Gym). However, this does not translate to real-world networking environments, where a network sender waiting on a policy without sending data leads to under-utilization of bandwidth. We instead propose to formulate congestion control with an asynchronous RL agent that handles delayed actions. We present MVFST-RL, a scalable framework for congestion control in the QUIC transport protocol that leverages state-of-the-art in asynchronous RL training with off-policy correction. We analyze modeling improvements to mitigate the deviation from Markovian dynamics, and evaluate our method on emulated networks from the Pantheon benchmark platform. The source code is publicly available at https://github.com/facebookresearch/mvfst-rl

UCL Discovery

CFD investigation of a complete floating offshore wind turbine

Author: BJ Leimkuhler
C Farhat
E Marino
F Dehaeze
F Dehaeze
F Rieper
G Bilotta
J Degroote
J Morison
J Vierendeels
L Roald
M Carrión
M Carrión
M Carrión
M Carrión
M Gomez-Gesteira
M Jarkowski
M Karimirad
M Karimirad
MA Fernández
O Axelsson
O Zienkiewicz
PE Nikravesh
R Steijl
S Gómez-Iradi
S Osher
SC Eisenstat
T Belytschko
TJ Larsen
U Küttler
V Leble
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

This chapter presents numerical computations for floating offshore wind turbines for a machine of 10-MW rated power. The rotors were computed using the Helicopter Multi-Block flow solver of the University of Glasgow that solves the Navier-Stokes equations in integral form using the arbitrary Lagrangian-Eulerian formulation for time-dependent domains with moving boundaries. Hydrodynamic loads on the support platform were computed using the Smoothed Particle Hydrodynamics method. This method is mesh-free, and represents the fluid by a set of discrete particles. The motion of the floating offshore wind turbine is computed using a Multi-Body Dynamic Model of rigid bodies and frictionless joints. Mooring cables are modelled as a set of springs and dampers. All solvers were validated separately before coupling, and the loosely coupled algorithm used is described in detail alongside the obtained results

Crossref

Springer - Publisher Connector

Enlighten

Insights from the NeurIPS 2021 NetHack Challenge

Author: Babaev D
Byeon M
Chakraborty D
Grefenstette E
Hambro E
Jiang M
Jo D
Kanervisto A
Kim J
Kim S
Kirk R
Kurin V
Kwon T
Küttler H
Lee D
Mella V
Mohanty S
Nardelli N
Nazarov I
Ovsov N
Parker-Holder J
Raileanu R
Ramanauskas K
Rocktäschel T
Rothermel D
Samvelyan M
Sorokin D
Sypetkowski M
Sypetkowski M
Publication venue: Proceedings of Machine Learning Research (PMLR)
Publication date: 01/01/2022
Field of study

In this report, we summarize the takeaways from the first NeurIPS 2021 NetHack Challenge. Participants were tasked with developing a program or agent that can win (i.e., ‘ascend’ in) the popular dungeon-crawler game of NetHack by interacting with the NetHack Learning Environment (NLE), a scalable, procedurally generated, and challenging Gym environment for reinforcement learning (RL). The challenge showcased community-driven progress in AI with many diverse approaches significantly beating the previously best results on NetHack. Furthermore, it served as a direct comparison between neural (e.g., deep RL) and symbolic AI, as well as hybrid systems, demonstrating that on NetHack symbolic bots currently outperform deep RL by a large margin. Lastly, no agent got close to winning the game, illustrating NetHack’s suitability as a long-term benchmark for AI research

arXiv.org e-Print Archive

UCL Discovery