Search CORE

3,549 research outputs found

Multi-Task Policy Search for Robotics

Author: Deisenroth MP
Englert P
Fox D
Peters J
Publication venue
Publication date: 01/01/2014
Field of study

© 2014 IEEE.Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in realrobot experiments are shown

TUbiblio

Crossref

Spiral - Imperial College Digital Repository

MPG.PuRe

Multiple-Target Reinforcement Learning with a Single Policy

Author: Deisenroth MP
Fox D
Publication venue
Publication date: 31/07/2011
Field of study

Spiral - Imperial College Digital Repository

Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning

Author: Deisenroth MP
Fox D
Rasmussen CE
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2011
Field of study

Over the last years, there has been substantial progress in robust manipulation in unstructured environments. The long-term goal of our work is to get away from precise, but very expensive robotic systems and to develop affordable, potentially imprecise, self-adaptive manipulator systems that can interactively perform tasks such as playing with children. In this paper, we demonstrate how a low-cost off-the-shelf robotic system can learn closed-loop policies for a stacking task in only a handful of trials-from scratch. Our manipulator is inaccurate and provides no pose feedback. For learning a controller in the work space of a Kinect-style depth camera, we use a model-based reinforcement learning technique. Our learning method is data efficient, reduces model bias, and deals with several noise sources in a principled way during long-term planning. We present a way of incorporating state-space constraints into the learning process and analyze the learning gain by exploiting the sequential structure of the stacking task

CiteSeerX

Spiral - Imperial College Digital Repository

Multi-Task Policy Search

Author: Deisenroth MP
Englert P
Fox D
Peters J
Publication venue
Publication date: 31/12/2013
Field of study

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in real-robot experiments are shown

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

Do HIV treatment eligibility expansions crowd out the sickest? Evidence from rural South Africa

Author: Bor J
Bärnighausen T
Fox MP
Kluberg SA
LaValley M
Pillay D
Publication venue
Publication date: 01/09/2018
Field of study

OBJECTIVE: The 2015 WHO recommendation to initiate all HIV patients on antiretroviral therapy (ART) at diagnosis could potentially overextend health systems and crowd out sicker patients, mitigating the policy's impact. We evaluate whether South Africa's prior eligibility expansion from CD4 ≤200 to CD4 ≤350 cells/μL reduced ART uptake in the sickest patients. METHODS: Using data on all patients presenting to the Hlabisa HIV Treatment and Care Program in KwaZulu-Natal from April 2010 - June 2012 (n=13,809), we assessed the impact of the August 2011 eligibility expansion on the number of patients seeking care, number initiating ART, and time from HIV diagnosis to ART initiation among patients always eligible (CD4 0-200), newly eligible (CD4 201-350), and not yet eligible by CD4 count (>350). We used interrupted time series methods to control for long-run trends and isolate the effect of the policy. RESULTS: Expanding ART eligibility led to an increased number of patients initiating ART per month [+95.5; 95% CI (-1.3; 192.3)]. Newly eligible patients (CD4 201-350) initiated treatment 47% faster than before (95% CI 19%; 82%), while the sickest patients (CD4 ≤200) saw no decline in the monthly number of patients initiating treatment or the rate of treatment uptake. CONCLUSION: The Hlabisa program successfully extended ART to patients with CD4 ≤350 cells/μL, while ensuring that the sickest patients did not experience delays in ART initiation. Treatment programs must be vigilant to maintain quality of care for the sickest as countries move to treat all patients irrespective of CD4 count. This article is protected by copyright. All rights reserved

UCL Discovery

Building governance and energy efficiency: Mapping the interdisciplinary challenge

Author: AB Jaffe
David Weatherall
E Fox
E Hirst
FF Sniehotta
H Tajfel
I Ajzen
JM Darley
K Raworth
KB Janda
L Lutzenhiser
M Crozier
MP Lefeuvre
RJ Sampson
S Bright
S Sorrell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Improving the energy efficiency of multi-owned properties (MoPs)—commonly known as apartment or condominium buildings—is central to the achievement of European energy targets. However, little work to date has focused on how to facilitate retrofit in this context. Drawing on interdisciplinary Social Sciences and Humanities expertise in academia, policy and practice, this chapter posits that decision-making processes within MoPs might provide a key to the retrofit challenge. Existing theories or models of decision-making, applied in the MoP context, might help to explain how collective retrofit decisions are taken—or overlooked. Insights from case studies and practitioners are also key. Theories of change might then be employed to develop strategies to facilitate positive retrofit decisions. The chapter maps the issues and sets an agenda for further interdisciplinary research in this novel area

Crossref

Enlighten

Imputing HIV treatment start dates from routine laboratory data in South Africa: a validation study

Author: C Handford
CF Houlihan
Cheryl Hendrickson
Deenan Pillay
F Fritz
Ian Sanne
J Bor
JA Blaya
Jacob Bor
LA Green
M Forster
Matthew P Fox
Mhairi Maskew
MP Fox
MP Fox
N Menachemi
Sergio Carmona
Till Bärnighausen
UNAIDS
Wendy Stevens
William MacLeod
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Diagnosing Dementia in the Clinical Setting: Can Amyloid PET Provide Additional Value Over Cerebrospinal Fluid?

Author: Barnes A
Bomanji JB
Dickson J
Fox NC
Kayani I
Lunn MP
Mummery CJ
Paterson RW
Rossor MN
Schott JM
Warren JD
Weston PS
Zetterberg H
Publication venue
Publication date: 27/08/2016
Field of study

Cerebrospinal fluid (CSF) measures of amyloid and tau are the first-line Alzheimer's disease biomarkers in many clinical centers. We assessed if and when the addition of amyloid PET following CSF measurements provides added diagnostic value. Twenty patients from a cognitive clinic, who had undergone detailed assessment including CSF measures, went on to have amyloid PET. The treating neurologist's working diagnosis, and degree of diagnostic certainty, was assessed both before and after the PET. Amyloid PET changed the diagnosis in 7/20 cases. Amyloid PET can provide added diagnostic value, particularly in young-onset, atypical dementias, where CSF results are borderline and diagnostic uncertainty remains

UCL Discovery

PubMed Central

Migrations and habitat use of the smooth hammerhead shark (Sphyrna zygaena) in the Atlantic Ocean

Author: AP Klimley
Bass AJ
BFJ Manly
Catarina C. Santos
CH Lam
D Rosa
DA Teets
DB Olson
DM Knip
E Cortés
E Cortés
ER Hoffmayer
F Mas
F Poisson
FG Carey
FJ Abascal
FJ Abascal
GC Hays
George Tserpes
H Dewar
H Levene
H Wickham
HW Lilliefors
J Fox
JA Musick
JD Stevens
JD Stevens
KM Diemer
LA Howey-Jordan
MD Camhi
MK Musyl
MK Musyl
MK Musyl
ML Domeier
MP Fay
MP Francis
MR Heithaus
MW Beck
N Hammerschlag
N Queiroz
N Queiroz
R Coelho
R Coelho
R Hilborn
RA Myers
Rui Coelho
S Bessudo
SC Clarke
SG Wilson
SJ Jorgensen
SL Petersen
V Buencuerpo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2018
Field of study

The smooth hammerhead shark, Sphyrna zygaena, is a cosmopolitan semipelagic shark captured as bycatch in pelagic oceanic fisheries, especially pelagic longlines targeting swordfish and/or tunas. From 2012 to 2016, eight smooth hammerheads were tagged with Pop-up Satellite Archival Tags in the inter-tropical region of the Northeast Atlantic Ocean, with successful transmissions received from seven tags (total of 319 tracking days). Results confirmed the smooth hammerhead is a highly mobile species, as the longest migration ever documented for this species (> 6600 km) was recorded. An absence of a diel vertical movement behavior was noted, with the sharks spending most of their time at surface waters (0-50 m) above 23 degrees C. The operating depth of the pelagic long-line gear was measured with Minilog Temperature and Depth Recorders, and the overlap with the species vertical distribution was calculated. The overlap is taking place mainly during the night and is higher for juveniles (similar to 40% of overlap time). The novel information presented can now be used to contribute to the provision of sustainable management tools and serve as input for Ecological Risk Assessments for smooth hammerheads caught in Atlantic pelagic longline fisheries.Oceanario de Lisboa through Project "SHARK-TAG: Migrations and habitat use of the smooth hammerhead shark in the Atlantic Ocean"; Investigador-FCT from the Portuguese Foundation for Science and Technology (FCT, Fundacao para a Ciencia e Tecnologia) [Ref: IF/00253/2014]; EU European Social Fund; Programa Operacional Potencial Human

Crossref

Directory of Open Access Journals

Sapientia

Cerebrospinal fluid metallomics in cerebral amyloid angiopathy: an exploratory analysis.

Author: Ambler G
Banerjee G
Foiani MS
Forsgard N
Fox NC
Heslegrave A
Keshavan A
Lunn MP
Paterson RW
Schott JM
Thompson EJ
Toombs J
Werring DJ
Zetterberg H
Publication venue
Publication date: 22/07/2021
Field of study

INTRODUCTION: Cerebral amyloid angiopathy (CAA) is associated with symptomatic intracerebral haemorrhage. Biomarkers of clinically silent bleeding events, such as cerebrospinal fluid (CSF) ferritin and iron, might provide novel measures of disease presence and severity. METHODS: We performed an exploratory study comparing CSF iron, ferritin, and other metal levels in patients with CAA, control subjects (CS) and patients with Alzheimer's disease (AD). Ferritin was measured using a latex fixation test; metal analyses were performed using inductively coupled plasma mass spectrometry. RESULTS: CAA patients (n = 10) had higher levels of CSF iron than the AD (n = 20) and CS (n = 10) groups (medians 23.42, 15.48 and 17.71 μg/L, respectively, p = 0.0015); the difference between CAA and AD groups was significant in unadjusted and age-adjusted analyses. We observed a difference in CSF ferritin (medians 10.10, 7.77 and 8.01 ng/ml, for CAA, AD and CS groups, respectively, p = 0.01); the difference between the CAA and AD groups was significant in unadjusted, but not age-adjusted, analyses. We also observed differences between the CAA and AD groups in CSF nickel and cobalt (unadjusted analyses). CONCLUSIONS: In this exploratory study, we provide preliminary evidence for a distinct CSF metallomic profile in patients with CAA. Replication and validation of these results in larger cohorts is needed

UCL Discovery