Search CORE

arXiv.org e-Print Archive

Optimal control as a graphical model inference problem

Author: B. Broek van den
B. Broek van den
B. Skyrms
C. A. Albers
C. Boutilier
D. Koller
D. P. Bertsekas
E. A. Theodorou
E. A. Theodorou
E. A. Theodorou
E. Todorov
E. Todorov
E. Todorov
G. Cooper
H. J. Kappen
H. J. Kappen
Hilbert J. Kappen
J. A. Bagnell
J. Kober
J. M. Mooij
J. Peters
J. Tatman
J. Yedidia
J. Yedidia
K. J. Friston
K. Murphy
M. Silva da
M. Toussaint
Manfred Opper
P. Dayan
R. D. Shachter
R. F. Stengel
S. J. Russell
S. L. Lauritzen
T. Heskes
Vicenç Gómez
W. Wiegerinck
W. Wiegerinck
W. Yoshida
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We reformulate a class of non-linear stochastic optimal control problems introduced by Todorov (2007) as a Kullback-Leibler (KL) minimization problem. As a result, the optimal control computation reduces to an inference computation and approximate inference methods can be applied to efficiently compute approximate optimal controls. We show how this KL control theory contains the path integral control method as a special case. We provide an example of a block stacking task and a multi-agent cooperative game where we demonstrate how approximate inference can be successfully applied to instances that are too complex for exact computation. We discuss the relation of the KL control approach to other inference approaches to control.Comment: 26 pages, 12 Figures; Machine Learning Journal (2012

Association for the Advancement of Artificial Intelligence: AAAI Publications

Radboud Repository

UPF Digital Repository

Model-based contextual policy search for data-efficient generalization of robot skills

Author: Abbeel
Atkeson
Bagnell
Bagnell
Baxter
Boyd
da Silva
Daniel
Deisenroth
Deisenroth
Deisenroth
Deisenroth
Deisenroth
Englert
Gams
Grollman
Ijspeert
Ko
Kober
Kober
Kober
Kohl
Kormushev
Kupcsik
Lens
Muelling
Neumann
Neumann
Ng
Peters
Peters
Rasmussen
Rückstieß
Schneider
Sehnke
Snelson
Sutton
Theodorou
Titsias
Ude
Wierstra
Williams
Yi
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

In robotics, lower-level controllers are typically used to make the robot solve a specific task in a fixed context. For example, the lower-level controller can encode a hitting movement while the context defines the target coordinates to hit. However, in many learning problems the context may change between task executions. To adapt the policy to a new context, we utilize a hierarchical approach by learning an upper-level policy that generalizes the lower-level controllers to new contexts. A common approach to learn such upper-level policies is to use policy search. However, the majority of current contextual policy search approaches are model-free and require a high number of interactions with the robot and its environment. Model-based approaches are known to significantly reduce the amount of robot experiments, however, current model-based techniques cannot be applied straightforwardly to the problem of learning contextual upper-level policies. They rely on specific parametrizations of the policy and the reward function, which are often unrealistic in the contextual policy search formulation. In this paper, we propose a novel model-based contextual policy search algorithm that is able to generalize lower-level controllers, and is data-efficient. Our approach is based on learned probabilistic forward models and information theoretic policy search. Unlike current algorithms, our method does not require any assumption on the parametrization of the policy or the reward function. We show on complex simulated robotic tasks and in a real robot experiment that the proposed learning framework speeds up the learning process by up to two orders of magnitude in comparison to existing methods, while learning high quality policies

University of Lincoln Institutional Repository

TUbiblio

Spiral - Imperial College Digital Repository

UCL Discovery

MPG.PuRe

Complete abdominal aortic aneurysm thrombosis and obstruction of both common iliac arteries with intrathrombotic pressures demonstrating a continuing risk of rupture: a case report and review of the literature

Author: Aikaterini Kotzadimitriou
Andreas Manouras
DA Vorp
DH Wang
Dimitrios Theodorou
E Di Martino
Emmanuel E Lagoudianakis
GW Schurink
H Takagi
Haridimos Markogiannakis
J Satta
J Stenbaek
JH Kwaan
JJ Ricotta
Konstantinos A Filis
Konstantinos Bramis
Konstantinos Xiromeritis
M Kazi
MA Leke
Nikolaos Koronakis
S Dalal
SS Hans
WC Pevec
YG Wolf
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

University of Lincoln Institutional Repository

Learning modular policies for robotics

Author: Alexandros Paraschos
Andras Kupcsik
Butterfield
Calinon
Chiappa
Christian Daniel
da Silva
Daniel
Daniel
Daniel
dAvella
Dempster
Gerhard Neumann
Ghavamzadeh
Ijspeert
Jan Peters
Khansari-Zadeh
Kober
Kober
Kober
Kormushev
Kulic
Kupcsik
Lazaric
Meier
Morimoto
Neal
Neumann
Neumann
Niekum
Paraschos
Peters
Peters
Peters
Rasmussen
Rozo
Schaal
Snelson
Stulp
Stulp
Theodorou
Todorov
Toussaint
Ude
Vlassis
Williams
Williams
Ãlvarez
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2014
Field of study

A promising idea for scaling robot learning to more complex tasks is to use elemental behaviors as building blocks to compose more complex behavior. Ideally, such building blocks are used in combination with a learning algorithm that is able to learn to select, adapt, sequence and co-activate the building blocks. While there has been a lot of work on approaches that support one of these requirements, no learning algorithm exists that unifies all these properties in one framework. In this paper we present our work on a unified approach for learning such a modular control architecture. We introduce new policy search algorithms that are based on information-theoretic principles and are able to learn to select, adapt and sequence the building blocks. Furthermore, we developed a new representation for the individual building block that supports co-activation and principled ways for adapting the movement. Finally, we summarize our experiments for learning modular control architectures in simulation and with real robots

TUbiblio

Directory of Open Access Journals

Frontiers - Publisher Connector

Systematic review and meta-analysis of the diagnostic accuracy of ultrasonography for deep vein thrombosis

Author: A Cogo
A Cogo
A Elias
A Elias
A Gongolo
A Markel
A Miselli
AF Aburahma
AG Aitken
AJ Comerota
AJ Comerota
AJ Sutton
Alex Sutton
AT Irvine
AW Lensing
B Kassai
B van Ramshorst
B Wolf
BD Lewis
BG Birdwell
BG Birdwell
BN Raghavendra
BS Hwang
C Kearon
C Savy-Stortz
D Becker
D Cavaye
D Gaitini
DA Kristo
DC Mitchell
DH O'Leary
DL Rollins
DP Flanigan
DR Anderson
E Bernardi
E Kalodiki
Edwin van Beek
EK Yucel
F Dany
F Pasquariello
FC Sampson
Fiona Sampson
FJ Song
G Guazzaloca
GJ Zhang
GM Baxter
GM Baxter
GP Shields
GP Size
GR Simons
GV Belcaro
H Bounameaux
H Heijboer
H Heijboer
H Rosier
HJ Aronen
HO Leven
I Baumgartner
J De Laveaucoupet
J Howe
J Yao
J Zamora
JC De Valois
JE George
JE George
JF Chance
JG Lijmer
JJ Cranley
JJ Cronan
JJ Deeks
JM Schindler
JP Carpenter
JP Fletcher
JP Laissy
JPJ Archie
JS Ginsberg
K Forbes
K Ouriel
K Walsh
LA Killewich
LW Tick
M Atri
M Dauzat
M Grobety
M Hay
M Mantoni
M Monreal
M Sluzewski
MA Amin
MA Mattos
MCG Holmes
ME McCandless
MJ Bradley
MK Eskandari
MK Zhou
N Labropoulos
N Miller
NH Rosner
OM Pedersen
P De Faucal
P Vogel
PJ Bendick
PL Robertson
PL Robertson
PR Biondetti
PS Wells
PS Wells
PS Wells
PS Wells
PS Wells
PT Appelman
PT Kennedy
R Lindqvist
R Mani
R Poti
R Puls
R Quintavalla
RA Bucek
RA Kraaijenhagen
RB Patterson
SC Rose
SG Thompson
SJ Theodorou
SM Bates
SM Schellong
SM Stevens
Steve Goodacre
Steve Thomas
SW Heim
TE Gudmundsen
TL Turnbull
W Habscheid
W Wysokinski
WD Foley
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Background Ultrasound (US) has largely replaced contrast venography as the definitive diagnostic test for deep vein thrombosis (DVT). We aimed to derive a definitive estimate of the diagnostic accuracy of US for clinically suspected DVT and identify study-level factors that might predict accuracy. Methods We undertook a systematic review, meta-analysis and meta-regression of diagnostic cohort studies that compared US to contrast venography in patients with suspected DVT. We searched Medline, EMBASE, CINAHL, Web of Science, Cochrane Database of Systematic Reviews, Cochrane Controlled Trials Register, Database of Reviews of Effectiveness, the ACP Journal Club, and citation lists (1966 to April 2004). Random effects meta-analysis was used to derive pooled estimates of sensitivity and specificity. Random effects meta-regression was used to identify study-level covariates that predicted diagnostic performance. Results We identified 100 cohorts comparing US to venography in patients with suspected DVT. Overall sensitivity for proximal DVT (95% confidence interval) was 94.2% (93.2 to 95.0), for distal DVT was 63.5% (59.8 to 67.0), and specificity was 93.8% (93.1 to 94.4). Duplex US had pooled sensitivity of 96.5% (95.1 to 97.6) for proximal DVT, 71.2% (64.6 to 77.2) for distal DVT and specificity of 94.0% (92.8 to 95.1). Triplex US had pooled sensitivity of 96.4% (94.4 to 97.1%) for proximal DVT, 75.2% (67.7 to 81.6) for distal DVT and specificity of 94.3% (92.5 to 95.8). Compression US alone had pooled sensitivity of 93.8 % (92.0 to 95.3%) for proximal DVT, 56.8% (49.0 to 66.4) for distal DVT and specificity of 97.8% (97.0 to 98.4). Sensitivity was higher in more recently published studies and in cohorts with higher prevalence of DVT and more proximal DVT, and was lower in cohorts that reported interpretation by a radiologist. Specificity was higher in cohorts that excluded patients with previous DVT. No studies were identified that compared repeat US to venography in all patients. Repeat US appears to have a positive yield of 1.3%, with 89% of these being confirmed by venography. Conclusion Combined colour-doppler US techniques have optimal sensitivity, while compression US has optimal specificity for DVT. However, all estimates are subject to substantial unexplained heterogeneity. The role of repeat scanning is very uncertain and based upon limited data

Directory of Open Access Journals

Edinburgh Research Explorer

White Rose Research Online

Leicester Research Archive

Overcoming Ostrea edulis seed production limitations to meet ecosystem restoration demands in the UN decade on restoration

Author: Bakker Nienke
Blanco Ainhoa
Bonačić Kruno
Boudry Pierre
Brundu Gianni
Cameron Tom C.
Colsoul Bérenger
Connellan Iarfhlaith
Da costa Fiz
Debney Alison
Ermgassen Philine S.e. Zu
Fabra Monica
Frankic Anamarija
Gamble Celine
Gray Mathew W.
Helmer Luke
Holbrook Zoë
Hugh-Jones Tristan
Kamermans Pauline
Magnesen Thorolf
Nielsen Pernille
Preston Joanne
Ranger Christopher J.
Saurel Camille
Smyth David
Stechele Brecht
Strand Åsa
Theodorou John A.
Publication venue: 'EDP Sciences'
Publication date: 01/01/2023
Field of study

The European flat oyster, Ostrea edulis, is a habitat-forming bivalve which was historically widespread throughout Europe. Following its decline due to overfishing, pollution, sedimentation, invasive species, and disease, O. edulis and its beds are now listed as a threatened and/or declining species and habitat by OSPAR. Increasing recognition of the plight of the oyster, alongside rapidly developing restoration techniques and growing interest in marine restoration, has resulted in a recent and rapid growth in habitat restoration efforts. O. edulis seed supply is currently a major bottleneck in scaling up habitat restoration efforts in Europe. O. edulis has been cultured for centuries, however, research into its culture declined following the introduction of the Pacific oyster, Crassostrea gigas to Europe in the early 1970 s. Recent efforts to renew both hatchery and pond production of O. edulis seed for habitat restoration purposes are hampered by restoration project timelines and funding typically being short, or projects not planning appropriately for the timescales required for investment, research-and-development and delivery of oyster seed by commercial producers. Furthermore, funding for restoration is intermittent, making long-term commitments between producers and restoration practitioners difficult. Long-term, strategic investment in research and production are needed to overcome these bottlenecks and meet current ambitious restoration targets across Europe

EDP Sciences OAI-PMH repository (1.2.0)

ArchiMer - Institutional Archive of Ifremer

Edinburgh Research Explorer

Electronic Publication Information Center

Digital.CSIC

Online Research Database In Technology

Recent translational research: Oncogene discovery by insertional mutagenesis gets a new boost

Author: A Marchetti
AC Spradling
AH Lund
AJ Dupuy
CA MacArthur
D Gallahan
DA Gray
FK Johansson
FK Johansson
FS Lee
G Peters
G Peters
GM Shackleford
H Mikkers
H Mikkers
H Roelink
J Jonkers
J Li
John Hilkens
KM Erny
L Girard
LS Collier
M Garcia
MS Shin
R Nusse
RR Zwaal
RS Mitchell
S Ding
T Suzuki
V Theodorou
W Lowther
WS Hayward
Publication venue: BioMed Central
Publication date: 16/01/2006
Field of study

Knowledge of the genes and genetic pathways involved in onco-genesis is essential if we are to identify novel targets for cancer therapy. Insertional mutagenesis in mouse models is among the most efficient tools to detect novel cancer genes. Retrovirus-mediated insertional mutagenesis received a tremendous boost by the availability of the mouse genome sequence and new PCR methods. Application of such advances were limited to lympho-magenesis but are now also being applied to mammary tumourigenesis. Novel transposons that allow insertional muta-genesis studies to be conducted in tumors of any mouse tissue may give cancer gene discovery a further boost

CiteSeerX

GPs' opinions of public and industrial information regarding drugs: a cross-sectional study

Author: A Fretheim
A Mason
AJ Fugh-Berman
B Wettermark
BD Montgomery
Cecilia Björkelund
D Korenstein
D Lea
D Sackett
DA Henry
DL Roter
DL Sackett
Evidence-Based Medicine Working Group
F Caamano
F Sjöqvist
G Eliasson
GK Spurling
H Prosser
I Skoglund
II ABLA
Ingmarie Skoglund
J Avorn
J Lexchin
J Schramm
JP Kassirer
JS Ross
Kirsten Mehlig
L Tobin
M Angell
M Jägestedt
M Theodorou
Margareta Möller
MR Layton
P McGettigan
P Örn
Ronny Gunnarsson
RP Sequeira
S Vancelik
SBU
Snijders
TS Caudill
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: General Practitioners {GP} in Sweden prescribe more than 50% of all prescriptions. Scientific knowledge on the opinions of GPs regarding drug information has been sparse. Such knowledge could be valuable when designing evidence-based drug information to GPs. GPs' opinions on public- and industry-provided drug information are presented in this article. Methods: A cross-sectional study using a questionnaire was answered by 368 GPs at 97 primary-health care centres {PHCC}. The centres were invited to participate by eight out of 29 drug and therapeutic committees {DTCs}. A multilevel model was used to analyse associations between opinions of GPs regarding drug information and whether the GPs worked in public sector or in a private enterprise, their age, sex, and work experience. PHCC and geographical area were included as random effects. Results: About 85% of the GPs perceived they received too much information from the industry, that the quality of public information was high and useful, and that the main task of public authorities was to increase the GPs' knowledge of drugs. Female GPs valued information from public authorities to a much greater extent than male GPs. Out of the GPs, 93% considered the main task of the industry was to promote sales. Differences between the GPs' opinions between PHCCs were generally more visible than differences between areas. Conclusions: Some kind of incentives could be considered for PHCCs that actively reduce drug promotion from the industry. That female GPs valued information from public authorities to a much greater extent than male GPs should be taken into consideration when designing evidence-based drug information from public authorities to make implementation easier

ResearchOnline@JCU

ResearchOnline at James Cook University

arXiv.org e-Print Archive

Thermodynamics as a theory of decision-making with information processing costs

Author: Başar T
Bellman RE
Bishop CM
Braun DA
Callen HB
Camerer C
Daw ND
de Finetti B
Feynman RP
Gigerenzer G
Gigerenzer G
Gladwell M
Gumbel EJ
Jaynes ET
Kahnemann D
Kolmogorov A
Luce RD
Luce RD
MacKay DJC
McFadden D
Meginnis JR
Ortega PA
Ortega PA
Ortega PA
Ortega PA
Peters J
Rawlik K
Rubinstein A
Russell SJ
Savage LJ
Simon H
Simon H
Stone LD
Sutton RS
Theodorou E
Tishby N
Todorov E
van den Broek JL
Vitanyi PMB
Von Neumann J
Whittle P
Wolpert D
Wolpert DH
Publication venue: 'The Royal Society'
Publication date: 30/07/2012
Field of study

Perfectly rational decision-makers maximize expected utility, but crucially ignore the resource costs incurred when determining optimal actions. Here we propose an information-theoretic formalization of bounded rational decision-making where decision-makers trade off expected utility and information processing costs. Such bounded rational decision-makers can be thought of as thermodynamic machines that undergo physical state changes when they compute. Their behavior is governed by a free energy functional that trades off changes in internal energy-as a proxy for utility-and entropic changes representing computational costs induced by changing states. As a result, the bounded rational decision-making problem can be rephrased in terms of well-known concepts from statistical physics. In the limit when computational costs are ignored, the maximum expected utility principle is recovered. We discuss the relation to satisficing decision-making procedures as well as links to existing theoretical frameworks and human decision-making experiments that describe deviations from expected utility theory. Since most of the mathematical machinery can be borrowed from statistical physics, the main contribution is to axiomatically derive and interpret the thermodynamic free energy as a model of bounded rational decision-making.Comment: 26 pages, 5 figures, (under revision since February 2012