Search CORE

4,989 research outputs found

The Nightingale Prize 2010 for best MBEC paper in 2009 awarded

Author: A Avolio
A Rémond
AD Hughes
AG Cutti
CA Leguy
D Blana
D Farina
E Stride
G Pages
JA Spaan
JA Spaan
JA Spaan
Jos A. E. Spaan
KH Hong
KH Parker
L Guo
LP Li
M Ferrario
M Potse
M Zolgharni
R Alcaraz
U Richter
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Reinforcement Learning by Guided Safe Exploration

Author: Jansen Nils
Simão Thiago D.
Spaan Matthijs T. J.
Tindemans Simon H.
Yang Qisong
Publication venue
Publication date: 26/07/2023
Field of study

Safety is critical to broadening the application of reinforcement learning (RL). Often, we train RL agents in a controlled environment, such as a laboratory, before deploying them in the real world. However, the real-world target task might be unknown prior to deployment. Reward-free RL trains an agent without the reward to adapt quickly once the reward is revealed. We consider the constrained reward-free setting, where an agent (the guide) learns to explore safely without the reward signal. This agent is trained in a controlled environment, which allows unsafe interactions and still provides the safety signal. After the target task is revealed, safety violations are not allowed anymore. Thus, the guide is leveraged to compose a safe behaviour policy. Drawing from transfer learning, we also regularize a target policy (the student) towards the guide while the student is unreliable and gradually eliminate the influence of the guide as training progresses. The empirical analysis shows that this method can achieve safe transfer learning and helps the student solve the target task faster.Comment: Accecpted at ECAI 202

arXiv.org e-Print Archive

Large-scale trade in legally protected marine mollusc shells from Java and Bali, Indonesia

Author: Nekaris K
Nijman V
Spaan D
Publication venue
Publication date: 01/01/2015
Field of study

Background Tropical marine molluscs are traded globally. Larger species with slow life histories are under threat from over-exploitation. We report on the trade in protected marine mollusc shells in and from Java and Bali, Indonesia. Since 1987 twelve species of marine molluscs are protected under Indonesian law to shield them from overexploitation. Despite this protection they are traded openly in large volumes. Methodology/Principal Findings We collected data on species composition, origins, volumes and prices at two large open markets (2013), collected data from wholesale traders (2013), and compiled seizure data by the Indonesian authorities (2008–2013). All twelve protected species were observed in trade. Smaller species were traded for 32,000 shells valued at USD500,000), chambered nautilus (Nautilus pompilius) (>3,000 shells, USD60,000) and giant clams (Tridacna spp.) (>2,000 shells, USD45,000) were traded in largest volumes. Two-thirds of this trade was destined for international markets, including in the USA and Asia-Pacific region. Conclusions/Significance We demonstrated that the trade in protected marine mollusc shells in Indonesia is not controlled nor monitored, that it involves large volumes, and that networks of shell collectors, traders, middlemen and exporters span the globe. This impedes protection of these species on the ground and calls into question the effectiveness of protected species management in Indonesia; solutions are unlikely to be found only in Indonesia and must involve the cooperation of importing countries

Crossref

Directory of Open Access Journals

PubMed Central

Oxford Brookes University: RADAR

FigShare

Scalable Safe Policy Improvement via Monte Carlo Tree Search

Author: A. Castellini
A. Farinelli
E. Zorzi
F. Bianchi
M. T. J. Spaan
T. D. Simao
Publication venue: PMLR
Publication date: 01/01/2023
Field of study

Algorithms for safely improving policies are important to deploy reinforcement learning approaches in real-world scenarios. In this work, we propose an algorithm, called MCTS-SPIBB, that computes safe policy improvement online using a Monte Carlo Tree Search based strategy. We theoretically prove that the policy generated by MCTS-SPIBB converges, as the number of simulations grows, to the optimal safely improved policy generated by Safe Policy Improvement with Baseline Bootstrapping (SPIBB), a popular algorithm based on policy iteration. Moreover, our empirical analysis performed on three standard benchmark domains shows that MCTS-SPIBB scales to significantly larger problems than SPIBB because it computes the policy online and locally, i.e., only in the states actually visited by the agent

Catalogo dei prodotti della ricerca

Context-dependent costs and benefits of tuberculosis resistance traits in a wild mammalian host

Author: Beechler Brianna R.
Buss Peter E.
Ezenwa Vanessa O.
Gorsich Erin E.
Hoal Eileen G.
Jolles Anna E.
le Roex Nikki
Spaan Johannie M.
Spaan Robert S.
Tavalire Hannah F.
van Helden Paul D.
Publication venue: Wiley-Blackwell Publishing, Inc
Publication date: 01/01/2018
Field of study

Disease acts as a powerful driver of evolution in natural host populations, yet individuals in a population often vary in their susceptibility to infection. Energetic trade-offs between immune and reproductive investment lead to the evolution of distinct life history strategies, driven by the relative fitness costs and benefits of resisting infection. However, examples quantifying the cost of resistance outside of the laboratory are rare. Here, we observe two distinct forms of resistance to bovine tuberculosis (bTB), an important zoonotic pathogen, in a free-ranging African buffalo (Syncerus caffer) population. We characterize these phenotypes as “infection resistance,” in which hosts delay or prevent infection, and “proliferation resistance,” in which the host limits the spread of lesions caused by the pathogen after infection has occurred. We found weak evidence that infection resistance to bTB may be heritable in this buffalo population (h2 = 0.10) and comes at the cost of reduced body condition and marginally reduced survival once infected, but also associates with an overall higher reproductive rate. Infection-resistant animals thus appear to follow a “fast” pace-of-life syndrome, in that they reproduce more quickly but die upon infection. In contrast, proliferation resistance had no apparent costs and was associated with measures of positive host health—such as having a higher body condition and reproductive rate. This study quantifies striking phenotypic variation in pathogen resistance and provides evidence for a link between life history variation and a disease resistance trait in a wild mammalian host population

Dryad Digital Repository (Duke University)

Warwick Research Archives Portal Repository

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Electronic Archiving System

Stellenbosch University SUNScholar Repository

Parameter-Independent Strategies for pMDPs via POMDPs

Author: A Lukina
C Baier
C Baier
C Daws
C Dehnert
C Dehnert
D Beyer
E Bartocci
E Polgreen
EM Hahn
EM Hahn
J Aspnes
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
LI Sennott
M Baldi
M Cubuktepe
M Kwiatkowska
MTJ Spaan
N Jansen
O Madani
PR Halmos
R Lanotte
S Pathak
S Russell
T Quatmann
V Kreinovich
Publication venue
Publication date: 01/01/2018
Field of study

Markov Decision Processes (MDPs) are a popular class of models suitable for solving control decision problems in probabilistic reactive systems. We consider parametric MDPs (pMDPs) that include parameters in some of the transition probabilities to account for stochastic uncertainties of the environment such as noise or input disturbances. We study pMDPs with reachability objectives where the parameter values are unknown and impossible to measure directly during execution, but there is a probability distribution known over the parameter values. We study for the first time computing parameter-independent strategies that are expectation optimal, i.e., optimize the expected reachability probability under the probability distribution over the parameters. We present an encoding of our problem to partially observable MDPs (POMDPs), i.e., a reduction of our problem to computing optimal strategies in POMDPs. We evaluate our method experimentally on several benchmarks: a motivating (repeated) learner model; a series of benchmarks of varying configurations of a robot moving on a grid; and a consensus protocol.Comment: Extended version of a QEST 2018 pape

arXiv.org e-Print Archive

Crossref

Publikationsserver der RWTH Aachen University

IST Austria: PubRep (Institute of Science and Technology)

Improved performance of the LHCb Outer Tracker in LHC Run 2

Author: Aaij R.
Archilli F.
Bachmann S.
Berninghoff D.
Birnkraut A.
Blouw J.
Ciezarek G.
d'Argent Ph.
de Cian M.
de Vries J. A.
Demmer M.
Dettori F.
Dufour L.
Färber Ch.
Gersabeck E.
Grabowski J.
Grillo L.
Hulsbergen W. D.
Khanji B.
Kolpin M.
Kucharczyk M.
Malecki B. P.
Merk M.
Mueller V.
Mulder M.
Müller J.
Pellegrino A.
Pikies M.
Rachwal B.
Schmelzer T.
Spaan B.
Szczekowski M.
Tolk S.
Tuning N.
Ukleja A.
Uwer U.
van Tilburg J.
Wishahi J.
Witek M.
Publication venue: 'IOP Publishing'
Publication date: 01/01/2017
Field of study

The LHCb Outer Tracker is a gaseous detector covering an area of

5\times 6 m^2

with 12 double layers of straw tubes. The performance of the detector is presented based on data of the LHC Run 2 running period from 2015 and 2016. Occupancies and operational experience for data collected in

p p

, pPb and PbPb collisions are described. An updated study of the ageing effects is presented showing no signs of gain deterioration or other radiation damage effects. In addition several improvements with respect to LHC Run 1 data taking are introduced. A novel real-time calibration of the time-alignment of the detector and the alignment of the single monolayers composing detector modules are presented, improving the drift-time and position resolution of the detector by 20\%. Finally, a potential use of the improved resolution for the timing of charged tracks is described, showing the possibility to identify low-momentum hadrons with their time-of-flight.Comment: 29 pages, 20 figures, minor changes to match the published versio

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Cagliari

CERN Document Server

Determination of the Michel Parameters rho, xi, and delta in tau-Lepton Decays with tau --> rho nu Tags

Author: Albrecht H.
ARGUS Collaboration
Balagura V.
Barsuk S.
Belyaev I.
Bracko M.
Chistov R.
Danilov M.
Eckmann R.
Ehret K.
Eiges V.
Frankl C.
Gershtein L.
Gershtein Yu.
Golutvin A.
Graf J.
Hamacher T.
Hast C.
Hofmann R. P.
Hofmann W.
Hupper A.
Igonkina O.
Kapitza H.
Kernel G.
Kirchhoff T.
Knopfle K. T.
Kolanoski H.
Korolko I.
Kosche A.
Kostina G.
Krieger P.
Krizan P.
Kriznic E.
Kuipers H.
Lange Arnd
Lindner A.
Litvintsev D.
MacFarlane D. B.
Mai O.
Mankel R.
Medin G.
Mundt R.
Nau A.
Nowak S.
Oest T.
Pakhlov P.
Podobnik T.
Prentice J. D.
Reim K.
Reiner R.
Ressing D.
Rohde A.
Saull P. R. B.
Schieber M.
Schmidt-Parzefall W.
Schmidtler M.
Schneider M.
Schramm M.
Schroder H.
Schubert Klaus R.
Schulz H. D.
Schwierz R.
Semenov S.
Siegmund T.
Snizhko A.
Spaan B.
Spengler J.
Stiewe J.
Thurn H.
Tichomirov I.
Topfer D.
Tzamariudaki K.
Van de Water Richard George
Waldi R.
Walter M.
Wegener D.
Wegener H.
Werner S.
Weseler S.
Wurth R.
Yoon T. S.
Zaitsev Yu.
Zivko T.
Publication venue: 'Elsevier BV'
Publication date: 27/11/1997
Field of study

Using the ARGUS detector at the

e^+ e^-

storage ring DORIS II, we have measured the Michel parameters

\rho

\xi

, and

\xi\delta

for

\tau^{\pm}\to l^{\pm} \nu\bar\nu

decays in

\tau

-pair events produced at center of mass energies in the region of the

\Upsilon

resonances. Using

\tau^\mp \to \rho^\mp \nu

as spin analyzing tags, we find

\rho_{e}=0.68\pm 0.04 \pm 0.08

\xi_{e}= 1.12 \pm 0.20 \pm 0.09

\xi\delta_{e}= 0.57 \pm 0.14 \pm 0.07

\rho_{\mu}= 0.69 \pm 0.06 \pm 0.08

\xi_{\mu}= 1.25 \pm 0.27 \pm 0.14

and

\xi\delta_{\mu}= 0.72 \pm 0.18 \pm 0.10

. In addition, we report the combined ARGUS results on

\rho

\xi

, and

\xi\delta

using this work und previous measurements.Comment: 10 pages, well formatted postscript can be found at http://pktw06.phy.tu-dresden.de/iktp/pub/desy97-194.p

arXiv.org e-Print Archive

DESY

Semileptonic Branching Fraction of Charged and Neutral B Mesons

Author: Alam M.
Alexander J.
Ammar R.
Artuso M.
Athanas M.
Avery P.
Balest R.
Baringer P.
Barish B.
Bartelt J.
Battle M.
Bean A.
Bebek C.
Bellerive A.
Bergfeld T.
Berkelman K.
Besson D.
Bishai M.
Bloom K.
Britton D.
Browder T.
Brower W.
Butler F.
Cassel D.
Chadha M.
Chan S.
Cho H.
Cho K.
Cinabro D.
Coffman D.
Coppage D.
Copty N.
Cowen D.
Crawford G.
Crowcroft D.
Csorna S.
Daubenmier C.
Davis R.
Dominick J.
Drell P.
Dumas D.
Edwards K.
Egyed Z.
Ehrlich R.
Eigen G.
Eisenstein B.
Ernst J.
Fast J.
Ford W.
Freyberger A.
Fu X.
Fujino D.
Fulton R.
Gaidarev P.
Gan K.
Gao M.
Garcia-Sciveres M.
Geiser B.
Gibaut D.
Gibbons L.
Gittelman B.
Goldberg M.
Gollin G.
Gray S.
Grendt E.
Gronberg J.
Hancock N.
Hartill D.
He D.
Heltsley B.
Henderson S.
Honscheid K.
Horwitz N.
Hyatt E.
Jain V.
Johnson D.
Jones C.
Jones S.
Kagan H.
Kandaswamy J.
Kass R.
Katayama N.
Kelly M.
Kennett R.
Kim I.
Kim P.
Kinoshita K.
Kotov S.
Kravchenko I.
Kreinick D.
Kubota Y.
Kutschke R.
Kwak N.
Kwon Y.
Lam H.
Lambrecht M.
Lattery M.
Lee J.
Ling Z.
Lingel K.
Liu T.
Lohner M.
Ludwig G.
MacFarlane D.
Mahmood A.
Malchow R.
Masek G.
Masui J.
McIlwain R.
Menary S.
Mevissen J.
Miao T.
Miller D.
Miller J.
Mistry N.
Modesitt M.
Momayezi M.
Moneti G.
Morrison R.
Mountain R.
Muheim F.
Mukhin Y.
Nakanishi S.
Nelson H.
Nelson J.
Nelson T.
Nemati B.
Ng C.
Nordberg E.
O'Grady C.
O'Neill J.
Ogg M.
Ong B.
Paar H.
Palmer M.
Patel P.
Patterson J.
Patton S.
Payne D.
Peterson D.
Playfer S.
Poling R.
Pomianowski P.
Qiao C.
Rankin P.
Richman J.
Riley D.
Roberts S.
Rodriguez J.
Ross W.
Rozen Y.
Ryd A.
Sadoff A.
Salman S.
Sanghera S.
Sapper M.
Saulnier M.
Savinov V.
Schrenk S.
Selen M.
Severini H.
Shelkov V.
Shibata E.
Shipsey I.
Skovpen Y.
Skubic P.
Skwarnicki T.
Smith J.
Spaan B.
Sperka D.
Stone S.
Stroynowski R.
Sun C.
Sung M.
Tajima H.
Thaler J.
Thorndike E.
Urheim J.
Vasseur G.
Volobouev I.
Wang C.
Wang P.
Wang R.
Wappler F.
Wei G.
Weinstein A.
White C.
Wilson R.
Witherell M.
Wood M.
Würthwein F.
Xing X.
Yamamoto H.
Yang S.
Yelton D.
Zadorozhny P.
Zhu G.
Zoeller M.
Publication venue: 'American Physical Society (APS)'
Publication date: 22/06/1994
Field of study

An examination of leptons in

{\Upsilon (4S)}

events tagged by reconstructed

B

decays yields semileptonic branching fractions of

b_-=(10.1 \pm 1.8\pm 1.4)\%

for charged and

b_0=(10.9 \pm 0.7\pm 1.1)\%

for neutral

B

mesons. This is the first measurement for charged

B

. Assuming equality of the charged and neutral semileptonic widths, the ratio

b_-/b_0=0.93 \pm 0.18 \pm 0.12

is equivalent to the ratio of lifetimes. A postscript version is available through World-Wide-Web in http://w4.lns.cornell.edu/public/CLNS/1994Comment: 9 pages (in REVTEX format) Preprint CLNS94-1286, CLEO 94-1

arXiv.org e-Print Archive

Crossref

Enlighten

CERN Document Server