Search CORE

4,770 research outputs found

An Enhanced Features Extractor for a Portfolio of Constraint Solvers

Author: Hutter F.
Morara M.
Nethercote N.
O'Mahony E.
Xu L.
Publication venue
Publication date: 01/01/2014
Field of study

Recent research has shown that a single arbitrarily efficient solver can be significantly outperformed by a portfolio of possibly slower on-average solvers. The solver selection is usually done by means of (un)supervised learning techniques which exploit features extracted from the problem specification. In this paper we present an useful and flexible framework that is able to extract an extensive set of features from a Constraint (Satisfaction/Optimization) Problem defined in possibly different modeling languages: MiniZinc, FlatZinc or XCSP. We also report some empirical results showing that the performances that can be obtained using these features are effective and competitive with state of the art CSP portfolio techniques

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

MDL Convergence Speed for Bernoulli Sequences

Author: A. K. Zvonkin
A. R. Barron
A. R. Barron
B. S. Clarke
J. J. Rissanen
J. J. Rissanen
Jan Poland
L. A. Levin
M. Hutter
M. Hutter
M. Hutter
Marcus Hutter
P. Gács
P. M. Vitányi
R. J. Solomonoff
V. G. Vovk
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

The Minimum Description Length principle for online sequence estimation/prediction in a proper learning setup is studied. If the underlying model class is discrete, then the total expected square loss is a particularly interesting performance measure: (a) this quantity is finitely bounded, implying convergence with probability one, and (b) it additionally specifies the convergence speed. For MDL, in general one can only have loss bounds which are finite but exponentially larger than those for Bayes mixtures. We show that this is even the case if the model class contains only Bernoulli distributions. We derive a new upper bound on the prediction error for countable Bernoulli classes. This implies a small bound (comparable to the one for Bayes mixtures) for certain important model classes. We discuss the application to Machine Learning tasks such as classification and hypothesis testing, and generalization to countable classes of i.i.d. models.Comment: 28 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

The Australian National University

Hokkaido University Collection of Scholarly and Academic Papers

Self-Modification of Policy and Utility Function in Rational Agents

Author: B Hibbard
D Dewey
D Silver
J Schmidhuber
L Orseau
L Orseau
L Orseau
LP Kaelbling
M Hutter
M Hutter
M Ring
N Bostrom
R Sutton
RV Yampolskiy
S Legg
V Mnih
Publication venue
Publication date: 10/05/2016
Field of study

Any agent that is part of the environment it interacts with and has versatile actuators (such as arms and fingers), will in principle have the ability to self-modify -- for example by changing its own source code. As we continue to create more and more intelligent agents, chances increase that they will learn about this ability. The question is: will they want to use it? For example, highly intelligent systems may find ways to change their goals to something more easily achievable, thereby `escaping' the control of their designers. In an important paper, Omohundro (2008) argued that goal preservation is a fundamental drive of any intelligent system, since a goal is more likely to be achieved if future versions of the agent strive towards the same goal. In this paper, we formalise this argument in general reinforcement learning, and explore situations where it fails. Our conclusion is that the self-modification possibility is harmless if and only if the value function of the agent anticipates the consequences of self-modifications and use the current utility function when evaluating the future.Comment: Artificial General Intelligence (AGI) 201

arXiv.org e-Print Archive

Crossref

The Australian National University

On the Computability of Solomonoff Induction and Knowledge-Seeking

Author: I Wood
L Orseau
L Orseau
L Orseau
L Orseau
L Orseau
M Hutter
P Gács
R Solomonoff
S Rathmanner
T Lattimore
T Lattimore
Publication venue
Publication date: 15/07/2015
Field of study

Solomonoff induction is held as a gold standard for learning, but it is known to be incomputable. We quantify its incomputability by placing various flavors of Solomonoff's prior M in the arithmetical hierarchy. We also derive computability bounds for knowledge-seeking agents, and give a limit-computable weakly asymptotically optimal reinforcement learning agent.Comment: ALT 201

arXiv.org e-Print Archive

Crossref

The Australian National University

Optimistic Agents are Asymptotically Optimal

Author: D. Blackwell
D. Ryabko
J. Doob
L. Orseau
M. Hutter
S.J. Russell
T. Lattimore
T. Lattimore
T. Lattimore
Publication venue
Publication date: 01/01/2012
Field of study

We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.Comment: 13 LaTeX page

arXiv.org e-Print Archive

CiteSeerX

Crossref

The Australian National University

Bayesian reinforcement learning with exploration

Author: E. Even-Dar
I. Szita
K. Dyagilev
L. Orseau
M. Hutter
M. Hutter
M. Hutter
M. Kearns
M.G. Azar
P. Auer
P. Sunehag
S. Mannor
T. Lattimore
T. Lattimore
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We consider a general reinforcement learning problem and show that carefully combining the Bayesian optimal policy and an exploring policy leads to minimax sample-complexity bounds in a very general class of (history-based) environments. We also prove lower bounds and show that the new algorithm displays adaptive behaviour when the environment is easier than worst-case

Crossref

The Australian National University

Electron correlation in C_(4N+2) carbon rings: aromatic vs. dimerized structures

Author: C. W. Greeff
C.-H. Kiang
E. J. Bylaska
J. Ashkenazi
J. C. Grossman
J. D. Watts
J. Hunter
J. Hutter
J. Hutter
J. M. L. Martin
J. M. L. Martin
K. Raghavachari
L. Mitas
L. Mitas
L. Mitas
L. Salem
Lubos Mitas
M. Saito
M. W. Schmidt
R. B. Murphy
R. E. Peierls
R. G. Pearson
R. O. Jones
T. Torelli
T. Wakabayashi
Tommaso Torelli
Y. Shlyakhter
Publication venue: 'American Physical Society (APS)'
Publication date: 27/03/2000
Field of study

The electronic structure of C_(4N+2) carbon rings exhibits competing many-body effects of Huckel aromaticity, second-order Jahn-Teller and Peierls instability at large sizes. This leads to possible ground state structures with aromatic, bond angle or bond length alternated geometry. Highly accurate quantum Monte Carlo results indicate the existence of a crossover between C_10 and C_14 from bond angle to bond length alternation. The aromatic isomer is always a transition state. The driving mechanism is the second-order Jahn-Teller effect which keeps the gap open at all sizes.Comment: Submitted for publication: 4 pages, 3 figures. Corrected figure

arXiv.org e-Print Archive

Crossref

Concurrent bariatric operations and association with perioperative outcomes: Registry based cohort study

Author: Ban Kristen A
Berian Julia R
Hall Bruce L
Hoyt David B
Huffman Kristopher M
Hutter Matthew M
Ko Clifford Y
Liu Jason B
Liu Yaoming
Publication venue: Digital Commons@Becker
Publication date: 01/01/2017
Field of study

Crossref

Digital Commons@Becker

Time consistent discounting

Author: B. Peleg
L. Green
M. Hutter
M. Hutter
M.J. Osborne
P.A. Samuelson
R. Thaler
R.A. Pollak
R.H. Strotz
S. Legg
S.M. Goldman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

A possibly immortal agent tries to maximise its summed discounted rewards over time, where discounting is used to avoid infinite utilities and encourage the agent to value current rewards more than future ones. Some commonly used discount functions lead to time-inconsistent behavior where the agent changes its plan over time. These inconsistencies can lead to very poor behavior. We generalise the usual discounted utility model to one where the discount function changes with the age of the agent. We then give a simple characterisation of time-(in)consistent discount functions and show the existence of a rational policy for an agent that knows its discount function is time-inconsistent

Crossref

The Australian National University

Comparison of single-nucleotide polymorphisms and microsatellites in detecting quantitative trait loci for alcoholism: The Collaborative Study on the Genetics of Alcoholism

Author: Edwards Karen L
Hutter Carolyn M
Kim Helen
Monks Stephanie A
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The feasibility of effectively analyzing high-density single nucleotide polymorphism (SNP) maps in whole genome scans of complex traits is not known. The purpose of this study was to compare variance components linkage results using different density marker maps in data from the Collaborative Study on the Genetics of Alcoholism (COGA). Marker maps having an average spacing of 10 cM (microsatellite), 0.78 cM (SNP1), and 0.31 cM (SNP2) were used to identify quantitative trait loci (QTLs) affecting maximum number of alcoholic drinks consumed in a 24-hour period (lnmaxalc). RESULTS: Heritability of lnmaxalc was estimated to be 15%. Multipoint variance components linkage analysis revealed similar linkage patterns among the three marker panels, with the SNP maps consistently yielding higher LOD scores. Robust LOD scores > 1.0 were observed on chromosomes 1 and 13 for all three marker maps. Additional LODs > 1.0 were observed on chromosome 4 with both SNP maps and on chromosomes 18 and 21 with the SNP2 map. Peak LOD scores for lnmaxalc were observed on chromosome 1, although none reached genome-wide statistical significance. Quantile-quantile plots revealed that the multipoint distribution of SNP results appeared to fit the asymptotic null distribution better than the twopoint results. CONCLUSION: In conclusion, variance-components linkage analysis using high-density SNP maps provided higher LOD scores compared with the standard microsatellite map, similar to studies using nonparametric linkage methods. Widespread application of SNP maps will depend on further improvements in the computational methods implemented in current software packages

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

SHAREOK repository