Search CORE

5,811 research outputs found

Understanding the Role of Adaptivity in Machine Teaching: The Case of Version Space Learners

Author: Chen Yuxin
Mac Aodha Oisin
Perona Pietro
Singla Adish
Yue Yisong
Publication venue
Publication date: 01/01/2018
Field of study

In real-world applications of education, an effective teacher adaptively chooses the next example to teach based on the learner's current state. However, most existing work in algorithmic machine teaching focuses on the batch setting, where adaptivity plays no role. In this paper, we study the case of teaching consistent, version space learners in an interactive setting. At any time step, the teacher provides an example, the learner performs an update, and the teacher observes the learner's new state. We highlight that adaptivity does not speed up the teaching process when considering existing models of version space learners, such as "worst-case" (the learner picks the next hypothesis randomly from the version space) and "preference-based" (the learner picks hypothesis according to some global preference). Inspired by human teaching, we propose a new model where the learner picks hypotheses according to some local preference defined by the current hypothesis. We show that our model exhibits several desirable properties, e.g., adaptivity plays a key role, and the learner's transitions over hypotheses are smooth/interpretable. We develop efficient teaching algorithms and demonstrate our results via simulation and user studies.Comment: NeurIPS 2018 (extended version

arXiv.org e-Print Archive

Caltech Authors

MPG.PuRe

Selecting Near-Optimal Learners via Incremental Data Allocation

Author: Sabharwal Ashish
Samulowitz Horst
Tesauro Gerald
Publication venue
Publication date: 31/12/2015
Field of study

We study a novel machine learning (ML) problem setting of sequentially allocating small subsets of training data amongst a large set of classifiers. The goal is to select a classifier that will give near-optimal accuracy when trained on all data, while also minimizing the cost of misallocated samples. This is motivated by large modern datasets and ML toolkits with many combinations of learning algorithms and hyper-parameters. Inspired by the principle of "optimism under uncertainty," we propose an innovative strategy, Data Allocation using Upper Bounds (DAUB), which robustly achieves these objectives across a variety of real-world datasets. We further develop substantial theoretical support for DAUB in an idealized setting where the expected accuracy of a classifier trained on

n

samples can be known exactly. Under these conditions we establish a rigorous sub-linear bound on the regret of the approach (in terms of misallocated data), as well as a rigorous bound on suboptimality of the selected classifier. Our accuracy estimates using real-world datasets only entail mild violations of the theoretical scenario, suggesting that the practical behavior of DAUB is likely to approach the idealized behavior.Comment: AAAI-2016: The Thirtieth AAAI Conference on Artificial Intelligenc

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Universal Learning of Repeated Matrix Games

Author: Hutter Marcus
Poland Jan
Publication venue
Publication date: 01/01/2005
Field of study

We study and compare the learning dynamics of two universal learning algorithms, one based on Bayesian learning and the other on prediction with expert advice. Both approaches have strong asymptotic performance guarantees. When confronted with the task of finding good long-term strategies in repeated 2x2 matrix games, they behave quite differently.Comment: 16 LaTeX pages, 8 eps figure

arXiv.org e-Print Archive

CiteSeerX

The Australian National University

Reinforcement Learning: A Survey

Author: Kaelbling L. P.
Littman M. L.
Moore A. W.
Publication venue
Publication date: 01/01/1996
Field of study

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word ``reinforcement.'' The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.Comment: See http://www.jair.org/ for any accompanying file

arXiv.org e-Print Archive

CiteSeerX

Theoretical Interpretations and Applications of Radial Basis Function Networks

Author: Blanzieri Enrico
Publication venue
Publication date: 01/05/2003
Field of study

Medical applications usually used Radial Basis Function Networks just as Artificial Neural Networks. However, RBFNs are Knowledge-Based Networks that can be interpreted in several way: Artificial Neural Networks, Regularization Networks, Support Vector Machines, Wavelet Networks, Fuzzy Controllers, Kernel Estimators, Instanced-Based Learners. A survey of their interpretations and of their corresponding learning algorithms is provided as well as a brief survey on dynamic learning algorithms. RBFNs' interpretations can suggest applications that are particularly interesting in medical domains

Unitn-eprints Research

Skills for growth: the national skills strategy analytical paper

Author
Publication venue: Great Britain. Department for Business, Innovation and Skills
Publication date: 01/01/2009
Field of study

Digital Education Resource Archive

Improving adaptive bagging methods for evolving data streams

Author: A. Bifet
F. Chu
F. Gustafsson
G. Hulten
J. Campo-Ávila del
J. Gama
L. Breiman
M. Basseville
N. Oza
P. Zhang
T. Mitchell
W.N. Street
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

We propose two new improvements for bagging methods on evolving data streams. Recently, two new variants of Bagging were proposed: ADWIN Bagging and Adaptive-Size Hoeffding Tree (ASHT) Bagging. ASHT Bagging uses trees of different sizes, and ADWIN Bagging uses ADWIN as a change detector to decide when to discard underperforming ensemble members. We improve ADWIN Bagging using Hoeffding Adaptive Trees, trees that can adaptively learn from data streams that change over time. To speed up the time for adapting to change of Adaptive-Size Hoeffding Tree (ASHT) Bagging, we add an error change detector for each classifier. We test our improvements by performing an evaluation study on synthetic and real-world datasets comprising up to ten million examples

Crossref

Research Commons@Waikato

Frontier Capital: Early Stage Investing for Financial Returns and Social Impact in Emerging Markets

Author: Matt Bannick
Michael Kubzansky
Paula Goldman
Publication venue: Omidyar Network
Publication date: 10/10/2015
Field of study

This report outlines the importance and promise of serving low- and lower-middle-income (LMI) populations -- essentially, the groups situated between the very bottom of the pyramid and the existing middle class. As we detail herein, LMI populations have huge unmet needs and face quite a bit of instability -- challenges that can be addressed by innovative business models. We believe companies serving this demographic represent an under-tapped opportunity, both for financial returns and for outsized impact. The LMI segment represents a major market opportunity. For example, in Latin America and the Caribbean, the purchasing power of the LMI population is estimated at

405B. In South Asia, it is estimated at

483B

IssueLab