Search CORE

19 research outputs found

Bandits with heavy tail

Author: Bubeck Sébastien
Cesa-Bianchi Nicolò
Lugosi Gábor
Publication venue
Publication date: 01/01/2012
Field of study

The stochastic multi-armed bandit problem is well understood when the reward distributions are sub-Gaussian. In this paper we examine the bandit problem under the weaker assumption that the distributions have moments of order 1+\epsilon, for some

\epsilon \in (0,1]

. Surprisingly, moments of order 2 (i.e., finite variance) are sufficient to obtain regret bounds of the same order as under sub-Gaussian reward distributions. In order to achieve such regret, we define sampling strategies based on refined estimators of the mean such as the truncated empirical mean, Catoni's M-estimator, and the median-of-means estimator. We also derive matching lower bounds that also show that the best achievable regret deteriorates when \epsilon <1

arXiv.org e-Print Archive

CiteSeerX

AIR Universita degli studi di Milano

Functional Bandits

Author: Tran-Thanh Long
Yu Jia Yuan
Publication venue
Publication date: 10/05/2014
Field of study

We introduce the functional bandit problem, where the objective is to find an arm that optimises a known functional of the unknown arm-reward distributions. These problems arise in many settings such as maximum entropy methods in natural language processing, and risk-averse decision-making, but current best-arm identification techniques fail in these domains. We propose a new approach, that combines functional estimation and arm elimination, to tackle this problem. This method achieves provably efficient performance guarantees. In addition, we illustrate this method on a number of important functionals in risk management and information theory, and refine our generic theoretical results in those cases

arXiv.org e-Print Archive

CiteSeerX

An Approach to Improve Classification Accuracy in Very Large Datasets

Author: Delen Dursun
Kim Jin-hwa
Kletke Marilyn
Publication venue: AIS Electronic Library (AISeL)
Publication date: 31/12/2004
Field of study

AIS Electronic Library (AISeL)

Scalable Optimization-Based Feature Selection Using Random Sampling

Author: Olafsson Sigurdur
Olafsson Sigurdur
Yang Jaekyung
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2003
Field of study

We analyze an optimization-based approach called the NP-Filter for feature selection and show how the scalability of this method can be improved using random sampling of instances from the training data. The NP-Filter has attractive theoretical properties as the final solution quality can be quantified and it is flexible in terms of incorporating various feature evaluation methods. We show how the NP-Filter can automatically adjust to the randomness that occurs when a sample of training instances is used, and present numerical results that illustrate both this key result and the scalability improvement that are obtained

Digital Repository @ Iowa State University (ISU)

Meta-learning for data summarization based on instance selection method

Author: Islam Rafiqul
Smith-Miles Kate
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

The purpose of instance selection is to identify which instances (examples, patterns) in a large dataset should be selected as representatives of the entire dataset, without significant loss of information. When a machine learning method is applied to the reduced dataset, the accuracy of the model should not be significantly worse than if the same method were applied to the entire dataset. The reducibility of any dataset, and hence the success of instance selection methods, surely depends on the characteristics of the dataset, as well as the machine learning method. This paper adopts a meta-learning approach, via an empirical study of 112 classification datasets from the UCI Repository [1], to explore the relationship between data characteristics, machine learning methods, and the success of instance selection method.<br /

Deakin Research Online

TSEC: a framework for online experimentation under experimental constraints

Author: Hoang Lavonne
Mak Simon
Wu C. F. Jeff
Zhou Yuanshuo
Publication venue
Publication date: 17/01/2021
Field of study

Thompson sampling is a popular algorithm for solving multi-armed bandit problems, and has been applied in a wide range of applications, from website design to portfolio optimization. In such applications, however, the number of choices (or arms)

N

can be large, and the data needed to make adaptive decisions require expensive experimentation. One is then faced with the constraint of experimenting on only a small subset of

K \ll N

arms within each time period, which poses a problem for traditional Thompson sampling. We propose a new Thompson Sampling under Experimental Constraints (TSEC) method, which addresses this so-called "arm budget constraint". TSEC makes use of a Bayesian interaction model with effect hierarchy priors, to model correlations between rewards on different arms. This fitted model is then integrated within Thompson sampling, to jointly identify a good subset of arms for experimentation and to allocate resources over these arms. We demonstrate the effectiveness of TSEC in two problems with arm budget constraints. The first is a simulated website optimization study, where TSEC shows noticeable improvements over industry benchmarks. The second is a portfolio optimization application on industry-based exchange-traded funds, where TSEC provides more consistent and greater wealth accumulation over standard investment strategies

arXiv.org e-Print Archive

FigShare

Surfing the modeling of pos taggers in low-resource scenarios

Author: Darriba Bilbao Victor Manuel
Graña Gil Jorge
Ribadas Pena Francisco Jose
Vilares Ferro Manuel
Publication venue: 'MDPI AG'
Publication date: 01/09/2022
Field of study

The recent trend toward the application of deep structured techniques has revealed the limits of huge models in natural language processing. This has reawakened the interest in traditional machine learning algorithms, which have proved still to be competitive in certain contexts, particularly in low-resource settings. In parallel, model selection has become an essential task to boost performance at reasonable cost, even more so when we talk about processes involving domains where the training and/or computational resources are scarce. Against this backdrop, we evaluate the early estimation of learning curves as a practical mechanism for selecting the most appropriate model in scenarios characterized by the use of non-deep learners in resource-lean settings. On the basis of a formal approximation model previously evaluated under conditions of wide availability of training and validation resources, we study the reliability of such an approach in a different and much more demanding operational environment. Using as a case study the generation of pos taggers for Galician, a language belonging to the Western Ibero-Romance group, the experimental results are consistent with our expectations.Ministerio de Ciencia e Innovación | Ref. PID2020-113230RB-C21Ministerio de Ciencia e Innovación | Ref. PID2020-113230RB-C22Xunta de Galicia | Ref. ED431C 2020/1

Repositorio da Universidade da Coruña

Investigo

Directory of Open Access Journals

A Comprehensive Survey of Data Mining-based Fraud Detection Research

Author: Agrawal
Au
Berry
Brentnall
Chen
Chiang Wang
David C. Yen
Feelders
Han
Hayhoe
Kirkosa
Ku
Leonard
Mitchell
Ngai
Quah
Rothman
Shaw
Shing-Han Li
Song
Sudjianto
Titus
Wen-Hui Lu
White
Publication venue: 'Elsevier BV'
Publication date: 30/09/2010
Field of study

This survey paper categorises, compares, and summarises from almost all published technical and review articles in automated fraud detection within the last 10 years. It defines the professional fraudster, formalises the main types and subtypes of known fraud, and presents the nature of data evidence collected within affected industries. Within the business context of mining the data to achieve higher cost savings, this research presents methods and techniques together with their problems. Compared to all related reviews on fraud detection, this survey covers much more technical articles and is the only one, to the best of our knowledge, which proposes alternative data and solutions from related domains.Comment: 14 page

arXiv.org e-Print Archive

Crossref