Search CORE

6 research outputs found

Subjectively Interesting Subgroup Discovery on Real-valued Targets

Author: De Bie Tijl
Duivesteijn Wouter
Kang Bo
Lijffijt Jefrey
Oikarinen Emilia
Puolamäki Kai
Publication venue
Publication date: 01/01/2017
Field of study

Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the fact that there are exponentially many variable combinations to potentially consider, and there are infinitely many if we consider weighted combinations, even for linear combinations. Hence, an obvious question is whether we can automate the search for interesting patterns and visualizations. In this paper, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. For example, to understand the distribution of crime rates in different geographic areas in terms of other (numerical, ordinal and/or categorical) variables that describe the areas. We introduce a method to find subgroups in the data that are maximally informative (in the formal Information Theoretic sense) with respect to a single or set of real-valued target attributes. The subgroup descriptions are in terms of a succinct set of arbitrarily-typed other attributes. The approach is based on the Subjective Interestingness framework FORSIED to enable the use of prior knowledge when finding most informative non-redundant patterns, and hence the method also supports iterative data mining.Comment: 12 pages, 10 figures, 2 tables, conference submissio

arXiv.org e-Print Archive

Repository TU/e

Crossref

Pure OAI Repository

Ghent University Academic Bibliography

Aaltodoc Publication Archive

Identificação De Padrões Em Infraestrutura Ferroviária - Via Uma Abordagem Data Mining

Author: Nuno Pinto Barriga de Carvalho Tavares
Publication venue
Publication date: 01/12/2014
Field of study

Repositório Aberto da Universidade do Porto

Sports analytics for professional speed skating

Author: Arno Knobbe
Benjamin van der Burgh
D Stranneby
D Vandewalle
DW Hosmer
F Lemmerich
H Grosskreutz
J Friedman
Jac Orie
Nico Hofman
P-O Åstrand
PK Novak
R Caruana
R Tibshirani
Ricardo Cachucho
TW Calvert
W Klösgen
WD McArdle
WL Kenney
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning subjectively interesting data representations

Author: Kang Bo
Publication venue: Ghent University. Faculty of Engineering and Architecture
Publication date: 01/01/2019
Field of study

Ghent University Academic Bibliography

Distribution rules with numeric attributes of interest

Author: A. Silberschatz
B. Kavsek
B. Liu
B. Liu
G.I. Webb
H. Zhang
M. Kearns
R. Agrawal
R. Srikant
R.J. Bayardo
T. Fukuda
W. Klősgen
W.J. Conover
Publication venue
Publication date: 01/01/2006
Field of study

In this paper we introduce distribution rules, a kind of association rules with a distribution on the consequent. Distribution rules are related to quantitative association rules but can be seen as a more fundamental concept, useful for learning distributions. We formalize the main concepts and indicate applications to tasks such as frequent pattern discovery, sub group discovery and forecasting. An efficient algorithm for the generation of distribution rules is described. We also provide interest measures, visualization techniques and evaluation

CiteSeerX

Crossref