Search CORE

1,603 research outputs found

Types of cost in inductive concept learning

Author: Turney Peter
Publication venue
Publication date: 01/01/2000
Field of study

Inductive concept learning is the task of learning to assign cases to a discrete set of classes. In real-world applications of concept learning, there are many different types of cost involved. The majority of the machine learning literature ignores all types of cost (unless accuracy is interpreted as a type of cost measure). A few papers have investigated the cost of misclassification errors. Very few papers have examined the many other types of cost. In this paper, we attempt to create a taxonomy of the different types of cost that are involved in inductive concept learning. This taxonomy may help to organize the literature on cost-sensitive learning. We hope that it will inspire researchers to investigate all types of cost in inductive concept learning in more depth

CiteSeerX

NRC Publications Archive

CogPrints Cognitive Sciences Eprint Archive

Post-processing of association rules.

Author: Baesens Bart
Vanthienen Jan
Viaene Stijn
Publication venue
Publication date
Field of study

In this paper, we situate and motivate the need for a post-processing phase to the association rule mining algorithm when plugged into the knowledge discovery in databases process. Major research effort has already been devoted to optimising the initially proposed mining algorithms. When it comes to effectively extrapolating the most interesting knowledge nuggets from the standard output of these algorithms, one is faced with an extreme challenge, since it is not uncommon to be confronted with a vast amount of association rules after running the algorithms. The sheer multitude of generated rules often clouds the perception of the interpreters. Rightful assessment of the usefulness of the generated output introduces the need to effectively deal with different forms of data redundancy and data being plainly uninteresting. In order to do so, we will give a tentative overview of some of the main post-processing tasks, taking into account the efforts that have already been reported in the literature.

Research Papers in Economics

Efficient chain structure for high-utility sequential pattern mining

Author: Djenouri Youcef
Fournier-Viger Philippe
Li Yuanfa
Lin Jerry Chun-Wei
Zhang Ji
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

High-utility sequential pattern mining (HUSPM) is an emerging topic in data mining, which considers both utility and sequence factors to derive the set of high-utility sequential patterns (HUSPs) from the quantitative databases. Several works have been presented to reduce the computational cost by variants of pruning strategies. In this paper, we present an efficient sequence-utility (SU)-chain structure, which can be used to store more relevant information to improve mining performance. Based on the SU-Chain structure, the existing pruning strategies can also be utilized here to early prune the unpromising candidates and obtain the satisfied HUSPs. Experiments are then compared with the state-of-the-art HUSPM algorithms and the results showed that the SU-Chain-based model can efficiently improve the efficiency performance than the existing HUSPM algorithms in terms of runtime and number of the determined candidates

SINTEF Open

NORA - Norwegian Open Research Archives

University of Southern Queensland ePrints

CURIO: A fast outlier and outlier cluster detection algorithm for large datasets

Author: Ceglar Aaron John
Powers David Martin
Roddick John Francis
Publication venue: Australian Computer Society
Publication date: 01/01/2007
Field of study

Sydney, NS

Flinders Academic Commons