Search CORE

5,409 research outputs found

A taxonomy framework for unsupervised outlier detection techniques for multi-type data sets

Author: Havinga P.J.M.
Meratnia N.
Zhang Yang
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2007
Field of study

The term "outlier" can generally be defined as an observation that is significantly different from the other values in a data set. The outliers may be instances of error or indicate events. The task of outlier detection aims at identifying such outliers in order to improve the analysis of data and further discover interesting and useful knowledge about unusual events within numerous applications domains. In this paper, we report on contemporary unsupervised outlier detection techniques for multiple types of data sets and provide a comprehensive taxonomy framework and two decision trees to select the most suitable technique based on data set. Furthermore, we highlight the advantages, disadvantages and performance issues of each class of outlier detection techniques under this taxonomy framework

University of Twente Research Information

Spectral Embedding Norm: Looking Deep into the Spectrum of the Graph Laplacian

Author: Cheng Xiuyuan
Mishne Gal
Publication venue
Publication date: 22/08/2019
Field of study

The extraction of clusters from a dataset which includes multiple clusters and a significant background component is a non-trivial task of practical importance. In image analysis this manifests for example in anomaly detection and target detection. The traditional spectral clustering algorithm, which relies on the leading

K

eigenvectors to detect

K

clusters, fails in such cases. In this paper we propose the {\it spectral embedding norm} which sums the squared values of the first

I

normalized eigenvectors, where

I

can be significantly larger than

K

. We prove that this quantity can be used to separate clusters from the background in unbalanced settings, including extreme cases such as outlier detection. The performance of the algorithm is not sensitive to the choice of

I

, and we demonstrate its application on synthetic and real-world remote sensing and neuroimaging datasets

arXiv.org e-Print Archive

PubMed Central

eScholarship - University of California

Enhance density peak clustering algorithm for anomaly intrusion detection system

Author: Alkafagi Salam Saad
Almuttairi Rafah M.
Publication venue: 'International University of Sarajevo'
Publication date: 06/06/2021
Field of study

In this paper proposed new model of Density Peak Clustering algorithm to enhance clustering of intrusion attacks. The Anomaly Intrusion Detection System (AIDS) by using original density peak clustering algorithm shows the stable in result to be applied to data-mining module of the intrusion detection system. The proposed system depends on two objectives; the first objective is to analyzing the disadvantage of DPC; however, we propose a novel improvement of DPC algorithm by modifying the calculation of local density method based on cosine similarity instead of the cat off distance parameter to improve the operation of selecting the peak points. The second objective is using the Gaussian kernel measure as a distance metric instead of Euclidean distance to improve clustering of high-dimensional complex nonlinear inseparable network traffic data and reduce the noise. The experimentations evaluated with NSL-KDD dataset

Periodicals of Engineering and Natural Sciences (PEN - International University of Sarajevo)