Search CORE

3,091 research outputs found

Outlier Detection from Network Data with Subnetwork Interpretation

Author: Basu Prithwish
Dang Xuan-Hong
Silva Arlei
Singh Ambuj
Swami Ananthram
Publication venue
Publication date: 30/09/2016
Field of study

Detecting a small number of outliers from a set of data observations is always challenging. This problem is more difficult in the setting of multiple network samples, where computing the anomalous degree of a network sample is generally not sufficient. In fact, explaining why the network is exceptional, expressed in the form of subnetwork, is also equally important. In this paper, we develop a novel algorithm to address these two key problems. We treat each network sample as a potential outlier and identify subnetworks that mostly discriminate it from nearby regular samples. The algorithm is developed in the framework of network regression combined with the constraints on both network topology and L1-norm shrinkage to perform subnetwork discovery. Our method thus goes beyond subspace/subgraph discovery and we show that it converges to a global optimum. Evaluation on various real-world network datasets demonstrates that our algorithm not only outperforms baselines in both network and high dimensional setting, but also discovers highly relevant and interpretable local subnetworks, further enhancing our understanding of anomalous networks

arXiv.org e-Print Archive

Crossref

Relational Data Mining Through Extraction of Representative Exemplars

Author: Blanchard Frédéric
Herbin Michel
Publication venue
Publication date: 01/01/2012
Field of study

With the growing interest on Network Analysis, Relational Data Mining is becoming an emphasized domain of Data Mining. This paper addresses the problem of extracting representative elements from a relational dataset. After defining the notion of degree of representativeness, computed using the Borda aggregation procedure, we present the extraction of exemplars which are the representative elements of the dataset. We use these concepts to build a network on the dataset. We expose the main properties of these notions and we propose two typical applications of our framework. The first application consists in resuming and structuring a set of binary images and the second in mining co-authoring relation in a research team

arXiv.org e-Print Archive

CiteSeerX

HAL Descartes

Market Structure, Inflation, and Price Dispersion

Author: Alpay Filiztekin
Michael T. Rauh
Mustafa Caglayan
Publication venue
Publication date
Field of study

In this paper, we investigate the impact of market structure on the relationship between inflation and price dispersion. We first propose a new empirical model of the relationship between inflation and dispersion with firmer theoretical foundations, and then extend the basic model to incorporate the potential effects of market structure. We estimate the basic and market structure specifications using a unique micro-level data set from Istanbul, which consists of monthly price observations from three different store types: convenience stores, open-air markets, and supermarkets. Our empirical findings support almost all of the basic and market structure predictions.inflation, market structure, menu cost models, micro panel data, price dispersion

Research Papers in Economics

A Local Density-Based Approach for Local Outlier Detection

Author: He Haibo
Tang Bo
Publication venue
Publication date: 27/06/2016
Field of study

This paper presents a simple but effective density-based outlier detection approach with the local kernel density estimation (KDE). A Relative Density-based Outlier Score (RDOS) is introduced to measure the local outlierness of objects, in which the density distribution at the location of an object is estimated with a local KDE method based on extended nearest neighbors of the object. Instead of using only

k

nearest neighbors, we further consider reverse nearest neighbors and shared nearest neighbors of an object for density distribution estimation. Some theoretical properties of the proposed RDOS including its expected value and false alarm probability are derived. A comprehensive experimental study on both synthetic and real-life data sets demonstrates that our approach is more effective than state-of-the-art outlier detection methods.Comment: 22 pages, 14 figures, submitted to Pattern Recognition Letter

arXiv.org e-Print Archive

Crossref

DigitalCommons@URI