A New Class of Weighted Similarity Indices Using Polytomous Variables

I. MORLINI; S. ZANI

A New Class of Weighted Similarity Indices Using Polytomous Variables

Authors: I. MORLINI
S. ZANI
Publication date: 1 January 2012
Publisher

Abstract

In this paper we introduce new similarity indeces for variables with multiple categories. The proposed measures are conceptually simple and straightforward to compute. In contrast to traditionally used similarity indeces, they also consider the frequency of the modalities of each attribute in the sample. This feature is useful when dealing with rare categories, since it makes sense to differently evaluate the pairwise presence of a rare category from the pairwise presence of a widespread one. Moreover, this feature helps finding under-represented groups in cluster analysis. There are two versions of the weighted index: one for independent categorical variables and one for dependent variables. The suitability of the proposed indeces is shown in this paper using both simulated and real world data sets

Similar works

Full text

Available Versions

Archivio istituzionale della Ricerca - Università degli Studi di Parma

oai:air.unipr.it:11381/2434843

Last time updated on 09/07/2019