Search CORE

13,983 research outputs found

Relational visual cluster validity

Author: Ding Y.
Harrison R.F.
Publication venue: 'Elsevier BV'
Publication date: 01/11/2007
Field of study

The assessment of cluster validity plays a very important role in cluster analysis. Most commonly used cluster validity methods are based on statistical hypothesis testing or finding the best clustering scheme by computing a number of different cluster validity indices. A number of visual methods of cluster validity have been produced to display directly the validity of clusters by mapping data into two- or three-dimensional space. However, these methods may lose too much information to correctly estimate the results of clustering algorithms. Although the visual cluster validity (VCV) method of Hathaway and Bezdek can successfully solve this problem, it can only be applied for object data, i.e. feature measurements. There are very few validity methods that can be used to analyze the validity of data where only a similarity or dissimilarity relation exists – relational data. To tackle this problem, this paper presents a relational visual cluster validity (RVCV) method to assess the validity of clustering relational data. This is done by combining the results of the non-Euclidean relational fuzzy c-means (NERFCM) algorithm with a modification of the VCV method to produce a visual representation of cluster validity. RVCV can cluster complete and incomplete relational data and adds to the visual cluster validity theory. Numeric examples using synthetic and real data are presente

White Rose Research Online

Democracy and Economic Development: a Fuzzy Classification Approach

Author: Ana Margarida Oliveira Brochado
Francisco Vitorino Martins
Publication venue
Publication date
Field of study

The aim of this work is to (1) analyse whether countries differ on political indicators (democracy, rule of law, government effectiveness and corruption) and (2) study whether countries with different political profiles are associated with different levels of economic, human development and gender-related development indicators. Using a fuzzy classification approach (fuzzy k-means algorithm), we propose a typology of 124 countries based on 10 political variables. Six segments are identified; these political groups implicate the access to different levels of economic and human development. In this study evidence of a positive but not perfect relationship between democracy and economic and human development is observed, thus presenting new insights for the understanding of the heterogeneity of behaviors relatively to political indicators.Democracy, Economic Development, Fuzzy k-means

Research Papers in Economics

Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

Author: Principe Jose C.
Sledge Isaac J.
Publication venue
Publication date: 27/10/2017
Field of study

In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized, information-theoretic criterion that measures the change in costs associated with changes in information. Optimizing the value of information yields a deterministic annealing style of clustering with many benefits. For instance, investigators avoid needing to a priori specify the number of clusters, as the partitions naturally undergo phase changes, during the annealing process, whereby the number of clusters changes in a data-driven fashion. The global-best partition can also often be identified.Comment: Submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP

arXiv.org e-Print Archive

Crossref

Examining the segment retention problem for the “Group Satellite” case

Author: Ana Oliveira-Brochado
F. Vitorino Martins
Publication venue
Publication date
Field of study

The purpose of this work is to determine how well, criteria designed to help the selection of the adequate number of market segments, perform in recovering small niche segments, in mixture regressions of normal data, with experimental data. The simulation experiment compares several segment retention criteria, including information criteria and classification-based criteria. We also address the impact of distributional misspecification on segment retention criteria success rates. This study shows that Akaike’s Information criterion with penalty factors of 3 and 4, rather than the traditional value of 2, are the best segment retention criteria to use in recovering small niche segments. Although these criteria were designed for the specific context of mixture models, they are rarely applied in the marketing literature.Information criteria; Latent Class Segmentation.

Research Papers in Economics

Possibilistic and fuzzy clustering methods for robust analysis of non-precise data

Author: Ferraro MARIA BRIGIDA
Giordani Paolo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

This work focuses on robust clustering of data affected by imprecision. The imprecision is managed in terms of fuzzy sets. The clustering process is based on the fuzzy and possibilistic approaches. In both approaches the observations are assigned to the clusters by means of membership degrees. In fuzzy clustering the membership degrees express the degrees of sharing of the observations to the clusters. In contrast, in possibilistic clustering the membership degrees are degrees of typicality. These two sources of information are complementary because the former helps to discover the best fuzzy partition of the observations while the latter reflects how well the observations are described by the centroids and, therefore, is helpful to identify outliers. First, a fully possibilistic k-means clustering procedure is suggested. Then, in order to exploit the benefits of both the approaches, a joint possibilistic and fuzzy clustering method for fuzzy data is proposed. A selection procedure for choosing the parameters of the new clustering method is introduced. The effectiveness of the proposal is investigated by means of simulated and real-life data

Archivio della ricerca- Università di Roma La Sapienza

Recommended from our members

The role of human factors in stereotyping behavior and perception of digital library users: A robust clustering approach

Author: Chen SY
Frias-Martinez E
Liu X
Macredie RD
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 03/04/2007
Field of study

To deliver effective personalization for digital library users, it is necessary to identify which human factors are most relevant in determining the behavior and perception of these users. This paper examines three key human factors: cognitive styles, levels of expertise and gender differences, and utilizes three individual clustering techniques: k-means, hierarchical clustering and fuzzy clustering to understand user behavior and perception. Moreover, robust clustering, capable of correcting the bias of individual clustering techniques, is used to obtain a deeper understanding. The robust clustering approach produced results that highlighted the relevance of cognitive style for user behavior, i.e., cognitive style dominates and justifies each of the robust clusters created. We also found that perception was mainly determined by the level of expertise of a user. We conclude that robust clustering is an effective technique to analyze user behavior and perception

Brunel University Research Archive

Cluster validity in clustering methods

Author: Zhao Qinpei
Publication venue: University of Eastern Finland
Publication date
Field of study

UEF Electronic Publications