13,983 research outputs found

    Relational visual cluster validity

    Get PDF
    The assessment of cluster validity plays a very important role in cluster analysis. Most commonly used cluster validity methods are based on statistical hypothesis testing or finding the best clustering scheme by computing a number of different cluster validity indices. A number of visual methods of cluster validity have been produced to display directly the validity of clusters by mapping data into two- or three-dimensional space. However, these methods may lose too much information to correctly estimate the results of clustering algorithms. Although the visual cluster validity (VCV) method of Hathaway and Bezdek can successfully solve this problem, it can only be applied for object data, i.e. feature measurements. There are very few validity methods that can be used to analyze the validity of data where only a similarity or dissimilarity relation exists – relational data. To tackle this problem, this paper presents a relational visual cluster validity (RVCV) method to assess the validity of clustering relational data. This is done by combining the results of the non-Euclidean relational fuzzy c-means (NERFCM) algorithm with a modification of the VCV method to produce a visual representation of cluster validity. RVCV can cluster complete and incomplete relational data and adds to the visual cluster validity theory. Numeric examples using synthetic and real data are presente

    Democracy and Economic Development: a Fuzzy Classification Approach

    Get PDF
    The aim of this work is to (1) analyse whether countries differ on political indicators (democracy, rule of law, government effectiveness and corruption) and (2) study whether countries with different political profiles are associated with different levels of economic, human development and gender-related development indicators. Using a fuzzy classification approach (fuzzy k-means algorithm), we propose a typology of 124 countries based on 10 political variables. Six segments are identified; these political groups implicate the access to different levels of economic and human development. In this study evidence of a positive but not perfect relationship between democracy and economic and human development is observed, thus presenting new insights for the understanding of the heterogeneity of behaviors relatively to political indicators.Democracy, Economic Development, Fuzzy k-means

    Partitioning Relational Matrices of Similarities or Dissimilarities using the Value of Information

    Full text link
    In this paper, we provide an approach to clustering relational matrices whose entries correspond to either similarities or dissimilarities between objects. Our approach is based on the value of information, a parameterized, information-theoretic criterion that measures the change in costs associated with changes in information. Optimizing the value of information yields a deterministic annealing style of clustering with many benefits. For instance, investigators avoid needing to a priori specify the number of clusters, as the partitions naturally undergo phase changes, during the annealing process, whereby the number of clusters changes in a data-driven fashion. The global-best partition can also often be identified.Comment: Submitted to the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP

    Examining the segment retention problem for the “Group Satellite” case

    Get PDF
    The purpose of this work is to determine how well, criteria designed to help the selection of the adequate number of market segments, perform in recovering small niche segments, in mixture regressions of normal data, with experimental data. The simulation experiment compares several segment retention criteria, including information criteria and classification-based criteria. We also address the impact of distributional misspecification on segment retention criteria success rates. This study shows that Akaike’s Information criterion with penalty factors of 3 and 4, rather than the traditional value of 2, are the best segment retention criteria to use in recovering small niche segments. Although these criteria were designed for the specific context of mixture models, they are rarely applied in the marketing literature.Information criteria; Latent Class Segmentation.

    Possibilistic and fuzzy clustering methods for robust analysis of non-precise data

    Get PDF
    This work focuses on robust clustering of data affected by imprecision. The imprecision is managed in terms of fuzzy sets. The clustering process is based on the fuzzy and possibilistic approaches. In both approaches the observations are assigned to the clusters by means of membership degrees. In fuzzy clustering the membership degrees express the degrees of sharing of the observations to the clusters. In contrast, in possibilistic clustering the membership degrees are degrees of typicality. These two sources of information are complementary because the former helps to discover the best fuzzy partition of the observations while the latter reflects how well the observations are described by the centroids and, therefore, is helpful to identify outliers. First, a fully possibilistic k-means clustering procedure is suggested. Then, in order to exploit the benefits of both the approaches, a joint possibilistic and fuzzy clustering method for fuzzy data is proposed. A selection procedure for choosing the parameters of the new clustering method is introduced. The effectiveness of the proposal is investigated by means of simulated and real-life data

    Cluster validity in clustering methods

    Get PDF
    • …
    corecore