1,961 research outputs found
A Generalized Framework for Agglomerative Clustering of Signed Graphs applied to Instance Segmentation
We propose a novel theoretical framework that generalizes algorithms for
hierarchical agglomerative clustering to weighted graphs with both attractive
and repulsive interactions between the nodes. This framework defines GASP, a
Generalized Algorithm for Signed graph Partitioning, and allows us to explore
many combinations of different linkage criteria and cannot-link constraints. We
prove the equivalence of existing clustering methods to some of those
combinations, and introduce new algorithms for combinations which have not been
studied. An extensive comparison is performed to evaluate properties of the
clustering algorithms in the context of instance segmentation in images,
including robustness to noise and efficiency. We show how one of the new
algorithms proposed in our framework outperforms all previously known
agglomerative methods for signed graphs, both on the competitive CREMI 2016 EM
segmentation benchmark and on the CityScapes dataset.Comment: 19 pages, 8 figures, 6 table
Spatial agglomeration and business groups: new evidence from Italian industrial districts
This paper is a first attempt to analyse the relationship between spatial agglomeration and firms’ organizational structures. It takes advantage of a large data set on Italian business groups that allowed us to analyse the differences in the presence and characteristics of business groups between districts and non district areas. Overall the result confirms the hypothesis that spatial agglomeration of business activities influences firms’ organization. Groups are more widespread in industrial districts than in non-district areas; moreover groups in industrial districts are less diversified and more spatially concentrated than groups outside industrial districts. Key words: business groups, industrial districts, spatial agglomeration, firm’s organizational forms JEL Classification: L22, R12
Color image segmentation using a spatial k-means clustering algorithm
This paper details the implementation of a new adaptive technique for color-texture segmentation that is a generalization of the standard K-Means algorithm. The standard K-Means algorithm produces accurate segmentation results only when applied to images defined by homogenous regions with respect to texture and color since no local constraints are applied to impose spatial continuity. In addition, the initialization of the K-Means algorithm is problematic and usually the initial cluster centers are randomly picked. In this paper we detail the implementation of a novel technique to select the dominant colors from the input image using the information from the color histograms. The main contribution of this work is the generalization of the K-Means algorithm that includes the primary features that describe the color smoothness and texture complexity in the process of pixel assignment. The resulting color segmentation scheme has been applied to a large number of natural images and the experimental data indicates the robustness of the new developed segmentation algorithm
Multi-level algorithms for modularity clustering
Modularity is one of the most widely used quality measures for graph
clusterings. Maximizing modularity is NP-hard, and the runtime of exact
algorithms is prohibitive for large graphs. A simple and effective class of
heuristics coarsens the graph by iteratively merging clusters (starting from
singletons), and optionally refines the resulting clustering by iteratively
moving individual vertices between clusters. Several heuristics of this type
have been proposed in the literature, but little is known about their relative
performance.
This paper experimentally compares existing and new coarsening- and
refinement-based heuristics with respect to their effectiveness (achieved
modularity) and efficiency (runtime). Concerning coarsening, it turns out that
the most widely used criterion for merging clusters (modularity increase) is
outperformed by other simple criteria, and that a recent algorithm by Schuetz
and Caflisch is no improvement over simple greedy coarsening for these
criteria. Concerning refinement, a new multi-level algorithm is shown to
produce significantly better clusterings than conventional single-level
algorithms. A comparison with published benchmark results and algorithm
implementations shows that combinations of coarsening and multi-level
refinement are competitive with the best algorithms in the literature.Comment: 12 pages, 10 figures, see
http://www.informatik.tu-cottbus.de/~rrotta/ for downloading the graph
clustering softwar
Multiorder neurons for evolutionary higher-order clustering and growth
This letter proposes to use multiorder neurons for clustering irregularly shaped data arrangements. Multiorder neurons are an evolutionary extension of the use of higher-order neurons in clustering. Higher-order neurons parametrically model complex neuron shapes by replacing the classic synaptic weight by higher-order tensors. The multiorder neuron goes one step further and eliminates two problems associated with higher-order neurons. First, it uses evolutionary algorithms to select the best neuron order for a given problem. Second, it obtains more information about the underlying data distribution by identifying the correct order for a given cluster of patterns. Empirically we observed that when the correlation of clusters found with ground truth information is used in measuring clustering accuracy, the proposed evolutionary multiorder neurons method can be shown to outperform other related clustering methods. The simulation results from the Iris, Wine, and Glass data sets show significant improvement when compared to the results obtained using self-organizing maps and higher-order neurons. The letter also proposes an intuitive model by which multiorder neurons can be grown, thereby determining the number of clusters in data
Image annotation and retrieval based on multi-modal feature clustering and similarity propagation.
The performance of content-based image retrieval systems has proved to be inherently constrained by the used low level features, and cannot give satisfactory results when the user\u27s high level concepts cannot be expressed by low level features. In an attempt to bridge this semantic gap, recent approaches started integrating both low level-visual features and high-level textual keywords. Unfortunately, manual image annotation is a tedious process and may not be possible for large image databases. In this thesis we propose a system for image retrieval that has three mains components. The first component of our system consists of a novel possibilistic clustering and feature weighting algorithm based on robust modeling of the Generalized Dirichlet (GD) finite mixture. Robust estimation of the mixture model parameters is achieved by incorporating two complementary types of membership degrees. The first one is a posterior probability that indicates the degree to which a point fits the estimated distribution. The second membership represents the degree of typicality and is used to indentify and discard noise points. Robustness to noisy and irrelevant features is achieved by transforming the data to make the features independent and follow Beta distribution, and learning optimal relevance weight for each feature subset within each cluster. We extend our algorithm to find the optimal number of clusters in an unsupervised and efficient way by exploiting some properties of the possibilistic membership function. We also outline a semi-supervised version of the proposed algorithm. In the second component of our system consists of a novel approach to unsupervised image annotation. Our approach is based on: (i) the proposed semi-supervised possibilistic clustering; (ii) a greedy selection and joining algorithm (GSJ); (iii) Bayes rule; and (iv) a probabilistic model that is based on possibilistic memebership degrees to annotate an image. The third component of the proposed system consists of an image retrieval framework based on multi-modal similarity propagation. The proposed framework is designed to deal with two data modalities: low-level visual features and high-level textual keywords generated by our proposed image annotation algorithm. The multi-modal similarity propagation system exploits the mutual reinforcement of relational data and results in a nonlinear combination of the different modalities. Specifically, it is used to learn the semantic similarities between images by leveraging the relationships between features from the different modalities. The proposed image annotation and retrieval approaches are implemented and tested with a standard benchmark dataset. We show the effectiveness of our clustering algorithm to handle high dimensional and noisy data. We compare our proposed image annotation approach to three state-of-the-art methods and demonstrate the effectiveness of the proposed image retrieval system
Differentiation of Modernization Processes in EU Countries and in Poland in Time and Space - Non Classical View
The aim of this paper is a typology of EU-countries from the point of view of their modernization measures, a diagnostics and an evaluation of changes in a sectoral structure of the employment in EU-countries. Against this background there are exposed changes in Polish economy in the period 1997-2008. There were used the multidimensional comparative analysis of the sequence separation of homogenous objects subsets (cluster analysis and quick cluster) and the little known in Poland, but popular in the US, measurement technique of sectoral changes in time - the shift share-technique to illustrate and verify empirically the theory of three sectors by A. Fisher, C. Clark i J. Fourastiè, orientated to the evaluation of the modernization process of EU-economies
- …