5,192 research outputs found
Bipartite graph partitioning and data clustering
Many data types arising from data mining applications can be modeled as
bipartite graphs, examples include terms and documents in a text corpus,
customers and purchasing items in market basket analysis and reviewers and
movies in a movie recommender system. In this paper, we propose a new data
clustering method based on partitioning the underlying bipartite graph. The
partition is constructed by minimizing a normalized sum of edge weights between
unmatched pairs of vertices of the bipartite graph. We show that an approximate
solution to the minimization problem can be obtained by computing a partial
singular value decomposition (SVD) of the associated edge weight matrix of the
bipartite graph. We point out the connection of our clustering algorithm to
correspondence analysis used in multivariate analysis. We also briefly discuss
the issue of assigning data objects to multiple clusters. In the experimental
results, we apply our clustering algorithm to the problem of document
clustering to illustrate its effectiveness and efficiency.Comment: Proceedings of ACM CIKM 2001, the Tenth International Conference on
Information and Knowledge Management, 200
Adaptive Window Selection for Non-uniform Lighting Image Thresholding
Selection of appropriate size of windows or subimages is the most important step for thresholding images with non-uniform lighting. In this paper, a novel criteria function is developed to partition images into different size of sub images appropriate for thresholding. After the partitioning, each subimage is segmented by Otsu's thresholding approaches. The performance of the proposed method is validated on benchmark test images with different degree of uneven lighting. Based on the qualitative and quantitative measures, the proposed method is fully automatic, fast and efficient in comparison to many landmark approaches
- …