171 research outputs found

    Document clustering for knowledge discovery using nature-inspired algorithm

    Get PDF
    As the internet is overload with information, various knowledge based systems are now equipped with data analytics features that facilitate knowledge discovery.This includes the utilization of optimization algorithms that mimics the behavior of insects or animals.This paper presents an experiment on document clustering utilizing the Gravitation Firefly algorithm (GFA).The advantage of GFA is that clustering can be performed without a pre-defined value of k clusters.GFA determines the center of clusters by identifying documents with high force.Upon identification of the centers, clusters are created based on cosine similarity measurement.Experimental results demonstrated that GFA utilizing a random positioning of documents outperforms existing clustering algorithm such as Particles Swarm Optimization (PSO) and K-means

    Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach

    Get PDF
    Defining the correct number of clusters is one of the most fundamental tasks in graph clustering. When it comes to large graphs, this task becomes more challenging because of the lack of prior information. This paper presents an approach to solve this problem based on the Bat Algorithm, one of the most promising swarm intelligence based algorithms. We chose to call our solution, “Bat-Cluster (BC).” This approach allows an automation of graph clustering based on a balance between global and local search processes. The simulation of four benchmark graphs of different sizes shows that our proposed algorithm is efficient and can provide higher precision and exceed some best-known values

    Consensus clustering with differential evolution

    Get PDF
    summary:Consensus clustering algorithms are used to improve properties of traditional clustering methods, especially their accuracy and robustness. In this article, we introduce our approach that is based on a refinement of the set of initial partitions and uses differential evolution algorithm in order to find the most valid solution. Properties of the algorithm are demonstrated on four benchmark datasets

    Multilevel thresholding hyperspectral image segmentation based on independent component analysis and swarm optimization methods

    Get PDF
    High dimensional problems are often encountered in studies related to hyperspectral data. One of the challenges that arise is how to find representations that are accurate so that important structures can be clearly easily. This study aims to process segmentation of hyperspectral image by using swarm optimization techniques. This experiments use Aviris Indian Pines hyperspectral image dataset that consist of 103 bands. The method used for segmentation image is particle swarm optimization (PSO), Darwinian particle swarm optimization (DPSO) and fractional order Darwinian particle swarm optimization (FODPSO). Before process segmentation image, the dimension of the hyperspectral image data set are first reduced by using independent component analysis (ICA) technique to get first independent component. The experimental show that FODPSO method is better than PSO and DPSO, in terms of the average CPU processing time and best fitness value. The PSNR and SSIM values when using FODPSO are better than the other two swarm optimization method. It can be concluded that FODPSO method is better in order to obtain better segmentation results compared to the previous method

    A review of clustering techniques and developments

    Full text link
    © 2017 Elsevier B.V. This paper presents a comprehensive study on clustering: exiting methods and developments made at various times. Clustering is defined as an unsupervised learning where the objects are grouped on the basis of some similarity inherent among them. There are different methods for clustering the objects such as hierarchical, partitional, grid, density based and model based. The approaches used in these methods are discussed with their respective states of art and applicability. The measures of similarity as well as the evaluation criteria, which are the central components of clustering, are also presented in the paper. The applications of clustering in some fields like image segmentation, object and character recognition and data mining are highlighted
    corecore