4 research outputs found
Perbandingan Rapid Centroid Estimation (RCE) — K Nearest Neighbor (K-NN) Dengan K Means — K Nearest Neighbor (K-NN)
Teknik Clustering terbukti dapat meningkatkan akurasi dalam melakukan klasifikasi, terutama pada algoritma K-Nearest Neighbor (K-NN). Setiap data dari setiap kelas akan membentuk K cluster yang kemudian nilai centroid akhir dari setiap cluster pada setiap kelas data tersebut akan dijadikan data acuan untuk melakukan proses klasifikasi menggunakan algoritma K-NN. Namun kendala dari banyaknya teknik clustering adalah biaya komputasi yang mahal, Rapid Centroid Estimation (RCE) dan K-Means termasuk kedalam teknik clustering dengan biaya komputasi yang murah. Untuk melihat manakah dari kedua algoritma ini (RCE dan K-Means) yang lebih baik memberikan peningkatan akurasi pada algoritma K-NN maka, pada penelitian ini akan mencoba untuk membandingkan kedua algoritma tersebut. Hasil dari penelitian ini adalah gabungan RCE—K-NN memberikan hasil akurasi yang lebih baik dari K-Means—K-NN pada data set iris dan wine. Namun dalam perubahan nilai akurasi RCE—K-NN lebih stabil hanya pada data set iris. Sedangkan pada data set wine, K-Means—K-NN terlihat mendapati perubahan akurasi yang lebih stabil dibandingkan RCE—K-NN
Recommended from our members
An Evaluation of Performance Enhancements to Particle Swarm Optimisation on Real-World Data
Swarm Computation is a relatively new optimisation paradigm. The basic premise is to model the collective behaviour of self-organised natural phenomena such as swarms, flocks and shoals, in order to solve optimisation problems. Particle Swarm Optimisation (PSO) is a type of swarm computation inspired by bird flocks or swarms of bees by modelling their collective social influence as they search for optimal solutions.
In many real-world applications of PSO, the algorithm is used as a data pre-processor for a neural network or similar post processing system, and is often extensively modified to suit the application. The thesis introduces techniques that allow unmodified PSO to be applied successfully to a range of problems, specifically three extensions to the basic PSO algorithm: solving optimisation problems by training a hyperspatial matrix, using a hierarchy of swarms to coordinate optimisation on several data sets simultaneously, and dynamic neighbourhood selection in swarms.
Rather than working directly with candidate solutions to an optimisation problem, the PSO algorithm is adapted to train a matrix of weights, to produce a solution to the problem from the inputs. The search space is abstracted from the problem data.
A single PSO swarm optimises a single data set and has difficulties where the data set comprises disjoint parts (such as time series data for different days). To address this problem, we introduce a hierarchy of swarms, where each child swarm optimises one section of the data set whose gbest particle is a member of the swarm above in the hierarchy. The parent swarm(s) coordinate their children and encourage more exploration of the solution space. We show that hierarchical swarms of this type perform better than single swarm PSO optimisers on the disjoint data sets used.
PSO relies on interaction between particles within a neighbourhood to find good solutions. In many PSO variants, possible interactions are arbitrary and fixed on initialisation. Our third contribution is a dynamic neighbourhood selection: particles can modify their neighbourhood, based on the success of the candidate neighbour particle. As PSO is intended to reflect the social interaction of agents, this change significantly increases the ability of the swarm to find optimal solutions. Applied to real-world medical and cosmological data, this modification is and shows improvements over standard PSO approaches with fixed neighbourhoods
Classification of Medical Data Based On Sparse Representation Using Dictionary Learning
Due to the increase in the sources of image acquisition and storage capacity, the
search for relevant information in large medical image databases has become more
challenging. Classification of medical data into different categories is an important
task, and enables efficient cataloging and retrieval with large image collections. The
medical image classification systems available today classify medical images based on
modality, body part, disease or orientation. Recent work in this direction seek to use
the semantics of medical data to achieve better classification. However, representation
of semantics is a challenging task and sparse representation has been explored in this
thesis for this task
Optimization strategies for rapid centroid estimation
Particle swarm algorithm has been extensively utilized as a tool to solve optimization problems. Recently proposed particle swarm±based clustering algorithm called the Rapid Centroid Estimation (RCE) is a lightweight alteration to Particle Swarm Clustering (PSC). The RCE in its standard form is shown to be superior to conventional PSC algorithm. We have observed some limitations in RCE including the possibility to stagnate at a local minimum combination and the restriction in swarm size. We propose strategies to optimize RCE further by introducing RCE+ and swarm RCE+. Five benchmark datasets from UCI machine learning database are used to test the performance of these new strategies. In Glass dataset swarm RCE+ is able to achieve highest purity centroid combinations with less iteration (90.3%±1.1% in 9±5 iterations) followed by RCE+ (89%±3.5% in 65±62 iterations) and RCE (87%±5.9% in 54±44). Similar quality is also reflected in other benchmark datasets including Iris, Wine, Breast Cancer, and Diabetes. © 2012 IEEE