CORE
CO
nnecting
RE
positories
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Research partnership
About
About
About us
Our mission
Team
Blog
FAQs
Contact us
Community governance
Governance
Advisory Board
Board of supporters
Research network
Innovations
Our research
Labs
research
A survey on feature weighting based K-Means algorithms
Authors
A GODER
A STURN
+37 more
AK JAIN
AL BLUM
AP DEMPSTER
AP GASCH
B Mirkin
CY TSAI
D ALOISE
D Steinley
D STEINLEY
D STEINLEY
D WETTSCHERECK
DS MODHA
E Polak
F Murtagh
G Soete de
G Soete de
GH BALL
H Steinhaus
I GUYON
JC BEZDEK
L HUBERT
LA ZADEH
P DRINEAS
P MITRA
PE GREEN
R Bellman
R GNANADESIKAN
R KOHAVI
RC AMORIM DE
RC AMORIM DE
Renato Cordeiro de Amorim
SP CHATZIS
V MAKARENKOV
WS DESARBO
WS DESARBO
WS DESARBO
Z Huang
Publication date
25 August 2016
Publisher
'Springer Science and Business Media LLC'
Doi
Abstract
This is a pre-copyedited, author-produced PDF of an article accepted for publication in Journal of Classification [de Amorim, R. C., 'A survey on feature weighting based K-Means algorithms', Journal of Classification, Vol. 33(2): 210-242, August 25, 2016]. Subject to embargo. Embargo end date: 25 August 2017. The final publication is available at Springer via http://dx.doi.org/10.1007/s00357-016-9208-4 © Classification Society of North America 2016In a real-world data set there is always the possibility, rather high in our opinion, that different features may have different degrees of relevance. Most machine learning algorithms deal with this fact by either selecting or deselecting features in the data preprocessing phase. However, we maintain that even among relevant features there may be different degrees of relevance, and this should be taken into account during the clustering process. With over 50 years of history, K-Means is arguably the most popular partitional clustering algorithm there is. The first K-Means based clustering algorithm to compute feature weights was designed just over 30 years ago. Various such algorithms have been designed since but there has not been, to our knowledge, a survey integrating empirical evidence of cluster recovery ability, common flaws, and possible directions for future research. This paper elaborates on the concept of feature weighting and addresses these issues by critically analysing some of the most popular, or innovative, feature weighting mechanisms based in K-Means.Peer reviewedFinal Accepted Versio
Similar works
Full text
Open in the Core reader
Download PDF
Available Versions
University of Essex Research Repository
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:repository.essex.ac.uk:203...
Last time updated on 20/10/2017
University of Hertfordshire Research Archive
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:uhra.herts.ac.uk:5259
Last time updated on 02/07/2025
Crossref
See this paper in CORE
Go to the repository landing page
Download from data provider
info:doi/10.1007%2Fs00357-016-...
Last time updated on 01/04/2019