Location of Repository

Clustering files of chemical structures using the Szekely-Rizzo generalization of Ward's method \ud

By T. Varin, R. Bureau, C. Mueller and P. Willett

Abstract

Ward's method is extensively used for clustering chemical structures represented by 2D fingerprints. This paper compares Ward clusterings of 14 datasets (containing between 278 and 4332 molecules) with those obtained using the Szekely–Rizzo clustering method, a generalization of Ward's method. The clusters resulting from these two methods were evaluated by the extent to which the various classifications were able to group active molecules together, using a novel criterion of clustering effectiveness. Analysis of a total of 1400 classifications (Ward and Székely–Rizzo clustering methods, 14 different datasets, 5 different fingerprints and 10 different distance coefficients) demonstrated the general superiority of the Székely–Rizzo method. The distance coefficient first described by Soergel performed extremely well in these experiments, and this was also the case when it was used in simulated virtual screening experiments.\ud \u

Publisher: Elsevier
Year: 2009
OAI identifier: oai:eprints.whiterose.ac.uk:10328

Suggested articles

Preview


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.