The Metric Nearness Problems with Applications

Inderjit S. Dhillon; Suvrit Sra

The Metric Nearness Problems with Applications

Authors: Inderjit S. Dhillon
Suvrit Sra
Publication date
Publisher

Abstract

Many practical applications in machine learning require pairwise distances among a set of objects. It is often desirable that these distance measurements satisfy the properties of a metric, especially the triangle inequality. Applications that could benefit from the metric property include data clustering and metric-based indexing of databases. In this paper, we present the metric nearness problem: Given a dissimilarity matrix, find the “nearest ” matrix of distances that satisfy the triangle inequalities. A weight matrix in the formulation captures the confidence in individual dissimilarity measures, including the case of altogether missing distances. For an important class of nearness measures, the problem can be attacked with convex optimization techniques. A pleasing aspect of this formulation is that we can compute globally optimal solutions. Experiments on some sample dissimilarity matrices are presented, including some from biology.

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.81.87...

Last time updated on 22/10/2014

CiteSeerX

oai:CiteSeerX.psu:10.1.1.78.82...

Last time updated on 22/10/2014