Dependency detection with similarity constraints

Kaski, Samuel; Knuutila, Sakari; Lahti, Leo; Myllykangas, Samuel

slides

Dependency detection with similarity constraints

Authors: Samuel Kaski
Sakari Knuutila
Leo Lahti
Samuel Myllykangas
Publication date: 1 January 2009
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Unsupervised two-view learning, or detection of dependencies between two paired data sets, is typically done by some variant of canonical correlation analysis (CCA). CCA searches for a linear projection for each view, such that the correlations between the projections are maximized. The solution is invariant to any linear transformation of either or both of the views; for tasks with small sample size such flexibility implies overfitting, which is even worse for more flexible nonparametric or kernel-based dependency discovery methods. We develop variants which reduce the degrees of freedom by assuming constraints on similarity of the projections in the two views. A particular example is provided by a cancer gene discovery application where chromosomal distance affects the dependencies between gene copy number and activity levels. Similarity constraints are shown to improve detection performance of known cancer genes.Comment: 9 pages, 3 figures. Appeared in proceedings of the 2009 IEEE International Workshop on Machine Learning for Signal Processing XIX (MLSP'09). Implementation of the method available at http://bioconductor.org/packages/devel/bioc/html/pint.htm

Similar works

Full text

Available Versions

Crossref

Last time updated on 05/06/2019

CiteSeerX

oai:CiteSeerX.psu:10.1.1.745.8...

Last time updated on 30/10/2017