Search CORE

47,522 research outputs found

Positive Definite Kernels in Machine Learning

Author: Cuturi Marco
Publication venue
Publication date: 01/01/2009
Field of study

This survey is an introduction to positive definite kernels and the set of methods they have inspired in the machine learning literature, namely kernel methods. We first discuss some properties of positive definite kernels as well as reproducing kernel Hibert spaces, the natural extension of the set of functions

\{k(x,\cdot),x\in\mathcal{X}\}

associated with a kernel

k

defined on a space

\mathcal{X}

. We discuss at length the construction of kernel functions that take advantage of well-known statistical models. We provide an overview of numerous data-analysis methods which take advantage of reproducing kernel Hilbert spaces and discuss the idea of combining several kernels to improve the performance on certain tasks. We also provide a short cookbook of different kernels which are particularly useful for certain data-types such as images, graphs or speech segments.Comment: draft. corrected a typo in figure

arXiv.org e-Print Archive

CiteSeerX

Edit distance Kernelization of NP theorem proving for polynomial-time machine learning of proof heuristics

Author: A Urquhart
Andreas Fischer
C Cortes
G Gonthier
J. A. Robinson
Lawrence C. Paulson
Michel Neuhaus
Stephen A. Cook
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

We outline a general strategy for the application of edit- distance based kernels to NP Theorem Proving in order to allow for polynomial-time machine learning of proof heuristics without the loss of sequential structural information associated with conventional feature- based machine learning. We provide a general short introduction to logic and proof considering a few important complexity results to set the scene and highlight the relevance of our findings

Crossref

Middlesex University Research Repository

Edit distance Kernelization of NP theorem proving for polynomial-time machine learning of proof heuristics

Author: Kammueller F.
Kammueller F.
Windridge D.
Windridge D.
Publication venue: Springer
Publication date: 01/01/2020
Field of study

Middlesex University Research Repository

Semi-supervised prediction of protein interaction sentences exploiting semantically encoded metrics

Author: D.D. Lewis
E.M. Marcotte
J.D. Kim
K. Lund
L. Azzopardi
M. Girolami
M.N. Jones
M.N. Jones
R. Bunescu
S. Padó
S. Pyysalo
S. Rogers
T. Joachims
T.K. Landauer
Z. Minier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Protein-protein interaction (PPI) identification is an integral component of many biomedical research and database curation tools. Automation of this task through classification is one of the key goals of text mining (TM). However, labelled PPI corpora required to train classifiers are generally small. In order to overcome this sparsity in the training data, we propose a novel method of integrating corpora that do not contain relevance judgements. Our approach uses a semantic language model to gather word similarity from a large unlabelled corpus. This additional information is integrated into the sentence classification process using kernel transformations and has a re-weighting effect on the training features that leads to an 8% improvement in F-score over the baseline results. Furthermore, we discover that some words which are generally considered indicative of interactions are actually neutralised by this process