Search CORE

106,123 research outputs found

Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell

Author: Fokkens Antske
Sommerauer Pia
Publication venue
Publication date: 01/01/2018
Field of study

This paper presents an approach for investigating the nature of semantic information captured by word embeddings. We propose a method that extends an existing human-elicited semantic property dataset with gold negative examples using crowd judgments. Our experimental approach tests the ability of supervised classifiers to identify semantic features in word embedding vectors and com- pares this to a feature-identification method based on full vector cosine similarity. The idea behind this method is that properties identified by classifiers, but not through full vector comparison are captured by embeddings. Properties that cannot be identified by either method are not. Our results provide an initial indication that semantic properties relevant for the way entities interact (e.g. dangerous) are captured, while perceptual information (e.g. colors) is not represented. We conclude that, though preliminary, these results show that our method is suitable for identifying which properties are captured by embeddings.Comment: Accepted to the EMNLP workshop "Analyzing and interpreting neural networks for NLP

arXiv.org e-Print Archive

VU Research Portal

Crossref

Recommended from our members

Measuring category intuitiveness in unconstrained categorization tasks

Author: Akaike
Amotz Perlman
Anderson
Ashby
Ashby
Ashby
Barrett
Billman
Brown
Chapman
Chater
Colreavy
Compton
Compton
Corter
Darren J. Edwards
Demetras
Elman
Emmanuel M. Pothos
Estes
Feldman
Feldman
Fiser
Gopnik
Gosselin
Gureckis
Hahn
Hampton
Handel
Handel
Handel
Heller
Hines
John V. McDonnell
Johnson
Jones
Ken Kurtz
Kurtz
Love
Malt
Malt
Mareschal
Medin
Medin
Medin
Medin
Mervis
Milton
Milton
Minda
Morgan
Murphy
Murphy
Murphy
Nelson
Nelson
Nosofsky
Nosofsky
Peter Hines
Pitt
Pothos
Pothos
Pothos
Quinn
Rand
Reber
Regehr
Rips
Rosch
Sanborn
Schyns
Smith
Stewart
Todd M. Bailey
Vanpaemel
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

What makes a category seem natural or intuitive? In this paper, an unsupervised categorization task was employed to examine observer agreement concerning the categorization of nine different stimulus sets. The stimulus sets were designed to capture different intuitions about classification structure. The main empirical index of category intuitiveness was the frequency of the preferred classification, for different stimulus sets. With 169 participants, and a within participants design, with some stimulus sets the most frequent classification was produced over 50 times and with others not more than two or three times. The main empirical finding was that cluster tightness was more important in determining category intuitiveness, than cluster separation. The results were considered in relation to the following models of unsupervised categorization: DIVA, the rational model, the simplicity model, SUSTAIN, an Unsupervised version of the Generalized Context Model (UGCM), and a simple geometric model based on similarity. DIVA, the geometric approach, SUSTAIN, and the UGCM provided good, though not perfect, fits. Overall, the present work highlights several theoretical and practical issues regarding unsupervised categorization and reveals weaknesses in some of the corresponding formal models

City Research Online

Crossref

Online Research @ Cardiff

Cronfa at Swansea University

Differential geometric regularization for supervised learning of classifiers

Author: Bai Qinxun
Rosenberg Steven
Sclaroff Stan
Wu Zheng
Publication venue
Publication date: 01/01/2016
Field of study

We study the problem of supervised learning for both binary and multiclass classification from a unified geometric perspective. In particular, we propose a geometric regularization technique to find the submanifold corresponding to an estimator of the class probability P(y|\vec x). The regularization term measures the volume of this submanifold, based on the intuition that overfitting produces rapid local oscillations and hence large volume of the estimator. This technique can be applied to regularize any classification function that satisfies two requirements: firstly, an estimator of the class probability can be obtained; secondly, first and second derivatives of the class probability estimator can be calculated. In experiments, we apply our regularization technique to standard loss functions for classification, our RBF-based implementation compares favorably to widely used regularization methods for both binary and multiclass classification.http://proceedings.mlr.press/v48/baia16.pdfPublished versio

Boston University Institutional Repository (OpenBU)

Out-of-sample generalizations for supervised manifold learning for classification

Author: Guillemot Christine
Vural Elif
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/02/2015
Field of study

Supervised manifold learning methods for data classification map data samples residing in a high-dimensional ambient space to a lower-dimensional domain in a structure-preserving way, while enhancing the separation between different classes in the learned embedding. Most nonlinear supervised manifold learning methods compute the embedding of the manifolds only at the initially available training points, while the generalization of the embedding to novel points, known as the out-of-sample extension problem in manifold learning, becomes especially important in classification applications. In this work, we propose a semi-supervised method for building an interpolation function that provides an out-of-sample extension for general supervised manifold learning algorithms studied in the context of classification. The proposed algorithm computes a radial basis function (RBF) interpolator that minimizes an objective function consisting of the total embedding error of unlabeled test samples, defined as their distance to the embeddings of the manifolds of their own class, as well as a regularization term that controls the smoothness of the interpolation function in a direction-dependent way. The class labels of test data and the interpolation function parameters are estimated jointly with a progressive procedure. Experimental results on face and object images demonstrate the potential of the proposed out-of-sample extension algorithm for the classification of manifold-modeled data sets

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

OpenMETU (Middle East Technical University)

HAL-Rennes 1