Search CORE

78 research outputs found

Specialising Word Vectors for Lexical Entailment

Author: Mrkšić N
Vulic I
Publication venue: https://aclweb.org/anthology/volumes/proceedings-of-the-2018-conference-of-the-north-american-chapter-of-the-association-for-computational-linguistics-human-language-technologies-volume-1-long-papers/
Publication date: 01/01/2018
Field of study

We present LEAR (Lexical Entailment Attract-Repel), a novel post-processing method that transforms any input word vector space to emphasise the asymmetric relation of lexical entailment (LE), also known as the IS-A or hyponymy-hypernymy relation. By injecting external linguistic constraints (e.g., WordNet links) into the initial vector space, the LE specialisation procedure brings true hyponymy-hypernymy pairs closer together in the transformed Euclidean space. The proposed asymmetric distance measure adjusts the norms of word vectors to reflect the actual WordNet-style hierarchy of concepts. Simultaneously, a joint objective enforces semantic similarity using the symmetric cosine distance, yielding a vector space specialised for both lexical relations at once. LEAR specialisation achieves state-of-the-art performance in the tasks of hypernymy directionality, hypernymy detection, and graded lexical entailment, demonstrating the effectiveness and robustness of the proposed asymmetric specialisation model

arXiv.org e-Print Archive

Crossref

Apollo (Cambridge)

Don't Blame Distributional Semantics if it can't do Entailment

Author: Boleda Gemma
Westera Matthijs
Publication venue
Publication date: 01/01/2019
Field of study

Distributional semantics has had enormous empirical success in Computational Linguistics and Cognitive Science in modeling various semantic phenomena, such as semantic similarity, and distributional models are widely used in state-of-the-art Natural Language Processing systems. However, the theoretical status of distributional semantics within a broader theory of language and cognition is still unclear: What does distributional semantics model? Can it be, on its own, a fully adequate model of the meanings of linguistic expressions? The standard answer is that distributional semantics is not fully adequate in this regard, because it falls short on some of the central aspects of formal semantic approaches: truth conditions, entailment, reference, and certain aspects of compositionality. We argue that this standard answer rests on a misconception: These aspects do not belong in a theory of expression meaning, they are instead aspects of speaker meaning, i.e., communicative intentions in a particular context. In a slogan: words do not refer, speakers do. Clearing this up enables us to argue that distributional semantics on its own is an adequate model of expression meaning. Our proposal sheds light on the role of distributional semantics in a broader theory of language and cognition, its relationship to formal semantics, and its place in computational models.Comment: To appear in Proceedings of the 13th International Conference on Computational Semantics (IWCS 2019), Gothenburg, Swede

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

Recommended from our members

Scoring lexical entailment with a supervised directional similarity network

Author: Gerz Daniela
Rei Marek
Vulić I
Publication venue: 'Organisation for Economic Co-Operation and Development (OECD)'
Publication date: 01/01/2018
Field of study

Scoring Lexical Entailment with a Supervised Directional Similarity NetworkERC Nvidi

Apollo (Cambridge)

Distributional Inclusion Vector Embedding for Unsupervised Hypernymy Detection

Author: Chang Haw-Shiuan
McCallum Andrew
Vilnis Luke
Wang ZiYun
Publication venue
Publication date: 01/01/2018
Field of study

Modeling hypernymy, such as poodle is-a dog, is an important generalization aid to many NLP tasks, such as entailment, coreference, relation extraction, and question answering. Supervised learning from labeled hypernym sources, such as WordNet, limits the coverage of these models, which can be addressed by learning hypernyms from unlabeled text. Existing unsupervised methods either do not scale to large vocabularies or yield unacceptably poor accuracy. This paper introduces distributional inclusion vector embedding (DIVE), a simple-to-implement unsupervised method of hypernym discovery via per-word non-negative vector embeddings which preserve the inclusion property of word contexts in a low-dimensional and interpretable space. In experimental evaluations more comprehensive than any previous literature of which we are aware-evaluating on 11 datasets using multiple existing as well as newly proposed scoring functions-we find that our method provides up to double the precision of previous unsupervised embeddings, and the highest average performance, using a much more compact word representation, and yielding many new state-of-the-art results.Comment: NAACL 201

arXiv.org e-Print Archive

Crossref

Scoring lexical entailment with a supervised directional similarity network

Author: Gerz D
Rei M
Vulić I
Publication venue: ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)
Publication date: 01/01/2018
Field of study

Scoring Lexical Entailment with a Supervised Directional Similarity NetworkERC Nvidi

arXiv.org e-Print Archive

Crossref

Apollo (Cambridge)

Learning semantic sentence representations from visually grounded language without lexical knowledge

Author: Frank Stefan
Merkx Danny
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2019
Field of study

Current approaches to learning semantic representations of sentences often use prior word-level knowledge. The current study aims to leverage visual information in order to capture sentence level semantics without the need for word embeddings. We use a multimodal sentence encoder trained on a corpus of images with matching text captions to produce visually grounded sentence embeddings. Deep Neural Networks are trained to map the two modalities to a common embedding space such that for an image the corresponding caption can be retrieved and vice versa. We show that our model achieves results comparable to the current state-of-the-art on two popular image-caption retrieval benchmark data sets: MSCOCO and Flickr8k. We evaluate the semantic content of the resulting sentence embeddings using the data from the Semantic Textual Similarity benchmark task and show that the multimodal embeddings correlate well with human semantic similarity judgements. The system achieves state-of-the-art results on several of these benchmarks, which shows that a system trained solely on multimodal data, without assuming any word representations, is able to capture sentence level semantics. Importantly, this result shows that we do not need prior knowledge of lexical level semantics in order to model sentence level semantics. These findings demonstrate the importance of visual information in semantics

arXiv.org e-Print Archive

Radboud Repository

MPG.PuRe