44,574 research outputs found
From Word to Sense Embeddings: A Survey on Vector Representations of Meaning
Over the past years, distributed semantic representations have proved to be
effective and flexible keepers of prior knowledge to be integrated into
downstream applications. This survey focuses on the representation of meaning.
We start from the theoretical background behind word vector space models and
highlight one of their major limitations: the meaning conflation deficiency,
which arises from representing a word with all its possible meanings as a
single vector. Then, we explain how this deficiency can be addressed through a
transition from the word level to the more fine-grained level of word senses
(in its broader acceptation) as a method for modelling unambiguous lexical
meaning. We present a comprehensive overview of the wide range of techniques in
the two main branches of sense representation, i.e., unsupervised and
knowledge-based. Finally, this survey covers the main evaluation procedures and
applications for this type of representation, and provides an analysis of four
of its important aspects: interpretability, sense granularity, adaptability to
different domains and compositionality.Comment: 46 pages, 8 figures. Published in Journal of Artificial Intelligence
Researc
Specificity distinction
This paper is concerned with semantic noun phrase typology, focusing on the question of how to draw fine-grained distinctions necessary for an accurate account of natural language phenomena. In the extensive literature on this topic, the most commonly encountered parameters of classification concern the semantic type of the denotation of the noun phrase, the familiarity or novelty of its referent, the quantificational/nonquantificational distinction (connected to the weak/strong dichotomy), as well as, more recently, the question of whether the noun phrase is choice-functional or not (see Reinhart 1997, Winter 1997, Kratzer 1998, Matthewson 1999). In the discussion that follows I will attempt to make the following general points: (i) phenomena involving the behavior of noun phrases both within and across languages point to the need of establishing further distinctions that are too fine-grained to be caught in the net of these typologies; (ii) some of the relevant distinctions can be captured in terms of conditions on assignment functions; (iii) distribution and scopal peculiarities of noun phrases may result from constraints they impose on the way variables they introduce are to be assigned values.
Section 2 reviews the typology of definite noun phrases introduced in Farkas 2000 and the way it provides support for the general points above. Section 3 examines some of the problems raised by recognizing the rich variety of 'indefinite' noun phrases found in natural language and by attempting to capture their distribution and interpretation. Common to the typologies discussed in the two sections is the issue of marking different types of variation in the interpretation of a noun phrase. In the light of this discussion, specificity turns out to be an epiphenomenon connected to a family of distinctions that are marked differently in different languages
Phobos: A front-end approach to extensible compilers (long version)
This paper describes a practical approach for implementing certain types of domain-specific languages with extensible compilers. Given a compiler with one or more front-end languages, we introduce the idea of a "generic" front-end that allows the syntactic and semantic specification of domain-specific languages. Phobos, our generic front-end, offers modular language specification, allowing the programmer to define new syntax and semantics incrementally
Intuitive querying of e-Health data repositories
At the centre of the Clinical e-Science Framework (CLEF) project is a repository of well organised, detailed clinical histories, encoded as data that will be available for use in clinical care and in-silico medical experiments. An integral part of the CLEF workbench is a tool to allow biomedical researchers and clinicians to query – in an intuitive way – the repository of patient data. This paper describes the CLEF query editing interface, which makes use of natural language generation techniques in order to alleviate some of the problems generally faced by natural language and graphical query interfaces. The query interface also incorporates an answer renderer that dynamically generates responses in both natural language text and graphics
NLSC: Unrestricted Natural Language-based Service Composition through Sentence Embeddings
Current approaches for service composition (assemblies of atomic services)
require developers to use: (a) domain-specific semantics to formalize services
that restrict the vocabulary for their descriptions, and (b) translation
mechanisms for service retrieval to convert unstructured user requests to
strongly-typed semantic representations. In our work, we argue that effort to
developing service descriptions, request translations, and matching mechanisms
could be reduced using unrestricted natural language; allowing both: (1)
end-users to intuitively express their needs using natural language, and (2)
service developers to develop services without relying on syntactic/semantic
description languages. Although there are some natural language-based service
composition approaches, they restrict service retrieval to syntactic/semantic
matching. With recent developments in Machine learning and Natural Language
Processing, we motivate the use of Sentence Embeddings by leveraging richer
semantic representations of sentences for service description, matching and
retrieval. Experimental results show that service composition development
effort may be reduced by more than 44\% while keeping a high precision/recall
when matching high-level user requests with low-level service method
invocations.Comment: This paper will appear on SCC'19 (IEEE International Conference on
Services Computing) on July 1
- …