44,574 research outputs found

    From Word to Sense Embeddings: A Survey on Vector Representations of Meaning

    Get PDF
    Over the past years, distributed semantic representations have proved to be effective and flexible keepers of prior knowledge to be integrated into downstream applications. This survey focuses on the representation of meaning. We start from the theoretical background behind word vector space models and highlight one of their major limitations: the meaning conflation deficiency, which arises from representing a word with all its possible meanings as a single vector. Then, we explain how this deficiency can be addressed through a transition from the word level to the more fine-grained level of word senses (in its broader acceptation) as a method for modelling unambiguous lexical meaning. We present a comprehensive overview of the wide range of techniques in the two main branches of sense representation, i.e., unsupervised and knowledge-based. Finally, this survey covers the main evaluation procedures and applications for this type of representation, and provides an analysis of four of its important aspects: interpretability, sense granularity, adaptability to different domains and compositionality.Comment: 46 pages, 8 figures. Published in Journal of Artificial Intelligence Researc

    Specificity distinction

    Get PDF
    This paper is concerned with semantic noun phrase typology, focusing on the question of how to draw fine-grained distinctions necessary for an accurate account of natural language phenomena. In the extensive literature on this topic, the most commonly encountered parameters of classification concern the semantic type of the denotation of the noun phrase, the familiarity or novelty of its referent, the quantificational/nonquantificational distinction (connected to the weak/strong dichotomy), as well as, more recently, the question of whether the noun phrase is choice-functional or not (see Reinhart 1997, Winter 1997, Kratzer 1998, Matthewson 1999). In the discussion that follows I will attempt to make the following general points: (i) phenomena involving the behavior of noun phrases both within and across languages point to the need of establishing further distinctions that are too fine-grained to be caught in the net of these typologies; (ii) some of the relevant distinctions can be captured in terms of conditions on assignment functions; (iii) distribution and scopal peculiarities of noun phrases may result from constraints they impose on the way variables they introduce are to be assigned values. Section 2 reviews the typology of definite noun phrases introduced in Farkas 2000 and the way it provides support for the general points above. Section 3 examines some of the problems raised by recognizing the rich variety of 'indefinite' noun phrases found in natural language and by attempting to capture their distribution and interpretation. Common to the typologies discussed in the two sections is the issue of marking different types of variation in the interpretation of a noun phrase. In the light of this discussion, specificity turns out to be an epiphenomenon connected to a family of distinctions that are marked differently in different languages

    Phobos: A front-end approach to extensible compilers (long version)

    Get PDF
    This paper describes a practical approach for implementing certain types of domain-specific languages with extensible compilers. Given a compiler with one or more front-end languages, we introduce the idea of a "generic" front-end that allows the syntactic and semantic specification of domain-specific languages. Phobos, our generic front-end, offers modular language specification, allowing the programmer to define new syntax and semantics incrementally

    Intuitive querying of e-Health data repositories

    Get PDF
    At the centre of the Clinical e-Science Framework (CLEF) project is a repository of well organised, detailed clinical histories, encoded as data that will be available for use in clinical care and in-silico medical experiments. An integral part of the CLEF workbench is a tool to allow biomedical researchers and clinicians to query – in an intuitive way – the repository of patient data. This paper describes the CLEF query editing interface, which makes use of natural language generation techniques in order to alleviate some of the problems generally faced by natural language and graphical query interfaces. The query interface also incorporates an answer renderer that dynamically generates responses in both natural language text and graphics

    NLSC: Unrestricted Natural Language-based Service Composition through Sentence Embeddings

    Full text link
    Current approaches for service composition (assemblies of atomic services) require developers to use: (a) domain-specific semantics to formalize services that restrict the vocabulary for their descriptions, and (b) translation mechanisms for service retrieval to convert unstructured user requests to strongly-typed semantic representations. In our work, we argue that effort to developing service descriptions, request translations, and matching mechanisms could be reduced using unrestricted natural language; allowing both: (1) end-users to intuitively express their needs using natural language, and (2) service developers to develop services without relying on syntactic/semantic description languages. Although there are some natural language-based service composition approaches, they restrict service retrieval to syntactic/semantic matching. With recent developments in Machine learning and Natural Language Processing, we motivate the use of Sentence Embeddings by leveraging richer semantic representations of sentences for service description, matching and retrieval. Experimental results show that service composition development effort may be reduced by more than 44\% while keeping a high precision/recall when matching high-level user requests with low-level service method invocations.Comment: This paper will appear on SCC'19 (IEEE International Conference on Services Computing) on July 1
    • …
    corecore