Search CORE

48,595 research outputs found

Deduction over Mixed-Level Logic Representations for Text Passage Retrieval

Author: Hess Michael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1996
Field of study

A system is described that uses a mixed-level representation of (part of) meaning of natural language documents (based on standard Horn Clause Logic) and a variable-depth search strategy that distinguishes between the different levels of abstraction in the knowledge representation to locate specific passages in the documents. Mixed-level representations as well as variable-depth search strategies are applicable in fields outside that of NLP.Comment: 8 pages, Proceedings of the Eighth International Conference on Tools with Artificial Intelligence (TAI'96), Los Alamitos C

arXiv.org e-Print Archive

CiteSeerX

Crossref

ZORA

Multilingual Language Processing From Bytes

Author: Brunk Cliff
Gillick Dan
Subramanya Amarnag
Vinyals Oriol
Publication venue
Publication date: 01/01/2016
Field of study

We describe an LSTM-based model which we call Byte-to-Span (BTS) that reads text as bytes and outputs span annotations of the form [start, length, label] where start positions, lengths, and labels are separate entries in our vocabulary. Because we operate directly on unicode bytes rather than language-specific words or characters, we can analyze text in many languages with a single model. Due to the small vocabulary size, these multilingual models are very compact, but produce results similar to or better than the state-of- the-art in Part-of-Speech tagging and Named Entity Recognition that use only the provided training datasets (no external data sources). Our models are learning "from scratch" in that they do not rely on any elements of the standard pipeline in Natural Language Processing (including tokenization), and thus can run in standalone fashion on raw text

arXiv.org e-Print Archive

Crossref

Unsupervised Terminological Ontology Learning based on Hierarchical Topic Modeling

Author: Bless Patrick
Klabjan Diego
Zhu Xiaofeng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/08/2017
Field of study

In this paper, we present hierarchical relationbased latent Dirichlet allocation (hrLDA), a data-driven hierarchical topic model for extracting terminological ontologies from a large number of heterogeneous documents. In contrast to traditional topic models, hrLDA relies on noun phrases instead of unigrams, considers syntax and document structures, and enriches topic hierarchies with topic relations. Through a series of experiments, we demonstrate the superiority of hrLDA over existing topic models, especially for building hierarchies. Furthermore, we illustrate the robustness of hrLDA in the settings of noisy data sets, which are likely to occur in many practical scenarios. Our ontology evaluation results show that ontologies extracted from hrLDA are very competitive with the ontologies created by domain experts

arXiv.org e-Print Archive

Crossref

A Relation-Centric Query Engine for the Foundational Model of Anatomy

Author: Ann Li
Augusto V. Agoncillo
Cornelius Rosse
Emily Chung
James F. Brinkley
Jose L.V. Mejino Jr.
José L. V
Landon T. Detwiler
Linda G. Shapiro
On T. Detwiler
Publication venue
Publication date: 01/01/2004
Field of study

The Foundational Model of Anatomy (FMA), a detailed representation of the structural organization of the human body, was constructed to support the development of software applications requiring knowledge of anatomy. The FMA's focus on the structural relationships between anatomical entities distinguishes it from other current anatomical knowledge sources. We developed Emily, a query engine for the FMA, to enable users to explore the richness and depth of these relationships. Preliminary analysis suggests that Emily is capable of correctly processing real world anatomical queries provided they have been translated into a constrained form suitable for processing by the query engine

CiteSeerX

University of Washington Structural Informatics Group Publications

Adversarial Connective-exploiting Networks for Implicit Discourse Relation Classification

Author: Hu Zhiting
Qin Lianhui
Xing Eric P.
Zhang Zhisong
Zhao Hai
Publication venue
Publication date: 01/01/2017
Field of study

Implicit discourse relation classification is of great challenge due to the lack of connectives as strong linguistic cues, which motivates the use of annotated implicit connectives to improve the recognition. We propose a feature imitation framework in which an implicit relation network is driven to learn from another neural network with access to connectives, and thus encouraged to extract similarly salient features for accurate classification. We develop an adversarial model to enable an adaptive imitation scheme through competition between the implicit network and a rival feature discriminator. Our method effectively transfers discriminability of connectives to the implicit features, and achieves state-of-the-art performance on the PDTB benchmark.Comment: To appear in ACL201

arXiv.org e-Print Archive

Crossref