Search CORE

38,949 research outputs found

Integrating Semantic Knowledge to Tackle Zero-shot Text Classification

Author: Guo Yike
Lertvittayakumjorn Piyawat
Zhang Jingqing
Publication venue
Publication date: 01/01/2019
Field of study

Insufficient or even unavailable training data of emerging classes is a big challenge of many classification tasks, including text classification. Recognising text documents of classes that have never been seen in the learning stage, so-called zero-shot text classification, is therefore difficult and only limited previous works tackled this problem. In this paper, we propose a two-phase framework together with data augmentation and feature augmentation to solve this problem. Four kinds of semantic knowledge (word embeddings, class descriptions, class hierarchy, and a general knowledge graph) are incorporated into the proposed framework to deal with instances of unseen classes effectively. Experimental results show that each and the combination of the two phases achieve the best overall accuracy compared with baselines and recent approaches in classifying real-world texts under the zero-shot scenario.Comment: Accepted NAACL-HLT 201

arXiv.org e-Print Archive

Crossref

Spiral - Imperial College Digital Repository

Paper-based Mixed Reality Sketch Augmentation as a Conceptual Design Support Tool

Author: Dijk E.M.A.G. van
Santos G.J.D. dos
Vyas D.M.
Publication venue: ACM
Publication date: 01/01/2009
Field of study

This undergraduate student paper explores usage of mixed reality techniques as support tools for conceptual design. A proof-of-concept was developed to illustrate this principle. Using this as an example, a small group of designers was interviewed to determine their views on the use of this technology. These interviews are the main contribution of this paper. Several interesting applications were determined, suggesting possible usage in a wide range of domains. Paper-based sketching, mixed reality and sketch augmentation techniques complement each other, and the combination results in a highly intuitive interface

Queensland University of Technology ePrints Archive

University of Twente Research Information

Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media

Author: Mai Trung
Nguyen Mao
Nguyen Tri
Pham Dang
Quan Tho
Truong Minh
Vo Khuong
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2019
Field of study

Sentiment analysis has been emerging recently as one of the major natural language processing (NLP) tasks in many applications. Especially, as social media channels (e.g. social networks or forums) have become significant sources for brands to observe user opinions about their products, this task is thus increasingly crucial. However, when applied with real data obtained from social media, we notice that there is a high volume of short and informal messages posted by users on those channels. This kind of data makes the existing works suffer from many difficulties to handle, especially ones using deep learning approaches. In this paper, we propose an approach to handle this problem. This work is extended from our previous work, in which we proposed to combine the typical deep learning technique of Convolutional Neural Networks with domain knowledge. The combination is used for acquiring additional training data augmentation and a more reasonable loss function. In this work, we further improve our architecture by various substantial enhancements, including negation-based data augmentation, transfer learning for word embeddings, the combination of word-level embeddings and character-level embeddings, and using multitask learning technique for attaching domain knowledge rules in the learning process. Those enhancements, specifically aiming to handle short and informal messages, help us to enjoy significant improvement in performance once experimenting on real datasets.Comment: A Preprint of an article accepted for publication by Inderscience in IJCVR on September 201

arXiv.org e-Print Archive

Crossref

Metadata Augmentation for Semantic- and Context- Based Retrieval of Digital Cultural Objects

Author: Pham Binh
Smith Robert
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

Cultural objects are increasingly stored and generated in digital form, yet effective methods for their indexing and retrieval still remain an open area of research. The main problem arises from the disconnection between the content-based indexing approach used by computer scientists and the description-based approach used by information scientists. There is also a lack of representational schemes that allow the alignment of the semantics and context with keywords and low-level features that can be automatically extracted from the content of these cultural objects. This paper presents an integrated approach to address these problems, taking advantage of both computer science and information science approaches. The focus is on the rationale and conceptual design of the system and its various components. In particular, we discuss techniques for augmenting commonly used metadata with visual features and domain knowledge to generate high-level abstract metadata which in turn can be used for semantic and context-based indexing and retrieval. We use a sample collection of Vietnamese traditional woodcuts to demonstrate the usefulness of this approach

Crossref

Queensland University of Technology ePrints Archive

V/STOL maneuverability and control

Author: Anderson S. B.
Franklin J. A.
Publication venue
Publication date
Field of study

Maneuverability and control of V/STOL aircraft in powered-lift flight is studied with specific considerations of maneuvering in forward flight. A review of maneuverability for representative operational mission tasks is presented and covers takeoff, transition, hover, and landing flight phases. Maneuverability is described in terms of the ability to rotate and translate the aircraft and is specified in terms of angular and translational accelerations imposed on the aircraft. Characteristics of representative configurations are reviewed, including experience from past programs and expectations for future designs. The review of control covers the characteristics inherent in the basic airframe and propulsion system and the behavior associated with ontrol augmentation systems. Demands for augmented stability and control response to meet certain mission operational requirements are discussed. Experience from ground-based simulation and flight experiments that illustrates the impact of augmented stability and control on aircraft design is related by example

NASA Technical Reports Server

Natural Language Query in the Biochemistry and Molecular Biology Domains Based on Cognition Search™

Author: Elizabeth J. Goldsmith
Kathleen Dahlgren
Radha Akella
Saurabh Mendiratta
Publication venue
Publication date: 19/09/2008
Field of study

Motivation: With the tremendous growth in scientific literature, it is necessary to improve upon the standard pattern matching style of the available search engines. Semantic NLP may be the solution to this problem. Cognition Search (CSIR) is a natural language technology. It is best used by asking a simple question that might be answered in textual data being queried, such as MEDLINE. CSIR has a large English dictionary and semantic database. Cognition’s semantic map enables the search process to be based on meaning rather than statistical word pattern matching and, therefore, returns more complete and relevant results. The Cognition Search engine uses downward reasoning and synonymy which also improves recall. It improves precision through phrase parsing and word sense disambiguation.
Result: Here we have carried out several projects to "teach" the CSIR lexicon medical, biochemical and molecular biological language and acronyms from curated web-based free sources. Vocabulary from the Alliance for Cell Signaling (AfCS), the Human Genome Nomenclature Consortium (HGNC), the United Medical Language System (UMLS) Meta-thesaurus, and The International Union of Pure and Applied Chemistry (IUPAC) was introduced into the CSIR dictionary and curated. The resulting system was used to interpret MEDLINE abstracts. Meaning-based search of MEDLINE abstracts yields high precision (estimated at >90%), and high recall (estimated at >90%), where synonym information has been encoded. The present implementation can be found at http://MEDLINE.cognition.com. &#xa

PubMed Central

Nature Precedings