Search CORE

5 research outputs found

Selective Sampling for Example-based Word Sense Disambiguation

Author: Fujii Atsushi
Inui Kentaro
Tanaka Hozumi
Tokunaga Takenobu
Publication venue
Publication date: 01/01/1998
Field of study

This paper proposes an efficient example sampling method for example-based word sense disambiguation systems. To construct a database of practical size, a considerable overhead for manual sense disambiguation (overhead for supervision) is required. In addition, the time complexity of searching a large-sized database poses a considerable problem (overhead for search). To counter these problems, our method selectively samples a smaller-sized effective subset from a given example set for use in word sense disambiguation. Our method is characterized by the reliance on the notion of training utility: the degree to which each example is informative for future example sampling when used for the training of the system. The system progressively collects examples by selecting those with greatest utility. The paper reports the effectiveness of our method through experiments on about one thousand sentences. Compared to experiments with other example sampling methods, our method reduced both the overhead for supervision and the overhead for search, without the degeneration of the performance of the system.Comment: 25 pages, 14 Postscript figure

arXiv.org e-Print Archive

CiteSeerX

Conceptual dependency and its descendants

Author: Bain
Birnbaum
Bobrow
Bower
Charniak
Dyer
Fikes
Fillmore
Hammond
Kass
Katz
Laird
Lebowitz
Lytinen
Lytinen
Martin
Minsky
Newell
Pollard
Riesbeck
Rissland
Rumelhart
Sacerdote
Schank
Schank
Schank
Schank
Schank
Schank
Seifert
Shapiro
Simpson
Small
Steven L. Lytinen
Sussman
Sycara
Wilensky
Winograd
Woods
Publication venue: 'Elsevier BV'
Publication date: 01/01/1992
Field of study

This paper surveys representation and processing theories arising out of conceptual dependency theory. One of the primary characteristics of conceptual dependency was the notion of a canonical form, built out of a small number of primitive representations. Although the notion of primitives has largely been lost in subsequent work, many other of the basic notions of CD have remained. In particular, the idea of building representations around inferential capabilities has prevailed in this family of research. The result is a set of representational structures, all of which are highly knowledge-intensive. The use of these structures in various processing theories has led to knowledge-based theories of language understanding, planning, reasoning and other tasks, which have contrasted sharply with the traditional search-oriented approaches used in other systems.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/30278/1/0000679.pd

Elsevier - Publisher Connector

Crossref

Deep Blue Documents at the University of Michigan

Syntactic and Semantic Analysis and Visualization of Unstructured English Texts

Author: Karmakar Saurav
Publication venue: ScholarWorks @ Georgia State University
Publication date: 14/12/2011
Field of study

People have complex thoughts, and they often express their thoughts with complex sentences using natural languages. This complexity may facilitate efficient communications among the audience with the same knowledge base. But on the other hand, for a different or new audience this composition becomes cumbersome to understand and analyze. Analysis of such compositions using syntactic or semantic measures is a challenging job and defines the base step for natural language processing. In this dissertation I explore and propose a number of new techniques to analyze and visualize the syntactic and semantic patterns of unstructured English texts. The syntactic analysis is done through a proposed visualization technique which categorizes and compares different English compositions based on their different reading complexity metrics. For the semantic analysis I use Latent Semantic Analysis (LSA) to analyze the hidden patterns in complex compositions. I have used this technique to analyze comments from a social visualization web site for detecting the irrelevant ones (e.g., spam). The patterns of collaborations are also studied through statistical analysis. Word sense disambiguation is used to figure out the correct sense of a word in a sentence or composition. Using textual similarity measure, based on the different word similarity measures and word sense disambiguation on collaborative text snippets from social collaborative environment, reveals a direction to untie the knots of complex hidden patterns of collaboration

ScholarWorks @ Georgia State University

Integrated conceptual parser

Author: Taterway Binod Krishna
Publication venue: Lehigh Preserve
Publication date
Field of study

Lehigh University: Lehigh Preserve