Search CORE

1,204 research outputs found

AnaPro, Tool for Identification and Resolution of Direct Anaphora in Spanish

Author: CUEVAS RASGADO ALMA DELIA
CUEVAS RASGADO ALMA DELIA
Cuevas-Rasgado Alma Delia
GUZMAN ARENAS ADOLFO
GUZMAN ARENAS ADOLFO
Guzmán-Arenas Adolfo
MENDEZ SEGUNDO LAURA
MENDEZ SEGUNDO LAURA
Méndez-Segundo Laura
Toledo Gómez Israel
Toledo Gómez Israel
Toledo-Gómez Israel
Valtierra Romero Erick
Valtierra Romero Erick
Valtierra-Romero Erick
Publication venue: 'International Journal of Applied Research and Technology'
Publication date: 01/02/2014
Field of study

Introduction Anaphora is a relation of coreference between linguistic terms. According to Webster’s dictionary: “It is the use of a grammatical substitute (as a pronoun or a pro-verb) to refer to the denotation of a preceding word or group of words; also : the relation between a grammatical substitute and its antecedent.” Therefore, anaphora is a discourse relation. Anaphora resolution is very important in Natural Language Processing (NLP). This work is part of Project OM* (Ontology Merging), which seeks to build a large ontology by fusing smaller ontologies extracted from textual documents. An important part of the project is to analyze the sentences in a document with the goal to transform that text into an ontology that comprises its contents. A brief description of Project OM* follows.AnaPro is software that solves direct anaphora in Spanish, specifically pronouns: it finds the noun or group of words to which the pronoun refers. It locates in the previous sentenc es the referent or antecedent which the pronoun replaces. An example of a direct anaphora solved is the pronoun “ he” in the sentence “He is sad.” Much of the work on anaphora has been done for texts in English; thus , we specifically focus on Spanish documents. AnaPro directly supports text analys is (to understand what a document says ), a non trivial task since there are different writing styles, references, idiomatic expressions, etc. The problem grows if t he analyzer is a computer, because they lack “common sense” (which persons possess) . Hence, before text analysis, its preprocessing is required, in order to assign tags (noun, verb,...) to each word, find the stems, disambiguate nouns, verbs, prepositions, identify colloquial expressions, i dentify and resolve anaphor a, among other chores. AnaPro works for Spanish sentences. It is a novel procedure, since it is automatic (no user intervenes during the resolution) and it does not need dictionaries. It employs heu ristics procedures to discover the semantics and help in the decisions; they are rather easy to implement and use li mited knowledge. Nevertheless, its results are good (81% of correct answers, at least). However, more tests will give a better idea of its goodness.Authors I.T. and E.V. would like to acknowledge ESCOM-IPN, where they defended their thesis, #20110083 , which gives a more detailed description of AnaPro. Work herein reported was partially sponsored by CONACYT Grant #128163 (Project OM*), by IPN and by SNI and UAEM

Repositorio Institucional de la Universidad Autónoma del Estado de México

Deriving Verb Predicates By Clustering Verbs with Arguments

Author: Rouhizadeh Masoud
Schwartz Andy
Sedoc Joao
Ungar Lyle
Wijaya Derry
Publication venue
Publication date: 01/01/2017
Field of study

Hand-built verb clusters such as the widely used Levin classes (Levin, 1993) have proved useful, but have limited coverage. Verb classes automatically induced from corpus data such as those from VerbKB (Wijaya, 2016), on the other hand, can give clusters with much larger coverage, and can be adapted to specific corpora such as Twitter. We present a method for clustering the outputs of VerbKB: verbs with their multiple argument types, e.g. "marry(person, person)", "feel(person, emotion)." We make use of a novel low-dimensional embedding of verbs and their arguments to produce high quality clusters in which the same verb can be in different clusters depending on its argument type. The resulting verb clusters do a better job than hand-built clusters of predicting sarcasm, sentiment, and locus of control in tweets

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

ETRANS: A English-Thai translator

Author: Warote Nuntaporn
Publication venue: RIT Scholar Works
Publication date: 01/01/1991
Field of study

ETRANS is an experimental English-Thai machine translation (MT) system that translates a simple English sentence into a grammatically correct Thai sentence. The entire system is written in C-Prolog, and runs on UNIX systems. The MT strategy taken by ETRANS is an interlingual strategy with a parser for English and a generator for Thai. The parser creates a semantic representation equivalent to the meaning of the English sentence. A generator then interprets the semantic representation into Thai. ETRANS employs frames as a means for representing knowledge, and an augmented transition network (ATN) as the linguistic framework for analyzing and generating sentences

RIT Scholar Works

Investigations into the Use of Preposition Sense in Semantic Argument Classification

Author: Dahlmeier Daniel Hermann Richard
Publication venue
Publication date: 04/08/2008
Field of study

KITopen