Search CORE

26 research outputs found

Paradigms for abstracting systems

A New, Fully Automatic Version of Mitkov's Knowledge-Poor Pronoun Resolution Method

Author: B. J. Grosz
C.D. Paice
I. Dagan
I. Dagan
J. H. Holland
J. R. Hobbs
R. Evans
R. Mitkov
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2002
Field of study

This paper describes a new, advanced and completely revamped version of Mitkov's knowledge-poor approach to pronoun resolution. In contrast to most anaphora resolution approaches, the new system, referred to as MARS, operates in fully automatic mode. It benefits from purpose-built programs for identifying occurrences of non-nominal anaphora (including pleonastic pronouns) and for recognition of animacy, and employs genetic algorithms to achieve optimal performance. The paper features extensive evaluation and discusses important evaluation issues in anaphora resolution

Crossref

Wolverhampton Intellectual Repository and E-theses

Pronominal Anaphora Generation in an English-Spanish MT Approach

Author: A. Ferrández
C.D. Paice
J. Chandioux
S. Landes
S. Lappin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Putting Successor Variety Stemming to Work

Author: C.D. Paice
D. Harman
D.R. Morrison
J.B. Lovins
M. Braschler
M.F. Porter
W.B. Frakes
Publication venue
Publication date: 01/01/2007
Field of study

Abstract. Stemming algorithms find canonical forms for inflected words, e. g. for declined nouns or conjugated verbs. Since such a unification of words with respect to gender, number, time, and case is a language-specific issue, stemming algorithms operationalize a set of linguistically motivated rules for the language in question. The most well-known rule-based algorithm for the English language is from Porter [14]. The paper presents a statistical stemming approach which is based on the analysis of the distribution of word prefixes in a document collection, and which thus is widely language-independent. In particular, our approach addresses the problem of index construction for multi-lingual documents. Related work for statistical stemming focuses either on stemming quality [2,3] or on runtime performance [11], but neither provides a reasonable tradeoff between both. For selected retrieval tasks under vector-based document models we report on new results related to stemming quality and collection size dependency. Interestingly, successor variety stemming has neither been investigated under similarity concerns for index construction nor is it applied as a technology in current retrieval applications. As our results will show, this disregard is not justified.

CiteSeerX

Crossref

Distribution Based Stemmer Refinement

Author: A. Wald
C.D. Paice
G. Salton
J. Xu
M.F. Porter
N.L. Johnson
V.N. Vapnik
W.B. Frakes
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Crossref

Classifying with Co-stems

Author: B. Stein
C.D. Paice
E. Blanzieri
E. Stamatatos
J.B. Lovins
M.F. Porter
T. Gottron
U. Hanani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref

A case-based approach for developing writing tools aimed at non-native English users

Author: C.D. Paice
E. Hovy
G. Born
G. Crookes
G. Taylor
H. Kitano
N. Fontana
R.A. Buchanan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automatic language-specific stemming in information retrieval

Author: C. Jacquemin
C. Manning
C.D. Paice
C.D. Paice
D. Harman
D. Hull
D. Jurafsky
G. Adamson
G. Kowalski
J. Goldsmith
J. Goldsmith
J. Rissanen
J. Xu
J.B. Lovins
K. Sparck Jones
M. F. Porter
M. Hafer
M. Lennon
M. Popovič
T. Strzalkowski
W.B. Frakes
Publication venue: Springer Verlag
Publication date: 01/01/2001
Field of study

Abstract. We employ Automorphology, an MDL-based algorithm that determines the suffixes present in a language-sample with no prior knowledge of the language in question, and describe our experiments on the usefulness of this approach for Information Retrieval, employing this stemmer in a SMARTbased IR engine.

CiteSeerX

Crossref

The algorithms for preliminary text processing: Decomposition, annotation, morphological analysis

Author: C.D. Paice
E. V. Larchenko
K. Börjars
L.J. Brinton
M. S. Starikov
T. N. Vishnyakov
V. A. Yatsko
Publication venue: 'Allerton Press'
Publication date
Field of study

Crossref

Évaluation d’un Système pour le Résumé Automatique de Documents ÉLectroniques

Author: C.D. Paice
H.P. Edmundson
H.P. Luhn
I. Manit
K. Barker
K. Ono
L.L. Earl
P. D. Turney
W.C. Mann
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref