6,827 research outputs found
Using Decision Trees for Coreference Resolution
This paper describes RESOLVE, a system that uses decision trees to learn how
to classify coreferent phrases in the domain of business joint ventures. An
experiment is presented in which the performance of RESOLVE is compared to the
performance of a manually engineered set of rules for the same task. The
results show that decision trees achieve higher performance than the rules in
two of three evaluation metrics developed for the coreference task. In addition
to achieving better performance than the rules, RESOLVE provides a framework
that facilitates the exploration of the types of knowledge that are useful for
solving the coreference problem.Comment: 6 pages; LaTeX source; 1 uuencoded compressed EPS file (separate);
uses ijcai95.sty, named.bst, epsf.tex; to appear in Proc. IJCAI '9
Nowcasting Thunderstorms for Munich Airport
The successful demonstration and assessment of the DLR thunderstorm nowcasting algorithms at Munich Airport during two campaigns in the summers of 2010 and 2011 are described. The algorithms Cb-TRAM and Rad-TRAM, that detect, monitor, and forecast up to one hour (nowcast) thunderstorm cells from satellite and radar data, run in real time and provided new thunderstorm products for users at the airport. The products were presented on displays the users were already familiar with as well as on webpages designed by DLR. On the webpages, also additional information like measurements with DLR’s polarimetric radar and model forecasts was shown. Moreover, thunderstorm warnings were is-sued and sent via email to the users whenever a thunderstorm was detected in the terminal manoeu-vring area of the airport of Munich. The nowcasting skills of Rad-TRAM and Cb-TRAM are encouraging, especially for lead times up to 30 minutes, and the user feedback on the DLR thunderstorm products was very positive. The Rad-TRAM and Cb-TRAM products provide a good overview on the situation and its future development, and the thunderstorm warnings were very helpful for the collaborative decision making at the airport. However, some suggestions for improvements were made like the demand for nowcasts beyond one hour. This will be considered within the integrated weather forecast system, WxFUSION, which has been further developed during the campaigns
New Methods, Current Trends and Software Infrastructure for NLP
The increasing use of `new methods' in NLP, which the NeMLaP conference
series exemplifies, occurs in the context of a wider shift in the nature and
concerns of the discipline. This paper begins with a short review of this
context and significant trends in the field. The review motivates and leads to
a set of requirements for support software of general utility for NLP research
and development workers. A freely-available system designed to meet these
requirements is described (called GATE - a General Architecture for Text
Engineering). Information Extraction (IE), in the sense defined by the Message
Understanding Conferences (ARPA \cite{Arp95}), is an NLP application in which
many of the new methods have found a home (Hobbs \cite{Hob93}; Jacobs ed.
\cite{Jac92}). An IE system based on GATE is also available for research
purposes, and this is described. Lastly we review related work.Comment: 12 pages, LaTeX, uses nemlap.sty (included
Comparing knowledge sources for nominal anaphora resolution
We compare two ways of obtaining lexical knowledge for antecedent selection in other-anaphora
and definite noun phrase coreference. Specifically, we compare an algorithm that relies on links
encoded in the manually created lexical hierarchy WordNet and an algorithm that mines corpora
by means of shallow lexico-semantic patterns. As corpora we use the British National
Corpus (BNC), as well as the Web, which has not been previously used for this task. Our
results show that (a) the knowledge encoded in WordNet is often insufficient, especially for
anaphor-antecedent relations that exploit subjective or context-dependent knowledge; (b) for
other-anaphora, the Web-based method outperforms the WordNet-based method; (c) for definite
NP coreference, the Web-based method yields results comparable to those obtained using
WordNet over the whole dataset and outperforms the WordNet-based method on subsets of the
dataset; (d) in both case studies, the BNC-based method is worse than the other methods because
of data sparseness. Thus, in our studies, the Web-based method alleviated the lexical knowledge
gap often encountered in anaphora resolution, and handled examples with context-dependent relations
between anaphor and antecedent. Because it is inexpensive and needs no hand-modelling
of lexical knowledge, it is a promising knowledge source to integrate in anaphora resolution systems
- …