58 research outputs found
Proceedings
Proceedings of the Ninth International Workshop
on Treebanks and Linguistic Theories.
Editors: Markus Dickinson, Kaili Müürisep and Marco Passarotti.
NEALT Proceedings Series, Vol. 9 (2010), 268 pages.
© 2010 The editors and contributors.
Published by
Northern European Association for Language
Technology (NEALT)
http://omilia.uio.no/nealt .
Electronically published at
Tartu University Library (Estonia)
http://hdl.handle.net/10062/15891
Proceedings of the Conference on Natural Language Processing 2010
This book contains state-of-the-art contributions to the 10th
conference on Natural Language Processing, KONVENS 2010
(Konferenz zur Verarbeitung natürlicher Sprache), with a focus
on semantic processing.
The KONVENS in general aims at offering a broad perspective
on current research and developments within the interdisciplinary
field of natural language processing. The central theme
draws specific attention towards addressing linguistic aspects
ofmeaning, covering deep as well as shallow approaches to semantic
processing. The contributions address both knowledgebased
and data-driven methods for modelling and acquiring
semantic information, and discuss the role of semantic information
in applications of language technology.
The articles demonstrate the importance of semantic processing,
and present novel and creative approaches to natural
language processing in general. Some contributions put their
focus on developing and improving NLP systems for tasks like
Named Entity Recognition or Word Sense Disambiguation, or
focus on semantic knowledge acquisition and exploitation with
respect to collaboratively built ressources, or harvesting semantic
information in virtual games. Others are set within the
context of real-world applications, such as Authoring Aids, Text
Summarisation and Information Retrieval. The collection highlights
the importance of semantic processing for different areas
and applications in Natural Language Processing, and provides
the reader with an overview of current research in this field
Formalizovaný kontrastivní popis lexikálních jednotek: deskriptivní rámec pro dvojjazyčné slovníky
Institute of the Czech National CorpusÚstav českého národního korpusuFilozofická fakultaFaculty of Art
''Why were the former days better than these?'': An examination of temporal horizons in Ecclesiastes.
A number of studies explore temporal vocabulary in the Old Testament
generally and Ecclesiastes particularly, yet few attempt a holistic approach of reading
Ecclesiastes through its presentation of time. Scholars have long recognized the
work’s tensions, but the link that holds the tensions together in a unified reading has
received less attention. This unifying idea is the presentation of time. Time is not a
singular concept, however, and this project undertakes a sustained engagement with
the broad presentation of time both to examine Ecclesiastes’ inquiry after what is
good for human beings and its often-identified tensions. As such, this study fills a
considerable gap in current Ecclesiastes scholarship.
Part One, consisting of chapters two and three, examines terms for time,
including ʽEt, yom, dor, ʽolam, shanah, zekher/zikhron, through a close examination of
these words in their contexts. It becomes clear that time in Ecclesiastes is a mixture
of reflections on the main character’s present, the past, and the passing of time over
the course of generations. The project argues in Part Two that approaching time with
an awareness of how Ecclesiastes creates, compares, and contrasts time horizons aids
the reader to comprehend the contradictions and tensions. Chapter four demonstrates
the presence of identifiable and quantifiable horizons in what is widely regarded as
the introduction of Ecclesiastes, 1:1-2:26. These horizons, identified as nature’s
time, generation time, lifespan time and event time, are juxtaposed in order to point
toward the benefit of short-duration thinking for life under the sun. Chapter five
examines Ecclesiastes 3:1-12:14 according to the categories of nature’s time,
generation time, and lifespan time to ascertain characteristics common to these
horizons. Consistently, Ecclesiastes presents these horizons of time as impenetrable
and inaccessible to human endeavour. Chapter six examines the same material but
from the perspective of what occurs in defined situations, which are designated event
time. Ecclesiastes presents event time as partially controllable thereby suggesting
proper and improper uses within this horizon. The chapter concludes with a
discussion of wisdom and event time, demonstrating that wisdom in Ecclesiastes is
not focused on success over one’s whole life (lifespan time), but focuses upon
capturing the potential of the present moment to provide rest, companionship, and
enjoyment in the short-term. The exploration of time as temporal horizons suggests
an opportunity to observe similar phenomena in other works associated with wisdom
and in other non-narrative works within the Hebrew Bible
Modeling information structure in a cross-linguistic perspective
This study makes substantial contributions to both the theoretical and computational treatment of information structure, with a specific focus on creating natural language processing applications such as multilingual machine translation systems. The present study first provides cross-linguistic findings in regards to information structure meanings and markings. Building upon such findings, the current model represents information structure within the HPSG/MRS framework using Individual Constraints. The primary goal of the present study is to create a multilingual grammar model of information structure for the LinGO Grammar Matrix system. The present study explores the construction of a grammar library for creating customized grammar incorporating information structure and illustrates how the information structure-based model improves performance of transfer-based machine translation
Constructing a poor man’s wordnet in a resource-rich world
International audienceIn this paper we present a language-independent, fully modular and automatic approach to bootstrap a wordnet for a new language by recycling different types of already existing language resources, such as machine-readable dictionaries, parallel corpora, and Wikipedia. The approach, which we apply here to Slovene, takes into account monosemous and polysemous words, general and specialised vocabulary as well as simple and multi-word lexemes. The extracted words are then assigned one or several synset ids, based on a classifier that relies on several features including distributional similarity. Finally, we identify and remove highly dubious (literal, synset) pairs, based on simple distributional information extracted from a large corpus in an unsupervised way. Automatic, manual and task-based evaluations show that the resulting resource, the latest version of the Slovene wordnet, is already a valuable source of lexico-semantic information
- …