22,744 research outputs found
LocLinkVis: A Geographic Information Retrieval-Based System for Large-Scale Exploratory Search
In this paper we present LocLinkVis (Locate-Link-Visualize); a system which
supports exploratory information access to a document collection based on
geo-referencing and visualization. It uses a gazetteer which contains
representations of places ranging from countries to buildings, and that is used
to recognize toponyms, disambiguate them into places, and to visualize the
resulting spatial footprints.Comment: SEM'1
Inclusion, Contrast and Polysemy in Dictionaries: The Relationship between Theory, Language Use and Lexicographic Practice
This paper explores the lexicographic representation of a type of polysemy that arises when the meaning of one lexical item can either include or contrast with the meaning of another, as in the case of dog/bitch, shoe/boot, finger/thumb and animal/bird. A survey of how such pairs are represented in monolingual English dictionaries showed that dictionaries mostly represent as explicitly polysemous those lexical items whose broader and narrower readings are more distinctive and clearly separable in definitional terms. They commonly only represented the broader readings for terms that are in fact frequently used in the narrower reading, as shown by data from the British National Corpus
Recommended from our members
Beyond definition: Organising semantic information in bilingual dictionaries
This paper considers the process of organising semantic information in bilingual dictionaries with diachronic coverage, from selecting the textual source-material to designing the entries. The discussion centres on practical aspects of ancient Greek lexicography. First, the traditional semantic frameworks are described. Then, more recent approaches are noted, notably those of Adrados and of Chadwick, both of which aim to integrate contextual data within a semantic framework. Since the relevance of contextual information varies with lemma part of speech, different configurations are required for entries describing nouns, adjectives, and verbs. These are illustrated by three entries from a Greek-English dictionary currently being written at Cambridge. In order to organise data to this level of specificity, stylistic templates are indispensable, and digital software provides a means of providing them. However, systems designed for writing new dictionaries require different features from those designed for encoding pre-existing texts. A description is given of how the lexicographic requirements of the Cambridge dictionary were met by a user-designed system
Negotiating cultures in cyberspace
In this paper we report findings of a multidisciplinary study of online participation by culturally diverse participants in a distance adult education course offered in Canada and examine in detail three of the study's findings. First, we explore both the historical and cultural origins of "cyberculture values" as manifested in our findings, using the notions of explicit and implicit enforcement of those values and challenging the assumption that cyberspace is a culture free zone. Second, we examine the notion of cultural gaps between participants in the course and the
potential consequences for online communication successes and difficulties. Third, the analysis describes variations in participation frequency as a function of broad cultural groupings in our data. We identify the need for additional research, primarily in the form of larger scale comparisons across cultural groups of patterns of participation and interaction, but also in the form of case studies that can be submitted to microanalyses of the form as well as the content of communicator's participation and interaction online
Bethlehem in the Bronze and Iron Ages in the light of recent discoveries by the Palestinian MOTA-DACH
The discovery of the necropolis of Khalet al-Jam’a, around 2.2 Km south-east of Bethlehem (Nigro et al. in this volume), provides new data on the Bronze and Iron Age town which controlled the main route connecting Jerusalem to Hebron, and the access to the wadiat crossing the southern Judean desert and leading to the coastal plain. Intermediate Bronze Age/Early Bronze IV, Middle Bronze shaft tombs, and at least two major Iron II burial caves (Tomb A7 and the Barmil’s Tomb) excavated by the Palestinian MOTA-DACH in an Iron Age cemetery allow to draw up a renewed picture of Bethlehem and its environs and give the opportunity to re-appraise its long history
Updating the Dolch Basic Sight Vocabulary
Word lists for reading instruction have long been of interest to educators. It has been noted by Johnson and Barrett (16) that over 125 word lists have been constructed during the past seventy years. Of these many word lists, there is little doubt that the Dolch list has received widest publication and use. Authors of texbooks on the teaching of reading (1, 8, 18, 23) have made reference to the Dolch list with suggestions for teaching the words. In addition, many reading materials have been developed to help teach these words in isolation and in context. Books have been written with the Dolch words and a small number of nouns to give children practice in using these words in a natural reading situation. Johnson (15) is probably correct in observing that hundreds of thousands of children have been asked to learn these 220 basic words
L1 transfer in the acquisition of manner and path in Spanish by native speakers of English
In this article the authors argue that L1 transfer from English is not only important in the early stages of L2 acquisition of Spanish, but remains influential in later stages if there is not enough positive evidence for the learners to progress in their development (Lefebvre, White, & Jourdan, 2006). The findings are based on analyses of path and manner of movement in stories told by British students of Spanish (N = 68) of three different proficiency levels. Verbs that conflate motion and path, on the one hand, are mastered early, possibly because the existence of Latinate path verbs, such as enter and ascend in English, facilitate their early acquisition by British learners of Spanish. Contrary to the findings of Cadierno (2004) and Cadierno and Ruiz (2006), the encoding of manner, in particular in boundary crossing contexts, seems to pose enormous difficulties, even among students who had been abroad on a placement in a Spanish-speaking country prior to the data collection. An analysis of the frequency of manner verbs in Spanish corpora shows that one of the key reasons why students struggle with manner is that manner verbs are so infrequent in Spanish. The authors claim that scarce positive evidence in the language exposed to and little or no negative evidence are responsible for the long-lasting effect of transfer on the expression of manner
STARC: Structured Annotations for Reading Comprehension
We present STARC (Structured Annotations for Reading Comprehension), a new
annotation framework for assessing reading comprehension with multiple choice
questions. Our framework introduces a principled structure for the answer
choices and ties them to textual span annotations. The framework is implemented
in OneStopQA, a new high-quality dataset for evaluation and analysis of reading
comprehension in English. We use this dataset to demonstrate that STARC can be
leveraged for a key new application for the development of SAT-like reading
comprehension materials: automatic annotation quality probing via span ablation
experiments. We further show that it enables in-depth analyses and comparisons
between machine and human reading comprehension behavior, including error
distributions and guessing ability. Our experiments also reveal that the
standard multiple choice dataset in NLP, RACE, is limited in its ability to
measure reading comprehension. 47% of its questions can be guessed by machines
without accessing the passage, and 18% are unanimously judged by humans as not
having a unique correct answer. OneStopQA provides an alternative test set for
reading comprehension which alleviates these shortcomings and has a
substantially higher human ceiling performance.Comment: ACL 2020. OneStopQA dataset, STARC guidelines and human experiments
data are available at https://github.com/berzak/onestop-q
- …