9,034 research outputs found
Common vocabularies for collective intelligence - work in progress
Web based applications and tools offer a great potential to increase the efficiency of information flow and communication among different agents during emergencies. Among the different factors, technical and non technical, that hinder the integration of an information model in emergency management sector, is a lack of a common, shared vocabulary. This paper furthers previous work in the area of ontology development, and presents a summary and overview of the goal, process and methodology to construct a shared set of metadata that can be used to map existing vocabulary. This paper is a work in progress report
Collaborative tagging as a knowledge organisation and resource discovery tool
The purpose of the paper is to provide an overview of the collaborative tagging phenomenon and explore some of the reasons for its emergence. Design/methodology/approach - The paper reviews the related literature and discusses some of the problems associated with, and the potential of, collaborative tagging approaches for knowledge organisation and general resource discovery. A definition of controlled vocabularies is proposed and used to assess the efficacy of collaborative tagging. An exposition of the collaborative tagging model is provided and a review of the major contributions to the tagging literature is presented. Findings - There are numerous difficulties with collaborative tagging systems (e.g. low precision, lack of collocation, etc.) that originate from the absence of properties that characterise controlled vocabularies. However, such systems can not be dismissed. Librarians and information professionals have lessons to learn from the interactive and social aspects exemplified by collaborative tagging systems, as well as their success in engaging users with information management. The future co-existence of controlled vocabularies and collaborative tagging is predicted, with each appropriate for use within distinct information contexts: formal and informal. Research limitations/implications - Librarians and information professional researchers should be playing a leading role in research aimed at assessing the efficacy of collaborative tagging in relation to information storage, organisation, and retrieval, and to influence the future development of collaborative tagging systems. Practical implications - The paper indicates clear areas where digital libraries and repositories could innovate in order to better engage users with information. Originality/value - At time of writing there were no literature reviews summarising the main contributions to the collaborative tagging research or debate
Towards an automated query modification assistant
Users who need several queries before finding what they need can benefit from
an automatic search assistant that provides feedback on their query
modification strategies. We present a method to learn from a search log which
types of query modifications have and have not been effective in the past. The
method analyses query modifications along two dimensions: a traditional
term-based dimension and a semantic dimension, for which queries are enriches
with linked data entities. Applying the method to the search logs of two search
engines, we identify six opportunities for a query modification assistant to
improve search: modification strategies that are commonly used, but that often
do not lead to satisfactory results.Comment: 1st International Workshop on Usage Analysis and the Web of Data
(USEWOD2011) in the 20th International World Wide Web Conference (WWW2011),
Hyderabad, India, March 28th, 201
Exploring the Mental Lexicon of the Multilingual: Vocabulary Size, Cognate Recognition and Lexical Access in the L1, L2 and L3
Recent empirical findings in the field of Multilingualism have shown that the mental lexicon of a language learner does not consist of separate entities, but rather of an intertwined system where languages can interact with each other (e.g. Cenoz, 2013; Szubko-Sitarek, 2015). Accordingly, multilingual language learners have been considered differently to second language learners in a growing number of studies, however studies on the variation in learners’ vocabulary size both in the L2 and L3 and the effect of cognates on the target languages have been relatively scarce. This paper, therefore, investigates the impact of prior lexical knowledge on additional language learning in the case of Hungarian native speakers, who use Romanian (a Romance language) as a second language (L2) and learn English as an L3. The study employs an adapted version of the widely used Vocabulary Size Test (Nation & Beglar, 2007), the Romanian Vocabulary Size Test (based on the Romanian Frequency List; Szabo, 2015) and a Hungarian test (based on a Hungarian frequency list; Varadi, 2002) in order to measure vocabulary sizes, cognate knowledge and response times in these languages. The findings, complemented by a self-rating language background questionnaire, indicate a strong link between Romanian and English lexical proficiency
Bilingual vocabulary acquisition A corpus-based case study
A bilingual learning two languages from birth seems to follow the same developmental trajectory as that of a monolingual. However, within vocabulary acquisition persistent and significant differences have been found between bilingual and monolinguals when the languages of bilinguals are compared separately to monolinguals’ level. The domain specific nature of a bilingual’s vocabularies is an important characteristic in understanding this difference, but also when investigating a bilingual’s vocabulary acquisition. The role of context in which a bilingual acquires his or her languages is crucial to the aspect of domain specificity. An important theory here is the Complementarity Principle, emphasizing how different domains in life require different languages and that the vocabularies and proficiencies of a bilingual’s languages will develop thereafter. This theory has mostly been investigated in adult bilingualism.
This study examines the English and Norwegian vocabularies of a 2-year-old bilingual girl, Emma. Emma is growing up in northern Norway and has acquired Norwegian and English from birth. The thesis uses corpus data collected between the ages of 2;7-2;11 to examine the expressive lexical characteristics of Emma’s vocabularies. Emma is a relatively balanced bilingual, with an asymmetrical code-switching pattern. Her vocabulary levels do not match those of monolingual peers when her languages are compared separately, but when her total and conceptual vocabularies are compared to monolingual vocabulary levels, her results are more comparable. Based on this, Emma’s results are discussed in relation to the implications of the complementarity principle and current research on monolingual and bilingual comparisons
Thesaurus-assisted search term selection and query expansion: a review of user-centred studies
This paper provides a review of the literature related to the application of domain-specific thesauri in the search and retrieval process. Focusing on studies which adopt a user-centred approach, the review presents a survey of the methodologies and results from empirical studies undertaken on the use of thesauri as sources of term selection for query formulation and expansion during the search process. It summaries the ways in which domain-specific thesauri from different disciplines have been used by various types of users and how these tools aid users in the selection of search terms. The review consists of two main sections covering, firstly studies on thesaurus-aided search term selection and secondly those dealing with query expansion using thesauri. Both sections are illustrated with case studies that have adopted a user-centred approach
Recommended from our members
Introducing a Romanian Frequency List and the Romanian Vocabulary Levels Test
Vocabulary is considered essential to language learning, thus English word lists and tests based on frequency information have become the centre of attention for researchers, teachers and learners alike. As a result, it is argued hereby that frequency based word lists and tests should be adapted and regarded as key elements for teaching and learning Romanian as an additional language as well.
Since there are currently no reliable frequency lists and lexical tests in Romanian, this paper aims to bridge this gap by introducing the first Romanian Word List and the Romanian Vocabulary Levels Test. The list contains the 10,000 most frequent Romanian words and is based on the Romanian Balanced Annotated Corpus (ROMBAC, Ion, Irimia, Ștefănescu, Tufiș 2012).
The primary objective of the paper is to elaborate on the compilation criteria, the challenges involved and the benefits of such a list in the case of teaching, learning and curriculum design for Romanian as an additional language. The secondary objective is to present a practical application of the word list by introducing an exemplary Romanian lexical test, the Romanian Vocabulary Levels Test and examine its reliability and validity
Recommended from our members
Lexical organization in deaf children who use British Sign Language: Evidence from a semantic fluency task
We adapted the semantic fluency task into British Sign Language (BSL). In Study 1, we present data from twenty-two deaf signers aged four to fifteen. We show that the same ‘cognitive signatures’ that characterize this task in spoken languages are also present in deaf children, for example, the semantic clustering of responses. In Study 2, we present data from thirteen deaf children with Specific Language Impairment (SLI) in BSL, in comparison to a subset of children from Study 1 matched for age and BSL exposure. The two groups' results were comparable in most respects. However, the group with SLI made occasional word-finding errors and gave fewer responses in the first 15 seconds. We conclude that deaf children with SLI do not differ from their controls in terms of the semantic organization of the BSL lexicon, but that they access signs less efficiently
- …