1,071 research outputs found
Corpus-driven vs. corpus-based approach to the study of relational patterns
Contextual lexical relations, such as sense relations, have traditionally played an essential role in disambiguating word senses in lexicography, as they offer insights into the meaning and use of a word. However, the description of paradigmatic relations in particular is often restricted to a few types such as synonymy and antonymy. The limited description of various types of relations and the method of presenting these relations in existing German dictionaries are often problematic.
Elexiko, the first German hypertext dictionary compiled exclusively on the basis of an electronic corpus, offers a new way of presenting sense relations, using a variety of approaches to extract the necessary data. In this paper, I will show how elexiko presents a differentiated system of paradigmatic relations including synonymy, various subtypes of incompatibility (such as antonymy, complementarity, converseness, reversiveness, etc.), and vertical structures (such as hyponymy and meronymy). Primary attention, however, will focus on the question of how data for a paradigmatic description is retrieved from the corpus. Whereas a corpus-driven approach is mainly used for various semantic information and a corpus-based method plays an important part in obtaining data for the grammatical description in elexiko, it will be argued that both the corpus-driven and the corpus-based approach can be complementary methods in gaining insights into sense relations. I will demonstrate which results can be obtained by each approach, and advantages and disadvantages of both procedures will be explored in more detail.
As sense relations are context-dependent, it will also be demonstrated how a sense-bound presentation can be realised in an electronic reference work including a system of cross-referencing that illustrates lexical structures and the interrelatedness of words within the lexicon. Finally, I will show how accompanying examples from the corpus and additional lexicographic information help the user to understand contextual restrictions, so that s/he is able to use dictionary information more effectively
Selecting artificially-generated sentences for fine-tuning neural machine translation
Neural Machine Translation (NMT) models
tend to achieve best performance when larger
sets of parallel sentences are provided for trai-
ning. For this reason, augmenting the training
set with artificially-generated sentence pairs
can boost performance.
Nonetheless, the performance can also be im-
proved with a small number of sentences
if they are in the same domain as the test
set. Accordingly, we want to explore the use
of artificially-generated sentences along with
data-selection algorithms to improve German-
to-English NMT models trained solely with
authentic data.
In this work, we show how artificially-
generated sentences can be more beneficial
than authentic pairs, and demonstrate their ad-
vantages when used in combination with data-
selection algorithms
Communication in today\u27s network
Technology has given us many ways of communicating. The most emerging and evolving network of exchanging information today is social media. Communicating in Today’s Network is a thesis that examines and explores different trends in social media. This body of this work is a result of data collected by surveying active users of social media in society. The results are visually communicated through information graphics. It is intended to inform designers of the importance in learning and identifying ways to communicate with our new developing medium, social media
Social Deprivation and Exclusion of Immigrants in Germany
This paper aims at providing empirical evidence on social exclusion of immigrants in Germany. We demonstrate that when using a conventional definition of the social inclusion index typically applied in the literature, immigrants appear to experience a significant degree of social deprivation and exclusion, confirming much of the economic literature examining the economic assimilation of immigrants in Germany. We propose a weighting scheme that weights components of social inclusion by their subjective contribution to an overall measure of life satisfaction.Using this weighting scheme to calculate an index of social inclusion, we find that immigrants are in fact as "included" as Germans. This result is driven strongly by the disproportionately positive socio-demographic characteristics that immigrants possess as measured by the contribution to their life satisfaction.Social Exclusion, International Migration, Integration
Social Deprivation and Exclusion of Immigrants in Germany
This paper aims at providing empirical evidence on social exclusion of immigrants in Germany. We demonstrate that when using a conventional definition of the social inclusion index typically applied in the literature, immigrants appear to experience a significant degree of social deprivation and exclusion, confirming much of the economic literature examining the economic assimilation of immigrants in Germany. We propose a weighting scheme that weights components of social inclusion by their subjective contribution to an overall measure of life satisfaction. Using this weighting scheme to calculate an index of social inclusion, we find that immigrants are in fact as "included" as Germans. This result is driven strongly by the disproportionately positive socio- demographic characteristics that immigrants possess as measured by the contribution to their life satisfaction.Social exclusion, international migration, integration
The Problems with Problem Solving: Reflections on the Rise, Current Status, and Possible Future of a Cognitive Research Paradigm
The research paradigm invented by Allen Newell and Herbert A. Simon in the late 1950s dominated the study of problem solving for more than three decades. But in the early 1990s, problem solving ceased to drive research on complex cognition. As part of this decline, Newell and Simon’s most innovative research practices – especially their method for inducing subjects’ strategies from verbal protocols - were abandoned. In this essay, I summarize Newell and Simon’s theoretical and methodological innovations and explain why their strategy identification method did not become a standard research tool. I argue that the method lacked a systematic way to aggregate data, and that Newell and Simon’s search for general problem solving strategies failed. Paradoxically, the theoretical vision that led them to search elsewhere for general principles led researchers away from studies of complex problem solving. Newell and Simon’s main enduring contribution is the theory that people solve problems via heuristic search through a problem space. This theory remains the centerpiece of our understanding of how people solve unfamiliar problems, but it is seriously incomplete. In the early 1970s, Newell and Simon suggested that the field should focus on the question where problem spaces and search strategies come from. I propose a breakdown of this overarching question into five specific research questions. Principled answers to those questions would expand the theory of heuristic search into a more complete theory of human problem solving
Approaching religion through linguistics: methodological thoughts on a linguistic analysis of 'religion' in political communication
The constructions of ‘religion’ in general language are seldom themselves in the focus of empirical research. Aiming to retrieve the inherent knowledge that lies within these constructions, this article suggests a term-based textual analysis to focus on the linguistic use of ‘religion’. This method invites us to question the unity of texts through an analysis of textual semantics. It offers the chance to ask about the formation of the concept. The article initially shows how this approach differs from comparative and policy-oriented studies by differentiating between criticism and critique. It then develops the idea of a term-based textual analysis. Using examples from the policy field of foreign aid, the text illustrates how much inherent knowledge there is in the usage of ‘religion’ in political communication and calls for a general reconsideration of the way ‘religion’ is approached in empirical research
- …