4,091 research outputs found

    Korean to English Translation Using Synchronous TAGs

    Full text link
    It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic phenomena that are seen as requiring a knowledge-based approach. In this paper we present an alternative approach, exemplified by a prototype system for machine translation of English and Korean which is implemented in Synchronous TAGs. This approach is essentially transfer based, and uses semantic feature unification for accurate lexical selection of polysemous verbs. The same semantic features, when combined with a discourse model which stores previously mentioned entities, can also be used for the recovery of topicalized arguments. In this paper we concentrate on the translation of Korean to English.Comment: ps file. 8 page

    Does orthographic overlap influence lexical selection?

    Get PDF
    Understanding whether and how various processes interact in language production can help us both understand aphasic errors and develop theoretically motivated treatment approaches. We examined semantic errors produced in writing-to-dictation by an individual with acquired dysgraphia to determine whether letter-level information – particularly overlap between the target and the semantic error – can affect lexical selection processes in these errors. Our results indicated that the particular semantic errors that were produced were significantly more likely to share orthographic structure than would be expected by chance alone, indicating interaction in the form of feedback from letter-level processes to lexical selection

    A syntactic skeleton for statistical machine translation

    Get PDF
    We present a method for improving statistical machine translation performance by using linguistically motivated syntactic information. Our algorithm recursively decomposes source language sentences into syntactically simpler and shorter chunks, and recomposes their translation to form target language sentences. This improves both the word order and lexical selection of the translation. We report statistically significant relative improvementsof 3.3% BLEU score in an experiment (English!Spanish) carried out on an 800-sentence test set extracted from the Europarl corpus

    Lexical Selection in Multi-Word Production

    Get PDF
    In multi-word utterances, target words need to be selected in the context of other target words. In the present study, three hypotheses were tested that differed in their assumptions about whether the lexical selection mechanism considers the activation levels of the other target lexical representations, and whether it takes into account their grammatical class properties. Participants produced adjective + noun and noun + noun utterances in response to colored word and picture + word stimulus displays. In both types of utterances, the frequency of the first and second response was manipulated. The results revealed an effect of the frequency of the second response that did not depend on the utterance type, and additive effects for the frequency of the first and the second response in both utterance types. These results are interpreted in terms of a model of lexical selection that assumes that selection is non-competitive

    Bilingual lexical selection as a dynamic process:evidence from Arabic-French bilinguals

    Get PDF
    The nature of the lexical selection process in bilingual spoken word production is one of the pending questions of research on bilingualism. According to one view this competitive process is language-specific, while another holds that it is language-nonspecific (i.e., lexical competition is cross-linguistic). In recent years, research on bilingual language production has seen the rise of a third view that postulates that lexical selection is in fact dynamic and may function as language-specific or nonspecific depending on a number of factors. The aim of the present study was to investigate the lexical selection process among moderately proficient bilinguals whose two languages are typologically distant: Tunisian Arabic and French. The picture-word interference task was used in two experiments where moderately proficient Tunisian Arabic (L1)-French (L2) bilinguals were asked to name pictures in their L2 while ignoring auditory distractors (semantic, phono-translation, phonological, or unrelated) in their L2 (Experiment 1) or their L1 (Experiment 2). Thus, the language context was entirely monolingual in Experiment 1 and bilingual in Experiment 2. In Experiment 1, only a phonological facilitation effect was observed. In Experiment 2, interference was found in the phono-translation, semantic, and phonological conditions. Taken together, these results indicate that cross-language competition occurs among moderately proficient Tunisian Arabic-French bilinguals only in a bilingual context (Experiment 2) as indexed by the phono-translation interference effect observed. Our findings are in line with the recent hypothesis that lexical selection is a dynamic process modulated by factors like language similarity, language proficiency, and the experimental language context

    Lexical selection and the evolution of language units

    Get PDF
    In this paper I discuss similarities and differences between a potential new model of language development - lexical selection, and its biological equivalent - natural selection. Based on Dawkins' (1976) concept of the meme I discuss two units of language and explore their potential to be seen as linguistic replicators. The central discussion revolves around two key parts - the units that could potentially play the role of replicators in a lexical selection system and a visual representation of the model proposed. draw on work by Hoey (2005), Wray (2008) and Sinclair (1996, 1998) for the theoretical basis; Croft (2000) is highlighted as a similar framework. Finally brief examples are taken from the free online corpora provided by the corpus analysis tool Sketch Engine (Kilgarriff, Rychly, Smrz and Tugwell 2004) to ground the discussion in real world communicative situations. The examples highlight the point that different situational contexts will allow for different units to flourish based on the local social and linguistic environment. The paper also shows how a close look at the specific context and strings available to a language user at any given moment has potential to illuminate different aspects of language when compared with a more abstract approach
    corecore