484 research outputs found

    Investigating the Relationship between the Morphological Processing of Regular and Irregular Words and L2 Vocabulary Acquisition

    Get PDF
    Word formation in Arabic is rather different from English and relies more heavily on derivation rather than word creation. This study tests whether this difference may impact on the learning of words in English. Results of the study suggest that words that are irregularly derived in English are subject to a frequency effect in learning while regularly derived words are not. Results suggest that the predisposition of English for these irregular constructions may be a barrier to learning for learners with an aarabic speaking L1 background

    Proceedings of the 17th Annual Conference of the European Association for Machine Translation

    Get PDF
    Proceedings of the 17th Annual Conference of the European Association for Machine Translation (EAMT

    Tackling Sequence to Sequence Mapping Problems with Neural Networks

    Full text link
    In Natural Language Processing (NLP), it is important to detect the relationship between two sequences or to generate a sequence of tokens given another observed sequence. We call the type of problems on modelling sequence pairs as sequence to sequence (seq2seq) mapping problems. A lot of research has been devoted to finding ways of tackling these problems, with traditional approaches relying on a combination of hand-crafted features, alignment models, segmentation heuristics, and external linguistic resources. Although great progress has been made, these traditional approaches suffer from various drawbacks, such as complicated pipeline, laborious feature engineering, and the difficulty for domain adaptation. Recently, neural networks emerged as a promising solution to many problems in NLP, speech recognition, and computer vision. Neural models are powerful because they can be trained end to end, generalise well to unseen examples, and the same framework can be easily adapted to a new domain. The aim of this thesis is to advance the state-of-the-art in seq2seq mapping problems with neural networks. We explore solutions from three major aspects: investigating neural models for representing sequences, modelling interactions between sequences, and using unpaired data to boost the performance of neural models. For each aspect, we propose novel models and evaluate their efficacy on various tasks of seq2seq mapping.Comment: PhD thesi

    How many words do you need to speak Arabic? An Arabic vocabulary size test

    Get PDF
    This study describes a vocabulary size test in Arabic used with 339 nativespeaking learners at school and university in Saudi Arabia. Native speakervocabulary size scores should provide targets for attainment for learners ofArabic, should inform the writers of course books and teaching materials,and the test itself should allow learners to monitor their progress towardsthe goal of fluency. Educated native speakers of Arabic possess arecognition vocabulary about 25,000 words, a total which is largecompared with equivalent test scores of native speakers of English. Theresults also suggest that acquisition increases in speed with age and thisis tentatively explained by the highly regular system of morphologicalderivation which Arabic uses and which, it is thought, is acquired inadolescence. This again appears different from English where the rate ofacquisition appears to decline with age. While the test appears reliableand valid, there are issues surrounding the definition of a word in Arabicand further research into how words are stored, retrieved and processedin Arabic is needed to inform the construction of further tests whichmight, it is thought, profitably use a more encompassing definition ofthe lemma as the basis for testing
    • …
    corecore