Search CORE

63 research outputs found

Proceedings of the Sixteenth Australasian International Conference on Speech Science and Technology

Author
Publication venue: ASSTA
Publication date: 31/12/2016
Field of study

Chinese elements : a bridge of the integration between Chinese -English translation and linguaculture transnational mobility

Author: Yang Jinbao
Publication venue: The American Scholars Press, Inc.
Publication date: 01/01/2016
Field of study

[Abstract] As the popularity of Chinese elements in the innovation of the translation part in Chinese CET, we realized that Chinese elements have become a bridge between linguaculture transnational mobility and Chinese-English translation.So, Chinese students translation skills should be critically improved; for example, on their understanding about Chinese culture, especially the meaning of Chinese culture. Five important secrets of skillful translation are introduced to improve students’ translation skills

Ghent University Academic Bibliography

Word Morphology and Written Language Acquisition:Insights from Typical and Atypical Development in Different Orthographies

Author
Publication venue: 'Frontiers Media SA'
Publication date: 05/06/2019
Field of study

University of Dundee Online Publications

An automatic morphological analysis system for Indonesian

Author: Prihantoro Prihantoro
Publication venue: Lancaster University
Publication date: 10/11/2021
Field of study

This thesis reports the creation of SANTI-morf (Sistem Analisis Teks Indonesia – morfologi), a rule-based system that performs morphological annotation for Indonesian. The system has been built across three stages, namely preliminaries, annotation scheme creation (the linguistic aspect of the project), and system implementation (the computational aspect of the project). The preliminary matters covered include the necessary key concepts in morphology and Natural Language Processing (NLP), as well as a concise description of Indonesian morphology (largely based on the two primary reference grammars of Indonesian, Alwi et al. 1998 and Sneddon et al. 2010, together with work in the linguistic literature on Indonesian morphology (e.g. Kridalaksana 1989; Chaer 2008). As part of this preliminary stage, I created a testbed corpus for evaluation purposes. The design of the testbed is justified by considering the design of existing evaluation corpora, such as the testbed used by the English Constraint Grammar or EngCG system (Voutilanen 1992), the British National Corpus (BNC) 1994 evaluation data , and the training data used by MorphInd (Larasati et al. 2011), a morphological analyser (MA) for Indonesian. The dataset for this testbed was created by narrowing down an existing very large bit unbalanced collection of texts (drawn from the Leipzig corpora; see Goldhahn et al. 2012). The initial collection was reduced to a corpus composed of nine domains following the domain categorisation of the BNC) . A set of texts from each domain, proportional in size, was extracted and combined to form a testbed that complies with the design cited informed by the prior literature. The second stage, scheme creation, involved the creation of a new Morphological Annotation Scheme (MAS) for Indonesian, for use in the SANTI-morf system. First, a review of MASs in different languages (Finnish, Turkish, Arabic, Indonesian) as well as the Universal Dependencies MAS identifies the best practices in the field. From these, 15 design principles for the novel MAS were devised. This MAS consists of a morphological tagset, together with comprehensive justification of the morphological analyses used in the system. It achieves full morpheme-level annotation, presenting each morpheme’s orthographic and citation forms in the defined output, accompanied by robust morphological analyses, both formal and functional; to my knowledge, this is the first MAS of its kind for Indonesian. The MAS’s design is based not only on reference grammars of Indonesian and other linguistic sources, but also on the anticipated needs of researchers and other users of texts and corpora annotated using this scheme of analysis. The new MAS aims at The third stage of the project, implementation, consisted of three parts: a benchmarking evaluation exercise, a survey of frameworks and tools, leading ultimately to the actual implementation and evaluation of SANTI-morf. MorphInd (Larasati et al. 2012) is the prior state-of-the-art MA for Indonesian. That being the case, I evaluated MorphInd’s performance against the aforementioned testbed, both as just5ification of the need for an improved system, and to serve as a benchmark for SANTI-morf. MorphInd scored 93% on lexical coverage and 89% on tagging accuracy. Next, I surveyed existing MAs frameworks and tools. This survey justifies my choice for the rule-based approach (inspired by Koskenniemi’s 1983 Two Level Morphology, and NooJ (Silberztein 2S003) as respectively the framework and the software tool for SANTI-morf. After selection of this approach and tool, the language resources that constitute the SANTI-morf system were created. These are, primarily, a number of lexicons and sets of analysis rules, as well as necessary NooJ system configuration files. SANTI-morf’s 3 lexicon files (in total 86,590 entries) and 15 rule files (in total 659 rules) are organised into four modules, namely the Annotator, the Guesser, the Improver and the Disambiguator. These modules are applied one after another in a pipeline. The Annotator provides initial morpheme-level annotation for Indonesian words by identifying their having been built according to various morphological processes (affixation, reduplication, compounding, and cliticisation). The Guesser ensures that words not covered by the Annotator, because they are not covered by its lexicons, receive best guesses as to the correct analysis from the application of a set of probable but not exceptionless rules. The Improver improves the existing annotation, by adding probable analyses that the Annotator might have missed. Finally, the Disambiguator resolves ambiguities, that is, words for which the earlier elements of the pipeline have generated two or more possible analyses in terms of the morphemes identified or their annotation. NooJ annotations are saved in a binary file, but for evaluation purposes, plain-text output is required. I thus developed a system for data export using an in-NooJ mapping to and from a modified, exportable expression of the MAS, and wrote a small program to enable re-conversion of the output in plain-text format. For purposes of the evaluation, I created a 10,000 -word gold-standard SANTI-morf manually-annotated dataset. The outcome of the evaluation is that SANTI-morf has 100% coverage (because a best-guess analysis is always provided for unrecognised word forms), and 99% precision and recall for the morphological annotations, with a 1% rate of remaining ambiguity in the final output. SANTI-morf is thus shown to present a number of advancements over MorphInd, the state-of-the-art MA for Indonesian, exhibiting more robust annotation and better coverage. Other performance indicators, namely the high precision and recall, make SANTI-morf a concrete advance in the field of automated morphological annotation for Indonesian, and in consequence a substantive contribution to the field of Indonesian linguistics overall

Lancaster E-Prints

European Approaches to Japanese Language and Linguistics

Author: Heinrich Patrick
Pappalardo Giuseppe
Publication venue: 'Edizioni Ca Foscari'
Publication date: 01/01/2020
Field of study

In this volume European specialists of Japanese language present new and original research into Japanese over a wide spectrum of topics which include descriptive, sociolinguistic, pragmatic and didactic accounts. The articles share a focus on contemporary issues and adopt new approaches to the study of Japanese that often are specific to European traditions of language study. The articles address an audience that includes both Japanese Studies and Linguistics. They are representative of the wide range of topics that are currently studied in European universities, and they address scholars and students alike

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Word Morphology and Written Language Acquisition:Insights from Typical and Atypical Development in Different Orthographies

Author
Publication venue: 'Frontiers Media SA'
Publication date: 05/06/2019
Field of study

University of Dundee Online Publications

A Sound Approach to Language Matters: In Honor of Ocke-Schwen Bohn

Author: Avesani Cinzia
Baker Brett Joseph
Balling Laura Winther
Behne Dawn M.
Best Catherine
Bundgaard-Nielsen Rikke
Carlet Angélica
Cebrian Juli
Christensen Ken Ramshøj
Cooper Angela
Flege James Emil
Hejná Michaela
Hejná Mísa
Horslund Camilla Søballe
Hua Congehao
Højen Anders
Højen Anders
Jespersen Anna
Jespersen Anna Bothe
Jongman Allard
Jørgensen Henrik
Karmeli Sophia
Kizach Johannes
Kluge Denise Cristina
Lee Goun
Li Bin
Li Yingjie
Masapollo Matthew
Mooshammer Christine
Mora Joan C.
Mora-Plaza Ingrid
Niebuhr Oliver
Nyvad Anne Mette
Nyvad Anne Mette
Piske Thorsten
Polka Linda
Rasmussen Sidsel
Ruan Yufang
Sereno Joan A.
Steinlen Anja
Sørensen Mette Hjortshøj
Sørensen Mette Hjortshøj
Tyler Michael
Vayra Mario
Vikner Sten
Wang Yue
Wayland Ratree
Whalen D. H.
Wood Johanna
Yan Mengzhu
Publication venue: 'Aarhus University Library'
Publication date: 16/05/2019
Field of study

The contributions in this Festschrift were written by Ocke’s current and former PhD-students, colleagues and research collaborators. The Festschrift is divided into six sections, moving from the smallest building blocks of language, through gradually expanding objects of linguistic inquiry to the highest levels of description - all of which have formed a part of Ocke’s career, in connection with his teaching and/or his academic productions: “Segments”, “Perception of Accent”, “Between Sounds and Graphemes”, “Prosody”, “Morphology and Syntax” and “Second Language Acquisition”. Each one of these illustrates a sound approach to language matters

AU Library Scholarly Publishing Services: E-books (Aarhus University)

Dramaturgies for contemporary kabuki:Towards an understanding of the kokera otoshi celebrations at the Ginza Kabuki-za in 2013-14

Author: Parker Helen
Publication venue
Publication date: 26/08/2014
Field of study

Edinburgh Research Explorer

The Arabic (Re)dubbing of Wordplay in Disney Animated Films

Author: Aljuied Fatimah Mohammed J
Publication venue: UCL (University College London)
Publication date: 28/09/2021
Field of study

Although audiovisual translation (AVT) has received considerable attention in recent years, evidence suggests that there is a paucity of empirical research carried out on the dubbing of wordplay in the Arabophone countries. This piece of research sets to identify, describe and assess the most common translation techniques adopted by translators when dubbing English-language animated films into Arabic. The focus is on the special case of dubbing Disney animated films into Egyptian Arabic (EA) and their subsequent redubbing into Modern Standard Arabic (MSA), during the 1975-2015 period. The ultimate goal is to ascertain the similarities as well as the differences that set the two versions apart, particularly when it comes to the transfer of wordplay. To reach this objective, the methodological approach adopted for this study is a corpus of instances of wordplay that combines a quantitative phase, which has the advantage of identifying correlations between the types of wordplay and particular translation techniques and results and is then followed by a qualitative analysis that further probes the results and determines the different factors that contribute to the way wordplay is translated. The analysis reveals that, in their attempt to render this type of punning humour, in both Arabic dubbed versions, Arabic translators resort to a variety of translation techniques, namely, loan, direct translation, explication, paraphrase, substitution and omission. The examination of the data shows that achieving a humorous effect in the target dialogue is the top priority and driving factor influencing most of the strategies activated in the process of dubbing wordplay into EA. Dissimilarly, there is a noticeable lower amount of puns crossing over from the original films to the MSA dubbed versions, highlighting the fact that the approach generally taken by the dubbing teams seems to give priority to the denotative, informative dimension rather than the socio-pragmatic one. By shedding light on the intricacies of dubbing, it is hoped that this study would contribute to the advancement of knowledge in the translation of wordplay in the Arabophone countries and, more specifically, in the field of dubbing children’s programmes

UCL Discovery

Negative vaccine voices in Swedish social media

Author: Dimitrios Kokkinakis
Hammarlin Mia-Marie
Publication venue: 'International Society of Experimental Linguistics'
Publication date: 17/10/2022
Field of study

Vaccinations are one of the most significant interventions to public health, but vaccine hesitancy creates concerns for a portion of the population in many countries, including Sweden. Since discussions on vaccine hesitancy are often taken on social networking sites, data from Swedish social media are used to study and quantify the sentiment among the discussants on the vaccination-or-not topic during phases of the COVID-19 pandemic. Out of all the posts analyzed a majority showed a stronger negative sentiment, prevailing throughout the whole of the examined period, with some spikes or jumps due to the occurrence of certain vaccine-related events distinguishable in the results. Sentiment analysis can be a valuable tool to track public opinions regarding the use, efficacy, safety, and importance of vaccination

Lund University Publications