63,355 research outputs found
Rank-frequency relation for Chinese characters
We show that the Zipf's law for Chinese characters perfectly holds for
sufficiently short texts (few thousand different characters). The scenario of
its validity is similar to the Zipf's law for words in short English texts. For
long Chinese texts (or for mixtures of short Chinese texts), rank-frequency
relations for Chinese characters display a two-layer, hierarchic structure that
combines a Zipfian power-law regime for frequent characters (first layer) with
an exponential-like regime for less frequent characters (second layer). For
these two layers we provide different (though related) theoretical descriptions
that include the range of low-frequency characters (hapax legomena). The
comparative analysis of rank-frequency relations for Chinese characters versus
English words illustrates the extent to which the characters play for Chinese
writers the same role as the words for those writing within alphabetical
systems.Comment: To appear in European Physical Journal B (EPJ B), 2014 (22 pages, 7
figures
Literary machine translation under the magnifying glass : assessing the quality of an NMT-translated detective novel on document level
Several studies (covering many language pairs and translation tasks) have demonstrated that translation quality has improved enormously since the emergence of neural machine translation systems. This raises the question whether such systems are able to produce high-quality translations for more creative text types such as literature and whether they are able to generate coherent translations on document level. Our study aimed to investigate these two questions by carrying out a document-level evaluation of the raw NMT output of an entire novel. We translated Agatha Christie's novel The Mysterious Affair at Styles with Google's NMT system from English into Dutch and annotated it in two steps: first all fluency errors, then all accuracy errors. We report on the overall quality, determine the remaining issues, compare the most frequent error types to those in general-domain MT, and investigate whether any accuracy and fluency errors co-occur regularly. Additionally, we assess the inter-annotator agreement on the first chapter of the novel
Selecting ELL Textbooks: A Content Analysis of Ethnicity Depicted in Illustrations and Writing
In an effort to respond to the need for culturally appropriate English Language Learning(ELL) resources for adolescent immigrants, the researchers gathered 64 textbooks actually in use in eight Milwaukee middle schools to analyze their content for the range of diversity of ethnicity depicted in illustrations and written text. The eight school settings selected provided a broad range of materials to analyze. In addition, these materials reflect both public and Catholic teachers’ resource selection in predominantly Latino and Southeast Asian American classroom contexts. The settings were chosen with the advice of administrators and teachers as schools they perceived to be of greatest need for ELL curriculum and instruction development. Based upon their findings, the researchers draw some initial conclusions and recommendations for the selection of culturally appropriate textbooks that fit the cultural contexts of the learners. Finally, the study provides as appendices the bibliography of textbooks under analysis and sample coding instruments used to analyze the content of these textbooks
The emergence of prosody in linguistic theory
Prosody is a unique character in the production of sounds. Human speech is particularly marked by prosody for various functions in the different aspects of linguistics (e.g. phonology, morphology, sociolinguistics). The importance of prosody in human language had been known since very early periods of modern civilisation. Both Western and Eastern traditions had put a lot of emphasis on the proper practice of prosodic rhymes and rhythms in the use of language whether it was for analysing grammar or for praying to God or any other superior spirit. Subsequent developments in linguistics have revealed the central role played by prosody in determining the innate grammar of human language. This paper attempts to discuss in brief the evolution of the thought on prosody and its current standing in the field of linguistics.peer-reviewe
News discourses on distant suffering: A critical discourse analysis of the 2003 SARS outbreak
News carries a unique signifying power, a power to represent events in particular ways (Fairclough, 1995). Applying Critical Discourse Analysis and Chouliaraki's theory on the mediation of suffering (2006), this article explores the news representation of the 2003 global SARS outbreak. Following a case-based methodology, we investigate how two Belgian television stations have covered the international outbreak of SARS. By looking into the mediation of four selected discursive moments, underlying discourses of power, hierarchy and compassion were unraveled. The analysis further identified the key role of proximity in international news reporting and supports the claim that Western news media mainly reproduce a Euro-American centered world order. This article argues that news coverage of international crises such as SARS constructs and maintains the socio-cultural difference between 'us' and 'them' as well as articulating global power hierarchies and a division of the world in zones of poverty and prosperity, danger and safety
- …