Search CORE

8,393 research outputs found

Examining Scientific Writing Styles from the Perspective of Linguistic Complexity

Author: Bu Yi
Ding Ying
Lu Chao
Schnaars Matthew
Torvik Vetle
Wang Jie
Zhang Chengzhi
Publication venue
Publication date: 12/09/2018
Field of study

Publishing articles in high-impact English journals is difficult for scholars around the world, especially for non-native English-speaking scholars (NNESs), most of whom struggle with proficiency in English. In order to uncover the differences in English scientific writing between native English-speaking scholars (NESs) and NNESs, we collected a large-scale data set containing more than 150,000 full-text articles published in PLoS between 2006 and 2015. We divided these articles into three groups according to the ethnic backgrounds of the first and corresponding authors, obtained by Ethnea, and examined the scientific writing styles in English from a two-fold perspective of linguistic complexity: (1) syntactic complexity, including measurements of sentence length and sentence complexity; and (2) lexical complexity, including measurements of lexical diversity, lexical density, and lexical sophistication. The observations suggest marginal differences between groups in syntactical and lexical complexity.Comment: 6 figure

arXiv.org e-Print Archive

IUScholarWorks Open

Adolescent Literacy and Textbooks: An Annotated Bibliography

Author: Michael Kamil
Publication venue: Carnegie Corporation of New York
Publication date: 09/09/2009
Field of study

A companion report to Carnegie's Time to Act, provides an annotated bibliography of research on textbook design and reading comprehension for fourth through twelfth grade, arranged by topic. Calls for a dialogue between publishers and researchers

IssueLab

Ensuring Readability and Data-fidelity using Head-modifier Templates in Deep Type Description Generation

Author: Chen Jiangjie
Feng Suo
Jiang Haiyun
Li Chenguang
Wang Ao
Xiao Yanghua
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

A type description is a succinct noun compound which helps human and machines to quickly grasp the informative and distinctive information of an entity. Entities in most knowledge graphs (KGs) still lack such descriptions, thus calling for automatic methods to supplement such information. However, existing generative methods either overlook the grammatical structure or make factual mistakes in generated texts. To solve these problems, we propose a head-modifier template-based method to ensure the readability and data fidelity of generated type descriptions. We also propose a new dataset and two automatic metrics for this task. Experiments show that our method improves substantially compared with baselines and achieves state-of-the-art performance on both datasets.Comment: ACL 201

arXiv.org e-Print Archive

Crossref

Text readability and intuitive simplification: A comparison of readability formulas

Author: Allen David B.
Crossley Scott A.
McNamara Danielle S.
Publication venue: Center for Language & Technology
Publication date: 01/04/2011
Field of study

Texts are routinely simplified for language learners with authors relying on a variety of approaches and materials to assist them in making the texts more comprehensible. Readability measures are one such tool that authors can use when evaluating text comprehensibility. This study compares the Coh-Metrix Second Language (L2) Reading Index, a readability formula based on psycholinguistic and cognitive models of reading, to traditional readability formulas on a large corpus of texts intuitively simplified for language learners. The goal of this study is to determine which formula best classifies text level (advanced, intermediate, beginner) with the prediction that text classification relates to the formulas’ capacity to measure text comprehensibility. The results demonstrate that the Coh-Metrix L2 Reading Index performs significantly better than traditional readability formulas, suggesting that the variables used in this index are more closely aligned to the intuitive text processing employed by authors when simplifying texts

ScholarSpace at University of Hawai'i at Manoa

Sentence Complexity Estimation for Chinese-speaking Learners of Japanese

Author: Liu Jun
Matsumoto Yuji
Publication venue: the National University (Philippines)
Publication date: 01/01/2017
Field of study

Waseda University Repository

Web Mediators for Accessible Browsing

Author: Waber Benjamin N.
Magee John J.
Betke Margrit
Publication venue: Boston University Computer Science Department
Publication date: 01/01/1860
Field of study

We present a highly accurate method for classifying web pages based on link percentage, which is the percentage of text characters that are parts of links normalized by the number of all text characters on a web page. K-means clustering is used to create unique thresholds to differentiate index pages and article pages on individual web sites. Index pages contain mostly links to articles and other indices, while article pages contain mostly text. We also present a novel link grouping algorithm using agglomerative hierarchical clustering that groups links in the same spatial neighborhood together while preserving link structure. Grouping allows users with severe disabilities to use a scan-based mechanism to tab through a web page and select items. In experiments, we saw up to a 40-fold reduction in the number of commands needed to click on a link with a scan-based interface, which shows that we can vastly improve the rate of communication for users with disabilities. We used web page classification and link grouping to alter web page display on an accessible web browser that we developed to make a usable browsing interface for users with disabilities. Our classification method consistently outperformed a baseline classifier even when using minimal data to generate article and index clusters, and achieved classification accuracy of 94.0% on web sites with well-formed or slightly malformed HTML, compared with 80.1% accuracy for the baseline classifier.National Science Foundation (IIS-0308213, IIS-039009, IIS-0093367, P200A01031, EIA-0202067

Boston University Institutional Repository (OpenBU)

Evaluation of Reading Support Tools by Reading Comprehension Tests and Reading Speed Tests

Author: 九津見毅
井佐原均
佐田いち子
吉見毅彦
小谷克則
Publication venue: IWLeL 2004 Program Committee
Publication date: 31/03/2005
Field of study

This paper introduces our reading process monitoring systems and also presents the experimental results that show the adequacy of our reading data. Our system divides a text into reading areas and records reading time for each area. We conducted two experiments using this tool to verify the adequacy of our reading process data. In the first experiment, we examined whether the reading process can distinguish easy text reading and difficult text reading, and confirmed the adequacy of our reading process data. In the second experiment, we tried to evaluate efficiency of reading support tools such as (i) chunker, (ii) glosser, and (iii) machine translation system, assuming that efficiency of these systems co-relates with text readability. The experimental results show that only the machine translation system effectively supports reading

Waseda University Repository