2,195 research outputs found
A discourse-based approach for Arabic question answering
The treatment of complex questions with explanatory answers involves searching for arguments in texts. Because of the prominent role that discourse relations play in reflecting text-producersâ intentions, capturing the underlying structure of text constitutes a good instructor in this issue. From our extensive review, a system for automatic discourse analysis that creates full rhetorical structures in large scale Arabic texts is currently unavailable. This is due to the high computational complexity involved in processing a large number of hypothesized relations associated with large texts. Therefore, more practical approaches should be investigated. This paper presents a new Arabic Text Parser oriented for question answering systems dealing with ÙÙ
ۧ۰ۧ âwhyâ and ÙÙÙ âhow toâ questions. The Text Parser presented here considers the sentence as the basic unit of text and incorporates a set of heuristics to avoid computational explosion. With this approach, the developed question answering system reached a significant improvement over the baseline with a Recall of 68% and MRR of 0.62
Aspects of sentence analysis in the Arabic linguistic tradition, with particular reference to ellipsis.
SIGLEAvailable from British Library Document Supply Centre-DSC:DXN034735 / BLDSC - British Library Document Supply CentreGBUnited Kingdo
Discourse Structure in Machine Translation Evaluation
In this article, we explore the potential of using sentence-level discourse
structure for machine translation evaluation. We first design discourse-aware
similarity measures, which use all-subtree kernels to compare discourse parse
trees in accordance with the Rhetorical Structure Theory (RST). Then, we show
that a simple linear combination with these measures can help improve various
existing machine translation evaluation metrics regarding correlation with
human judgments both at the segment- and at the system-level. This suggests
that discourse information is complementary to the information used by many of
the existing evaluation metrics, and thus it could be taken into account when
developing richer evaluation metrics, such as the WMT-14 winning combined
metric DiscoTKparty. We also provide a detailed analysis of the relevance of
various discourse elements and relations from the RST parse trees for machine
translation evaluation. In particular we show that: (i) all aspects of the RST
tree are relevant, (ii) nuclearity is more useful than relation type, and (iii)
the similarity of the translation RST tree to the reference tree is positively
correlated with translation quality.Comment: machine translation, machine translation evaluation, discourse
analysis. Computational Linguistics, 201
Splitting Arabic Texts into Elementary Discourse Units
International audienceIn this article, we propose the first work that investigates the feasibility of Arabic discourse segmentation into elementary discourse units within the segmented discourse representation theory framework. We first describe our annotation scheme that defines a set of principles to guide the segmentation process. Two corpora have been annotated according to this scheme: elementary school textbooks and newspaper documents extracted from the syntactically annotated Arabic Treebank. Then, we propose a multiclass supervised learning approach that predicts nested units. Our approach uses a combination of punctuation, morphological, lexical, and shallow syntactic features. We investigate how each feature contributes to the learning process. We show that an extensive morphological analysis is crucial to achieve good results in both corpora. In addition, we show that adding chunks does not boost the performance of our system
Unique Challenges Saudi EFL Learners Face
Learning English as a foreign language (EFL) is both a promising endeavor and a challenging undertaking. All language learners encounter unique challenges in the process of learning English, and Saudi EFL learners are no exception. This article identifies the unique and multifarious challenges Saudi EFL learners face, and explores the multidimensional causal factors in the progression of the challenges they face most commonly. The analysis first tackles the considerable challenge of accurate spelling, followed by a discussion of the challenges Saudi EFL learners encounter when learning to read and write in English. This discussion addresses challenges in sociolinguistic competence and English pronunciation arising from multivariate factors, and concludes by offering measures to help Saudi EFL learners overcome these characteristic challenges and promote their trajectory toward successful acquisition of EFL
- âŠ