206 research outputs found
The order of Cardinal, ordinal and multiplicative Adjectives in English and Arabic: A comparative study
It is noticed that Arabic speakers of English and English speaker of Arabic encounter some difficulty
in the order of the adjectives especially when there are numerals. This study aims to compare and
contrast the construction, behaviour and order of the different types of numeral adjectives among the
other adjectives in the languages in question. In order to achieve the objectives of the study and find
out answers to the questions under investigation, the study follows the prescriptive method. It
examines analytically the similarities and differences in order to find out what is linguistic specific for
Arabic and for English with regard to the order of numerals and other adjectives. It is hoped that the
study will contribute in enriching the morpho-syntactic field and computational linguistics.
Definitions, structures and current examples including numerals and other adjectives in English and
in Arabic are presented and analysed structurally. The data are compared and contrasted in order to
find out reasons behind the problems and difficulties that face Arabic and English speakers in the
order of numeral and other adjectives. Results and recommendations will set the remedies needed
for these major challenges towards the perfection of the linguistic knowledge.PSAU-2022/02/19983 Deanship of Scientific Research at Prince Sattam Bin Abdulaziz Universit
Abstract syntax as interlingua: Scaling up the grammatical framework from controlled languages to robust pipelines
Syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the consistency of hand-annotated corpora, and to pipe analyses into high-precision semantic back ends. This article gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF
Treebank-Based Deep Grammar Acquisition for French Probabilistic Parsing Resources
Motivated by the expense in time and other resources to produce hand-crafted grammars, there has been increased interest in wide-coverage grammars automatically obtained from treebanks. In particular, recent years have seen a move
towards acquiring deep (LFG, HPSG and CCG) resources that can represent information absent from simple CFG-type structured treebanks and which are considered to produce more language-neutral linguistic representations, such
as syntactic dependency trees. As is often the case in early pioneering work in natural language processing, English has been the focus of attention in the first efforts towards acquiring treebank-based deep-grammar resources, followed by treatments of, for example, German, Japanese, Chinese and Spanish. However, to date no comparable large-scale automatically acquired deep-grammar resources have been obtained for French. The goal of the research presented in this thesis is to develop, implement, and evaluate treebank-based deep-grammar acquisition techniques for French. Along the way towards achieving this goal, this thesis presents the derivation of a new treebank for French from the Paris 7 Treebank, the Modified French Treebank, a cleaner, more coherent treebank with several transformed structures and new linguistic analyses. Statistical parsers trained on this data outperform those trained on the original Paris 7 Treebank, which has five times the amount of data. The Modified French Treebank is the data source used for the development of treebank-based automatic deep-grammar acquisition for LFG parsing resources
for French, based on an f-structure annotation algorithm for this treebank. LFG CFG-based parsing architectures are then extended and tested, achieving a competitive best f-score of 86.73% for all features. The CFG-based parsing architectures are then complemented with an alternative dependency-based statistical parsing approach, obviating the CFG-based parsing step, and instead directly
parsing strings into f-structures
vocoids and their prosodic distribution, with special reference to Italian and Arabic
This study attempts to characterize vocoids, i.e. vowels and semivowels, as a unified class of segments. In order to do so, it investigates the main phenomena concerning the quantitative distribution of these sounds, namely syllabic alternation, length alternations, deletion and insertion. Such phenomena are best analyzed by making reference to prosodic structure, and syllable structure in particular. Therefore, both frameworks adopted in this thesis take into consideration this type of representation. The main approach, which I refer to generally as Derivational Theory (DT), is based on the notion that surface phonetic forms are derived from underlying forms through a series of structural changes taking place at different levels of representation. This model is contrasted with the recently introduced (Prince and Smolensky 1993) Optimality Theory (OT), an output-oriented paradigm based on the parallel evaluation of candidate forms by means of universal but violable constraints. This thesis shows that OT offers some valuable insights into the phenomena under analysis, although there are areas in which it requires integration with derivational tools. This study also makes specific reference to two languages: Ammani Arabic and Standard Italian. These diverge in their treatment of vocoids, but clear general trends may be detected which have also been found in other languages
- …