167,882 research outputs found
Linguistically informed and corpus informed morphological analysis of Arabic
Standard English PoS-taggers generally involve tag-assignment (via dictionary-lookup etc) followed by tag-disambiguation (via a context model, e.g. PoS-ngrams or Brill transformations). We want to PoS-tag our Arabic Corpus, but evaluation of existing PoS-taggers has highlighted shortcomings; in particular, about a quarter of all word tokens are not assigned a fully correct morphological analysis. Tag-assignment is significantly more complex for Arabic. An Arabic lemmatiser program can extract the stem or root, but this is not enough for full PoS-tagging; words should be decomposed into five parts: proclitics, prefixes, stem or root, suffixes and postclitics. The morphological analyser should then add the appropriate linguistic information to each of these parts of the word; in effect, instead of a tag for a word, we need a subtag for each part (and possibly multiple subtags if there are multiple proclitics, prefixes, suffixes and postclitics). Many challenges face the implementation of Arabic morphology, the rich âroot-and-patternâ nonconcatenative (or nonlinear) morphology and the highly complex word formation process of root and patterns, especially if one or two long vowels are part of the root letters. Moreover, the orthographic issues of Arabic such as short vowels ( Ù Ù Ù ), Hamzah (ŰĄ ŰŁ Ű„ Ű€ ŰŠ), Taaâ Marboutah ( Ű© ) and Haâ ( Ù ), Yaâ ( Ù ) and Alif Maksorah( Ù ) , Shaddah ( Ù ) or gemination, and Maddah ( Űą ) or extension which is a compound letter of Hamzah and Alif ( ۣۧ ). Our morphological analyzer uses linguistic knowledge of the language as well as corpora to verify the linguistic information. To understand the problem, we started by analyzing fifteen established Arabic language dictionaries, to build a broad-coverage lexicon which contains not only roots and single words but also multi-word expressions, idioms, collocations requiring special part-of-speech assignment, and words with special part-of-speech tags. The next stage of research was a detailed analysis and classification of Arabic language roots to address the âtailâ of hard cases for existing morphological analyzers, and analysis of the roots, word-root combinations and the coverage of each root category of the Qurâan and the word-root information stored in our lexicon. From authoritative Arabic grammar books, we extracted and generated comprehensive lists of affixes, clitics and patterns. These lists were then cross-checked by analyzing words of three corpora: the Qurâan, the Corpus of Contemporary Arabic and Penn Arabic Treebank (as well as our Lexicon, considered as a fourth cross-check corpus). We also developed a novel algorithm that generates the correct pattern of the words, which deals with the orthographic issues of the Arabic language and other word derivation issues, such as the elimination or substitution of root letters
Recommended from our members
Segmenting Publics
This research synthesis was commissioned by the National Co-ordinating Centre for Public Engagement (NCCPE) and the Economic and Social Research Council (ESRC) to examine audience segmentation methods and tools in the area of public engagement. It provides resources for assessing the ways in which segmentation tools might be used to enhance the various activities through which models of public engagement in higher education are implemented. Understanding the opinions, values, and motivations of members of the public is a crucial feature of successful engagement. Segmentation methods can offer potential resources to help understand the complex set of interests and attitudes that the public have towards higher education.
Key findings:
There exist a number of existing segmentations which address many of the areas of activity found in Universities and HEIs. These include segmentations which inform strategic planning of communications; segmentations which inform the design of collaborative engagement activities by museums, galleries, and libraries; and segmentations that are used to identify under-represented users and consumers.
Segmentation is, on its own, only a tool, used in different ways in different contexts. The broader strategic rationale shaping the application and design of segmentation methods is a crucial factor in determining the utility of segmentation tools.
Four issues emerged of particular importance:
1. Segmentation exercises are costly and technically complex. Undertaking segmentations therefore requires significant commitment of financial and professional resources by HEIs; the appropriate interpretation, analysis, and application of segmentation exercises also require high levels of professional capacity and expertise
2. Undertaking a segmentation exercise has implications for the internal organisational operations of HEIs, not only for how they engage with external publics and stakeholders
3. Segmentation tools are adopted to inform interventions of various sorts, and superficially to differentiate and sometime discriminate between how groups of people are addressed and engaged.
4. For HEIs, the ethical issues and reputational risks which have been identified in this Research Synthesis as endemic to the application of segmentation methods for public purposes are particularly relevant
Individual emergence in contextual analysis
Located within the tradition of Hermeneutic Dialectics (HD) this paper offers an approach which can further an analysis of a fit between information and organizational systems. Drawn upon Information Systems Development projects a relationship between theory and practice is aided through a multi-disciplinary approach to sense making activity. Using a contemporary version of contextual analysis to understand a way in which individuals construct adapt and create meaning from their environment offers a route to improve a systems analysis process. This type of enquiry into contextual dependencies of knowledge creation can help direct a development of systems that have the intention to serve specific organizational actors and their needs. Combining methods outside of a traditional polar divide, sense making research undertaken within a systems thinking arena can enrich understanding by complementing qualitative and / or quantitative analysis with reflective depth. Drawing together interdisciplinary strands through a critical systems thinking approach offers new levels of professionalism for computer- and management-, practitioners or researchers in the 21st Century
The role of motion analysis in elite soccer
The optimal physical preparation of elite soccer (association football) players has become an indispensable part of the professional game especially due to the increased physical demands of match-play. The monitoring of playersâ work-rate profiles during competition is now feasible through computer-aided motion analysis. Traditional methods of motion analysis were extremely labour intensive and were largely restricted to university- based research projects. Recent technological developments have meant that sophisticated systems, capable of quickly recording and processing the data of all playersâ physical contributions throughout an entire match, are now being used in elite club environments. In recognition of the important role motion analysis now plays as a tool for measuring the physical performance of soccer players, this review critically appraises various motion analysis methods currently employed in elite soccer and explores research conducted using these methods. This review therefore aims to increase the awareness of both practitioners and researchers of the various motion analysis systems available, identify practical implications of the established body of knowledge, while highlighting areas that require further exploration
- âŠ