Search CORE

60 research outputs found

Exploring The Value Of Folksonomies For Creating Semantic Metadata

Author: Al-Khalifa Hend S.
Davis Hugh C.
Publication venue
Publication date: 01/01/2007
Field of study

Finding good keywords to describe resources is an on-going problem: typically we select such words manually from a thesaurus of terms, or they are created using automatic keyword extraction techniques. Folksonomies are an increasingly well populated source of unstructured tags describing web resources. This paper explores the value of the folksonomy tags as potential source of keyword metadata by examining the relationship between folksonomies, community produced annotations, and keywords extracted by machines. The experiment has been carried-out in two ways: subjectively, by asking two human indexers to evaluate the quality of the generated keywords from both systems; and automatically, by measuring the percentage of overlap between the folksonomy set and machine generated keywords set. The results of this experiment show that the folksonomy tags agree more closely with the human generated keywords than those automatically generated. The results also showed that the trained indexers preferred the semantics of folksonomy tags compared to keywords extracted automatically. These results can be considered as evidence for the strong relationship of folksonomies to the human indexer’s mindset, demonstrating that folksonomies used in the del.icio.us bookmarking service are a potential source for generating semantic metadata to annotate web resources

CiteSeerX

Southampton (e-Prints Soton)

Creating structure from disorder: using folksonomies to create semantic metadata

Author: Al-Khalifa Hend S.
Davis Hugh C.
Gilbert Lester
Publication venue
Publication date: 01/01/2007
Field of study

This paper reports on an on-going research project to create educational semantic metadata out of folksonomies. The paper describes a simple scenario for the usage of the generated semantic metadata in teaching, and describes the ‘FolksAnnotation’ tool which applies an organization scheme to tags in a specific domain of interest. The contribution of this paper is to describe an evaluation framework which will allow us to validate our claim that folksonomies are potentially a rich source of metadata

Southampton (e-Prints Soton)

Handwritten Arabic Character Recognition for Children Writ-ing Using Convolutional Neural Network and Stroke Identification

Author: Al-Khalifa Hend
Al-Matham Rawan
Alheraki Mais
Publication venue
Publication date: 03/11/2022
Field of study

Automatic Arabic handwritten recognition is one of the recently studied problems in the field of Machine Learning. Unlike Latin languages, Arabic is a Semitic language that forms a harder challenge, especially with variability of patterns caused by factors such as writer age. Most of the studies focused on adults, with only one recent study on children. Moreover, much of the recent Machine Learning methods focused on using Convolutional Neural Networks, a powerful class of neural networks that can extract complex features from images. In this paper we propose a convolutional neural network (CNN) model that recognizes children handwriting with an accuracy of 91% on the Hijja dataset, a recent dataset built by collecting images of the Arabic characters written by children, and 97% on Arabic Handwritten Character Dataset. The results showed a good improvement over the proposed model from the Hijja dataset authors, yet it reveals a bigger challenge to solve for children Arabic handwritten character recognition. Moreover, we proposed a new approach using multi models instead of single model based on the number of strokes in a character, and merged Hijja with AHCD which reached an averaged prediction accuracy of 96%.Comment: 1

arXiv.org e-Print Archive

Directory of Open Access Journals

Overview of OSACT4 Arabic Offensive Language Detection Shared Task

Author: Al-Khalifa Hend
Darwish Kareem
Elsayed Tamer
Magdy Walid
Mubarak Hamdy
Publication venue
Publication date: 12/05/2020
Field of study

Edinburgh Research Explorer

Meta-Evaluation of Sentence Simplification Metrics

Author: Al-Khalifa Hend
Alfear Noof
Kazakov Dimitar Lubomirov
Publication venue
Publication date: 25/05/2024
Field of study

Automatic Text Simplification (ATS) is one of the major Natural Language Processing (NLP) tasks, which aims to help people understand text that is above their reading abilities and comprehension. ATS models reconstruct the text into a simpler format by deletion, substitution, addition or splitting, while preserving the original meaning and maintaining correct grammar. Simplified sentences are usually evaluated by human experts based on three main factors: simplicity, adequacy and fluency or by calculating automatic evaluation metrics. In this paper, we conduct a meta-evaluation of reference-based automatic metrics for English sentence simplification using high-quality, human-annotated dataset, NEWSELA-LIKERT. We study the behavior of several evaluation metrics at sentence level across four different sentence simplification models. All the models were trained on the NEWSELA-AUTO dataset. The correlation between the metrics’ scores and human judgements was analyzed and the results used to recommend the most appropriate metrics for this task

White Rose Research Online

AraCore: An Arabic learning object metadata for indexing learning resources

Author: Al-Khalifa Hend
Davis Hugh
Publication venue
Publication date: 01/01/2005
Field of study

In the era of information systems globalization the need to have educational metadata to index and describe digital learning resources for easy searching, retrieving and reusing them quickly and efficiently is becoming an essential research topic in learning technologies discipline. In this paper we will present a brief overview of metadata standards, protocols and application profiles. Then we will discuss the issues related to the need for an Arabic learning object metadata. Also we will urge for the formation of a community of practitioners to identify guidelines for metadata implementers, creators and users in the use of metadata in e-learning content among Arabs. Finally, we propose a sample metadata application profile called AraCore which will be based on the IEEE 1484.12.1-2002 standard. This will be our first attempt to help the Arab community to think about creating an Arabic learning object metadata application profile to be used in assessing the exchange of Arabic learning objects

CiteSeerX

Southampton (e-Prints Soton)

FAsTA: A Folksonomy-Based Automatic Metadata Generator

Author: Al-Khalifa Hend S.
Davis Hugh C.
Publication venue
Publication date: 01/01/2007
Field of study

Folksonomies provide a free source of keywords describing web resources, however, these keywords are free form and unstructured. In this paper, we describe a novel tool that converts folksonomy tags into semantic metadata, and present a case study consisting of a framework for evaluating the usefulness of this metadata within the context of a particular eLearning application. The evaluation shows the number of ways in which the generated semantic metadata adds value to the raw folksonomy tags

CiteSeerX

Southampton (e-Prints Soton)

Towards Better Understanding of Folksonomic Patterns

Author: Al-Khalifa Hend S.
Davis Hugh C.
Publication venue
Publication date: 01/01/2007
Field of study

Folksonomies provide a free source of keywords describing web resources; however, these keywords are free form and their semantics spans multiple contextual dimension. In this paper, we present a pragmatic experiment that analyzes folksonomy tags using three classification categories: Personal, Factual and Subjective, in order to gain more understanding of the types of tags used in the social tagging process. The rational for this work was to measure the potential portion of folksonomy tags that might be helpful when considering the creation of structured metadata

CiteSeerX

Southampton (e-Prints Soton)

Crossref

Automatic document-level semantic metadata annotation using folksonomies and domain ontologies

Author: Al-Khalifa Hend S.
Publication venue
Publication date: 01/01/2007
Field of study

The last few years have witnessed a fast growth of the concept of Social Software. Be it video sharing such as YouTube, photo sharing such as Flickr, community building such as MySpace, or social bookmarking such as del.icio.us. These websites contain valuable user-generated metadata called folksonomies. Folksonomies are ad hoc, light-weight knowledge representation artefacts to describe web resources using people’s own vocabulary. The cheap metadata contained in such websites presents potential opportunities for us (researchers) to benefit from. This thesis presents a novel tool that uses folksonomies to automatically generate metadata with educational semantics in an attempt to provide semantic annotations to bookmarked web resources, and to help in making the vision of the Semantic Web a reality. The tool comprises two components: the tags normalisation process and the semantic annotation process. The tool uses the del.icio.us social bookmarking service as a source for folksonomy tags. The tool was applied to a case study consisting of a framework for evaluating the usefulness of the generated semantic metadata within the context of a particular eLearning application. This implementation of the tool was evaluated over three dimensions: the quality, the searchability and the representativeness of the generated semantic metadata. The results show that folksonomy tags were acceptable for creating semantic metadata. Moreover, folksonomy tags showed the power of aggregating people’s intelligence. The novel contribution of this work is the design of a tool that utilises folksonomy tags to automatically generate metadata with fine gained and extra educational semantics

Southampton (e-Prints Soton)

OpenGrey Repository