60 research outputs found

    Exploring The Value Of Folksonomies For Creating Semantic Metadata

    No full text
    Finding good keywords to describe resources is an on-going problem: typically we select such words manually from a thesaurus of terms, or they are created using automatic keyword extraction techniques. Folksonomies are an increasingly well populated source of unstructured tags describing web resources. This paper explores the value of the folksonomy tags as potential source of keyword metadata by examining the relationship between folksonomies, community produced annotations, and keywords extracted by machines. The experiment has been carried-out in two ways: subjectively, by asking two human indexers to evaluate the quality of the generated keywords from both systems; and automatically, by measuring the percentage of overlap between the folksonomy set and machine generated keywords set. The results of this experiment show that the folksonomy tags agree more closely with the human generated keywords than those automatically generated. The results also showed that the trained indexers preferred the semantics of folksonomy tags compared to keywords extracted automatically. These results can be considered as evidence for the strong relationship of folksonomies to the human indexer’s mindset, demonstrating that folksonomies used in the del.icio.us bookmarking service are a potential source for generating semantic metadata to annotate web resources

    Creating structure from disorder: using folksonomies to create semantic metadata

    No full text
    This paper reports on an on-going research project to create educational semantic metadata out of folksonomies. The paper describes a simple scenario for the usage of the generated semantic metadata in teaching, and describes the ‘FolksAnnotation’ tool which applies an organization scheme to tags in a specific domain of interest. The contribution of this paper is to describe an evaluation framework which will allow us to validate our claim that folksonomies are potentially a rich source of metadata

    Handwritten Arabic Character Recognition for Children Writ-ing Using Convolutional Neural Network and Stroke Identification

    Full text link
    Automatic Arabic handwritten recognition is one of the recently studied problems in the field of Machine Learning. Unlike Latin languages, Arabic is a Semitic language that forms a harder challenge, especially with variability of patterns caused by factors such as writer age. Most of the studies focused on adults, with only one recent study on children. Moreover, much of the recent Machine Learning methods focused on using Convolutional Neural Networks, a powerful class of neural networks that can extract complex features from images. In this paper we propose a convolutional neural network (CNN) model that recognizes children handwriting with an accuracy of 91% on the Hijja dataset, a recent dataset built by collecting images of the Arabic characters written by children, and 97% on Arabic Handwritten Character Dataset. The results showed a good improvement over the proposed model from the Hijja dataset authors, yet it reveals a bigger challenge to solve for children Arabic handwritten character recognition. Moreover, we proposed a new approach using multi models instead of single model based on the number of strokes in a character, and merged Hijja with AHCD which reached an averaged prediction accuracy of 96%.Comment: 1

    Meta-Evaluation of Sentence Simplification Metrics

    Get PDF
    Automatic Text Simplification (ATS) is one of the major Natural Language Processing (NLP) tasks, which aims to help people understand text that is above their reading abilities and comprehension. ATS models reconstruct the text into a simpler format by deletion, substitution, addition or splitting, while preserving the original meaning and maintaining correct grammar. Simplified sentences are usually evaluated by human experts based on three main factors: simplicity, adequacy and fluency or by calculating automatic evaluation metrics. In this paper, we conduct a meta-evaluation of reference-based automatic metrics for English sentence simplification using high-quality, human-annotated dataset, NEWSELA-LIKERT. We study the behavior of several evaluation metrics at sentence level across four different sentence simplification models. All the models were trained on the NEWSELA-AUTO dataset. The correlation between the metrics’ scores and human judgements was analyzed and the results used to recommend the most appropriate metrics for this task

    AraCore: An Arabic learning object metadata for indexing learning resources

    No full text
    In the era of information systems globalization the need to have educational metadata to index and describe digital learning resources for easy searching, retrieving and reusing them quickly and efficiently is becoming an essential research topic in learning technologies discipline. In this paper we will present a brief overview of metadata standards, protocols and application profiles. Then we will discuss the issues related to the need for an Arabic learning object metadata. Also we will urge for the formation of a community of practitioners to identify guidelines for metadata implementers, creators and users in the use of metadata in e-learning content among Arabs. Finally, we propose a sample metadata application profile called AraCore which will be based on the IEEE 1484.12.1-2002 standard. This will be our first attempt to help the Arab community to think about creating an Arabic learning object metadata application profile to be used in assessing the exchange of Arabic learning objects

    FAsTA: A Folksonomy-Based Automatic Metadata Generator

    No full text
    Folksonomies provide a free source of keywords describing web resources, however, these keywords are free form and unstructured. In this paper, we describe a novel tool that converts folksonomy tags into semantic metadata, and present a case study consisting of a framework for evaluating the usefulness of this metadata within the context of a particular eLearning application. The evaluation shows the number of ways in which the generated semantic metadata adds value to the raw folksonomy tags

    Towards Better Understanding of Folksonomic Patterns

    No full text
    Folksonomies provide a free source of keywords describing web resources; however, these keywords are free form and their semantics spans multiple contextual dimension. In this paper, we present a pragmatic experiment that analyzes folksonomy tags using three classification categories: Personal, Factual and Subjective, in order to gain more understanding of the types of tags used in the social tagging process. The rational for this work was to measure the potential portion of folksonomy tags that might be helpful when considering the creation of structured metadata

    Automatic document-level semantic metadata annotation using folksonomies and domain ontologies

    Get PDF
    The last few years have witnessed a fast growth of the concept of Social Software. Be it video sharing such as YouTube, photo sharing such as Flickr, community building such as MySpace, or social bookmarking such as del.icio.us. These websites contain valuable user-generated metadata called folksonomies. Folksonomies are ad hoc, light-weight knowledge representation artefacts to describe web resources using people’s own vocabulary. The cheap metadata contained in such websites presents potential opportunities for us (researchers) to benefit from. This thesis presents a novel tool that uses folksonomies to automatically generate metadata with educational semantics in an attempt to provide semantic annotations to bookmarked web resources, and to help in making the vision of the Semantic Web a reality. The tool comprises two components: the tags normalisation process and the semantic annotation process. The tool uses the del.icio.us social bookmarking service as a source for folksonomy tags. The tool was applied to a case study consisting of a framework for evaluating the usefulness of the generated semantic metadata within the context of a particular eLearning application. This implementation of the tool was evaluated over three dimensions: the quality, the searchability and the representativeness of the generated semantic metadata. The results show that folksonomy tags were acceptable for creating semantic metadata. Moreover, folksonomy tags showed the power of aggregating people’s intelligence. The novel contribution of this work is the design of a tool that utilises folksonomy tags to automatically generate metadata with fine gained and extra educational semantics
    corecore