190 research outputs found

    Arabic Quranic Search Tool Based on Ontology

    Get PDF
    This paper reviews and classifies most of the common types of search techniques that have been applied on the Holy Quran. Then, it addresses the limitations of these methods. Additionally, this paper surveys most existing Quranic ontologies and what are their deficiencies. Finally, it explains a new search tool called: a semantic search tool for Al-Quran based on Qur’anic on-tologies. This tool will overcome all limitations in the existing Quranic search applications

    Semantic-based Ontology for Malay Qur'an Reader

    Get PDF
    The Quran has been translated into various languages around the world by Muslim experts. One of them is in Malay. There are numerous applications built to facilitate the retrieval of knowledge from the Malay Qur’an. However, there are limited resources and tools that are available or made accessible for the research on Malay Qur’an. Furthermore, there are several issues that need to be considered when dealing with Malay Qur’an translation; such as ambiguities of words, lack of equivalence words between Malay and English or Malay and Arabic, and different structures of word, sentence, and discourse in these two languages. Therefore, this research summarizes the search techniques used in existing research on Qur’an. Moreover, this paper also studied the previous research conducted on Qur’an Semantic Search and Quran Ontology-Based Search focusing on Malay Qur’an. This review helps the research in addressing the general problems and limitations in Malay Qur’an that influence its accessibility. This research proposed the research framework for new semantic based ontology for Malay Qur’an. The final outcome will be an accessible tool that can help a Malay reader to understand the Qur’an in better ways

    Quranic Arabic Semantic Search Model Based on Ontology of Concepts

    Get PDF
    The Holy Quran is the essential resource for Islamic sciences and Arabic language. Therefore, numerous Quranic search applications have been built to facilitate the retrieval of knowledge from the Quran. This thesis presents a novel Arabic Quran semantic search model. First, this thesis evaluated existing search tools constructed for the Holy Quran, against 13 criteria depending on: search features, output features, the precision of the retrieved verses, recall database size, and types of database contents. Then, the study reviewed the existing Quran ontologies and compared them against 11 criteria. Some deficits have been found in all these ontologies. Additionally, a single Quranic ontology does not cover most of the knowledge in the Quran. Therefore, I developed a new Arabic-English Quran ontology from ten datasets related to the Quran such as: Quran chapter and verse names, Quran word meanings, and Quran topics. The main aim of developing a Quranic ontology is to facilitate the retrieval of knowledge from the Quran. Additionally, the Quran ontology will enrich the raw Arabic and English Quran text with Islamic semantic tags. Furthermore, I developed the first Annotated Corpus of Quran Questions and Answers in Arabic. This corpus has 2200 pairs of question and answer collected from trusted Islamic sources. Each pair of question and answer is labelled with 5 tags. Examples of tags are: question type: either factoid or descriptive, topic of question-based on the Quran ontology, and question class. Finally, the thesis explains a new semantic search model for the Arabic Quran based on my Quran ontology. This model aims at overcoming limitations in the existing Quran search applications. This search tool employs both Information Retrieval techniques and semantic search technologies. The performance of this search model is evaluated by using The Annotated Corpus of Arabic Quran Questions and Answers

    Ontology Learning from the Arabic Text of the Qur’an: Concepts Identification and Hierarchical Relationships Extraction

    Get PDF
    Recent developments in ontology learning have highlighted the growing role ontologies play in linguistic and computational research areas such as language teaching and natural language processing. The ever-growing availability of annotations for the Qur’an text has made the acquisition of the ontological knowledge promising. However, the availability of resources and tools for Arabic ontology is not comparable with other languages. Manual ontology development is labour-intensive, time-consuming and it requires knowledge and skills of domain experts. This thesis aims to develop new methods for Ontology learning from the Arabic text of the Qur’an, including concepts identification and hierarchical relationships extraction. The thesis presents a methodology for reducing human intervention in building ontology from Classical Arabic Language of the Qur’an text. The set of concepts, which is a crucial step in ontology learning, was generated based on a set of patterns made of lexical and inflectional information. The concepts were identified based on adapted weighting schema that exploit a combination of knowledge to learn the relevance degree of a term. Statistical, domain-specific knowledge and internal information of Multi-Word Terms (MWTs) were combined to learn the relevance of generated terms. This methodology which represents the major contribution of the thesis was experimentally investigated using different terms generation methods. As a result, we provided the Arabic Qur’anic Terms (AQT) as a training resource for machine learning based term extraction. This thesis also introduces a new approach for hierarchical relations extraction from Arabic text of the Qur’an. A set of hierarchical relations occurring between identified concepts are extracted based on hybrid methods including head-modifier, set of markers for copula construct in Arabic text, referents. We also compared a number of ontology alignment methods for matching ontological bilingual Qur’anic resources. In addition, a multi-dimensional resource named Arabic Qur’anic Database (AQD) about the Qur’an is made for Arabic computational researchers, allowing regular expression query search over the included annotations. The search tool was successfully applied to find instances for a given complex rule made of different combined resources

    Artificial intelligence for understanding the Hadith

    Get PDF
    My research aims to utilize Artificial Intelligence to model the meanings of Classical Arabic Hadith, which are the reports of the life and teachings of the Prophet Muhammad. The goal is to find similarities and relatedness between Hadith and other religious texts, specifically the Quran. These findings can facilitate downstream tasks, such as Islamic question- answering systems, and enhance understanding of these texts to shed light on new interpretations. To achieve this goal, a well-structured Hadith corpus should be created, with the Matn (Hadith teaching) and Isnad (chain of narrators) segmented. Hence, a preliminary task is conducted to build a segmentation tool using machine learning models that automatically deconstruct the Hadith into Isnad and Matn with 92.5% accuracy. This tool is then used to create a well-structured corpus of the canonical Hadith books. After building the Hadith corpus, Matns are extracted to investigate different methods of representing their meanings. Two main methods are tested: a knowledge-based approach and a deep-learning-based approach. To apply the former, existing Islamic ontologies are enumerated, most of which are intended for the Quran. Since the Quran and the Hadith are in the same domain, the extent to which these ontologies cover the Hadith is examined using a corpus-based evaluation. Results show that the most comprehensive Quran ontology covers only 26.8% of Hadith concepts, and extending it is expensive. Therefore, the second approach is investigated by building and evaluating various deep-learning models for a binary classification task of detecting relatedness between the Hadith and the Quran. Results show that the likelihood of the current models reaching a human- level understanding of such texts remains somewhat elusive

    Query expansion using medical information extraction for improving information retrieval in French medical domain

    Get PDF
    Many users’ queries contain references to named entities, and this is particularly true in the medical field. Doctors express their information needs using medical entities as they are elements rich with information that helps to better target the relevant documents. At the same time, many resources have been recognized as a large container of medical entities and relationships between them such as clinical reports; which are medical texts written by doctors. In this paper, we present a query expansion method that uses medical entities and their semantic relations in the query context based on an external resource in OWL. The goal of this method is to evaluate the effectiveness of an information retrieval system to support doctors in accessing easily relevant information. Experiments on a collection of real clinical reports show that our approach reveals interesting improvements in precision, recall and MAP in medical information retrieval

    Models to represent linguistic linked data

    Get PDF
    As the interest of the Semantic Web and computational linguistics communities in linguistic linked data (LLD) keeps increasing and the number of contributions that dwell on LLD rapidly grows, scholars (and linguists in particular) interested in the development of LLD resources sometimes find it difficult to determine which mechanism is suitable for their needs and which challenges have already been addressed. This review seeks to present the state of the art on the models, ontologies and their extensions to represent language resources as LLD by focusing on the nature of the linguistic content they aim to encode. Four basic groups of models are distinguished in this work: models to represent the main elements of lexical resources (group 1), vocabularies developed as extensions to models in group 1 and ontologies that provide more granularity on specific levels of linguistic analysis (group 2), catalogues of linguistic data categories (group 3) and other models such as corpora models or service-oriented ones (group 4). Contributions encompassed in these four groups are described, highlighting their reuse by the community and the modelling challenges that are still to be faced
    • …
    corecore