47 research outputs found

    Validation of Tagging Suggestion Models for a Hotel Ticketing Corpus

    Get PDF
    This paper investigates methods for the prediction of tags on a textual corpus that describes hotel staff inputs in a ticketing system. The aim is to improve the tagging process and find the most suitable method for suggesting tags for a new text entry. The paper consists of two parts: (i) exploration of existing sample data, which includes statistical analysis and visualisation of the data to provide an overview, and (ii) evaluation of tag prediction approaches. We have included different approaches from different research fields in order to cover a broad spectrum of possible solutions. As a result, we have tested a machine learning model for multi-label classification (using gradient boosting), a statistical approach (using frequency heuristics), and two simple similarity-based classification approaches (Nearest Centroid and k-Nearest Neighbours). The experiment which compares the approaches uses recall to measure the quality of results. Finally, we provide a recommendation of the modelling approach which produces the best accuracy in terms of tag prediction on the sample data

    WatsaQ: Repository of Al Hadith in Bahasa (Case Study: Hadith Bukhari)

    Get PDF
    The Hadith is one of the two sources of Islamic law after the Qur'an. It is a fact that there are a number of false hadith, recognised by Muslim scholars since the end of the first century of Hijra, and even earlier. In addition to the breadth of false hadith circulating among the public at this  time,  it  is difficult to determine the source of authenticity and distinguish false  from genuine.  This  is  due  to  the  configuration of  the genuine documents which are revealed in Arabic. To that end, the  authors  have  built  a  repository  collection  of  hadith al- Bukhari in the Indonesian language. The hadith chosen have secured originality and standardisation has been applied that can assist users in learning the content of the hadith. The authors implemented a repository of translation in Bahasa of Bukhari Hadith using XML schema. To study the repository performance, we use a web presentation using PHP employing brute-force string match algorithms to display the search results based on keywords entered by the user. We analyse the results of our proposed repository implementation average searching time is faster by 0.85 milliseconds compared with the repository based on the unstructured one

    Efficient skyline processing algorithm over dynamic and incomplete database

    Get PDF
    The notion of skyline processing is to discover the data items that are not dominated by any other data items. It is a well-known technique that is utilised to determine the best results that meet the user’s preferences. However, the rapid growth and frequent changes of data make the process of identifying skyline points no longer a trivial task. Most of the existing skyline approaches assume that the database is complete and static. However, in real world scenario, this assumption is not valid especially in multidimensional databases in which some dimensions have missing values while they are dynamic due to the continual modifications made towards them. Blindly examining the whole database after changes are made to identify the skyline points is inappropriate as not all data items are affected by the changes. Hence, in this study we propose a skyline algorithm, DyIn-Skyline, which is capable of identifying skyline points over dynamic and incomplete databases, by exploiting only those data items that are affected by the changes. Several experiments have been conducted and the results show that our proposed algorithm outperforms the previous work by reducing the number of pairwise comparisons in the range of 50% to 73%

    A Hybrid Model Schema Matching Using Constraint-Based and Instance-Based

    Get PDF
    Schema matching is an important process in the Enterprise Information Integration (EII) which is at the level of the back end to solve the problems due to the schematic heterogeneity. This paper is a summary of preliminary result work of the model development stage as part of research on the development of models and prototype of hybrid schema matching that combines two methods, namely constraint-based and instance-based. The discussion includes a general description of the proposed models and the development of models, start from requirement analysis, data type conversion, matching mechanism, database support, constraints and instance extraction, matching and compute the similarity, preliminary result, user verification, verified result, dataset for testing, as well as the performance measurement. Based on result experiment on 36 datasets of heterogeneous RDBMS, it obtained the highest P value is 100.00% while the lowest is 71.43%; The highest R value is 100.00% while the lowest is 75.00%; and F-Measure highest value is 100.00% while the lowest is 81.48%. Unsuccessful matching on the model still happens, including use of an id attribute with data type as autoincrement; using codes that are defined in the same way but different meanings; and if encountered in common instance with the same definition but different meaning

    The Development Of Natural Potential Of Ponggok Village As Recreational Sports And Water Sports Tourism For Regional People In Klaten Regency

    Get PDF
    The main objective of this research is to explain the management of the natural potential of Ponggok Village Klaten, describe the development of recreational sports and water sports tourism in Ponggok Village Klaten, and explain the role of Ponggok Village to support tourism development in Klaten Regency. This research is conducted in Ponggok Village, Klaten Regency, Central Java Province, using qualitative research and phenomenology approach. The data collection techniques used in this study are observation, deep interviews, and documentation. The results of the research are summarized as follows: 1) The natural potential of Ponggok Village Klaten are in the agricultural sector and water sector, that is supported by springs or usually known as umbuls. In the agricultural sector, people mostly plant rice. Meanwhile, the springs are used for fishery and tourism. In the fishery sector, there are several freshwater fish farming ponds. The tourism sector in Ponggok Village utilizes several umbuls including Umbul Ponggok, Umbul Ponggok Ciblon, Umbul Kapilaler, Umbul Sigedang, and Umbul Besuki. Most of the umbuls in Ponggok Village are used for recreational sports, such as swimming. Umbul Ponggok is also used for swimming, diving, and snorkeling. 2) The development of recreational sports and water sports tourism in Umbul Ponggok that was only used for swimming by visitors around Ponggok Village, now is even more developing. There are some more facilities such as rental mask snorkels and buoys for snorkeling, scuba set for diving, and Ponggok Walker for walking under water. There is also Ponggok Warior as mini outbound on water for children. Besides, there is a slide for stimulating adrenaline which is directly connected to the umbul. In Umbul Ponggok Ciblon, there are three swimming pools and parks that can be used for outbound activities. People also often use ponds in Umbul Ponggok Ciblon to swim, usually in the afternoon. In Umbul Kapilaler that is surrounded by big trees, visitors usually come to swim and play with water. 3) Ponggok Village is already quite popular among the President, the Ministers, the House of Representative members etc. The delegation of 20 countries of The Asian Productivity Organization (APO) also visit Ponggok Village. The Minister of Rural Development Malaysia also come to Ponggok Village and made 10 Village Heads of Malaysia wanted to take a closer look at Ponggok Village. Ponggok village certainly brings the name of Klaten Regency well known domestically and internationally. Ponggok has the potential that might be here only. People can enjoy the beauty and take pictures under fresh water. The water is very clear; it is hard to find elsewhere. Ponggok Village has its own uniqueness as a tourist attraction

    Survey: Models and Prototypes of Schema Matching

    Get PDF
    Schema matching is critical problem within many applications to integration of data/information, to achieve interoperability, and other cases caused by schematic heterogeneity. Schema matching evolved from manual way on a specific domain, leading to a new models and methods that are semi-automatic and more general, so it is able to effectively direct the user within generate a mapping among elements of two the schema or ontologies better. This paper is a summary of literature review on models and prototypes on schema matching within the last 25 years to describe the progress of and research chalenge and opportunities on a new models, methods, and/or prototypes

    Digital Humanities Data Processing

    Get PDF
    The editorial of this first issue of volume 9, corresponding to 2016, is devoted to digital humanities data processing

    Technological Ecosystems

    Get PDF

    Sensor Technologies for Caring People with Disabilities

    Get PDF
    Today, the population uses technology for every daily activity involving business, education, communication, entertainment, etc. Technologymay also help us to take care of peoplewho suffer some kind of disability. Complex technological ecosystems with pervasive and intelligent capabilities get along with us, facilitating the vigilance of those who need special attention or assisted living cares due to their health limitations. The advances in sensor research have enriched the powerful of these ecosystems to achieve more sophisticated monitoring and alarm systems, also taking into account the balance between the level of assistance and the people’s privacy. The Special Issue on “Sensor Technologies for Caring People with Disabilities” aims to present recent developments on sensor technologies for caring people with disabilities, focusing on the different configurations that can be used and novel applications in the field
    corecore