1,041 research outputs found

    Measuring the Propagation of Information in Partial Evaluation

    Get PDF
    We present the first measurement-based analysis of the information propagated by a partial evaluator. Our analysis is based on measuring implementations of string-matching algorithms, based on the observation that the sequence of character comparisons accurately reflects maintained information. Notably, we can easily prove matchers to be different and we show that they display more variety and finesse than previously believed. As a consequence, we are able to pinpoint differences and inaccuracies in many results previously considered equivalent. Our analysis includes a framework that lets us obtain string matchers - notably the family of Boyer-Moore algorithms - in a systematic formalism-independent way from a few information-propagation primitives. By leveraging the existing research in string matching, we show that the landscape of information propagation is non-trivial in the sense that small changes in information propagation may dramatically change the properties of the resulting string matchers. We thus expect that this work will prove useful as a test and feedback mechanism for information propagation in the development of advanced program transformations, such as GPC or Supercompilation

    PARALLEL PROCESSING OUTCOMES OF E-ABDULRAZZAQ ALGORITHM USING MULTI-CORE TECHNIQUE

    Get PDF
    The string matching problem is considered one of the substantial problems in the fields of computer science like speech and pattern recognition, signal and image processing, and artificial intelligence (AI). The increase in the speedup of performance is considered an important factor in meeting the growth rate of databases, Subsequently, one of the determinations to address this issue is the parallelization for exact string matching algorithms. In this study, the E-Abdulrazzaq string matching algorithm is chosen to be executed with the multi-core environment utilizing the OpenMP paradigm which can be utilized to decrease the execution time and increase the speedup of the algorithm. The parallelization algorithm got positive results within the parallel execution time, and excellent speeding-up capabilities, in comparison to the successive result. The Protein database showed optimal results in parallel execution time, and when utilizing short and long pattern lengths. The DNA database showed optimal speedup execution when utilizing short and long pattern lengths, while no specific database obtained the worst results

    Implementasi Algoritma Boyer Moore Pada Aplikasi Kamus Istilah Kebidanan Berbasis Web

    Get PDF
    The lack of understanding in obstetrics and limit of instructional media has become one of the factors in the making of dictionary application of midwifery. The current dictionary is still a thick book with many terms in it and difficult to use. dictionary midwifery terms have a weakness in the search process, because users should search for words and terms manually by opening pages per page on the dictionary and existing data could not be changed.Keywords: Algorithm, Boyer Moore, Midwifery Dictionary

    An Enhanced Boyer-Moore Algorithm for WorstCase Running Time

    Get PDF
    This article adderesses the exact string matching problem which consists in finding all occurrences of a given pattern in a text.It is an extensively studied problem in the field of computer science mainly due to despite its popularity in diverse area of application such as cluster computing, image and signal processing, speech analysis and recognition, information retrieval, data compression,computational biology,intrusion detection and virus scanning detection.In the last decade several new algorithm has been proposed.In this paper we compares all improved of the Boyer-Moore algorithm with my enhanced Boyer-Moore algorithm practically and theoretically result.It is not only generate the largest distance but also produces the minimum shifting and frequency of comparisons steps.By this enhanced algorithm we can reduce the number of comparisons frequency and number of shifting steps during the searching process.Moreover result of this enhanced Boyer-Moore algorithm reveals the efficiency is higher than of previous improved Boyer-Moore algorithms and time complexity is reduced in the concept of worst case analysis and lower than BM algorithm.Our enhanced algorithm 16% boost-up than previous improved Boyer-Moore algorithm when executed on the CPU.This enhanced Boyer-Moore algorithm can be plays an important role in finding extremely fast genetic moleculer and complex sequence pattern of interested database alignment of DNA

    A Survey of Software-based String Matching Algorithms for Forensic Analysis

    Get PDF
    Employing a fast string matching algorithm is essential for minimizing the overhead of extracting structured files from a raw disk image. In this paper, we summarize the concept, implementation, and main features of ten software-based string matching algorithms, and evaluate their applicability for forensic analysis. We provide comparisons between the selected software-based string matching algorithms from the perspective of forensic analysis by conducting their performance evaluation for file carving. According to the experimental results, the Shift-Or algorithm (R. Baeza-Yates & Gonnet, 1992) and the Karp-Rabin algorithm (Karp & Rabin, 1987) have the minimized search time for identifying the locations of specified headers and footers in the target disk. Keywords: string matching algorithm, forensic analysis, file carving, Scalpel, data recover

    Detecting Indonesian ambiguous sentences using Boyer-Moore algorithm

    Get PDF
    Ambiguous sentences are divided into 3 types namely phonetic, lexical, and grammatical. This study focuses on grammatical ambiguous sentences, grammatical ambiguous sentences are ambiguities that occur due to incorrect grammar, but this ambiguity will disappear once it is used within a sentence.  Ambiguous sentences become a big problem when they are processed by a computer. In order for the computer to interpret ambiguous words correctly, this study seeks to develop detection of Indonesian ammbiguous sentences using Boyer Moore algorithm. This algorithm matches ambiguous sentences that are inserted as input with the data set. Then the sentence is being detected whether it contains ambiguous sentences, by calculating the percentage of similarity using cosine similarity method. Cosine similarity system is able to find out the meaning of the sentence. In the data set, the number of ambiguous sentences that can be collected is 50 words. The 50 words consist of ambiguous words data, ambiguous sentences, and ambiguous sentence meanings. This system trial was carried out for 200 times and the accuracy level was 0.935, precision was 0.9320, and Recall was 0.8. While the F-Measure was 0.8061. While the speed for word search 0.003275 seconds

    Similarity Detection for Hadith of Fiqh of Women using Cosine Similarity and Boyer Moore Method

    Get PDF
    Nowadays, people can get information easily including about fiqh and hadith as a source of Islamic law. The problem is, there are so many articles about jurisprudence whose understanding refers to the laws or rules relating to the hadith whose validity cannot be ascertained. The study aims to determine the degree of similarity between the hadith contained in articles with reliable sources such as books and books. One of the outputs of this study is an application that can determine the similarity of hadith using Cosine Similarity and Boyer Moore by matching strings starting from the right position to the leftmost position and using the cosine similarity method to determine the similarity based on the calculation of the distance between vectors A and B that produce angles cosine x between the two vectors. In the testing phase, the proposed model can run as planned. In one test scenario, the number of keywords tested was 9 cases compared to the categories in the database with an accuracy of 80%. And determine the similarity of two or more objects Using the cosine similarity method with weights The percentage of similarity is proportional to the sample of words entered, which is equal to 36%

    Generate fuzzy string-matching to build self attention on Indonesian medical-chatbot

    Get PDF
    Chatbot is a form of interactive conversation that requires quick and precise answers. The process of identifying answers to users’ questions involves string matching and handling incorrect spelling. Therefore, a system that can independently predict and correct letters is highly necessary. The approach used to address this issue is to enhance the fuzzy string-matching method by incorporating several features for self-attention. The combination of fuzzy string-matching methods employed includes Jaro Winkler distance + Levenshtein Damerau distance and Damerau Levenshtein + Rabin Carp. The reason for using this combination is their ability not only to match strings but also to correct word typing errors. This research contributes by developing a self-attention mechanism through a modified fuzzy string-matching model with enhanced word feature structures. The goal is to utilize this self-attention mechanism in constructing the Indonesian medical bidirectional encoder representations from transformers (IM-BERT). This will serve as a foundation for additional features to provide accurate answers in the Indonesian medical question and answer system, achieving an exact match of 85.7% and an F1-score of 87.6%

    String Matching Problems with Parallel Approaches An Evaluation for the Most Recent Studies

    Get PDF
    In recent years string matching plays a functional role in many application like information retrieval, gene analysis, pattern recognition, linguistics, bioinformatics etc. For understanding the functional requirements of string matching algorithms, we surveyed the real time parallel string matching patterns to handle the current trends. Primarily, in this paper, we focus on present developments of parallel string matching, and the central ideas of the algorithms and their complexities. We present the performance of the different algorithms and their effectiveness. Finally this analysis helps the researchers to develop the better techniques

    Implementasi Algoritma Boyer Moore Pada Aplikasi Kamus Istilah Kebidanan Berbasis Web

    Get PDF
    The lack of understanding in obstetrics and limit of instructional media has become one of the factors in the making of dictionary application of midwifery. The current dictionary is still a thick book with many terms in it and difficult to use. dictionary midwifery terms have a weakness in the search process, because users should search for words and terms manually by opening pages per page on the dictionary and existing data could not be changed.Keywords: Algorithm, Boyer Moore, Midwifery Dictionary
    • …
    corecore