123,015 research outputs found

    External inverse pattern matching

    Get PDF
    We consider {\sl external inverse pattern matching} problem. Given a text \t of length nn over an ordered alphabet Σ\Sigma, such that Σ=σ|\Sigma|=\sigma, and a number mnm\le n. The entire problem is to find a pattern \pe\in \Sigma^m which is not a subword of \t and which maximizes the sum of Hamming distances between \pe and all subwords of \t of length mm. We present optimal O(nlogσ)O(n\log\sigma)-time algorithm for the external inverse pattern matching problem which substantially improves the only known polynomial O(nmlogσ)O(nm\log\sigma)-time algorithm introduced by Amir, Apostolico and Lewenstein. Moreover we discuss a fast parallel implementation of our algorithm on the CREW PRAM model

    Prospects and limitations of full-text index structures in genome analysis

    Get PDF
    The combination of incessant advances in sequencing technology producing large amounts of data and innovative bioinformatics approaches, designed to cope with this data flood, has led to new interesting results in the life sciences. Given the magnitude of sequence data to be processed, many bioinformatics tools rely on efficient solutions to a variety of complex string problems. These solutions include fast heuristic algorithms and advanced data structures, generally referred to as index structures. Although the importance of index structures is generally known to the bioinformatics community, the design and potency of these data structures, as well as their properties and limitations, are less understood. Moreover, the last decade has seen a boom in the number of variant index structures featuring complex and diverse memory-time trade-offs. This article brings a comprehensive state-of-the-art overview of the most popular index structures and their recently developed variants. Their features, interrelationships, the trade-offs they impose, but also their practical limitations, are explained and compared

    Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array

    Full text link
    The longest common prefix (LCP) array is a versatile auxiliary data structure in indexed string matching. It can be used to speed up searching using the suffix array (SA) and provides an implicit representation of the topology of an underlying suffix tree. The LCP array of a string of length nn can be represented as an array of length nn words, or, in the presence of the SA, as a bit vector of 2n2n bits plus asymptotically negligible support data structures. External memory construction algorithms for the LCP array have been proposed, but those proposed so far have a space requirement of O(n)O(n) words (i.e. O(nlogn)O(n \log n) bits) in external memory. This space requirement is in some practical cases prohibitively expensive. We present an external memory algorithm for constructing the 2n2n bit version of the LCP array which uses O(nlogσ)O(n \log \sigma) bits of additional space in external memory when given a (compressed) BWT with alphabet size σ\sigma and a sampled inverse suffix array at sampling rate O(logn)O(\log n). This is often a significant space gain in practice where σ\sigma is usually much smaller than nn or even constant. We also consider the case of computing succinct LCP arrays for circular strings

    A Chatbot Framework for Yioop

    Get PDF
    Over the past few years, messaging applications have become more popular than Social networking sites. Instead of using a specific application or website to access some service, chatbots are created on messaging platforms to allow users to interact with companies’ products and also give assistance as needed. In this project, we designed and implemented a chatbot Framework for Yioop. The goal of the Chatbot Framework for Yioop project is to provide a platform for developers in Yioop to build and deploy chatbot applications. A chatbot is a web service that can converse with users using artificial intelligence in messaging platforms. Chatbots feel more like a human and it changes the interaction between people and computers. The Chatbot Framework enables developers to create chatbots and allows users to connect with them in the user chosen Yioop discussion channel. A developer can incorporate language skills within a chatbot by creating a knowledge base so that the chatbot understands user messages and reacts to them like a human. A knowledge base is created by using a language understanding web interface in Yioop

    Phase retrieval via regularization in self-diffraction based spectral interferometry

    Full text link
    A novel variant of spectral phase interferometry for direct electric-field reconstruction (SPIDER) is introduced and experimentally demonstrated. Other than most previously demonstrated variants of SPIDER, our method is based on a third-order nonlinear optical effect, namely self-diffraction, rather than the second-order effect of sum-frequency generation. On one hand, self-diffraction (SD) substantially simplifies phase-matching capabilities for multi-octave spectra that cannot be hosted by second-order processes, given manufacturing limitations of crystal lengths in the few-micrometer range. On the other hand, however, SD SPIDER imposes an additional constraint as it effectively measures the spectral phase of a self-convolved spectrum rather than immediately measuring the fundamental phase. Reconstruction of the latter from the measured phase and the spectral amplitude of the fundamental turns out to be an ill-posed problem, which we address by a regularization approach. We discuss the numerical implementation in detail and apply it to measured data from a Ti:sapphire amplifier system. Our experimental demonstration used 40-fs pulses and a 500 μ\mum thick BaF2{}_2 crystal to show that the SD SPIDER signal is sufficiently strong to be separable from stray light. Extrapolating these measurements to the thinnest conceivable nonlinear media, we predict that bandwidths well above two optical octaves can be measured by a suitably adapted SD SPIDER apparatus, enabling the direct characterization of pulses down to single-femtosecond pulse durations. Such characteristics appear out of range for any currently established pulse measurement technique

    Evolutionary robustness of differentiation in genetic regulatory networks

    Get PDF
    We investigate the ability of artificial Genetic Regulatory Networks (GRNs) to evolve differentiation. The proposed GRN model supports non-linear interaction between regulating factors, thereby facilitating the realization of complex regulatory logics. As a proof of concept we evolve GRNs of this kind to follow different pathways, producing two kinds of periodic dynamics in response to minimal differences in external input. Furthermore we find that successive increases in environmental pressure for differentiation, allowing a lineage to adapt gradually, compared to an immediate requirement for a switch between behaviors, yields better results on average. Apart from better success there is also less variability in performance, the latter indicating an increase in evolutionary robustness
    corecore