604 research outputs found

    Fast Algorithm for Partial Covers in Words

    Get PDF
    A factor uu of a word ww is a cover of ww if every position in ww lies within some occurrence of uu in ww. A word ww covered by uu thus generalizes the idea of a repetition, that is, a word composed of exact concatenations of uu. In this article we introduce a new notion of Ξ±\alpha-partial cover, which can be viewed as a relaxed variant of cover, that is, a factor covering at least Ξ±\alpha positions in ww. We develop a data structure of O(n)O(n) size (where n=∣w∣n=|w|) that can be constructed in O(nlog⁑n)O(n\log n) time which we apply to compute all shortest Ξ±\alpha-partial covers for a given Ξ±\alpha. We also employ it for an O(nlog⁑n)O(n\log n)-time algorithm computing a shortest Ξ±\alpha-partial cover for each Ξ±=1,2,…,n\alpha=1,2,\ldots,n

    On Quasiperiodic Morphisms

    Full text link
    Weakly and strongly quasiperiodic morphisms are tools introduced to study quasiperiodic words. Formally they map respectively at least one or any non-quasiperiodic word to a quasiperiodic word. Considering them both on finite and infinite words, we get four families of morphisms between which we study relations. We provide algorithms to decide whether a morphism is strongly quasiperiodic on finite words or on infinite words.Comment: 12 page

    Dictionary Matching with One Gap

    Full text link
    The dictionary matching with gaps problem is to preprocess a dictionary DD of dd gapped patterns P1,…,PdP_1,\ldots,P_d over alphabet Ξ£\Sigma, where each gapped pattern PiP_i is a sequence of subpatterns separated by bounded sequences of don't cares. Then, given a query text TT of length nn over alphabet Ξ£\Sigma, the goal is to output all locations in TT in which a pattern Pi∈DP_i\in D, 1≀i≀d1\leq i\leq d, ends. There is a renewed current interest in the gapped matching problem stemming from cyber security. In this paper we solve the problem where all patterns in the dictionary have one gap with at least Ξ±\alpha and at most Ξ²\beta don't cares, where Ξ±\alpha and Ξ²\beta are given parameters. Specifically, we show that the dictionary matching with a single gap problem can be solved in either O(dlog⁑d+∣D∣)O(d\log d + |D|) time and O(dlog⁑Ρd+∣D∣)O(d\log^{\varepsilon} d + |D|) space, and query time O(n(Ξ²βˆ’Ξ±)log⁑log⁑dlog⁑2min⁑{d,log⁑∣D∣}+occ)O(n(\beta -\alpha )\log\log d \log ^2 \min \{ d, \log |D| \} + occ), where occocc is the number of patterns found, or preprocessing time and space: O(d2+∣D∣)O(d^2 + |D|), and query time O(n(Ξ²βˆ’Ξ±)+occ)O(n(\beta -\alpha ) + occ), where occocc is the number of patterns found. As far as we know, this is the best solution for this setting of the problem, where many overlaps may exist in the dictionary.Comment: A preliminary version was published at CPM 201

    Activity of peroxisomal enzymes, and levels of polyamines in LPA-transgenic mice on two different diets

    Get PDF
    BACKGROUND: In man, elevated levels of plasma lipoprotein (a)(Lp(a)) is a cardiovascular risk factor, and oxidized phospholipids are believed to play a role as modulators of inflammatory processes such as atherosclerosis. Polyamines are potent antioxidants and anti-inflammatory agents. It was therefore of interest to examine polyamines and their metabolism in LPA transgenic mice. Concentration of the polyamines putrescine, spermidine and spermine as well as the activity of peroxisomal polyamine oxidase and two other peroxisomal enzymes, acyl-CoA oxidase and catalase were measured. The mice were fed either a standard diet or a diet high in fat and cholesterol (HFHC). Some of the mice in each feeding group were in addition given aminoguanidine (AG), a specific inhibitor of diamine oxidase, which catalyses degradation of putrescine, and also inhibits non-enzymatic glycosylation of protein which is implicated in the aetiology of atherosclerosis in diabetic patients. Non-transgenic mice were used as controls. RESULTS: Intestinal peroxisomal polyamine oxidase activity was significantly higher in LPA transgenic mice than in the non-transgenic mice, while intestinal peroxisomal catalase activity was significantly lower. Hepatic Ξ²-oxidation increased in Lp(a) transgenic mice fed the HFHC diet, but not in those on standard diet. Hepatic spermidine concentration was increased in all mice fed the HFHC diet compared to those fed a standard diet, while spermine concentration was decreased. With exception of the group fed only standard diet, transgenic mice showed a lower degree of hepatic steatosis than non-transgenic mice. AG had no significant effect on hepatic steatosis. CONCLUSION: The present results indicate a connection between peroxisomal enzyme activity and the presence of the human LPA gene in the murine genome. The effect may be a result of changes in oxidative processes in lipid metabolism rather than resulting from a direct effect of the LPA construct on the peroximal gene expression

    Computing Covers under Substring Consistent Equivalence Relations

    Full text link
    Covers are a kind of quasiperiodicity in strings. A string CC is a cover of another string TT if any position of TT is inside some occurrence of CC in TT. The shortest and longest cover arrays of TT have the lengths of the shortest and longest covers of each prefix of TT, respectively. The literature has proposed linear-time algorithms computing longest and shortest cover arrays taking border arrays as input. An equivalence relation β‰ˆ\approx over strings is called a substring consistent equivalence relation (SCER) iff Xβ‰ˆYX \approx Y implies (1) ∣X∣=∣Y∣|X| = |Y| and (2) X[i:j]β‰ˆY[i:j]X[i:j] \approx Y[i:j] for all 1≀i≀jβ‰€βˆ£X∣1 \le i \le j \le |X|. In this paper, we generalize the notion of covers for SCERs and prove that existing algorithms to compute the shortest cover array and the longest cover array of a string TT under the identity relation will work for any SCERs taking the accordingly generalized border arrays.Comment: 16 page

    Succinct Data Structures for Families of Interval Graphs

    Full text link
    We consider the problem of designing succinct data structures for interval graphs with nn vertices while supporting degree, adjacency, neighborhood and shortest path queries in optimal time in the Θ(log⁑n)\Theta(\log n)-bit word RAM model. The degree query reports the number of incident edges to a given vertex in constant time, the adjacency query returns true if there is an edge between two vertices in constant time, the neighborhood query reports the set of all adjacent vertices in time proportional to the degree of the queried vertex, and the shortest path query returns a shortest path in time proportional to its length, thus the running times of these queries are optimal. Towards showing succinctness, we first show that at least nlog⁑nβˆ’2nlog⁑log⁑nβˆ’O(n)n\log{n} - 2n\log\log n - O(n) bits are necessary to represent any unlabeled interval graph GG with nn vertices, answering an open problem of Yang and Pippenger [Proc. Amer. Math. Soc. 2017]. This is augmented by a data structure of size nlog⁑n+O(n)n\log{n} +O(n) bits while supporting not only the aforementioned queries optimally but also capable of executing various combinatorial algorithms (like proper coloring, maximum independent set etc.) on the input interval graph efficiently. Finally, we extend our ideas to other variants of interval graphs, for example, proper/unit interval graphs, k-proper and k-improper interval graphs, and circular-arc graphs, and design succinct/compact data structures for these graph classes as well along with supporting queries on them efficiently

    A Simple Linear-Space Data Structure for Constant-Time Range Minimum Query

    Full text link
    Abstract. We revisit the range minimum query problem and present a new O(n)-space data structure that supports queries in O(1) time. Although previous data structures exist whose asymptotic bounds match ours, our goal is to introduce a new solution that is simple, intuitive, and practical without increasing asymptotic costs for query time or space

    Brain Structural Networks Associated with Intelligence and Visuomotor Ability

    Get PDF
    Increasing evidence indicates that multiple structures in the brain are associated with intelligence and cognitive function at the network level. The association between the grey matter (GM) structural network and intelligence and cognition is not well understood. We applied a multivariate approach to identify the pattern of GM and link the structural network to intelligence and cognitive functions. Structural magnetic resonance imaging was acquired from 92 healthy individuals. Source-based morphometry analysis was applied to the imaging data to extract GM structural covariance. We assessed the intelligence, verbal fluency, processing speed, and executive functioning of the participants and further investigated the correlations of the GM structural networks with intelligence and cognitive functions. Six GM structural networks were identified. The cerebello-parietal component and the frontal component were significantly associated with intelligence. The parietal and frontal regions were each distinctively associated with intelligence by maintaining structural networks with the cerebellum and the temporal region, respectively. The cerebellar component was associated with visuomotor ability. Our results support the parieto-frontal integration theory of intelligence by demonstrating how each core region for intelligence works in concert with other regions. In addition, we revealed how the cerebellum is associated with intelligence and cognitive functions

    Reconstructing phylogenies from noisy quartets in polynomial time with a high success probability

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In recent years, quartet-based phylogeny reconstruction methods have received considerable attentions in the computational biology community. Traditionally, the accuracy of a phylogeny reconstruction method is measured by simulations on synthetic datasets with known "true" phylogenies, while little theoretical analysis has been done. In this paper, we present a new model-based approach to measuring the accuracy of a quartet-based phylogeny reconstruction method. Under this model, we propose three efficient algorithms to reconstruct the "true" phylogeny with a high success probability.</p> <p>Results</p> <p>The first algorithm can reconstruct the "true" phylogeny from the input quartet topology set without quartet errors in <it>O</it>(<it>n</it><sup>2</sup>) time by querying at most (<it>n </it>- 4) log(<it>n </it>- 1) quartet topologies, where <it>n </it>is the number of the taxa. When the input quartet topology set contains errors, the second algorithm can reconstruct the "true" phylogeny with a probability approximately 1 - <it>p </it>in <it>O</it>(<it>n</it><sup>4 </sup>log <it>n</it>) time, where <it>p </it>is the probability for a quartet topology being an error. This probability is improved by the third algorithm to approximately <inline-formula><m:math name="1748-7188-3-1-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mfrac><m:mn>1</m:mn><m:mrow><m:mn>1</m:mn><m:mo>+</m:mo><m:msup><m:mi>q</m:mi><m:mn>2</m:mn></m:msup><m:mo>+</m:mo><m:mfrac><m:mn>1</m:mn><m:mn>2</m:mn></m:mfrac><m:msup><m:mi>q</m:mi><m:mn>4</m:mn></m:msup><m:mo>+</m:mo><m:mfrac><m:mn>1</m:mn><m:mrow><m:mn>16</m:mn></m:mrow></m:mfrac><m:msup><m:mi>q</m:mi><m:mn>5</m:mn></m:msup></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF"> MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqaIXaqmaeaacqaIXaqmcqGHRaWkcqWGXbqCdaahaaqabeaacqaIYaGmaaGaey4kaSYaaSaaaeaacqaIXaqmaeaacqaIYaGmaaGaemyCae3aaWbaaeqabaGaeGinaqdaaiabgUcaRmaalaaabaGaeGymaedabaGaeGymaeJaeGOnaydaaiabdghaXnaaCaaabeqaaiabiwda1aaaaaaaaa@3D5A@</m:annotation></m:semantics></m:math></inline-formula>, where <inline-formula><m:math name="1748-7188-3-1-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>q</m:mi><m:mo>=</m:mo><m:mfrac><m:mi>p</m:mi><m:mrow><m:mn>1</m:mn><m:mo>βˆ’</m:mo><m:mi>p</m:mi></m:mrow></m:mfrac></m:mrow><m:annotation encoding="MathType-MTEF"> MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemyCaeNaeyypa0tcfa4aaSaaaeaacqWGWbaCaeaacqaIXaqmcqGHsislcqWGWbaCaaaaaa@3391@</m:annotation></m:semantics></m:math></inline-formula>, with running time of <it>O</it>(<it>n</it><sup>5</sup>), which is at least 0.984 when <it>p </it>< 0.05.</p> <p>Conclusion</p> <p>The three proposed algorithms are mathematically guaranteed to reconstruct the "true" phylogeny with a high success probability. The experimental results showed that the third algorithm produced phylogenies with a higher probability than its aforementioned theoretical lower bound and outperformed some existing phylogeny reconstruction methods in both speed and accuracy.</p
    • …
    corecore