Search CORE

482,862 research outputs found

Car Sequencing with respect to Regular Expressions and Linear Bounds

Author: Drótos Márton
Kis Tamás
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

In this paper we introduce a new model and a computational approach for sequencing assembly lines with two types of constraints: (i) patterns described by regular expressions and (ii) linear bounds on the number of certain products that may occur in pre-specified intervals. If we restrict the problem to the second type of constraints only we obtain a generalization of the familiar car sequencing problem, whereas constraints of type (i) may be useful to add extra structure. Constraints of both types may have priorities and can be violated, and a Pareto optimal solution is sought minimizing the violation of constraints in the given priority order. We describe a computational method based on mathematical programming and genetic algorithms for finding suboptimal solutions

Crossref

SZTAKI Publication Repository

Repository of the Academy's Library

Advancing the iid Test Based on Integration across the Correlation Integral: Ranges, Competition, and Power

Author: Evzen Kocenda
Lubos Briatka
Publication venue
Publication date
Field of study

This paper builds on Kočenda (2001) and extends it in two ways. First, two new intervals of the proximity parameter ε (over which the correlation integral is calculated) are specified. For these ε- ranges new critical values for various lengths of the data sets are introduced and through Monte Carlo studies it is shown that within new ε-ranges the test is even more powerful than within the original ε-range. A sensitivity analysis of the critical values with respect to ε-range choice is also given. Second, a comparison with existing results of the controlled competition of Barnett et al. (1997) as well as broad power tests on various nonlinear and chaotic data are provided. The results of the comparison strongly favor our robust procedure and confirm the ability of the test in finding nonlinear dependencies. An empirical comparison of the new ε-ranges with the original one shows that the test within the new ε-ranges is able to detect hidden patterns with much higher precision. Finally, new user-friendly and fast software is introduced.chaos, nonlinear dynamics, correlation integral, Monte Carlo, single-blind competition, power tests, high-frequency economic and financial data

Research Papers in Economics

Proactive Anomaly Detection in Large-Scale Cloud-Native Databases

Author: Chang Thomas
Dong Helen
Elsayed Engy
Lin Suzhen
Sherkat Reza
Publication venue: Technical Disclosure Commons
Publication date: 27/09/2023
Field of study

This disclosure describes techniques to identify anomalous patterns in customer workloads from database logs and to enable timely, corrective action that ensures uninterrupted operation of the database. Examples of anomalies include sudden increases (bursts) in the number of error messages written to a log file. An adaptive behavior norm is defined for each message type. Time instances or periods when the gap between messages of a given type in the database log deviate from the expected behavior norms are detected. A deviation from the behavior norm is a potential indicator of database problems. An anomaly detection tool outputs a ranked list of log statements exhibiting spikes of activity along with their time intervals that a database administrator (DBA) can examine to take corrective action. By automating anomaly detection, the valuable time of DBAs can be spent acting on issues rather than finding them

Technical Disclosure Common

Zoom-SVD: Fast and Memory Efficient Method for Extracting Key Patterns in an Arbitrary Time Range

Author: Choi Dongjin
Jang Jun-Gi
Jung Jinhong
Kang U
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/12/2018
Field of study

Given multiple time series data, how can we efficiently find latent patterns in an arbitrary time range? Singular value decomposition (SVD) is a crucial tool to discover hidden factors in multiple time series data, and has been used in many data mining applications including dimensionality reduction, principal component analysis, recommender systems, etc. Along with its static version, incremental SVD has been used to deal with multiple semi infinite time series data and to identify patterns of the data. However, existing SVD methods for the multiple time series data analysis do not provide functionality for detecting patterns of data in an arbitrary time range: standard SVD requires data for all intervals corresponding to a time range query, and incremental SVD does not consider an arbitrary time range. In this paper, we propose Zoom-SVD, a fast and memory efficient method for finding latent factors of time series data in an arbitrary time range. Zoom-SVD incrementally compresses multiple time series data block by block to reduce the space cost in storage phase, and efficiently computes singular value decomposition (SVD) for a given time range query in query phase by carefully stitching stored SVD results. Through extensive experiments, we demonstrate that Zoom-SVD is up to 15x faster, and requires 15x less space than existing methods. Our case study shows that Zoom-SVD is useful for capturing past time ranges whose patterns are similar to a query time range.Comment: 10 pages, 2018 ACM Conference on Information and Knowledge Management (CIKM 2018

arXiv.org e-Print Archive

Crossref

Longest Common Separable Pattern between Permutations

Author: Bouvel Mathilde
Rossin Dominique
Vialette Stephane
Publication venue
Publication date: 01/01/2007
Field of study

In this article, we study the problem of finding the longest common separable pattern between several permutations. We give a polynomial-time algorithm when the number of input permutations is fixed and show that the problem is NP-hard for an arbitrary number of input permutations even if these permutations are separable. On the other hand, we show that the NP-hard problem of finding the longest common pattern between two permutations cannot be approximated better than within a ratio of

sqrt{Opt}

(where

Opt

is the size of an optimal solution) when taking common patterns belonging to pattern-avoiding classes of permutations.Comment: 15 page

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

HAL-Rennes 1

Longest Common Pattern between two Permutations

Author: Alice Libardi
Daciberg Lima Gonçalves
Koschorke
Koschorke
Libardi
Libardi
Oziride Manzoli
Randall
Publication venue: 'Mathematical Sciences Publishers'
Publication date: 22/11/2006
Field of study

In this paper, we give a polynomial (O(n^8)) algorithm for finding a longest common pattern between two permutations of size n given that one is separable. We also give an algorithm for general permutations whose complexity depends on the length of the longest simple permutation involved in one of our permutations

arXiv.org e-Print Archive

Crossref

Finding patterns in strings using suffix arrays

Author: Stehouwer H.
Van Zaanen M.
Publication venue
Publication date: 01/01/2010
Field of study

Finding regularities in large data sets requires implementations of systems that are efﬁcient in both time and space requirements. Here, we describe a newly developed system that exploits the internal structure of the enhanced sufﬁxarray to ﬁnd signiﬁcant patterns in a large collection of sequences. The system searches exhaustively for all signiﬁcantly compressing patterns where patterns may consist of symbols and skips or wildcards. We demonstrate a possible application of the system by detecting interesting patterns in a Dutch and an English corpus

MPG.PuRe

Mining Heterogeneous Multivariate Time-Series for Learning Meaningful Patterns: Application to Home Health Telecare

Author: Duchene Florence
Garbay Catherine
Rialle Vincent
Publication venue
Publication date: 25/11/2004
Field of study

For the last years, time-series mining has become a challenging issue for researchers. An important application lies in most monitoring purposes, which require analyzing large sets of time-series for learning usual patterns. Any deviation from this learned profile is then considered as an unexpected situation. Moreover, complex applications may involve the temporal study of several heterogeneous parameters. In that paper, we propose a method for mining heterogeneous multivariate time-series for learning meaningful patterns. The proposed approach allows for mixed time-series -- containing both pattern and non-pattern data -- such as for imprecise matches, outliers, stretching and global translating of patterns instances in time. We present the early results of our approach in the context of monitoring the health status of a person at home. The purpose is to build a behavioral profile of a person by analyzing the time variations of several quantitative or qualitative parameters recorded through a provision of sensors installed in the home

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes