482,862 research outputs found
Car Sequencing with respect to Regular Expressions and Linear Bounds
In this paper we introduce a new model and a computational approach for sequencing
assembly lines with two types of constraints: (i) patterns described by regular expressions and
(ii) linear bounds on the number of certain products that may occur in pre-specified intervals.
If we restrict the problem to the second type of constraints only we obtain a generalization
of the familiar car sequencing problem, whereas constraints of type (i) may be useful to add
extra structure. Constraints of both types may have priorities and can be violated, and a Pareto
optimal solution is sought minimizing the violation of constraints in the given priority order. We
describe a computational method based on mathematical programming and genetic algorithms
for finding suboptimal solutions
Advancing the iid Test Based on Integration across the Correlation Integral: Ranges, Competition, and Power
This paper builds on Kočenda (2001) and extends it in two ways. First, two new intervals of the proximity parameter ε (over which the correlation integral is calculated) are specified. For these ε- ranges new critical values for various lengths of the data sets are introduced and through Monte Carlo studies it is shown that within new ε-ranges the test is even more powerful than within the original ε-range. A sensitivity analysis of the critical values with respect to ε-range choice is also given. Second, a comparison with existing results of the controlled competition of Barnett et al. (1997) as well as broad power tests on various nonlinear and chaotic data are provided. The results of the comparison strongly favor our robust procedure and confirm the ability of the test in finding nonlinear dependencies. An empirical comparison of the new ε-ranges with the original one shows that the test within the new ε-ranges is able to detect hidden patterns with much higher precision. Finally, new user-friendly and fast software is introduced.chaos, nonlinear dynamics, correlation integral, Monte Carlo, single-blind competition, power tests, high-frequency economic and financial data
Proactive Anomaly Detection in Large-Scale Cloud-Native Databases
This disclosure describes techniques to identify anomalous patterns in customer workloads from database logs and to enable timely, corrective action that ensures uninterrupted operation of the database. Examples of anomalies include sudden increases (bursts) in the number of error messages written to a log file. An adaptive behavior norm is defined for each message type. Time instances or periods when the gap between messages of a given type in the database log deviate from the expected behavior norms are detected. A deviation from the behavior norm is a potential indicator of database problems. An anomaly detection tool outputs a ranked list of log statements exhibiting spikes of activity along with their time intervals that a database administrator (DBA) can examine to take corrective action. By automating anomaly detection, the valuable time of DBAs can be spent acting on issues rather than finding them
Zoom-SVD: Fast and Memory Efficient Method for Extracting Key Patterns in an Arbitrary Time Range
Given multiple time series data, how can we efficiently find latent patterns
in an arbitrary time range? Singular value decomposition (SVD) is a crucial
tool to discover hidden factors in multiple time series data, and has been used
in many data mining applications including dimensionality reduction, principal
component analysis, recommender systems, etc. Along with its static version,
incremental SVD has been used to deal with multiple semi infinite time series
data and to identify patterns of the data. However, existing SVD methods for
the multiple time series data analysis do not provide functionality for
detecting patterns of data in an arbitrary time range: standard SVD requires
data for all intervals corresponding to a time range query, and incremental SVD
does not consider an arbitrary time range. In this paper, we propose Zoom-SVD,
a fast and memory efficient method for finding latent factors of time series
data in an arbitrary time range. Zoom-SVD incrementally compresses multiple
time series data block by block to reduce the space cost in storage phase, and
efficiently computes singular value decomposition (SVD) for a given time range
query in query phase by carefully stitching stored SVD results. Through
extensive experiments, we demonstrate that Zoom-SVD is up to 15x faster, and
requires 15x less space than existing methods. Our case study shows that
Zoom-SVD is useful for capturing past time ranges whose patterns are similar to
a query time range.Comment: 10 pages, 2018 ACM Conference on Information and Knowledge Management
(CIKM 2018
Longest Common Separable Pattern between Permutations
In this article, we study the problem of finding the longest common separable
pattern between several permutations. We give a polynomial-time algorithm when
the number of input permutations is fixed and show that the problem is NP-hard
for an arbitrary number of input permutations even if these permutations are
separable. On the other hand, we show that the NP-hard problem of finding the
longest common pattern between two permutations cannot be approximated better
than within a ratio of (where is the size of an optimal
solution) when taking common patterns belonging to pattern-avoiding classes of
permutations.Comment: 15 page
Longest Common Pattern between two Permutations
In this paper, we give a polynomial (O(n^8)) algorithm for finding a longest
common pattern between two permutations of size n given that one is separable.
We also give an algorithm for general permutations whose complexity depends on
the length of the longest simple permutation involved in one of our
permutations
Finding patterns in strings using suffix arrays
Finding regularities in large data sets requires implementations of systems that are efficient in both time and space requirements. Here, we describe a newly developed system that exploits the internal structure of the enhanced suffixarray to find significant patterns in a large collection of sequences. The system searches exhaustively for all significantly compressing patterns where patterns may consist of symbols and skips or wildcards. We demonstrate a possible application of the system by detecting interesting patterns in a Dutch and an English corpus
Mining Heterogeneous Multivariate Time-Series for Learning Meaningful Patterns: Application to Home Health Telecare
For the last years, time-series mining has become a challenging issue for
researchers. An important application lies in most monitoring purposes, which
require analyzing large sets of time-series for learning usual patterns. Any
deviation from this learned profile is then considered as an unexpected
situation. Moreover, complex applications may involve the temporal study of
several heterogeneous parameters. In that paper, we propose a method for mining
heterogeneous multivariate time-series for learning meaningful patterns. The
proposed approach allows for mixed time-series -- containing both pattern and
non-pattern data -- such as for imprecise matches, outliers, stretching and
global translating of patterns instances in time. We present the early results
of our approach in the context of monitoring the health status of a person at
home. The purpose is to build a behavioral profile of a person by analyzing the
time variations of several quantitative or qualitative parameters recorded
through a provision of sensors installed in the home
- …