Search CORE

536 research outputs found

Challenging Issues of Spatio-Temporal Data Mining

Author: Hossain Md. Anwar
Rashid A.N.M. Bazlur
Publication venue: The International Institute for Science, Technology and Education (IISTE)
Publication date: 31/03/2012
Field of study

The spatio-temporal database (STDB) has received considerable attention during the past few years, due to the emergence of numerous applications (e.g., flight control systems, weather forecast, mobile computing, etc.) that demand efficient management of moving objects. These applications record objects' geographical locations (sometimes also shapes) at various timestamps and support queries that explore their historical and future (predictive) behaviors. The STDB significantly extends the traditional spatial database, which deals with only stationary data and hence is inapplicable to moving objects, whose dynamic behavior requires re-investigation of numerous topics including data modeling, indexes, and the related query algorithms. In many application areas, huge amounts of data are generated, explicitly or implicitly containing spatial or spatiotemporal information. However, the ability to analyze these data remains inadequate, and the need for adapted data mining tools becomes a major challenge. In this paper, we have presented the challenging issues of spatio-temporal data mining. Keywords: database, data mining, spatial, temporal, spatio-tempora

International Institute for Science, Technology and Education (IISTE): E-Journals

Efficient MaxCount and threshold operators of moving objects

Author: Anderson Scot
Revesz Peter
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2008
Field of study

Calculating operators of continuously moving objects presents some unique challenges, especially when the operators involve aggregation or the concept of congestion, which happens when the number of moving objects in a changing or dynamic query space exceeds some threshold value. This paper presents the following six d-dimensional moving object operators: (1) MaxCount (or MinCount), which finds the Maximum (or Minimum) number of moving objects simultaneously present in the dynamic query space at any time during the query time interval. (2) CountRange, which finds a count of point objects whose trajectories intersect the dynamic query space during the query time interval. (3) ThresholdRange, which finds the set of time intervals during which the dynamic query space is congested. (4) ThresholdSum, which finds the total length of all the time intervals during which the dynamic query space is congested. (5) ThresholdCount, which finds the number of disjoint time intervals during which the dynamic query space is congested. And (6) ThresholdAverage, which finds the average length of time of all the time intervals when the dynamic query space is congested. For these operators separate algorithms are given to find only estimate or only precise values. Experimental results from more than 7,500 queries indicate that the estimation algorithms produce fast, efficient results with error under 5%

DigitalCommons@University of Nebraska

Southern Adventist University

Springer - Publisher Connector

QuickSel: Quick Selectivity Learning with Mixture Models

Author: Aboulnaga A.
Agrawal S.
Anagnostopoulos C.
Asparouhov T.
Chaudhuri S.
Gupta A.
Jagadish H.
Jagadish H. V.
Khachatryan A.
Kraska T.
Lam E.
Lim L.
Lin X.
Lynch C. A.
Markl V.
Markl V.
Rubner Y.
Ré C.
Ré C.
Stillger M.
Sun J.
Tzoumas K.
Van Gelder A.
Wu Y.
Yang J.
Zhang Q.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/04/2020
Field of study

Estimating the selectivity of a query is a key step in almost any cost-based query optimizer. Most of today's databases rely on histograms or samples that are periodically refreshed by re-scanning the data as the underlying data changes. Since frequent scans are costly, these statistics are often stale and lead to poor selectivity estimates. As an alternative to scans, query-driven histograms have been proposed, which refine the histograms based on the actual selectivities of the observed queries. Unfortunately, these approaches are either too costly to use in practice---i.e., require an exponential number of buckets---or quickly lose their advantage as they observe more queries. In this paper, we propose a selectivity learning framework, called QuickSel, which falls into the query-driven paradigm but does not use histograms. Instead, it builds an internal model of the underlying data, which can be refined significantly faster (e.g., only 1.9 milliseconds for 300 queries). This fast refinement allows QuickSel to continuously learn from each query and yield increasingly more accurate selectivity estimates over time. Unlike query-driven histograms, QuickSel relies on a mixture model and a new optimization algorithm for training its model. Our extensive experiments on two real-world datasets confirm that, given the same target accuracy, QuickSel is 34.0x-179.4x faster than state-of-the-art query-driven histograms, including ISOMER and STHoles. Further, given the same space budget, QuickSel is 26.8%-91.8% more accurate than periodically-updated histograms and samples, respectively

arXiv.org e-Print Archive

Crossref

In-Memory Trajectory Indexing for On-The-Fly Travel-Time Estimation

Author: Waury Robert
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2019
Field of study

VBN

Dynamic-parinet (D-parinet) : indexing present and future trajectories in networks

Author: Nandi Mou
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/2011
Field of study

While indexing historical trajectories is a hot topic in the field of moving objects (MO) databases for many years, only a few of them consider that the objects movements are constrained. DYNAMIC-PARINET (D-PATINET) is designed for capturing of trajectory data flow in multiple discrete small time interval efficiently and to predict a MO’s movement or the underlying network state at a future time. The cornerstone of D-PARINET is PARINET, an efficient index for historical trajectory data. The structure of PARINET is based on a combination of graph partitioning and a set of composite B+-tree local indexes tuned for a given query load and a given data distribution in the network space. D-PARINET studies continuous update of trajectory data and use interpolation to predict future MO movement in the network. PARINET and D-PARINET can easily be integrated into any RDBMS, which is an essential asset particularly for industrial or commercial applications. The experimental evaluation under an off-the-shelf DBMS using simulated traffic data shows that DPARINET is robust and significantly outperforms the R-tree based access methods

Digital Commons @ New Jersey Institute of Technology (NJIT)

PolyFit: Polynomial-based Indexing Approach for Fast Approximate Range Aggregate Queries

Author: Chan Tsz Nam
Jensen Christian S.
Li Zhe
Yiu Man Lung
Publication venue
Publication date: 01/01/2021
Field of study

Range aggregate queries find frequent application in data analytics. In some use cases, approximate results are preferred over accurate results if they can be computed rapidly and satisfy approximation guarantees. Inspired by a recent indexing approach, we provide means of representing a discrete point data set by continuous functions that can then serve as compact index structures. More specifically, we develop a polynomial-based indexing approach, called PolyFit, for processing approximate range aggregate queries. PolyFit is capable of supporting multiple types of range aggregate queries, including COUNT, SUM, MIN and MAX aggregates, with guaranteed absolute and relative error bounds. Experiment results show that PolyFit is faster and more accurate and compact than existing learned index structures.Comment: 13 page

arXiv.org e-Print Archive

VBN

PolyFit:Polynomial-based indexing approach for fast approximate range aggregate queries

Author: Chan Tsz Nam
Jensen Christian S.
Li Zhe
Yiu Man Lung
Publication venue: OpenProceedings.org
Publication date: 01/01/2021
Field of study

VBN

Efficient MaxCount and threshold operators of moving objects

Author: A Civilis
D Pfoser
D Zhang
G Marsaglia
G Trajcevski
H Samet
H Samet
M Nascimento
M Pelanis
O Wolfson
P Revesz
P Rigaux
Peter Revesz
RH Guting
Scot Anderson
T Tzouramanis
TH Cormen
VT Almeida de
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref