Search CORE

26 research outputs found

Tractability in probabilistic databases

Author: Dan Suciu
Dan Suciu
See Profile
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

All in-text references underlined in blue are linked to publications on ResearchGate, letting you access and read them immediately

CiteSeerX

Crossref

AN ESTIMATION ALGORITHM FOR MISSING DATA IN WIRELESS SENSOR NETWORKS

Author
Publication venue: 'Exeley, Inc.'
Publication date: 01/01/2013
Field of study

Crossref

Approximation trade-offs in Markovian stream processing: An empirical study

Author: Christopher Ré
Julie Letchner
Magdalena Balazinska
Matthai Philipose
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/06/2010
Field of study

A large amount of the world’s data is both sequential and imprecise. Such data is commonly modeled as Markovian streams; examples include words/sentences inferred from raw audio signals, or discrete location sequences inferred from RFID or GPS data. The rich semantics and large volumes of these streams make them difficult to query efficiently. In this paper, we study the effects—on both efficiency and accuracy—of two common stream approximations. Through experiments on a realworld RFID data set, we identify conditions under which these approximations can improve performance by several orders of magnitude, with only minimal effects on query results. We also identify cases when the full rich semantics are necessary

CiteSeerX

Crossref

Capturing Data Uncertainty in High-Volume Stream Processing

Author: Diao Yanlei
Li Boduo
Liu Anna
Peng Liping
Sutton Charles
Tran Thanh
Zink Michael
Publication venue
Publication date: 01/01/2009
Field of study

We present the design and development of a data stream system that captures data uncertainty from data collection to query processing to final result generation. Our system focuses on data that is naturally modeled as continuous random variables. For such data, our system employs an approach grounded in probability and statistical theory to capture data uncertainty and integrates this approach into high-volume stream processing. The first component of our system captures uncertainty of raw data streams from sensing devices. Since such raw streams can be highly noisy and may not carry sufficient information for query processing, our system employs probabilistic models of the data generation process and stream-speed inference to transform raw data into a desired format with an uncertainty metric. The second component captures uncertainty as data propagates through query operators. To efficiently quantify result uncertainty of a query operator, we explore a variety of techniques based on probability and statistical theory to compute the result distribution at stream speed. We are currently working with a group of scientists to evaluate our system using traces collected from the domains of (and eventually in the real systems for) hazardous weather monitoring and object tracking and monitoring.Comment: CIDR 200

arXiv.org e-Print Archive

CiteSeerX

ScholarWorks@UMass Amherst

Edinburgh Research Explorer

Extending Event-Driven Architecture for Proactive Systems

Author: Alexander Kofman
Anastasios Skarlatidis
Fabiana Fournier
Inna Skarbovsky
Publication venue
Publication date: 11/04/2020
Field of study

ABSTRACT Proactive Event-Driven Computing is a new paradigm, in which a decision is not made due to explicit users' requests nor is it made as a response to past events. Rather, the decision is autonomously triggered by forecasting future states. Proactive event-driven computing requires a departure from current event-driven architectures to ones capable of handling uncertainty and future events, and real-time decision making. We present a proactive event-driven architecture for Scalable Proactive Event-Driven Decision-making (SPEEDD), which combines these capabilities. The proposed architecture is composed of three main components: complex event processing, real-time decision making, and visualization. This architecture is instantiated by a real use case from the traffic management domain. In the future, the results of actual implementations of the use case will help us revise and refine the proposed architecture

CiteSeerX

RFID-Based Indoor Spatial Query Evaluation with Bayesian Filtering Techniques

Author: Gong Zhitao
Hui Bo
Ku Wei-Shinn
Lu Hua
Sun Min-Te
Wang Wenlu
Yu Jiao
Publication venue
Publication date: 01/04/2022
Field of study

People spend a significant amount of time in indoor spaces (e.g., office buildings, subway systems, etc.) in their daily lives. Therefore, it is important to develop efficient indoor spatial query algorithms for supporting various location-based applications. However, indoor spaces differ from outdoor spaces because users have to follow the indoor floor plan for their movements. In addition, positioning in indoor environments is mainly based on sensing devices (e.g., RFID readers) rather than GPS devices. Consequently, we cannot apply existing spatial query evaluation techniques devised for outdoor environments for this new challenge. Because Bayesian filtering techniques can be employed to estimate the state of a system that changes over time using a sequence of noisy measurements made on the system, in this research, we propose the Bayesian filtering-based location inference methods as the basis for evaluating indoor spatial queries with noisy RFID raw data. Furthermore, two novel models, indoor walking graph model and anchor point indexing model, are created for tracking object locations in indoor environments. Based on the inference method and tracking models, we develop innovative indoor range and k nearest neighbor (kNN) query algorithms. We validate our solution through use of both synthetic data and real-world data. Our experimental results show that the proposed algorithms can evaluate indoor spatial queries effectively and efficiently. We open-source the code, data, and floor plan at https://github.com/DataScienceLab18/IndoorToolKit

arXiv.org e-Print Archive

Probabilistic management of OCR data using an RDBMS

Author: Allauzen C.
Baeza-Yates R. A.
Bishop C. M.
Cho J.
Cowell R. G.
Gupta R.
Hopcroft J. E.
Jordan M. I.
Kimura H.
Lafferty J.
Mori S.
Widom J.
Yen J. Y.
Zobel J.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref