Search CORE

328 research outputs found

Morning surface temperature inversions (MSTIS) from Allegheny County,PA to Beijing, China: formation factors, health effects, and applications

Author: Gao Yifeng
Publication venue
Publication date: 10/12/2018
Field of study

In recent years, concern about air quality has increased as we better understand the relationship between air pollution and health, not only of humans and animals but also of the environment. This essay looks at the ways that scientists categorize air pollution as well as the types of air pollution and explores some of the health effects of exposure to air pollutants. Morning surface temperature inversions (MSTIs) are described and how air dispersion conditions might elevate the severity and duration of air pollution is explored. A method to use MSTIs to forecast potentially dangerous air pollution, developed in Allegheny County, Pennsylvania, is applied to data in Beijing, China, an approach that is supported by the similar geographical and environmental conditions the two regions share. This paper only utilizes the MSTIs detecting method, part of the entire forecasting method, to test preliminarily if the method is applicable to Beijing, China. The emergency heavy pollution alerts in Beijing was introduced in this paper, as a potential application field for MSTIs forecasting method in Beijing, China. Results of the preliminary test in Beijing data indicate that there is a promising future for application MSTIs method of Allegheny County to Beijing, China with some criteria adjusted needed to fit Beijing’s situation. A full-time and caliber-consistent equipment is suggested to collect more accurate data for future MSTIs research. Results of the use of the MTSIs method show that some opportunities to warn Beijing residents of potentially dangerous air pollution were missed, and that alerts can sometimes be issued when none is warranted. Public Health Statement: Information sharing is one way that governments can help protect the health of their citizens; establishing policies that limit polluting emissions is another

D-Scholarship@Pitt

Adaptive Resolution Loss: An Efficient and Effective Loss for Time Series Self-Supervised Learning Framework

Author: Gao Yifeng
Garcia Kevin
Perez Juan Manuel
Publication venue: ScholarWorks @ UTRGV
Publication date: 01/01/2023
Field of study

Time series data is a crucial form of information that has vast opportunities. With the widespread use of sensor networks, largescale time series data has become ubiquitous. One of the most prominent problems in time series data mining is representation learning. Recently, with the introduction of self-supervised learning frameworks (SSL), numerous amounts of research have focused on designing an effective SSL for time series data. One of the current state-of-the-art SSL frameworks in time series is called TS2Vec. TS2Vec specially designs a hierarchical contrastive learning framework that uses loss-based training, which performs outstandingly against benchmark testing. However, the computational cost for TS2Vec is often significantly greater than other SSL frameworks. In this paper, we present a new self-supervised learning loss named, adaptive resolution loss. The proposed solution reduces the number of resolutions used for training the model via score functions, leading to an efficient adaptive resolution learning algorithm. The proposed method preserves the original model’s integrity while significantly enhancing its training time

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

PMP: Privacy-Aware Matrix Profile against Sensitive Pattern Inference

Author: Ding Jiahao
Gao Yifeng
Lin Jessica
Zhang Li
Publication venue: ScholarWorks @ UTRGV
Publication date: 01/01/2023
Field of study

Recent rapid development of sensor technology has allowed massive fine-grained time series (TS) data to be collected and set the foundation for the development of data-driven services and applications. During the process, data sharing is often involved to allow the third-party modelers to perform specific time series data mining (TSDM) tasks based on the need of data owner. The high resolution of TS brings new challenges in protecting privacy. While meaningful information in high-resolution TS shifts from concrete point values to local shape-based segments, numerous research have found that long shape-based patterns could contain more sensitive information and may potentially be extracted and misused by a malicious third party. However, the privacy issue for TS patterns is surprisingly seldom explored in privacy-preserving literature. In this work, we consider a new privacy-preserving problem: preventing malicious inference on long shape-based patterns while preserving short segment information for the utility task performance. To mitigate the challenge, we investigate an alternative approach by sharing Matrix Profile (MP), which is a non-linear transformation of original data and a versatile data structure that supports many data mining tasks. We found that while MP can prevent concrete shape leakage, the canonical correlation in MP index can still reveal the location of sensitive long pattern. Based on this observation, we design two attacks named Location Attack and Entropy Attack to extract the pattern location from MP. To further protect MP from these two attacks, we propose a Privacy-Aware Matrix Profile (PMP) via perturbing the local correlation and breaking the canonical correlation in MP index vector. We evaluate our proposed PMP against baseline noise-adding methods through quantitative analysis and real-world case studies to show the effectiveness of the proposed method

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

Robust Time Series Chain Discovery with Incremental Nearest Neighbors

Author: Gao Yifeng
Lin Jessica
Zhang Li
Zhu Yan
Publication venue: ScholarWorks @ UTRGV
Publication date: 03/11/2022
Field of study

Time series motif discovery has been a fundamental task to identify meaningful repeated patterns in time series. Recently, time series chains were introduced as an expansion of time series motifs to identify the continuous evolving patterns in time series data. Informally, a time series chain (TSC) is a temporally ordered set of time series subsequences, in which every subsequence is similar to the one that precedes it, but the last and the first can be arbitrarily dissimilar. TSCs are shown to be able to reveal latent continuous evolving trends in the time series, and identify precursors of unusual events in complex systems. Despite its promising interpretability, unfortunately, we have observed that existing TSC definitions lack the ability to accurately cover the evolving part of a time series: the discovered chains can be easily cut by noise and can include non-evolving patterns, making them impractical in real-world applications. Inspired by a recent work that tracks how the nearest neighbor of a time series subsequence changes over time, we introduce a new TSC definition which is much more robust to noise in the data, in the sense that they can better locate the evolving patterns while excluding the non-evolving ones. We further propose two new quality metrics to rank the discovered chains. With extensive empirical evaluations, we demonstrate that the proposed TSC definition is significantly more robust to noise than the state of the art, and the top ranked chains discovered can reveal meaningful regularities in a variety of real world datasets

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

Robust Time Series Chain Discovery with Incremental Nearest Neighbors

Author: Gao Yifeng
Lin Jessica
Zhang Li
Zhu Yan
Publication venue
Publication date: 03/11/2022
Field of study

arXiv.org e-Print Archive

Scholarworks@UTRGV Univ. of Texas RioGrande Valley