181 research outputs found
A Discrete Dislocation Study of Thin Film Interfacial Fracture
Ph.DDOCTOR OF PHILOSOPH
PUPAE: Intuitive and Actionable Explanations for Time Series Anomalies
In recent years there has been significant progress in time series anomaly
detection. However, after detecting an (perhaps tentative) anomaly, can we
explain it? Such explanations would be useful to triage anomalies. For example,
in an oil refinery, should we respond to an anomaly by dispatching a hydraulic
engineer, or an intern to replace the battery on a sensor? There have been some
parallel efforts to explain anomalies, however many proposed techniques produce
explanations that are indirect, and often seem more complex than the anomaly
they seek to explain. Our review of the literature/checklists/user-manuals used
by frontline practitioners in various domains reveals an interesting
near-universal commonality. Most practitioners discuss, explain and report
anomalies in the following format: The anomaly would be like normal data A, if
not for the corruption B. The reader will appreciate that is a type of
counterfactual explanation. In this work we introduce a domain agnostic
counterfactual explanation technique to produce explanations for time series
anomalies. As we will show, our method can produce both visual and text-based
explanations that are objectively correct, intuitive and in many circumstances,
directly actionable.Comment: 9 Page Manuscript, 1 Page Supplementary (Supplement not published in
conference proceedings.
Matrix Profile XXVII: A Novel Distance Measure for Comparing Long Time Series
The most useful data mining primitives are distance measures. With an
effective distance measure, it is possible to perform classification,
clustering, anomaly detection, segmentation, etc. For single-event time series
Euclidean Distance and Dynamic Time Warping distance are known to be extremely
effective. However, for time series containing cyclical behaviors, the semantic
meaningfulness of such comparisons is less clear. For example, on two separate
days the telemetry from an athlete workout routine might be very similar. The
second day may change the order in of performing push-ups and squats, adding
repetitions of pull-ups, or completely omitting dumbbell curls. Any of these
minor changes would defeat existing time series distance measures. Some
bag-of-features methods have been proposed to address this problem, but we
argue that in many cases, similarity is intimately tied to the shapes of
subsequences within these longer time series. In such cases, summative features
will lack discrimination ability. In this work we introduce PRCIS, which stands
for Pattern Representation Comparison in Series. PRCIS is a distance measure
for long time series, which exploits recent progress in our ability to
summarize time series with dictionaries. We will demonstrate the utility of our
ideas on diverse tasks and datasets.Comment: Accepted at IEEE ICKG 2022. (Previously entitled IEEE ICBK.) Abridged
abstract as per arxiv's requirement
a mixed-method approach
Background Sedentary behaviours (SB) can be characterized by low energy
expenditure in a reclining position (e.g., sitting) often associated with work
and transport. Prolonged SB is associated with increased risk for chronic
conditions, and due to technological advances, the working population is in
office settings with high occupational exposure to SB. This study aims to
assess SB among office workers, as well as barriers and strategies towards
reducing SB in the work setting. Methods Using a mixed-methods approach guided
by the socio-ecological framework, non-academic office workers from a
professional school in a large public university were recruited. Of 180
eligible office workers, 40 enrolled and completed all assessments. Self-
reported and objectively measured SB and activity levels were captured. Focus
group discussion (FGD) were conducted to further understand perceptions,
barriers, and strategies to reducing workplace SB. Environmental factors were
systematically evaluated by trained research staff using an adapted version of
the Checklist for Health Promotion Environments at Worksites (CHEW). Thematic
analysis of FGD was conducted and descriptive analysis of quantitative data
was performed. Results The sample was mostly Chinese (n = 33, 80 %) with a
total of 24 (60 %) female participants. Most participants worked five days a
week for about 9.5(0.5) hrs/day. Accelerometer data show that participants
spend the majority of their days in sedentary activities both on workdays
(76.9 %) and non-workdays (69.5 %). Self-report data confirm these findings
with median sitting time of 420(180) minutes at work. From qualitative
analyses, major barriers to reducing SB emerged, including the following
themes: workplace social and cultural norms, personal factors, job scope, and
physical building/office infrastructure. CHEW results confirm a lack of
support from the physical infrastructure and information environment to
reducing SB. Conclusions There is high SB among office workers in this sample.
We identified multiple levels of influence for prolonged occupational SB, with
a particular emphasis on workplace norms and infrastructure as important
barriers to reducing SB and increasing PA. A larger, representative sample of
the Singaporean population is needed to confirm our findings but it seems that
any intervention aimed at reducing SB in the workplace should target
individual, environmental, and organizational levels
Multitask Learning for Time Series Data with 2D Convolution
Multitask learning (MTL) aims to develop a unified model that can handle a
set of closely related tasks simultaneously. By optimizing the model across
multiple tasks, MTL generally surpasses its non-MTL counterparts in terms of
generalizability. Although MTL has been extensively researched in various
domains such as computer vision, natural language processing, and
recommendation systems, its application to time series data has received
limited attention. In this paper, we investigate the application of MTL to the
time series classification (TSC) problem. However, when we integrate the
state-of-the-art 1D convolution-based TSC model with MTL, the performance of
the TSC model actually deteriorates. By comparing the 1D convolution-based
models with the Dynamic Time Warping (DTW) distance function, it appears that
the underwhelming results stem from the limited expressive power of the 1D
convolutional layers. To overcome this challenge, we propose a novel design for
a 2D convolution-based model that enhances the model's expressiveness.
Leveraging this advantage, our proposed method outperforms competing approaches
on both the UCR Archive and an industrial transaction TSC dataset
Toward a Foundation Model for Time Series Data
A foundation model is a machine learning model trained on a large and diverse
set of data, typically using self-supervised learning-based pre-training
techniques, that can be adapted to various downstream tasks. However, current
research on time series pre-training has mostly focused on models pre-trained
solely on data from a single domain, resulting in a lack of knowledge about
other types of time series. However, current research on time series
pre-training has predominantly focused on models trained exclusively on data
from a single domain. As a result, these models possess domain-specific
knowledge that may not be easily transferable to time series from other
domains. In this paper, we aim to develop an effective time series foundation
model by leveraging unlabeled samples from multiple domains. To achieve this,
we repurposed the publicly available UCR Archive and evaluated four existing
self-supervised learning-based pre-training methods, along with a novel method,
on the datasets. We tested these methods using four popular neural network
architectures for time series to understand how the pre-training methods
interact with different network designs. Our experimental results show that
pre-training improves downstream classification tasks by enhancing the
convergence of the fine-tuning process. Furthermore, we found that the proposed
pre-training method, when combined with the Transformer model, outperforms the
alternatives
Morphological and Molecular Defects in Human Three-Dimensional Retinal Organoid Model of X-Linked Juvenile Retinoschisis
X-linked juvenile retinoschisis (XLRS), linked to mutations in the RS1 gene, is a degenerative retinopathy with a retinal splitting phenotype. We generated human induced pluripotent stem cells (hiPSCs) from patients to study XLRS in a 3D retinal organoid in vitro differentiation system. This model recapitulates key features of XLRS including retinal splitting, defective retinoschisin production, outer-segment defects, abnormal paxillin turnover, and impaired ER-Golgi transportation. RS1 mutation also affects the development of photoreceptor sensory cilia and results in altered expression of other retinopathy-associated genes. CRISPR/Cas9 correction of the disease-associated C625T mutation normalizes the splitting phenotype, outer-segment defects, paxillin dynamics, ciliary marker expression, and transcriptome profiles. Likewise, mutating RS1 in control hiPSCs produces the disease-associated phenotypes. Finally, we show that the C625T mutation can be repaired precisely and efficiently using a base-editing approach. Taken together, our data establish 3D organoids as a valid disease model
An Efficient Content-based Time Series Retrieval System
A Content-based Time Series Retrieval (CTSR) system is an information
retrieval system for users to interact with time series emerged from multiple
domains, such as finance, healthcare, and manufacturing. For example, users
seeking to learn more about the source of a time series can submit the time
series as a query to the CTSR system and retrieve a list of relevant time
series with associated metadata. By analyzing the retrieved metadata, users can
gather more information about the source of the time series. Because the CTSR
system is required to work with time series data from diverse domains, it needs
a high-capacity model to effectively measure the similarity between different
time series. On top of that, the model within the CTSR system has to compute
the similarity scores in an efficient manner as the users interact with the
system in real-time. In this paper, we propose an effective and efficient CTSR
model that outperforms alternative models, while still providing reasonable
inference runtimes. To demonstrate the capability of the proposed method in
solving business problems, we compare it against alternative models using our
in-house transaction data. Our findings reveal that the proposed model is the
most suitable solution compared to others for our transaction data problem
IMI – Interventions myopia institute:Interventions for controlling myopia onset and progression report
Myopia has been predicted to affect approximately 50% of the world’s population based on trending myopia prevalence figures. Critical to minimizing the associated adverse visual consequences of complicating ocular pathologies are interventions to prevent or delay the onset of myopia, slow its progression, and to address the problem of mechanical instability of highly myopic eyes. Although treatment approaches are growing in number, evidence of treatment efficacy is variable. This article reviews research behind such interventions under four categories: optical, pharmacological, environmental (behavioral), and surgical. In summarizing the evidence of efficacy, results from randomized controlled trials have been given most weight, although such data are very limited for some treatments. The overall conclusion of this review is that there are multiple avenues for intervention worthy of exploration in all categories, although in the case of optical, pharmacological, and behavioral interventions for preventing or slowing progression of myopia, treatment efficacy at an individual level appears quite variable, with no one treatment being 100% effective in all patients. Further research is critical to understanding the factors underlying such variability and underlying mechanisms, to guide recommendations for combined treatments. There is also room for research into novel treatment options
- …