25,555 research outputs found
On mining complex sequential data by means of FCA and pattern structures
Nowadays data sets are available in very complex and heterogeneous ways.
Mining of such data collections is essential to support many real-world
applications ranging from healthcare to marketing. In this work, we focus on
the analysis of "complex" sequential data by means of interesting sequential
patterns. We approach the problem using the elegant mathematical framework of
Formal Concept Analysis (FCA) and its extension based on "pattern structures".
Pattern structures are used for mining complex data (such as sequences or
graphs) and are based on a subsumption operation, which in our case is defined
with respect to the partial order on sequences. We show how pattern structures
along with projections (i.e., a data reduction of sequential structures), are
able to enumerate more meaningful patterns and increase the computing
efficiency of the approach. Finally, we show the applicability of the presented
method for discovering and analyzing interesting patient patterns from a French
healthcare data set on cancer. The quantitative and qualitative results (with
annotations and analysis from a physician) are reported in this use case which
is the main motivation for this work.
Keywords: data mining; formal concept analysis; pattern structures;
projections; sequences; sequential data.Comment: An accepted publication in International Journal of General Systems.
The paper is created in the wake of the conference on Concept Lattice and
their Applications (CLA'2013). 27 pages, 9 figures, 3 table
Mining Heterogeneous Multivariate Time-Series for Learning Meaningful Patterns: Application to Home Health Telecare
For the last years, time-series mining has become a challenging issue for
researchers. An important application lies in most monitoring purposes, which
require analyzing large sets of time-series for learning usual patterns. Any
deviation from this learned profile is then considered as an unexpected
situation. Moreover, complex applications may involve the temporal study of
several heterogeneous parameters. In that paper, we propose a method for mining
heterogeneous multivariate time-series for learning meaningful patterns. The
proposed approach allows for mixed time-series -- containing both pattern and
non-pattern data -- such as for imprecise matches, outliers, stretching and
global translating of patterns instances in time. We present the early results
of our approach in the context of monitoring the health status of a person at
home. The purpose is to build a behavioral profile of a person by analyzing the
time variations of several quantitative or qualitative parameters recorded
through a provision of sensors installed in the home
Relating Objective and Subjective Performance Measures for AAM-based Visual Speech Synthesizers
We compare two approaches for synthesizing visual speech using Active Appearance Models (AAMs): one that utilizes acoustic features as input, and one that utilizes a phonetic transcription as input. Both synthesizers are trained using the same data and the performance is measured using both objective and subjective testing. We investigate the impact of likely sources of error in the synthesized visual speech by introducing typical errors into real visual speech sequences and subjectively measuring the perceived degradation. When only a small region (e.g. a single syllable) of ground-truth visual speech is incorrect we find that the subjective score for the entire sequence is subjectively lower than sequences generated by our synthesizers. This observation motivates further consideration of an often ignored issue, which is to what extent are subjective measures correlated with objective measures of performance? Significantly, we find that the most commonly used objective measures of performance are not necessarily the best indicator of viewer perception of quality. We empirically evaluate alternatives and show that the cost of a dynamic time warp of synthesized visual speech parameters to the respective ground-truth parameters is a better indicator of subjective quality
Chance and Necessity in Evolution: Lessons from RNA
The relationship between sequences and secondary structures or shapes in RNA
exhibits robust statistical properties summarized by three notions: (1) the
notion of a typical shape (that among all sequences of fixed length certain
shapes are realized much more frequently than others), (2) the notion of shape
space covering (that all typical shapes are realized in a small neighborhood of
any random sequence), and (3) the notion of a neutral network (that sequences
folding into the same typical shape form networks that percolate through
sequence space). Neutral networks loosen the requirements on the mutation rate
for selection to remain effective. The original (genotypic) error threshold has
to be reformulated in terms of a phenotypic error threshold. With regard to
adaptation, neutrality has two seemingly contradictory effects: It acts as a
buffer against mutations ensuring that a phenotype is preserved. Yet it is
deeply enabling, because it permits evolutionary change to occur by allowing
the sequence context to vary silently until a single point mutation can become
phenotypically consequential. Neutrality also influences predictability of
adaptive trajectories in seemingly contradictory ways. On the one hand it
increases the uncertainty of their genotypic trace. At the same time neutrality
structures the access from one shape to another, thereby inducing a topology
among RNA shapes which permits a distinction between continuous and
discontinuous shape transformations. To the extent that adaptive trajectories
must undergo such transformations, their phenotypic trace becomes more
predictable.Comment: 37 pages, 14 figures; 1998 CNLS conference; high quality figures at
http://www.santafe.edu/~walte
Behavior analysis for aging-in-place using similarity heatmaps
The demand for healthcare services for an increasing population of older adults is faced with the shortage of skilled caregivers and a constant increase in healthcare costs. In addition, the strong preference of the elderly to live independently has been driving much research on "ambient-assisted living" (AAL) systems to support aging-in-place. In this paper, we propose to employ a low-resolution image sensor network for behavior analysis of a home occupant. A network of 10 low-resolution cameras (30x30 pixels) is installed in a service flat of an elderly, based on which the user's mobility tracks are extracted using a maximum likelihood tracker. We propose a novel measure to find similar patterns of behavior between each pair of days from the user's detected positions, based on heatmaps and Earth mover's distance (EMD). Then, we use an exemplar-based approach to identify sleeping, eating, and sitting activities, and walking patterns of the elderly user for two weeks of real-life recordings. The proposed system achieves an overall accuracy of about 94%
Life course approaches to intergenerational inequality
This dissertation examines intergenerational inequalities from a life course perspective. Different outcomes and paths to outcomes are analyzed using register data. The thesis consists of four research articles and a conclusion.
Intergenerational inequality is one of the most researched subjects in sociological literature. We know that family background impacts practically every major decision and event in a person’s life. Even in the Finnish context of an extensive welfare state, we can only mitigate intergenerational inequality, not eliminate it. One could say that parents’ will and ability to transmit advantages to their children to different extents is almost as fundamental as the inevitable inequality in societies and social systems.
In this dissertation, the transmission of inequalities is studied from the life course perspective over many outcomes. The aim is to provide a comprehensive and holistic picture of intergenerational transmissions. Outcomes range from more traditional educational and economic ones to family formation patterns and homeownership. In all cases, family background has a clear impact.
Two register-based data sets from Statistics Finland are used in the articles (Finnish Growth Environment Panel and Finnish Census Panel). Both include information on the siblings and parents of the sample persons. Because they are register based, the data sets are comprehensive and reliable. We not only can match people with their relatives but also have detailed information on siblings and parents with respect to studied phenomena and background information.
In the first article, we established that early day care is associated with higher educational outcomes in early adulthood. We did not find any parental education-based variation in the strength of the association. In the second article, we found that early socioeconomic trajectories are influenced by family background. Although parental education and income in one’s childhood family are leading determinants of intergenerational transmissions, a large share of the family background effect remains unexplained. Furthermore, we show that only about half of the family of origin effect on early socioeconomic trajectories is associated with the end outcomes at age 35. Therefore, family background has a clear effect on early socioeconomic trajectories that could not be observed by analyzing the end outcomes. In the third article, we found that family formation trajectories are affected by family background, disadvantaged paths the most. We could not identify a large share of the family background determinants underlying the effects. In the last article, which analyzes homeownership, we found a strong overall effect of family background in the form of sibling correlations. We also found that for men, the effect varied between those living with or without a partner. The strongest sibling correlations were found for men living alone. Overall, social origin broadly influenced many aspects of a person’s life course
- …