26,394 research outputs found
Discovering Tight Space-Time Sequences
International audienceThe problem of discovering spatiotemporal sequential patterns affects a broad range of applications. Many initiatives find sequences constrained by space and time. This paper addresses an appealing new challenge for this domain: find tight space-time sequences, i.e., find within the same process: i) frequent sequences constrained in space and time that may not be frequent in the entire dataset and ii) the time interval and space range where these sequences are frequent. The discovery of such patterns along with their constraints may lead to extract valuable knowledge that can remain hidden using traditional methods since their support is extremely low over the entire dataset. We introduce a new Spatio-Temporal Sequence Miner (ST SM) algorithm to discover tight space-time sequences. We evaluate ST SM using a proof of concept use case. When compared with general spatial-time sequence mining algorithms (GST SM), ST SM allows for new insights by detecting maximal space-time areas where each pattern is frequent. To the best of our knowledge, this is the first solution to tackle the problem of identifying tight space-time sequences
Mutual Enrichment in Ranked Lists and the Statistical Assessment of Position Weight Matrix Motifs
Statistics in ranked lists is important in analyzing molecular biology
measurement data, such as ChIP-seq, which yields ranked lists of genomic
sequences. State of the art methods study fixed motifs in ranked lists. More
flexible models such as position weight matrix (PWM) motifs are not addressed
in this context. To assess the enrichment of a PWM motif in a ranked list we
use a PWM induced second ranking on the same set of elements. Possible orders
of one ranked list relative to the other are modeled by permutations. Due to
sample space complexity, it is difficult to characterize tail distributions in
the group of permutations. In this paper we develop tight upper bounds on tail
distributions of the size of the intersection of the top of two uniformly and
independently drawn permutations and demonstrate advantages of this approach
using our software implementation, mmHG-Finder, to study PWMs in several
datasets.Comment: Peer-reviewed and presented as part of the 13th Workshop on
Algorithms in Bioinformatics (WABI2013
Chaotic Crystallography: How the physics of information reveals structural order in materials
We review recent progress in applying information- and computation-theoretic
measures to describe material structure that transcends previous methods based
on exact geometric symmetries. We discuss the necessary theoretical background
for this new toolset and show how the new techniques detect and describe novel
material properties. We discuss how the approach relates to well known
crystallographic practice and examine how it provides novel interpretations of
familiar structures. Throughout, we concentrate on disordered materials that,
while important, have received less attention both theoretically and
experimentally than those with either periodic or aperiodic order.Comment: 9 pages, two figures, 1 table;
http://csc.ucdavis.edu/~cmg/compmech/pubs/ChemOpinion.ht
- …