4,185 research outputs found
Initial Draft of a Possible Declarative Semantics for the Language
This article introduces a preliminary declarative semantics for a subset of the language Xcerpt (so-called
grouping-stratifiable programs) in form of a classical (Tarski style) model theory, adapted to the specific
requirements of Xcerpt’s constructs (e.g. the various aspects of incompleteness in query terms, grouping
constructs in rule heads, etc.). Most importantly, the model theory uses term simulation as a replacement
for term equality to handle incomplete term specifications, and an extended notion of substitutions in
order to properly convey the semantics of grouping constructs. Based upon this model theory, a fixpoint
semantics is also described, leading to a first notion of forward chaining evaluation of Xcerpt program
Graph edit distance from spectral seriation
This paper is concerned with computing graph edit distance. One of the criticisms that can be leveled at existing methods for computing graph edit distance is that they lack some of the formality and rigor of the computation of string edit distance. Hence, our aim is to convert graphs to string sequences so that string matching techniques can be used. To do this, we use a graph spectral seriation method to convert the adjacency matrix into a string or sequence order. We show how the serial ordering can be established using the leading eigenvector of the graph adjacency matrix. We pose the problem of graph-matching as a maximum a posteriori probability (MAP) alignment of the seriation sequences for pairs of graphs. This treatment leads to an expression in which the edit cost is the negative logarithm of the a posteriori sequence alignment probability. We compute the edit distance by finding the sequence of string edit operations which minimizes the cost of the path traversing the edit lattice. The edit costs are determined by the components of the leading eigenvectors of the adjacency matrix and by the edge densities of the graphs being matched. We demonstrate the utility of the edit distance on a number of graph clustering problems
Starling flock networks manage uncertainty in consensus at low cost
Flocks of starlings exhibit a remarkable ability to maintain cohesion as a
group in highly uncertain environments and with limited, noisy information.
Recent work demonstrated that individual starlings within large flocks respond
to a fixed number of nearest neighbors, but until now it was not understood why
this number is seven. We analyze robustness to uncertainty of consensus in
empirical data from multiple starling flocks and show that the flock
interaction networks with six or seven neighbors optimize the trade-off between
group cohesion and individual effort. We can distinguish these numbers of
neighbors from fewer or greater numbers using our systems-theoretic approach to
measuring robustness of interaction networks as a function of the network
structure, i.e., who is sensing whom. The metric quantifies the disagreement
within the network due to disturbances and noise during consensus behavior and
can be evaluated over a parameterized family of hypothesized sensing strategies
(here the parameter is number of neighbors). We use this approach to further
show that for the range of flocks studied the optimal number of neighbors does
not depend on the number of birds within a flock; rather, it depends on the
shape, notably the thickness, of the flock. The results suggest that robustness
to uncertainty may have been a factor in the evolution of flocking for
starlings. More generally, our results elucidate the role of the interaction
network on uncertainty management in collective behavior, and motivate the
application of our approach to other biological networks.Comment: 19 pages, 3 figures, 9 supporting figure
The State-of-the-Art of Set Visualization
Sets comprise a generic data model that has been used in a variety of data analysis problems. Such problems involve analysing and visualizing set relations between multiple sets defined over the same collection of elements. However, visualizing sets is a non-trivial problem due to the large number of possible relations between them. We provide a systematic overview of state-of-the-art techniques for visualizing different kinds of set relations. We classify these techniques into six main categories according to the visual representations they use and the tasks they support. We compare the categories to provide guidance for choosing an appropriate technique for a given problem. Finally, we identify challenges in this area that need further research and propose possible directions to address these challenges. Further resources on set visualization are available at http://www.setviz.net
VoG: Summarizing and Understanding Large Graphs
How can we succinctly describe a million-node graph with a few simple
sentences? How can we measure the "importance" of a set of discovered subgraphs
in a large graph? These are exactly the problems we focus on. Our main ideas
are to construct a "vocabulary" of subgraph-types that often occur in real
graphs (e.g., stars, cliques, chains), and from a set of subgraphs, find the
most succinct description of a graph in terms of this vocabulary. We measure
success in a well-founded way by means of the Minimum Description Length (MDL)
principle: a subgraph is included in the summary if it decreases the total
description length of the graph.
Our contributions are three-fold: (a) formulation: we provide a principled
encoding scheme to choose vocabulary subgraphs; (b) algorithm: we develop
\method, an efficient method to minimize the description cost, and (c)
applicability: we report experimental results on multi-million-edge real
graphs, including Flickr and the Notre Dame web graph.Comment: SIAM International Conference on Data Mining (SDM) 201
- …