5,741 research outputs found
A Streaming Algorithm for the Undirected Longest Path Problem
We present the first streaming algorithm for the longest path problem in undirected graphs. The input graph is given as a stream of edges and RAM is limited to only a linear number of edges at a time (linear in the number of vertices n). We prove a per-edge processing time of O(n), where a naive solution would have required Omega(n^2). Moreover, we give a concrete linear upper bound on the number of bits of RAM that are required.
On a set of graphs with various structure, we experimentally compare our algorithm with three leading RAM algorithms: Warnsdorf (1823), Pohl-Warnsdorf (1967), and Pongrasz (2012). Although conducting only a small constant number of passes over the input, our algorithm delivers competitive results: with the exception of preferential attachment graphs, we deliver at least 71% of the solution of the best RAM algorithm. The same minimum relative performance of 71% is observed over all graph classes after removing the 10% worst cases. This comparison has strong meaning, since for each instance class there is one algorithm that on average delivers at least 84% of a Hamilton path. In some cases we deliver even better results than any of the RAM algorithms
On a Bounded Budget Network Creation Game
We consider a network creation game in which each player (vertex) has a fixed
budget to establish links to other players. In our model, each link has unit
price and each agent tries to minimize its cost, which is either its local
diameter or its total distance to other players in the (undirected) underlying
graph of the created network. Two versions of the game are studied: in the MAX
version, the cost incurred to a vertex is the maximum distance between the
vertex and other vertices, and in the SUM version, the cost incurred to a
vertex is the sum of distances between the vertex and other vertices. We prove
that in both versions pure Nash equilibria exist, but the problem of finding
the best response of a vertex is NP-hard. We take the social cost of the
created network to be its diameter, and next we study the maximum possible
diameter of an equilibrium graph with n vertices in various cases. When the sum
of players' budgets is n-1, the equilibrium graphs are always trees, and we
prove that their maximum diameter is Theta(n) and Theta(log n) in MAX and SUM
versions, respectively. When each vertex has unit budget (i.e. can establish
link to just one vertex), the diameter of any equilibrium graph in either
version is Theta(1). We give examples of equilibrium graphs in the MAX version,
such that all vertices have positive budgets and yet the diameter is
Omega(sqrt(log n)). This interesting (and perhaps counter-intuitive) result
shows that increasing the budgets may increase the diameter of equilibrium
graphs and hence deteriorate the network structure. Then we prove that every
equilibrium graph in the SUM version has diameter 2^O(sqrt(log n)). Finally, we
show that if the budget of each player is at least k, then every equilibrium
graph in the SUM version is k-connected or has diameter smaller than 4.Comment: 28 pages, 3 figures, preliminary version appeared in SPAA'1
Near-Linear Time Insertion-Deletion Codes and (1+)-Approximating Edit Distance via Indexing
We introduce fast-decodable indexing schemes for edit distance which can be
used to speed up edit distance computations to near-linear time if one of the
strings is indexed by an indexing string . In particular, for every length
and every , one can in near linear time construct a string
with , such that, indexing
any string , symbol-by-symbol, with results in a string where for which edit
distance computations are easy, i.e., one can compute a
-approximation of the edit distance between and any other
string in time.
Our indexing schemes can be used to improve the decoding complexity of
state-of-the-art error correcting codes for insertions and deletions. In
particular, they lead to near-linear time decoding algorithms for the
insertion-deletion codes of [Haeupler, Shahrasbi; STOC `17] and faster decoding
algorithms for list-decodable insertion-deletion codes of [Haeupler, Shahrasbi,
Sudan; ICALP `18]. Interestingly, the latter codes are a crucial ingredient in
the construction of fast-decodable indexing schemes
NetLSD: Hearing the Shape of a Graph
Comparison among graphs is ubiquitous in graph analytics. However, it is a
hard task in terms of the expressiveness of the employed similarity measure and
the efficiency of its computation. Ideally, graph comparison should be
invariant to the order of nodes and the sizes of compared graphs, adaptive to
the scale of graph patterns, and scalable. Unfortunately, these properties have
not been addressed together. Graph comparisons still rely on direct approaches,
graph kernels, or representation-based methods, which are all inefficient and
impractical for large graph collections.
In this paper, we propose the Network Laplacian Spectral Descriptor (NetLSD):
the first, to our knowledge, permutation- and size-invariant, scale-adaptive,
and efficiently computable graph representation method that allows for
straightforward comparisons of large graphs. NetLSD extracts a compact
signature that inherits the formal properties of the Laplacian spectrum,
specifically its heat or wave kernel; thus, it hears the shape of a graph. Our
evaluation on a variety of real-world graphs demonstrates that it outperforms
previous works in both expressiveness and efficiency.Comment: KDD '18: The 24th ACM SIGKDD International Conference on Knowledge
Discovery & Data Mining, August 19--23, 2018, London, United Kingdo
Computational methods for finding long simple cycles in complex networks
© 2017 Elsevier B.V. Detection of long simple cycles in real-world complex networks finds many applications in layout algorithms, information flow modelling, as well as in bioinformatics. In this paper, we propose two computational methods for finding long cycles in real-world networks. The first method is an exact approach based on our own integer linear programming formulation of the problem and a data mining pipeline. This pipeline ensures that the problem is solved as a sequence of integer linear programs. The second method is a multi-start local search heuristic, which combines an initial construction of a long cycle using depth-first search with four different perturbation operators. Our experimental results are presented for social network samples, graphs studied in the network science field, graphs from DIMACS series, and protein-protein interaction networks. These results show that our formulation leads to a significantly more efficient exact approach to solve the problem than a previous formulation. For 14 out of 22 networks, we have found the optimal solutions. The potential of heuristics in this problem is also demonstrated, especially in the context of large-scale problem instances
- …