Search CORE

13,527 research outputs found

Convolutional Neural Networks over Tree Structures for Programming Language Processing

Author: Jin Zhi
Li Ge
Mou Lili
Wang Tao
Zhang Lu
Publication venue
Publication date: 08/12/2015
Field of study

Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. However, different from a natural language sentence, a program contains rich, explicit, and complicated structural information. Hence, traditional NLP models may be inappropriate for programs. In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information. TBCNN is a generic architecture for programming language processing; our experiments show its effectiveness in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.Comment: Accepted at AAAI-1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Geometry-Oblivious FMM for Compressing Dense SPD Matrices

Author: Biros George
Levitt James
Reiz Severin
Yu Chenhan D.
Publication venue
Publication date: 01/07/2017
Field of study

We present GOFMM (geometry-oblivious FMM), a novel method that creates a hierarchical low-rank approximation, "compression," of an arbitrary dense symmetric positive definite (SPD) matrix. For many applications, GOFMM enables an approximate matrix-vector multiplication in

N \log N

or even

N

time, where

N

is the matrix size. Compression requires

N \log N

storage and work. In general, our scheme belongs to the family of hierarchical matrix approximation methods. In particular, it generalizes the fast multipole method (FMM) to a purely algebraic setting by only requiring the ability to sample matrix entries. Neither geometric information (i.e., point coordinates) nor knowledge of how the matrix entries have been generated is required, thus the term "geometry-oblivious." Also, we introduce a shared-memory parallel scheme for hierarchical matrix computations that reduces synchronization barriers. We present results on the Intel Knights Landing and Haswell architectures, and on the NVIDIA Pascal architecture for a variety of matrices.Comment: 13 pages, accepted by SC'1

arXiv.org e-Print Archive

Crossref

Faster Shortest Paths in Dense Distance Graphs, with Applications

Author: Mozes Shay
Nussbaum Yahav
Weimann Oren
Publication venue
Publication date: 03/04/2014
Field of study

We show how to combine two techniques for efficiently computing shortest paths in directed planar graphs. The first is the linear-time shortest-path algorithm of Henzinger, Klein, Subramanian, and Rao [STOC'94]. The second is Fakcharoenphol and Rao's algorithm [FOCS'01] for emulating Dijkstra's algorithm on the dense distance graph (DDG). A DDG is defined for a decomposition of a planar graph

G

into regions of at most

r

vertices each, for some parameter

r < n

. The vertex set of the DDG is the set of

\Theta(n/\sqrt r)

vertices of

G

that belong to more than one region (boundary vertices). The DDG has

\Theta(n)

arcs, such that distances in the DDG are equal to the distances in

G

. Fakcharoenphol and Rao's implementation of Dijkstra's algorithm on the DDG (nicknamed FR-Dijkstra) runs in

O(n\log(n) r^{-1/2} \log r)

time, and is a key component in many state-of-the-art planar graph algorithms for shortest paths, minimum cuts, and maximum flows. By combining these two techniques we remove the

\log n

dependency in the running time of the shortest-path algorithm, making it

O(n r^{-1/2} \log^2r)

. This work is part of a research agenda that aims to develop new techniques that would lead to faster, possibly linear-time, algorithms for problems such as minimum-cut, maximum-flow, and shortest paths with negative arc lengths. As immediate applications, we show how to compute maximum flow in directed weighted planar graphs in

O(n \log p)

time, where

p

is the minimum number of edges on any path from the source to the sink. We also show how to compute any part of the DDG that corresponds to a region with

r

vertices and

k

boundary vertices in

O(r \log k)

time, which is faster than has been previously known for small values of

k

arXiv.org e-Print Archive

CiteSeerX

Flow trees for exploring spatial trajectories

Author: Dykes J.
Radburn R.
Slingsby A.
Wood J.
Publication venue
Publication date: 01/01/2009
Field of study

A trajectory is a directed path that defines a link between two spatial locations. That path may be as simple as the Euclidean shortest distance between a start and end point, or may involve a more complex traversal through time and space to travel from start to end. Within GI analysis, trajectories are used to represent phenomena such as movement of people as migration and commuting, good

CiteSeerX

City Research Online