Search CORE

5,032 research outputs found

Convolutional Neural Fabrics

Author: Saxena Shreyas
Verbeek Jakob
Publication venue
Publication date: 04/12/2016
Field of study

Despite the success of CNNs, selecting the optimal architecture for a given task remains an open problem. Instead of aiming to select a single optimal architecture, we propose a "fabric" that embeds an exponentially large number of architectures. The fabric consists of a 3D trellis that connects response maps at different layers, scales, and channels with a sparse homogeneous local connectivity pattern. The only hyper-parameters of a fabric are the number of channels and layers. While individual architectures can be recovered as paths, the fabric can in addition ensemble all embedded architectures together, sharing their weights where their paths overlap. Parameters can be learned using standard methods based on back-propagation, at a cost that scales linearly in the fabric size. We present benchmark results competitive with the state of the art for image classification on MNIST and CIFAR10, and for semantic segmentation on the Part Labels dataset.Comment: Corrected typos (In proceedings of NIPS16

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Parallel Batch-Dynamic Graph Connectivity

Author: Awerbuch B.
Iyer A.
JaJa J.
Kejlberg-Rasmussen C.
Nanongkai D.
Reif J.
Reif J. H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 17/05/2020
Field of study

In this paper, we study batch parallel algorithms for the dynamic connectivity problem, a fundamental problem that has received considerable attention in the sequential setting. The most well known sequential algorithm for dynamic connectivity is the elegant level-set algorithm of Holm, de Lichtenberg and Thorup (HDT), which achieves

O(\log^2 n)

amortized time per edge insertion or deletion, and

O(\log n / \log\log n)

time per query. We design a parallel batch-dynamic connectivity algorithm that is work-efficient with respect to the HDT algorithm for small batch sizes, and is asymptotically faster when the average batch size is sufficiently large. Given a sequence of batched updates, where

\Delta

is the average batch size of all deletions, our algorithm achieves

O(\log n \log(1 + n / \Delta))

expected amortized work per edge insertion and deletion and

O(\log^3 n)

depth w.h.p. Our algorithm answers a batch of

k

connectivity queries in

O(k \log(1 + n/k))

expected work and

O(\log n)

depth w.h.p. To the best of our knowledge, our algorithm is the first parallel batch-dynamic algorithm for connectivity.Comment: This is the full version of the paper appearing in the ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 201

arXiv.org e-Print Archive

Crossref

Efficient and Robust Weighted Least-Squares Cell-Average Gradient Construction Methods for the Simulation of Scramjet Flows

Author: Baurle Robert A.
Nishikawa Hiroaki
White Jeffery A.
Publication venue
Publication date
Field of study

The ability to solve the equations governing the hypersonic turbulent flow of a real gas on unstructured grids using a spatially-elliptic, 2nd-order accurate, cell-centered, finite-volume method has been recently implemented in the VULCAN-CFD code. The construction of cell-average gradients using a weighted linear least-squares method and the use of these gradients in the construction of the inviscid fluxes is the focus of this paper. A comparison of least-squares stencil construction methodologies is presented and approaches designed to minimize the number of cells used to augment/stabilize the least-squares stencil while preserving accuracy are explored. Due to our interest in hypersonic flow, a robust multidimensional cell-average gradient limiter procedure that is consistent with the stencil used to construct the cellaverage gradients is described. Canonical problems are computed to illustrate the challenges and investigate the accuracy, robustness and convergence behavior of the cell-average gradient methods on unstructured cell-centered finite-volume grids. Finally, thermally perfect, chemically frozen, Mach 7.8 turbulent flow of air through a scramjet engine flowpath is computed and compared with experimental data to demonstrate the robustness, accuracy and convergence behavior of the preferred gradient method for a realistic 3-D geometry on a non-hex-dominant grid

NASA Technical Reports Server

cISP: A Speed-of-Light Internet Service Provider

Author: Aguirre Anthony
Aqeel Waqar
Bhattacherjee Debopam
Bozkurt Ilker Nadi
Chandrasekaran Balakrishnan
Godfrey P. Brighten
Jyothi Sangeetha Abdu
Laughlin Gregory P.
Maggs Bruce M.
Singla Ankit
Tirmazi Muhammad
Publication venue
Publication date: 01/01/2018
Field of study

Low latency is a requirement for a variety of interactive network applications. The Internet, however, is not optimized for latency. We thus explore the design of cost-effective wide-area networks that move data over paths very close to great-circle paths, at speeds very close to the speed of light in vacuum. Our cISP design augments the Internet's fiber with free-space wireless connectivity. cISP addresses the fundamental challenge of simultaneously providing low latency and scalable bandwidth, while accounting for numerous practical factors ranging from transmission tower availability to packet queuing. We show that instantiations of cISP across the contiguous United States and Europe would achieve mean latencies within 5% of that achievable using great-circle paths at the speed of light, over medium and long distances. Further, we estimate that the economic value from such networks would substantially exceed their expense

arXiv.org e-Print Archive

MPG.PuRe