Search CORE

82,280 research outputs found

Learning to infer: RL-based search for DNN primitive selection on Heterogeneous Embedded Systems

Author: abadi
anderson
baker
chetlur
cortes
dong
he
hsu
kim
li
real
sutton
tan
watkins
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/11/2018
Field of study

Deep Learning is increasingly being adopted by industry for computer vision applications running on embedded devices. While Convolutional Neural Networks' accuracy has achieved a mature and remarkable state, inference latency and throughput are a major concern especially when targeting low-cost and low-power embedded platforms. CNNs' inference latency may become a bottleneck for Deep Learning adoption by industry, as it is a crucial specification for many real-time processes. Furthermore, deployment of CNNs across heterogeneous platforms presents major compatibility issues due to vendor-specific technology and acceleration libraries. In this work, we present QS-DNN, a fully automatic search based on Reinforcement Learning which, combined with an inference engine optimizer, efficiently explores through the design space and empirically finds the optimal combinations of libraries and primitives to speed up the inference of CNNs on heterogeneous embedded devices. We show that, an optimized combination can achieve 45x speedup in inference latency on CPU compared to a dependency-free baseline and 2x on average on GPGPU compared to the best vendor library. Further, we demonstrate that, the quality of results and time "to-solution" is much better than with Random Search and achieves up to 15x better results for a short-time search

arXiv.org e-Print Archive

Crossref

Better Optimism By Bayes: Adaptive Planning with Rich Models

Author: Dayan Peter
Guez Arthur
Silver David
Publication venue
Publication date: 01/02/2014
Field of study

The computational costs of inference and planning have confined Bayesian model-based reinforcement learning to one of two dismal fates: powerful Bayes-adaptive planning but only for simplistic models, or powerful, Bayesian non-parametric models but using simple, myopic planning strategies such as Thompson sampling. We ask whether it is feasible and truly beneficial to combine rich probabilistic models with a closer approximation to fully Bayesian planning. First, we use a collection of counterexamples to show formal problems with the over-optimism inherent in Thompson sampling. Then we leverage state-of-the-art techniques in efficient Bayes-adaptive planning and non-parametric Bayesian methods to perform qualitatively better than both existing conventional algorithms and Thompson sampling on two contextual bandit-like problems.Comment: 11 pages, 11 figure

arXiv.org e-Print Archive

MPG.PuRe

Stick-Breaking Policy Learning in Dec-POMDPs

Author: Amato Christopher
Carin Lawrence
How Jonathan P.
Liao Xuejun
Liu Miao
Publication venue
Publication date: 01/07/2015
Field of study

Expectation maximization (EM) has recently been shown to be an efficient algorithm for learning finite-state controllers (FSCs) in large decentralized POMDPs (Dec-POMDPs). However, current methods use fixed-size FSCs and often converge to maxima that are far from optimal. This paper considers a variable-size FSC to represent the local policy of each agent. These variable-size FSCs are constructed using a stick-breaking prior, leading to a new framework called \emph{decentralized stick-breaking policy representation} (Dec-SBPR). This approach learns the controller parameters with a variational Bayesian algorithm without having to assume that the Dec-POMDP model is available. The performance of Dec-SBPR is demonstrated on several benchmark problems, showing that the algorithm scales to large problems while outperforming other state-of-the-art methods

arXiv.org e-Print Archive

DSpace@MIT

Multi-agent knowledge integration mechanism using particle swarm optimization

Author: Axelrod
Axelrod
Axtell
Chen
Choudhury
Clarke
Cunningham
Dalkey
Eden
Epstein
Epstein
Geschka
Ghanem
Gilbert
Habin Lee
Heckbert
Huhns
Johnson
Kennedy
Khan
Klein
Kosko
Kosko
Kun Chang Lee
Kwahk
Lee
Lee
Liu
Liu
Mateou
Mintzberg
Namho Lee
Nelson
Noh
Parsopoulos
Prusty
Ramaprasad
Rodriguez-Repiso
Saaty
Salmeron
Schelling
Stylos
Thorleuchter
Thorleuchter
Trelea
Ulengin
Woodridge
Wu
Xirogiannis
Xiwei
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/03/2012
Field of study

This is the post-print version of the final paper published in Technological Forecasting and Social Change. The published article is available from the link below. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. Copyright @ 2011 Elsevier B.V.Unstructured group decision-making is burdened with several central difficulties: unifying the knowledge of multiple experts in an unbiased manner and computational inefficiencies. In addition, a proper means of storing such unified knowledge for later use has not yet been established. Storage difficulties stem from of the integration of the logic underlying multiple experts' decision-making processes and the structured quantification of the impact of each opinion on the final product. To address these difficulties, this paper proposes a novel approach called the multiple agent-based knowledge integration mechanism (MAKIM), in which a fuzzy cognitive map (FCM) is used as a knowledge representation and storage vehicle. In this approach, we use particle swarm optimization (PSO) to adjust causal relationships and causality coefficients from the perspective of global optimization. Once an optimized FCM is constructed an agent based model (ABM) is applied to the inference of the FCM to solve real world problem. The final aggregate knowledge is stored in FCM form and is used to produce proper inference results for other target problems. To test the validity of our approach, we applied MAKIM to a real-world group decision-making problem, an IT project risk assessment, and found MAKIM to be statistically robust.Ministry of Education, Science and Technology (Korea

Crossref

Brunel University Research Archive