Search CORE

3,158 research outputs found

Interpretable Neural Architecture Search via Bayesian Optimisation with Weisfeiler-Lehman Kernels

Author: Dong Xiaowen
Osborne Michael
Ru Binxin
Wan Xingchen
Publication venue
Publication date: 12/01/2021
Field of study

Current neural architecture search (NAS) strategies focus only on finding a single, good, architecture. They offer little insight into why a specific network is performing well, or how we should modify the architecture if we want further improvements. We propose a Bayesian optimisation (BO) approach for NAS that combines the Weisfeiler-Lehman graph kernel with a Gaussian process surrogate. Our method optimises the architecture in a highly data-efficient manner: it is capable of capturing the topological structures of the architectures and is scalable to large graphs, thus making the high-dimensional and graph-like search spaces amenable to BO. More importantly, our method affords interpretability by discovering useful network features and their corresponding impact on the network performance. Indeed, we demonstrate empirically that our surrogate model is capable of identifying useful motifs which can guide the generation of new architectures. We finally show that our method outperforms existing NAS approaches to achieve the state of the art on both closed- and open-domain search spaces.Comment: ICLR 2021. 9 pages, 5 figures, 1 table (23 pages, 14 figures and 3 tables including references and appendices

arXiv.org e-Print Archive

Oxford University Research Archive

Interpretable neural architecture search via Bayesian optimisation with Weisfeiler-Lehman kernels

Author: Dong X
Osborne Ma
Ru B
Wan X
Publication venue: OpenReview
Publication date: 12/01/2021
Field of study

Oxford University Research Archive

Bayesian optimization on non-conventional search spaces

Author: Oh C.
Publication venue
Publication date: 01/01/2023
Field of study

International Migration, Integration and Social Cohesion online publications

Bayesian optimization on non-conventional search spaces

Author: Oh C.
Publication venue
Publication date: 01/01/2023
Field of study

International Migration, Integration and Social Cohesion online publications

Assessing hyper parameter optimization and speedup for convolutional neural networks

Author: A.Krizhevsky
D. L.Tutorial
E.Bochinski
E.Real
J.Bergstra
J.Deng
K.He
L.Xie
N.Srivastava
S.Ioffe
T.Domhan
W. Y.Lee
Z.Zhong
Publication venue: 'IGI Global'
Publication date: 01/01/2020
Field of study

The increased processing power of graphical processing units (GPUs) and the availability of large image datasets has fostered a renewed interest in extracting semantic information from images. Promising results for complex image categorization problems have been achieved using deep learning, with neural networks comprised of many layers. Convolutional neural networks (CNN) are one such architecture which provides more opportunities for image classification. Advances in CNN enable the development of training models using large labelled image datasets, but the hyper parameters need to be specified, which is challenging and complex due to the large number of parameters. A substantial amount of computational power and processing time is required to determine the optimal hyper parameters to define a model yielding good results. This article provides a survey of the hyper parameter search and optimization methods for CNN architectures

LSBU Research Open

Crossref

ResearchOnline@GCU