Search CORE

120,745 research outputs found

Replica Symmetry Breaking and the Kuhn-Tucker Cavity Method in simple and multilayer Perceptrons

Author: D- Gottingen
F. Gerl
U. Kreyß
Publication venue: 'EDP Sciences'
Publication date: 22/10/1996
Field of study

Within a Kuhn-Tucker cavity method introduced in a former paper, we study optimal stability learning for situations, where in the replica formalism the replica symmetry may be broken, namely (i) the case of a simple perceptron above the critical loading, and (ii) the case of two-layer AND-perceptrons, if one learns with maximal stability. We find that the deviation of our cavity solution from the replica symmetric one in these cases is a clear indication of the necessity of replica symmetry breaking. In any case the cavity solution tends to underestimate the storage capabilities of the networks.Comment: 32 pages, LaTex Source with 9 .eps-files enclosed, accepted by J. Phys I (France

arXiv.org e-Print Archive

CiteSeerX

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

The IoT Tree of Life

Author: Bremner Duncan
Publication venue: Knowledge Transfer Network, Sensors & Instrumentation Leadership Committee
Publication date: 01/01/2015
Field of study

The Internet of Things (IoT) has come to mean all things to all people. Combined with the huge amount of interest and investment into this emerging opportunity, there is a real possibility that the arising confusion will hamper adoption by the mass market. The SILC team have used their extensive Sensor Systems market and technical knowledge in an attempt to clarify the situation for individuals interested in understanding IoT, and the underpinning role of Sensor Systems. This paper proposes a phased model of the IoT ecosystem, starting with infrastructure establishment, and culminating in exploitation through the creation of new companies and business models. It does not attempt to quantify the emerging opportunities, relying instead on the many publications dedicated to detailed market analysis. The focus is to place the opportunities in context, demonstrate the importance of sensor system technology underpinning the emerging IoT revolution, and suggests areas where the UK could establish leadership positions. Throughout the paper, examples of the likely protagonists have been used by way of illustration

Enlighten

Smart Grid Communications: Overview of Research Challenges, Solutions, and Standardization Activities

Author: Chin Woon Hau
Efthymiou Costas
Fan Zhong
Gormus Sedat
Kalogridis Georgios
Kulkarni Parag
Lambotharan Sangarapillai
Sooriyabandara Mahesh
Zhu Ziming
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/12/2011
Field of study

Optimization of energy consumption in future intelligent energy networks (or Smart Grids) will be based on grid-integrated near-real-time communications between various grid elements in generation, transmission, distribution and loads. This paper discusses some of the challenges and opportunities of communications research in the areas of smart grid and smart metering. In particular, we focus on some of the key communications challenges for realizing interoperable and future-proof smart grid/metering networks, smart grid security and privacy, and how some of the existing networking technologies can be applied to energy management. Finally, we also discuss the coordinated standardization efforts in Europe to harmonize communications standards and protocols.Comment: To be published in IEEE Communications Surveys and Tutorial

arXiv.org e-Print Archive

Crossref

Characterizing Deep-Learning I/O Workloads in TensorFlow

Author: Chien Steven W. D.
Herman Pawel
Laure Erwin
Markidis Stefano
Narasimhamurthy Sai
Santos Luis
Sishtla Chaitanya Prasad
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/10/2018
Field of study

The performance of Deep-Learning (DL) computing frameworks rely on the performance of data ingestion and checkpointing. In fact, during the training, a considerable high number of relatively small files are first loaded and pre-processed on CPUs and then moved to accelerator for computation. In addition, checkpointing and restart operations are carried out to allow DL computing frameworks to restart quickly from a checkpoint. Because of this, I/O affects the performance of DL applications. In this work, we characterize the I/O performance and scaling of TensorFlow, an open-source programming framework developed by Google and specifically designed for solving DL problems. To measure TensorFlow I/O performance, we first design a micro-benchmark to measure TensorFlow reads, and then use a TensorFlow mini-application based on AlexNet to measure the performance cost of I/O and checkpointing in TensorFlow. To improve the checkpointing performance, we design and implement a burst buffer. We find that increasing the number of threads increases TensorFlow bandwidth by a maximum of 2.3x and 7.8x on our benchmark environments. The use of the tensorFlow prefetcher results in a complete overlap of computation on accelerator and input pipeline on CPU eliminating the effective cost of I/O on the overall performance. The use of a burst buffer to checkpoint to a fast small capacity storage and copy asynchronously the checkpoints to a slower large capacity storage resulted in a performance improvement of 2.6x with respect to checkpointing directly to slower storage on our benchmark environment.Comment: Accepted for publication at pdsw-DISCS 201

arXiv.org e-Print Archive

Crossref

Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices

Author: Chen Yu-Hsin
Emer Joel
Sze Vivienne
Yang Tien-Ju
Publication venue
Publication date: 20/05/2019
Field of study

A recent trend in DNN development is to extend the reach of deep learning applications to platforms that are more resource and energy constrained, e.g., mobile devices. These endeavors aim to reduce the DNN model size and improve the hardware processing efficiency, and have resulted in DNNs that are much more compact in their structures and/or have high data sparsity. These compact or sparse models are different from the traditional large ones in that there is much more variation in their layer shapes and sizes, and often require specialized hardware to exploit sparsity for performance improvement. Thus, many DNN accelerators designed for large DNNs do not perform well on these models. In this work, we present Eyeriss v2, a DNN accelerator architecture designed for running compact and sparse DNNs. To deal with the widely varying layer shapes and sizes, it introduces a highly flexible on-chip network, called hierarchical mesh, that can adapt to the different amounts of data reuse and bandwidth requirements of different data types, which improves the utilization of the computation resources. Furthermore, Eyeriss v2 can process sparse data directly in the compressed domain for both weights and activations, and therefore is able to improve both processing speed and energy efficiency with sparse models. Overall, with sparse MobileNet, Eyeriss v2 in a 65nm CMOS process achieves a throughput of 1470.6 inferences/sec and 2560.3 inferences/J at a batch size of 1, which is 12.6x faster and 2.5x more energy efficient than the original Eyeriss running MobileNet. We also present an analysis methodology called Eyexam that provides a systematic way of understanding the performance limits for DNN processors as a function of specific characteristics of the DNN model and accelerator design; it applies these characteristics as sequential steps to increasingly tighten the bound on the performance limits.Comment: accepted for publication in IEEE Journal on Emerging and Selected Topics in Circuits and Systems. This extended version on arXiv also includes Eyexam in the appendi

arXiv.org e-Print Archive

DSpace@MIT

Proceedings of Abstracts Engineering and Computer Science Research Conference 2019

Author: Adams Roderick
Amafabia Daerefa-a
Barker Trevor
Beka Nathan
Bhavsar Ronakben
Bonivart Agnes
Canoville Paul
Cañamero Lola
CHEN Yong Kang
CHEN Yong Kang
Chrysanthou Andreas
Counsell Nathan
Crook Brian
Davey Neil
David-West Opukuro
Denai Mouloud
Dhakal Hom
Drix Damien
Goncharenko Julia
Grasso Marzio
Hafner Verena Vanessa
Hall Samantha
Haritos George
Hassan Eheda
Helian Na
Herfatmanesh Mohammad Reza
Ismail Sikiru O.
Johnston Ian
Johnston Ian
Kadir Shabnam
Kaye Richard
Khan Imran
Kirner Raimund
Kirner Raimund
Klaholz Ingo
Klusak Jan
Kourtessis Pandelis
Lane Peter
Lekkala Himayasri Rao
Lilley Mariana
Mayor David
McCluskey Daniel
McCluskey Daniel
Menon Catherine
Metzner Christoph
Miko Rebecca
Montalvão Diogo
Mporas Iosif
Munro Ian
Nehaniv Chrystopher
Newman James
Nwawe Richard
Panday Deepak
Partou Helen
Pissanidis Georgios
Polani Daniel
Ren Guogang
Robinson Matthew
Rosiello Vincenzo
Sayers Paul
Schilstra Maria
Schirmer Pascal
Schmuker Michael
Siadati Rana
Sinha Ankur
Skaltsas Grigorios
Steffert Tony
Steuber Volker
Steuber Volker
Suckow Bjorn
Sun Yi
Sun Yichuang
Sunmola Funlade
Sutton Samuel
te Boekhorst Rene
Toffe Gilles
Tracey Mark
Tveretina Olga
Veneziano Vito
Verma Alok
Wang Yuan
Wernick Paul
Publication venue: University of Hertfordshire
Publication date: 01/09/2019
Field of study

© 2019 The Author(s). This is an open-access work distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. For further details please see https://creativecommons.org/licenses/by/4.0/. Note: Keynote: Fluorescence visualisation to evaluate effectiveness of personal protective equipment for infection control is © 2019 Crown copyright and so is licensed under the Open Government Licence v3.0. Under this licence users are permitted to copy, publish, distribute and transmit the Information; adapt the Information; exploit the Information commercially and non-commercially for example, by combining it with other Information, or by including it in your own product or application. Where you do any of the above you must acknowledge the source of the Information in your product or application by including or linking to any attribution statement specified by the Information Provider(s) and, where possible, provide a link to this licence: http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/This book is the record of abstracts submitted and accepted for presentation at the Inaugural Engineering and Computer Science Research Conference held 17th April 2019 at the University of Hertfordshire, Hatfield, UK. This conference is a local event aiming at bringing together the research students, staff and eminent external guests to celebrate Engineering and Computer Science Research at the University of Hertfordshire. The ECS Research Conference aims to showcase the broad landscape of research taking place in the School of Engineering and Computer Science. The 2019 conference was articulated around three topical cross-disciplinary themes: Make and Preserve the Future; Connect the People and Cities; and Protect and Care

University of Hertfordshire Research Archive

Shaping the learning landscape in neural networks around wide flat minima

Author: Baldassi Carlo
Pittorino Fabrizio
Zecchina Riccardo
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2020
Field of study

Learning in Deep Neural Networks (DNN) takes place by minimizing a non-convex high-dimensional loss function, typically by a stochastic gradient descent (SGD) strategy. The learning process is observed to be able to find good minimizers without getting stuck in local critical points, and that such minimizers are often satisfactory at avoiding overfitting. How these two features can be kept under control in nonlinear devices composed of millions of tunable connections is a profound and far reaching open question. In this paper we study basic non-convex one- and two-layer neural network models which learn random patterns, and derive a number of basic geometrical and algorithmic features which suggest some answers. We first show that the error loss function presents few extremely wide flat minima (WFM) which coexist with narrower minima and critical points. We then show that the minimizers of the cross-entropy loss function overlap with the WFM of the error loss. We also show examples of learning devices for which WFM do not exist. From the algorithmic perspective we derive entropy driven greedy and message passing algorithms which focus their search on wide flat regions of minimizers. In the case of SGD and cross-entropy loss, we show that a slow reduction of the norm of the weights along the learning process also leads to WFM. We corroborate the results by a numerical study of the correlations between the volumes of the minimizers, their Hessian and their generalization performance on real data.Comment: 37 pages (16 main text), 10 figures (7 main text

arXiv.org e-Print Archive

Archivio istituzionale della Ricerca - Bocconi

Archivio istituzionale della ricerca - Politecnico di Milano

PESH/OSHA Standards: Information for Workers

Author: New York State Public Employees Federation AFL-CIO
Publication venue: DigitalCommons@ILR
Publication date: 01/09/2003
Field of study

[Excerpt] Health and safety for union members on the job is a top priority for the Public Employees Federation. Our members face the risk of on-the-job injuries every working day. It is a known fact that the injury and illness rates for public employees far exceed that of private sector employees. Our union’s Health and Safety Department has prepared this handbook to assist PEF members in recognizing the workplace hazards that are most frequently cited by PESH and OSHA. This handbook gives you an overview of the standards related to those hazards as well as a reference guide to do any further research

DigitalCommons@ILR

eCommons@Cornell