Search CORE

200 research outputs found

Machine perception and computer vision

Author: Δήμας Γεώργιος Ι.
Δήμας Γεώργιος Ι.
Publication venue
Publication date: 01/01/2022
Field of study

Understanding from Machine Learning Models

Author: Sullivan Emily
Publication venue
Publication date: 01/01/2022
Field of study

Simple idealized models seem to provide more understanding than opaque, complex, and hyper-realistic models. However, an increasing number of scientists are going in the opposite direction by utilizing opaque machine learning models to make predictions and draw inferences, suggesting that scientists are opting for models that have less potential for understanding. Are scientists trading understanding for some other epistemic or pragmatic good when they choose a machine learning model? Or are the assumptions behind why minimal models provide understanding misguided? In this paper, using the case of deep neural networks, I argue that it is not the complexity or black box nature of a model that limits how much understanding the model provides. Instead, it is a lack of scientific and empirical evidence supporting the link that connects a model to the target phenomenon that primarily prohibits understanding

PhilPapers

Efficient Deep Learning in Network Compression and Acceleration

Author: Ge Shiming
Publication venue: 'IntechOpen'
Publication date: 05/11/2018
Field of study

While deep learning delivers state-of-the-art accuracy on many artificial intelligence tasks, it comes at the cost of high computational complexity due to large parameters. It is important to design or develop efficient methods to support deep learning toward enabling its scalable deployment, particularly for embedded devices such as mobile, Internet of things (IOT), and drones. In this chapter, I will present a comprehensive survey of several advanced approaches for efficient deep learning in network compression and acceleration. I will describe the central ideas behind each approach and explore the similarities and differences between different methods. Finally, I will present some future directions in this field