Dual Precision Deep Neural Network

Abstract

On-line Precision scalability of the deep neural networks(DNNs) is a critical feature to support accuracy and complexity trade-off during the DNN inference. In this paper, we propose dual-precision DNN that includes two different precision modes in a single model, thereby supporting an on-line precision switch without re-training. The proposed two-phase training process optimizes both low- and high-precision modes.Comment: 5 pages, 4 figures, 2 table

    Similar works

    Full text

    thumbnail-image

    Available Versions