Search CORE

5 research outputs found

Is Integer Arithmetic Enough for Deep Learning Training?

Author: Asgharian Masoud
Ghaffari Alireza
Nia Vahid Partovi
Tahaei Marzieh S.
Tayaranian Mohammadreza
Publication venue
Publication date: 18/07/2022
Field of study

The ever-increasing computational complexity of deep learning models makes their training and deployment difficult on various cloud and edge platforms. Replacing floating-point arithmetic with low-bit integer arithmetic is a promising approach to save energy, memory footprint, and latency of deep learning models. As such, quantization has attracted the attention of researchers in recent years. However, using integer numbers to form a fully functional integer training pipeline including forward pass, back-propagation, and stochastic gradient descent is not studied in detail. Our empirical and mathematical results reveal that integer arithmetic is enough to train deep learning models. Unlike recent proposals, instead of quantization, we directly switch the number representation of computations. Our novel training method forms a fully integer training pipeline that does not change the trajectory of the loss and accuracy compared to floating-point, nor does it need any special hyper-parameter tuning, distribution adjustment, or gradient clipping. Our experimental results show that our proposed method is effective in a wide variety of tasks such as classification (including vision transformers), object detection, and semantic segmentation

arXiv.org e-Print Archive

Integer Fine-tuning of Transformer-based Models

Author: Asgharian Masoud
Ghaffari Alireza
Nia Vahid Partovi
Rezagholizadeh Mehdi
Tahaei Marzieh S.
Tayaranian Mohammadreza
Publication venue
Publication date: 20/09/2022
Field of study

Transformer based models are used to achieve state-of-the-art performance on various deep learning tasks. Since transformer-based models have large numbers of parameters, fine-tuning them on downstream tasks is computationally intensive and energy hungry. Automatic mixed-precision FP32/FP16 fine-tuning of such models has been previously used to lower the compute resource requirements. However, with the recent advances in the low-bit integer back-propagation, it is possible to further reduce the computation and memory foot-print. In this work, we explore a novel integer training method that uses integer arithmetic for both forward propagation and gradient computation of linear, convolutional, layer-norm, and embedding layers in transformer-based models. Furthermore, we study the effect of various integer bit-widths to find the minimum required bit-width for integer fine-tuning of transformer-based models. We fine-tune BERT and ViT models on popular downstream tasks using integer layers. We show that 16-bit integer models match the floating-point baseline performance. Reducing the bit-width to 10, we observe 0.5 average score drop. Finally, further reduction of the bit-width to 8 provides an average score drop of 1.7 points

arXiv.org e-Print Archive

Patch-based image reconstruction for PET using prior-image derived dictionaries

Author: Andrew J Reader (5287021)
Marzieh S Tahaei (5287024)
Publication venue
Publication date
Field of study

<p>This collection contains figures and reconstructed images in .mat format associated with the manuscript tiled "<strong>Patch-based image reconstruction for PET using prior-image derived dictionaries</strong>" . The file, Data_Fig9-10.zip contains the reconstructed images associated with Fig 9 and 10 as a function of iteration for different methods. Data_Fig10-12.zip contains reconstructed images of the real data for different methods.</p

FigShare

Mr-guided PET image denoising

Author: Collins D. Louis
Reader Andrew J.
Tahaei Marzieh S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2016
Field of study

Crossref

King's Research Portal

Patch-based image reconstruction for PET using prior-image derived dictionaries

Author: Ahn S
Andrew J Reader
Ardekani B A
Chen S
Cheng-Liao J
Glowinski R
Green P J
Hoyer P O
Jacobs F
Lewitt R M
Marzieh S Tahaei
Novosad P
Reader A J
Tang J
Wells R G
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref