9 research outputs found
HW/SW Codesign for Robust and Efficient Binarized SNNs by Capacitor Minimization
Using accelerators based on analog computing is an efficient way to process
the immensely large workloads in Neural Networks (NNs). One example of an
analog computing scheme for NNs is Integrate-and-Fire (IF) Spiking Neural
Networks (SNNs). However, to achieve high inference accuracy in IF-SNNs, the
analog hardware needs to represent current-based multiply-accumulate (MAC)
levels as spike times, for which a large membrane capacitor needs to be charged
for a certain amount of time. A large capacitor results in high energy use,
considerable area cost, and long latency, constituting one of the major
bottlenecks in analog IF-SNN implementations. In this work, we propose a HW/SW
Codesign method, called CapMin, for capacitor size minimization in analog
computing IF-SNNs. CapMin minimizes the capacitor size by reducing the number
of spike times needed for accurate operation of the HW, based on the absolute
frequency of MAC level occurrences in the SW. To increase the operation of
IF-SNNs to current variation, we propose the method CapMin-V, which trades
capacitor size for protection based on the reduced capacitor size found in
CapMin. In our experiments, CapMin achieves more than a 14 reduction in
capacitor size over the state of the art, while CapMin-V achieves increased
variation tolerance in the IF-SNN operation, requiring only a small increase in
capacitor size.Comment: 9 pages, 9 figure
FeFET-based Binarized Neural Networks Under Temperature-dependent Bit Errors
Ferroelectric FET (FeFET) is a highly promising emerging non-volatile memory (NVM) technology, especially for binarized neural network (BNN) inference on the low-power edge. The reliability of such devices, however, inherently depends on temperature. Hence, changes in temperature during run time manifest themselves as changes in bit error rates. In this work, we reveal the temperature-dependent bit error model of FeFET memories, evaluate its effect on BNN accuracy, and propose countermeasures. We begin on the transistor level and accurately model the impact of temperature on bit error rates of FeFET. This analysis reveals temperature-dependent asymmetric bit error rates. Afterwards, on the application level, we evaluate the impact of the temperature-dependent bit errors on the accuracy of BNNs. Under such bit errors, the BNN accuracy drops to unacceptable levels when no countermeasures are employed. We propose two countermeasures: (1) Training BNNs for bit error tolerance by injecting bit flips into the BNN data, and (2) applying a bit error rate assignment algorithm (BERA) which operates in a layer-wise manner and does not inject bit flips during training. In experiments, the BNNs, to which the countermeasures are applied to, effectively tolerate temperature-dependent bit errors for the entire range of operating temperature
Nanoparticle Classification Using Frequency Domain Analysis on Resource-Limited Platforms
A mobile system that can detect viruses in real time is urgently needed, due to the combination of virus emergence and evolution with increasing global travel and transport. A biosensor called PAMONO (for Plasmon Assisted Microscopy of Nano-sized Objects) represents a viable technology for mobile real-time detection of viruses and virus-like particles. It could be used for fast and reliable diagnoses in hospitals, airports, the open air, or other settings. For analysis of the images provided by the sensor, state-of-the-art methods based on convolutional neural networks (CNNs) can achieve high accuracy. However, such computationally intensive methods may not be suitable on most mobile systems. In this work, we propose nanoparticle classification approaches based on frequency domain analysis, which are less resource-intensive. We observe that on average the classification takes 29 μ s per image for the Fourier features and 17 μ s for the Haar wavelet features. Although the CNN-based method scores 1–2.5 percentage points higher in classification accuracy, it takes 3370 μ s per image on the same platform. With these results, we identify and explore the trade-off between resource efficiency and classification performance for nanoparticle classification of images provided by the PAMONO sensor
Nanoparticle classification using frequency domain analysis on resource-limited platforms
A mobile system that can detect viruses in real time is urgently needed, due to the combination of virus emergence and evolution with increasing global travel and transport. A biosensor called PAMONO (for Plasmon Assisted Microscopy of Nano-sized Objects) represents a viable technology for mobile real-time detection of viruses and virus-like particles. It could be used for fast and reliable diagnoses in hospitals, airports, the open air, or other settings. For analysis of the images provided by the sensor, state-of-the-art methods based on convolutional neural networks (CNNs) can achieve high accuracy. However, such computationally intensive methods may not be suitable on most mobile systems. In this work, we propose nanoparticle classification approaches based on frequency domain analysis, which are less resource-intensive. We observe that on average the classification takes 29 μ s per image for the Fourier features and 17 μ s for the Haar wavelet features. Although the CNN-based method scores 1–2.5 percentage points higher in classification accuracy, it takes 3370 μ s per image on the same platform. With these results, we identify and explore the trade-off between resource efficiency and classification performance for nanoparticle classification of images provided by the PAMONO sensor