59 research outputs found
Leveraging Continuously Differentiable Activation Functions for Learning in Quantized Noisy Environments
Real-world analog systems intrinsically suffer from noise that can impede
model convergence and accuracy on a variety of deep learning models. We
demonstrate that differentiable activations like GELU and SiLU enable robust
propagation of gradients which help to mitigate analog quantization error that
is ubiquitous to all analog systems. We perform analysis and training of
convolutional, linear, and transformer networks in the presence of quantized
noise. Here, we are able to demonstrate that continuously differentiable
activation functions are significantly more noise resilient over conventional
rectified activations. As in the case of ReLU, the error in gradients are 100x
higher than those in GELU near zero. Our findings provide guidance for
selecting appropriate activations to realize performant and reliable hardware
implementations across several machine learning domains such as computer
vision, signal processing, and beyond
Plasmonic nanogap enhanced phase change devices with dual electrical-optical functionality
Modern-day computers use electrical signaling for processing and storing data
which is bandwidth limited and power-hungry. These limitations are bypassed in
the field of communications, where optical signaling is the norm. To exploit
optical signaling in computing, however, new on-chip devices that work
seamlessly in both electrical and optical domains are needed. Phase change
devices can in principle provide such functionality, but doing so in a single
device has proved elusive due to conflicting requirements of size-limited
electrical switching and diffraction-limited photonic devices. Here, we combine
plasmonics, photonics and electronics to deliver a novel integrated
phase-change memory and computing cell that can be electrically or optically
switched between binary or multilevel states, and read-out in either mode, thus
merging computing and communications technologies
In-memory photonic dot-product engine with electrically programmable weight banks
Electronically reprogrammable photonic circuits based on phase-change chalcogenides present an avenue to resolve the von-Neumann bottleneck; however, implementation of such hybrid photonic–electronic processing has not achieved computational success. Here, we achieve this milestone by demonstrating an in-memory photonic–electronic dot-product engine, one that decouples electronic programming of phase-change materials (PCMs) and photonic computation. Specifically, we develop non-volatile electronically reprogrammable PCM memory cells with a record-high 4-bit weight encoding, the lowest energy consumption per unit modulation depth (1.7 nJ/dB) for Erase operation (crystallization), and a high switching contrast (158.5%) using non-resonant silicon-on-insulator waveguide microheater devices. This enables us to perform parallel multiplications for image processing with a superior contrast-to-noise ratio (≥87.36) that leads to an enhanced computing accuracy (standard deviation σ ≤ 0.007). An in-memory hybrid computing system is developed in hardware for convolutional processing for recognizing images from the MNIST database with inferencing accuracies of 86% and 87%
Parallel convolution processing using an integrated photonic tensor core
With the proliferation of ultra-high-speed mobile networks and
internet-connected devices, along with the rise of artificial intelligence, the
world is generating exponentially increasing amounts of data - data that needs
to be processed in a fast, efficient and smart way. These developments are
pushing the limits of existing computing paradigms, and highly parallelized,
fast and scalable hardware concepts are becoming progressively more important.
Here, we demonstrate a computational specific integrated photonic tensor core -
the optical analog of an ASIC-capable of operating at Tera-Multiply-Accumulate
per second (TMAC/s) speeds. The photonic core achieves parallelized photonic
in-memory computing using phase-change memory arrays and photonic chip-based
optical frequency combs (soliton microcombs). The computation is reduced to
measuring the optical transmission of reconfigurable and non-resonant passive
components and can operate at a bandwidth exceeding 14 GHz, limited only by the
speed of the modulators and photodetectors. Given recent advances in hybrid
integration of soliton microcombs at microwave line rates, ultra-low loss
silicon nitride waveguides, and high speed on-chip detectors and modulators,
our approach provides a path towards full CMOS wafer-scale integration of the
photonic tensor core. While we focus on convolution processing, more generally
our results indicate the major potential of integrated photonics for parallel,
fast, and efficient computational hardware in demanding AI applications such as
autonomous driving, live video processing, and next generation cloud computing
services
- …