Deep neural networks (DNNs) are reshaping the field of information
processing. With their exponential growth challenging existing electronic
hardware, optical neural networks (ONNs) are emerging to process DNN tasks in
the optical domain with high clock rates, parallelism and low-loss data
transmission. However, to explore the potential of ONNs, it is necessary to
investigate the full-system performance incorporating the major DNN elements,
including matrix algebra and nonlinear activation. Existing challenges to ONNs
are high energy consumption due to low electro-optic (EO) conversion
efficiency, low compute density due to large device footprint and channel
crosstalk, and long latency due to the lack of inline nonlinearity. Here we
experimentally demonstrate an ONN system that simultaneously overcomes all
these challenges. We exploit neuron encoding with volume-manufactured
micron-scale vertical-cavity surface-emitting laser (VCSEL) transmitter arrays
that exhibit high EO conversion (<5 attojoule/symbol with Vπ​=4 mV), high
operation bandwidth (up to 25 GS/s), and compact footprint (<0.01 mm2 per
device). Photoelectric multiplication allows low-energy matrix operations at
the shot-noise quantum limit. Homodyne detection-based nonlinearity enables
nonlinear activation with instantaneous response. The full-system energy
efficiency and compute density reach 7 femtojoules per operation (fJ/OP) and 25
TeraOP/(mm2â‹… s), both representing a >100-fold improvement over
state-of-the-art digital computers, with substantially several more orders of
magnitude for future improvement. Beyond neural network inference, its feature
of rapid weight updating is crucial for training deep learning models. Our
technique opens an avenue to large-scale optoelectronic processors to
accelerate machine learning tasks from data centers to decentralized edge
devices.Comment: 10 pages, 5 figure