Tsinghua Science and Technology
Volume 27

Issue 3

Article 2

2022

Memristor-Based Signal Processing for Edge Computing
Han Zhao
School of Integrated Circuits, Beijing National Research Center for Information Science and Technology,
Tsinghua University, Beijing 100084, China;Department of Microelectronics Science and Technology,
Harbin Institute of Technology, Harbin 150001, China

Zhengwu Liu
School of Integrated Circuits, Beijing National Research Center for Information Science and Technology,
Tsinghua University, Beijing 100084, China

Jianshi Tang
School of Integrated Circuits, Beijing National Research Center for Information Science and Technology,
Tsinghua University, Beijing 100084, China;Beijing Innovation Center for Future Chips, Tsinghua University,
Beijing 100084, China

Bin Gao
School of Integrated Circuits, Beijing National Research Center for Information Science and Technology,
Tsinghua University, Beijing 100084, China;Beijing Innovation Center for Future Chips, Tsinghua University,
Beijing 100084, China

Yufeng Zhang
Department of Microelectronics Science and Technology, Harbin Institute of Technology, Harbin 150001,
China
Follow this and additional works at: https://dc.tsinghuajournals.com/tsinghua-science-and-technology
See next page for additional authors

Recommended Citation

Zhao, Han; Liu, Zhengwu; Tang, Jianshi; Gao, Bin; Zhang, Yufeng; Qian, He; and Wu, Huaqiang (2022)
"Memristor-Based Signal Processing for Edge Computing," Tsinghua Science and Technology: Vol. 27 :
Iss. 3 , Article 2.
DOI: 10.26599/TST.2021.9010043
Available at: https://dc.tsinghuajournals.com/tsinghua-science-and-technology/vol27/iss3/2

This Special Section on Integrated Circuit is brought to you for free and open access by Tsinghua University Press:
Journals Publishing. It has been accepted for inclusion in Tsinghua Science and Technology by an authorized editor
of Tsinghua University Press: Journals Publishing.

Memristor-Based Signal Processing for Edge Computing
Authors
Han Zhao, Zhengwu Liu, Jianshi Tang, Bin Gao, Yufeng Zhang, He Qian, and Huaqiang Wu

This special section on integrated circuit is available in Tsinghua Science and Technology:
https://dc.tsinghuajournals.com/tsinghua-science-and-technology/vol27/iss3/2

TSINGHUA SCIENCE AND TECHNOLOGY
ISSNll1007-0214 02/16 pp455–471
DOI: 1 0 . 2 6 5 9 9 / T S T . 2 0 2 1 . 9 0 1 0 0 4 3
V o l u m e 2 7, N u m b e r 3, J u n e 2 0 2 2

Memristor-Based Signal Processing for Edge Computing
Han Zhao, Zhengwu Liu, Jianshi Tang , Bin Gao, Yufeng Zhang, He Qian, and Huaqiang Wu
Abstract: The rapid growth of the Internet of Things (IoTs) has resulted in an explosive increase in data, and thus has
raised new challenges for data processing units. Edge computing, which settles signal processing and computing
tasks at the edge of networks rather than uploading data to the cloud, can reduce the amount of data for transmission
and is a promising solution to address the challenges. One of the potential candidates for edge computing is a
memristor, an emerging nonvolatile memory device that has the capability of in-memory computing. In this article,
from the perspective of edge computing, we review recent progress on memristor-based signal processing methods,
especially on the aspects of signal preprocessing and feature extraction. Then, we describe memristor-based signal
classification and regression, and end-to-end signal processing. In all these applications, memristors serve as critical
accelerators to greatly improve the overall system performance, such as power efficiency and processing speed.
Finally, we discuss existing challenges and future outlooks for memristor-based signal processing systems.
Key words: memristor; signal processing; edge computing; Internet of Things (IoTs); in-memory computing

1

Introduction

The explosive growth of the Internet of Things
(IoTs) devices for massive data collection in emerging
 Han Zhao is with the School of Integrated Circuits, Beijing
National Research Center for Information Science and
Technology, Tsinghua University, Beijing 100084, China, and
also with the Department of Microelectronics Science and
Technology, Harbin Institute of Technology, Harbin 150001,
China. E-mail: 1172100403@stu.hit.edu.cn.
 Zhengwu Liu is with the School of Integrated Circuits,
Beijing National Research Center for Information Science and
Technology, Tsinghua University, Beijing 100084, China. Email: liuzw18@mails.tsinghua.edu.cn.
 Jianshi Tang, Bin Gao, He Qian, and Huaqiang Wu are with the
School of Integrated Circuits, Beijing National Research Center
for Information Science and Technology, Tsinghua University,
Beijing 100084, China, and also with Beijing Innovation Center
for Future Chips, Tsinghua University, Beijing 100084, China.
E-mail: fjtang, gaob1, qianh, wuhqg@tsinghua.edu.cn.
 Yufeng Zhang is with the Department of Microelectronics
Science and Technology, Harbin Institute of Technology, Harbin
150001, China. E-mail: yufeng zhang@hit.edu.cn.
 Han Zhao and Zhengwu Liu contributed equally to this paper.
 To whom correspondence should be addressed.
Manuscript received: 2021-02-27; revised: 2021-06-15;
accepted: 2021-06-27.
C

applications, such as autonomous driving and unmanned
factory, poses huge challenges for data processing.
In conventional computing architectures, the collected
massive data from IoT sensors may be first converted
to digital data, and then transmitted to the cloud for
processing, which can lead to bottlenecks in terms
of latency, energy consumption, and communication
bandwidth, and also raise issues of privacy security[1] .
In addition to the time/energy-consuming data transfer
process, the conventional data processing module is
inefficient because it is implemented in the digital
domain, whereas sensory data in edge devices are
directly collected in an analog fashion. The conversion
process costs huge hardware and energy overheads, and
increases circuit complexity. All these issues hamper
the deployment of sensor-rich IoT platforms, so new
computing architectures for efficient signal processing
are demanded.
One potential solution is edge computing, where
signal processing tasks are performed at the edge of
networks rather than in the cloud[2] . By processing data
close to where they are produced, the amount of data for
transfer can be greatly reduced, saving plenty of time and
energy. More importantly, as the computing units are

The author(s) 2022. The articles published in this open access journal are distributed under the terms of the
Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

456

now placed in edge devices and close to sensors, analog
computing can be directly employed with analog sensory
data as the input. Both better computing flexibility and
lower circuit complexity of analog computing lead to a
much better computing performance[3–5] . As a result,
edge computing is quite attractive, for which novel
computing paradigms and emerging devices are highly
desired to process data with high power efficiency and
fast processing speed.
Memristors have been considered highly suitable
for analog edge signal processing, including Resistive
switching Random-Access Memory (RRAM)[6] , PhaseChange Memory (PCM)[7] , magnetic random-access
memory[8] , and ferroelectric devices[9] . As illustrated
in Fig. 1a, data generated from IoT devices are
transferred to memristor-based edge computing units
for preliminary processing, and then sent to the cloud.
The advantages of memristor-based edge computing
systems can be mainly attributed to several aspects:
On the device level, the memristor shows a compact
size down to a few nanometers[10–12] , fast switching
speed[13–15] , and low energy consumption[16–20] . On
the architecture level, memristor-based in-memory

Tsinghua Science and Technology, June 2022, 27(3): 455–471

computing performs computational tasks where the
data are stored, eliminating the time-consuming and
energy-intensive data transfer process between memory
and processing units[21, 22] . In addition, the analog
switching behavior, where the memristor conductance
can be continuously modulated by external pulses,
enables computations in the analog domain[5, 21] and can
decrease the energy consumption of data processing.
Based on the nonvolatile characteristics and analog
computing ability, memristor arrays can be utilized
to store data and perform the analog Matrix-Vector
Multiplication (MVM) operation in a highly parallel and
energy-efficient manner. Therefore, memristors have
shown a huge potential in applications, such as signal
encoding, transformation, and filtering (Figs. 1b–1g).
In literature, there have been many review articles
related to the memristor-based computing from various
perspectives. For example, Ielmini and Wong[21]
summarized in-memory computing schemes in both
digital and analog domains, based on different kinds
of memristors. Zidan et al.[23] reviewed memristorbased electronics for on-chip memory and computing,
the latter of which is mainly focused on the

Fig. 1 Illustration of the typical applications of memristors for signal processing. (a) Illustration of the memristor array playing
a key role in edge computing, and (b)–(g) several typical signal processing methods that can be implemented using memristor
arrays.

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing

2

progresses and challenges of memristor-based bioinspired computing. Xia and Yang[3] captured the
research progress in this field and mainly focused
on the efforts in building memristor arrays for bioinspired computing. Zhu et al.[24] reviewed emerging
neuromorphic devices in nine different categories
and discussed their applications for brain-inspired
computing. Sebastian et al.[22] investigated various
memristor-based computational primitives and their
various applications. Also, memristor-based in-memory
computing chips and systems were revisited and key
metrics for benchmarking different technologies from
device to system levels were proposed[25, 26] . Different
from all these prior works, in this article we intend
to review memristor-based signal processing from
the perspective of edge computing and discuss the
progresses and challenges on memristor-based signal
processing primitives, especially on the aspects of signal
pre-processing and feature extraction.
The rest of this paper is organized as follows.
Section 2 briefly introduces the intrinsic characteristics
of memristors and the principle of MVM on memristor
array. Section 3 describes the specific architectures
as well as algorithms used in several memristor-based
signal processing systems. Section 4 summarizes the
prior works and discusses the existing challenges with a
future outlook for memristor-based signal processing.

457

Memristor Device and Array

Current

Top electrode
Conductive filament
Dielectric layer
Bottom electrode

SET
RESET
Voltage

Conductance

A memristor is a two-terminal electronic device that
typically consists of a metal-insulator-metal stack
(Fig. 2a). When utilized in signal processing, a
memristor, regardless of which type it belongs to, can be
simplified as a tunable resistor. Because the switching
mechanisms of different types of memristors are not
the focus of this review, here, we just take the bipolar
RRAM as a typical example to illustrate the conductance
switching process of memristors. The conductance
switching of RRAM is based on Conductive Filaments
(CFs), typically made of oxygen vacancies or metal ions.
Driven by the voltage applied on electrodes, CFs inside
the memristors are formed or ruptured depending on the
polarity of the applied voltage[27, 28] . During the SET
process, CFs are connected, and the conductance of the
memristor would increase accordingly, bringing it to
a high-conductance state. Conversely, in the RESET
process, the rupture of CFs leads to a decrease in
memristor conductance, bringing the memristor to a lowconductance state. The typical Current-Voltage (I-V)
curve of a bipolar analog RRAM, which shows analog
conductance switching behaviors in SET and RESET
processes, is shown in Fig. 2b. By applying a series
of voltage pulses on the analog RRAM, the memristor
conductance can be modulated continuously, i.e., in an
analog way, as shown in Fig. 2c.

SET

Pulse number

(b)

(a)

RESET

(c)

Mapping

T

T
T

T

Mapping

(d)

(e)

Fig. 2 Fundamentals of the memristor. (a) Illustration of the device structure of memristor, taking RRAM as an example. (b)
Typical I-V curve of a bipolar analog RRAM. (c) Analog conductance tuning behavior of the memristor. (d) Illustration of the
MVM operation in a neural network. (e) Implementation of the MVM operation on the memristor array.

Tsinghua Science and Technology, June 2022, 27(3): 455–471

458

Memristor array, in a cross-point design with one
memristor at each intersection[29] , can be employed to
greatly accelerate the MVM operation (Fig. 2d), which is
the most frequent and yet most time/energy-consuming
computations in artificial neural networks, and also
widely used in many signal processing algorithms for
edge computing. As shown in Fig. 2e, the vector
and matrix can be mapped to input voltages and the
conductance of the memristor array, respectively. By
virtue of Ohm’s law and Kirchhoff’s current law to
conduct multiplication and accumulation operations,
respectively, MVM results can be obtained in a single
step[21, 30] , leading to potentially dramatic improvements
in the computing speed and energy consumption[31] .
Essentially, the Transpose MVM (TMVM) operation
can also be performed using the same memristor
array by reversing the input and output. As a result,
the memristor array shows promising prospects to
implement MVM-rich signal processing algorithms as a
hardware accelerator.

3

Memristor-Based Signal Processing

Over the past few years, tremendous progress
has been made in the fabrication of memristors
and integration with Complementary Metal-OxideSemiconductor (CMOS) circuits, so large sizes of
memristor arrays become available with superior
electrical performance. As a result, various signal
processing algorithms have been demonstrated on
memristor arrays with encouraging performance
advantages compared to conventional CMOS-based

systems. In general, these algorithms can be divided
into three main categories: signal preprocessing and
feature extraction, signal classification and regression,
and end-to-end signal processing. In the following parts,
the principles and exemplary works on these algorithms
are discussed in detail.
3.1

Signal preprocessing and feature extraction

The raw signals collected by IoT sensors usually
have noises that need to be preprocessed. Through
signal preprocessing and feature extraction, only the
key information is preserved, while noises and other
background information are filtered out. They have been
proven to be essential to improve accuracy and save the
hardware cost in the whole signal processing procedure.
3.1.1 Signal filtering
Signal filtering, an important signal preprocessing
method, can be employed to filter out certain frequency
components of the original signal. Infinite Impulse
Response (IIR) filter and Finite Impulse Response
(FIR) filter are two basic digital filters that are
widely used in the processing of sound and image
signals. The implementation of the filtering process
is mainly based on the convolution operation, and hence
can be accelerated with memristor-based computing
systems. Maleki et al.[32] proposed an IIR filter design
with memristor arrays, but this work is still on the
simulation level without considering the impact of
device nonidealities of actual memristors. Figure 3a
illustrates the implementation of memristor-based directform second-order IIR filters[32] . Different from IIR

Input
Crossbar
array

(a) IIR

(c) Three classes of input signals

Input
Crossbar
array

(b) FIR

(d) Corresponding output signals

Fig. 3 Illustration of memristor-based filters. (a) Memristor-based two-tap IIR filter. (b) Memristor-based six-tap FIR filter.
(c) and (d) Typical implementation of a memristor-based FIR filter bank. The input neural signals (from left to right are the
neural signals of normal, interictal, and ictal brain states in (c)) are sent to the FIR filter bank, and the corresponding filtered
results (d) are then obtained. The experimental results (purple lines) and software-calculated results (blue lines) are matched
well, demonstrating the excellent performance of the memristor-based FIR filter bank. Adapted with permission from Ref. [33].

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing

filters, FIR filters have no feedback loops (Fig. 3b),
and their response to an impulse input is of finite
duration. FIR filters have linear phase characteristics,
and hence can be utilized in much wider applications.
The feasibility of memristor-based FIR filters was first
proven through simulations[34, 35] , and a six-tap FIR
filter was then experimentally implemented with six
memristors[36] . More recently, a long-tap FIR filter,
which is more useful in practice, has been experimentally
demonstrated by Liu et al.[33] Four 120-order FIR
filters were implemented on a 1K-memristor array
to filter neural signals into four specific frequency
bands for the identification of epilepsy-related brain
activities with a high accuracy of 93.46%, showing
that the power efficiency can be greatly improved
by two orders of magnitude as compared to the
state-of-the-art CMOS systems. The filtering results
of memristor-based FIR filter banks are shown in
Figs. 3c and 3d. The above demonstrations suggest

Re(x)
Re(x)

Re(W )

+

Re(W )

459

that memristor-based signal filtering systems, such as
FIR and IIR filters, have tremendous potentials in
biomedical applications[37–39] . Particularly, memristor
device metrics, such as the number of conductance levels
and device noise, could largely affect the computing
accuracy of signal processing algorithms. Future device
optimization is expected to further enhance the filtering
system performance.
3.1.2

Signal transformation

Signal transformation converts signals from one
domain into another, enabling more effective signal
processing. Based on memristor arrays, Discrete
Fourier Transform (DFT), performing the critical timefrequency transformation, has been implemented in
simulations[40, 41] , and also experimentally employed
for the feature extraction of voice signals for speech
recognition[42] . The computing process of the memristorbased DFT is shown in Fig. 4a. In addition to
DFT, the feasibility of memristor-based Discrete

−

Im(W )

+

Im(W )

Buffer

−

Re(X )&Im(X )

Re(X )=Re(x)Re(W )−Im(x)Im(W )
Im(X )=Re(x)Im(W )+Im(x)Re(W )
(a) DFT

Raw image

First MVM

Second MVM

Matrix
transpose
Processed image
𝐼

(c) Original image

𝐼𝑊
(b) DWT/DCT

𝑊 T 𝐼𝑊

(d) 1st-step result

(e) 2nd-step result

Fig. 4 Illustration of the memristor-based signal transformation. (a) Computing process of the memristor-based DFT. Real
and imaginary parts are calculated separately. x and X represent the signal before and after transformation, respectively. W
represents the transform matrix. Reproduced with permission from Ref. [40]. (b) Two-step MVM operations for a 2D signal
transformation, such as the widely used 2D DWT and 2D DCT. The computing process can be expressed as C D WT IW, where
C represents the processed image; W and WT represent the transform matrix and its transpose, respectively; and I represents
the input image. (c)–(e) Typical implementation of memristor-based 2D DWT. The original image (c) is first processed column
by column (d) and then row by row (e). Adapted with permission from Ref. [44].

Tsinghua Science and Technology, June 2022, 27(3): 455–471

460

Cosine Transforms (DCTs), a Fourier-related transform
with better energy compaction[43] , has also been
demonstrated[5] . The above designs all show significant
improvements in speed and power efficiency over
conventional CMOS-based systems.
In addition,
two-dimensional (2D) signal
transformations, such as 2D DCT and 2D Discrete
Wavelet Transform (DWT), are also under study for
memristor-based implementations, because they can
be widely used in image compression and processing.
As shown in Fig. 4b, 2D image transformation can
be implemented based on a two-step computational
paradigm. First, a raw image matrix is divided into
several vectors row by row or column by column,
transferred into voltage pulses, and then applied on
the rows of the first memristor array. Second, the
intermediate matrix formed by the output vectors
from the first array is transposed, and then the same
process as the first step is implemented for the second
memristor array. Finally, the output matrix of the 2D
signal transformation is obtained from the second
array. In fact, matrices mapped onto the two crossbar
arrays are usually the same, so only one crossbar
array is needed, saving considerable hardware costs.
Moreover, an inverse 2D signal transformation can
be performed based on the same paradigm described
above. Specifically, memristor-based 2D DWT has
been demonstrated in simulations, and Figs. 4c–4e
show a typical example based on the system[44] . 2D
DCT, with better energy compaction than 2D DWT, was
experimentally implemented by Li et al.[5] , and several
optimization approaches have been proposed since then
to improve system robustness and reconstructed image
quality[45] . After the image transformation, in practice, a
small portion of the largest coefficients of a transformed
image are usually retained, and the rest are discarded.
In this way, image compression can be efficiently
performed. However, hampered by the small array size,
input images should be first divided into several patches
and then processed patch by patch. Such an image
division method degrades the quality of transformed
images as the spatial information on patch edges is
damaged. Hence, in the future, a larger memristor array
and a new signal transformation algorithm with novel
image division methods need to be further explored.
3.1.3

Signal encoding

Signal encoding, especially signal compression and
sparse representation, can be effectively implemented

with memristor arrays. Compressed Sensing (CS),
a very popular signal compression method with a
sub-Nyquist sampling rate, can be divided into two
processes, i.e., compression and reconstruction or, in
other words, encoding and decoding. As illustrated
in Fig. 5a, the input vector x can be decomposed
into
 S , where
is a transform matrix with a
size of N  N and S is a sparse vector. During the
signal compression process, through the multiplication
of the input vector x and sensing matrix , the
compressed signal y can be obtained. In turn, the
signal reconstruction process refers to the recovery
of the original signal x from the compressed signal
y. Memristor-based implementations of CS have been
intensively researched. For example, Qian et al.[48, 49]
proposed that the intrinsic randomness of memristors
can be used to generate the sensing matrix  and
demonstrated the feasibility of CS compression using a
single memristor-based MVM operation in simulations.
Liu et al.[50] further suggested that the CS reconstruction
process could also be realized on memristor arrays.
Experimental demonstrations of CS compression and
reconstruction were performed by Gallo et al.[46, 51]
using a 256K-PCM array, where the Approximate
Message Passing (AMP) algorithm was utilized for
the CS reconstruction process. Figures 5c–5e show
the typical compression and reconstruction processes.
Because the two most computation-extensive operations
associated with the AMP algorithm, namely, MVM and
TMVM, can be both accelerated with memristor arrays,
they achieved an O(N)-fold complexity reduction over
conventional software implementation. Considering that
CS plays a critical role in IoT devices[52] , memristorbased CS for edge computing appears to be quite
attractive, and future works should focus on developing
customized CS algorithms for memristor arrays for much
better acceleration and optimization.
Besides CS, Sparse Coding (SC) is another useful
signal encoding algorithm because replacing the input
signal with a sparse representation can help reduce
the input complexity and make the signal processing
much more efficient. As shown in Fig. 5b, in the SC
algorithm, the input signal x can be decomposed into
D  a, where D is an over-complete dictionary and
a is a sparse vector. With a 3232 memristor array,
a locally competitive algorithm, an iterative approach
mainly based on MVM and TMVM, was employed
to implement SC[47] . The SC of simple inputs and
natural images has been demonstrated, and a dictionary

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing

461

Decoding
Encoding

（a）

(c)

(d)
Encoding
（b）

Decoding

(e)
（g）

（h）

Itration number

（f）

（i）

（j）

Fig. 5 Illustrations of memristor-based signal encoding: (a) CS process. (b) SC process. (c)–(e) Typical implementation of CS.
The original image (c) is first compressed (d) and then reconstructed (e). Adapted with permission from Ref. [46]. (f)–(j) Typical
implementation of memristor-based SC. The original image is first divided into multiple non-overlapping 4  4 patches (f), each
patch (g) is processed with the SC algorithm (h). The network dynamics during the SC process, where the membrane potential
changes as a function of the iteration number, are shown in (i). The reconstructed image (j) based on the reconstructed patches.
Adapted with permission from Ref. [47].

was trained online using a “winner-take-all” approach
and Oja’s learning rule[53] . Figures 5f–5j show the
typical example of the SC process of a natural image.
The whole memristor-based system exhibited 16
improvement in power efficiency compared to the stateof-the-art digital computing system. In addition, many
new memristor-based algorithms[54–57] and spintronic
devices[56] have been attempted to implement the
SC algorithms. Future exploration and optimization
on devices, architectures, and algorithms are needed
to realize a much more practical and widely used
memristor-based signal encoding system.
3.1.4

Component analysis

Component analysis is critical in image processing,

dimensionality reduction, and other signal processing
applications. One of the most widely used component
analysis strategies, i.e., Principal Component Analysis
(PCA), has been demonstrated with memristors[58, 59] .
Unlike conventional methods used to solve the
eigenvectors of the covariance matrix, unsupervised
online learning was employed for obtaining Principal
Components (PCs). Figure 6a shows the typical
implementation of a memristor-based PCA primitive.
The standard breast cancer screening database from
the University of Wisconsin Hospital was used for
demonstration. With the learned PCs, nine-dimensional
(9D) raw data can be projected into a 2D space, as
illustrated in Fig. 6c. Moreover, the output of PCA,

Tsinghua Science and Technology, June 2022, 27(3): 455–471

462

Source I

Record I

Source II
Record II

（a）

（b）

（c）

（d）

Fig. 6 Illustrations of the memristor-based component analysis. (a) Typical implementation of the memristor-based PCA.
Reproduced with permission from Ref. [58]. (b) Typical ICA application for blind source separation. The sound of a speaker
and bell can be separated based on the signals simultaneously recorded by two microphones. (c) and (d) Typical implementation
of a memristor-based PCA. The 9D raw breast cancer data projected in the 2D space (c) cannot be linearly separated, whereas
the data after the PCA (d) can be linearly separated. Adapted with permission from Ref. [58].

that is, the 2D data (Fig. 6d), was then utilized for
further signal classification. High classification accuracy
of 97.1% indicates the outstanding performance of such
a memristor-based PCA. With ferroelectric materialbased memristors, Raj et al.[60] further suggested that
an improved performance of PCA can be achieved
in simulations. Moreover, techniques, such as online
learning or hybrid training[61] , could accommodate
device nonidealities, and thus they can be developed
and employed for memristor-based signal processing
systems to further improve performance.
In practice, PCA usually serves as the preprocessing
procedure for the Independent Component Analysis
(ICA), which is an unsupervised learning approach for
blind source separation. Figure 6b illustrates a typical
ICA application, where the sound of a speaker and a
bell can be separated based on the signals recorded
by two microphones. In fact, ICA is widely used
in numerous applications, such as electrocardiogram
signals and image denoizing. On the simulation level,
the feasibility of memristor-based ICA implementation
was demonstrated[62, 63] . However, the older versions
of ICA, which are inefficient and require a large
amount of memory, were utilized in the abovementioned
works rather than the more advanced entropy-based
ICA. Recently, the experimental setup of entropy-based
ICA was attempted by Reddy et al.[64] , paving the
road for future memristor-based component analyses.
Nevertheless, the experimental demonstration was
limited to a very small memristor array (2  2), and
a more complicated ICA with a larger array still needs
to be demonstrated. In this case, the impact of sneak
paths and parasitic voltage drop due to the interconnect
resistance (i.e., IR drop) should be carefully evaluated.

3.2

Classification and regression

Signals after preprocessing are usually used for postprocessing, such as classification or regression. The
optimization of signal classification and regression
processes is very important as it directly gives the
qualitative or quantitative results for a certain problem
and determines the output accuracy.
A Support Vector Machine (SVM), as shown in
Fig. 7a, is a useful classifier that can be used in
edge computing and IoT applications as a wake-up
system. SVM primitives have been implemented with
memtransistors[66] and RRAM[67] in simulations. Here
a memtransistor is usually a three-terminal device
that combines the concepts of a memristor and
transistor, with similar electrical characteristics as
typical memristors. Other than SVMs, most works are
actually focused on the implementations of two other
signal classification and regression methods, SingleLayer Perceptron (SLP) and Multiple-Layer Perceptron
(MLP). A memristor-based SLP was first experimentally
employed to perform the classification of small-scale
black/white patterns[68] . However, the number of
memristors of this work was limited to 12  12, and
more complicated tasks would demand larger arrays.
For example, a 1K-memristor array with excellent
bidirectional analog conductance modulation behaviors
was employed for a grayscale face classification[69] .
High classification accuracy was achieved either with
or without the write-verify methods for online weight
updates. While an SLP can only be used for linearly
separable datasets, an MLP has broader applications
as a more flexible classifier or regressor. As shown
in Fig. 7b, a typical MLP includes at least one hidden
layer, which differentiates from an SLP. Based on an

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing
Input
layer

463
Hidden
layers

Output
layer

Bias

(a)

(c）

(b)

(d）

（e）

Fig. 7 Illustrations of memristor-based classification and regression. (a) SVM for data classification. (b) Example of MLP
or Fully Connected Neural Network (FCNN). (c)–(e) Implementation of the handwritten digit recognition based on memristorbased MLP. Image of the handwritten digit (c) is fed into the network, and the raw currents of each output neuron are obtained
(d). Then, the Bayesian probabilities of each digit are calculated (e), and the handwritten digit is recognized as number 9.
Adapted with permission from Ref. [65].

8K-memristor array, an MLP was implemented using
an in-situ learning algorithm, which could be selfadaptive to memristor imperfections[69] . In Figs. 7c–7e, a
representative example of recognizing handwritten digits
is illustrated. This work achieved over 90% accuracy in
the classification of a down-sampled MNIST dataset[70] ,
with much-improved power efficiency, demonstrating
the feasibility and superior performance of memristorbased platforms for classification. Moreover, an MLP
implemented on a flash array was demonstrated, showing
100 and 10 000 improvements in computing the
speed and energy efficiency over digital circuits,
respectively[71] . Moreover, to achieve high accuracy,
Xiang et al.[72] proposed an optimized synaptic cell
and a refresh operation scheme to mitigate device
nonidealities in deep neural networks. In the future,
more efforts should be devoted to the cross-layer codesign from a device to an algorithm[26] to achieve a
comparable classification accuracy in state-of-the-art
CMOS systems.
3.3

End-to-end signal processing

Tasks that need to be handled step by step can be
performed in only one step with end-to-end signal

processing algorithms, which accept raw data as the
input and directly output processed results, including
both signal preprocess and post-process procedures. This
end-to-end method allows more flexibility for signal
processing models, and hence, it can simplify the manual
feature selection process and help optimize the overall
system performance.
The Convolutional Neural Network (CNN, see Fig. 8a),
a representative end-to-end signal processing algorithm,
is widely used for voice and image recognitions. The
memristor-based implementation of the convolution
operation of CNNs has been demonstrated[73, 74] . More
recently, based on eight 2K-cell memristor arrays, a fully
hardware-implemented CNN was demonstrated[61] . In
this work, the high-yield fabrication of high-performance
memristors and the hybrid training method contribute
to the high classification accuracy of MNIST image
recognition. Power efficiency was improved by more
than two orders of magnitude compared to typical
graphics processing units, with parallel convolution
technique and replication of identical kernels. Moreover,
on the NOR flash array, the key convolution operations
have been demonstrated to be feasible, showing high
computing speed and energy efficiency[75] . In addition

464

Tsinghua Science and Technology, June 2022, 27(3): 455–471

Fig. 8 Illustrations of memristor-based end-to-end signal processing. (a) Example of CNN. (b) Example of RNN. (c)–(e) Typical
implementation of LSTM-based handwritten digit sequence recognition. The handwritten digit sequence (c) is first fed into the
network, and the output currents from the output neurons are obtained (d). Then, the associated Bayesian probabilities are
calculated (e) to recognize the sequence as 1-2-3. Adapted with permission from Ref. [73].

to CNNs, Recurrent Neural Networks (RNNs, see
Fig. 8b) are another kind of end-to-end signal processing
algorithm that can be employed for sequence data
processing. However, classical RNNs have difficulty
in processing too long sequences. To address this
drawback, Long Short-Term Memory (LSTM) has been
proposed, which enables the remembering and forgetting
processes of the sequence history and can be widely
applied for machine translation[76] and natural language
understanding[77] . Experimentally, a memristor-based
LSTM layer was implemented, connected with a
fully connected neural network for regression and
classification tasks[78] . Figures 8c–8e illustrate a typical
example of LSTM-based recognition of handwritten
digit sequences. The memristor-based multilayer RNNs
with an LSTM layer and a fully connected layer can
serve as a promising hardware platform for end-to-

end edge computing, with fast processing speed and
high power efficiency. Compared to standard RNNs,
Reservoir Computing (RC) is a promising alternate for
temporal signal processing because it can significantly
reduce the training cost. The rich dynamics and nonlinear
behavior of memristors can be utilized to generate
rich reservoir states[79, 80] . Moon et al.[81] demonstrated
that memristors could be employed for RC with 50
improvement in energy consumption compared with
FPGA implementation. Zhong et al.[82] suggested that
even a single dynamic memristor can be utilized as a
reservoir for RC using the developed mask process. Such
a system was utilized for spoken-digit recognition and
time-series prediction with a low error rate of 0.4% and
low normalized root mean square error of 0.046. In the
future, multiple memristors with inner connections can
be used to implement complicated RC systems.

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing

4

Discussion and Outlook

As of now, a variety of memristor-based signal
processing algorithms has been demonstrated either
in experiments or simulations, showing at least one
order of magnitude improvement in energy efficiency
or computing speed over conventional CMOS-based
computing systems, as summarized in Table 1. There
are several signal processing algorithms, including
the IIR filter, DWT, and SVM, which still wait
to be experimentally demonstrated on memristors.
Table 1
Category

Subcategory

Signal
transform

Signal
encoding

Component
analysis

Classification
and
regression

End-to-end
signal
processing

N/A

N/A

The appealing advantages shown by memristor-based
implementations can be mainly attributed to two key
factors. First, the MVM operation can be efficiently
implemented on memristor arrays in the analog domain
with low power and fast speed. Second, as a nonvolatile
memory, memristor arrays can achieve in-memory
computing, reducing the data transfer overhead and
computing latency. As a result, memristor-based novel
computing paradigms have been demonstrated to be
quite attractive in edge computing systems.

Summary of memristor-based signal processing implementations.

Algorithm

S/E

FIR filter[33, 34]

E

IIR filter[32]

S

DFT[40–42]

E

DCT[5, 45]

E

DWT[44]

S

CS[46, 48, 49, 51]

E

SC[47, 54–56]

E

PCA[58–60]

E

ICA[62–64]

E

SVM[66, 67]

S

SLP[68, 69]

E

MLP[65]

E

CNN[61]

E

RNN[77]

E

RC[81, 82]

E

Signal
filtering

Signal
preprocessing
and
feature
extraction

465

Application
Identification of
epilepsy-related
brain states[33]
Filtering of
mixed signals of
two frequencies
Time-frequency
transformation[40] and
speech recognition[42]
Image compression
and processing[5]
Image
compression[44]
Image compression
and reconstruction[51]
Sparse representation
of natural images[47]
Classification of
breast cancer[60]
Blind image
source separation[64]
Wake-up
system[66]

Performance

Improvements vs.
CMOS systems

Power efficiency:
1.4 W/class[33]

400 in
power efficiency[33]

N/A

N/A

N/A

10 in speed,
109.8 in
power efficiency[40]

Energy efficiency:
N/A
119.7 TOPs 1 W 1[5]
Energy: 6.4 nJ/image 11 in energy efficiency,
Time: 15 s/image[44]
1.28 in speed[44]
Power dissipation:
50 in power
[51]
16.2 mW/read
consumption[51]
Energy: 719 J/image
16 in energy
Time: 0.036 s/image[47]
consumption[47]
Power dissipation:
N/A
0.27 W/feature[60]
N/A
Energy:
0.7 nJ for potentiation,
0.5 pJ for depression[66]
Energy:
30 nJ/epoch[69]

Face
classification[69]
MNIST handwritten
N/A
digit recognition[65]
MNIST
Energy efficiency:
image recognition[61]
11.0 TOPs 1 W 1[61]
Gait
N/A
identification[77]
Spoken-digit recognition and
Energy:
time-series prediction[81, 82]
3.0 nJ/input[81]

N/A

N/A
1000 in energy
consumption[69]
N/A
>100 in energy
efficiency[61]
N/A
50 in energy
consumption[81]

Notes: (1) S/E, simulation only (S) or experimental demonstration (E). N/A, not applicable or not available.
(2) Algorithm name abbreviation: FIR, finite impulse response. IIR, infinite impulse response. DFT, discrete Fourier transform. DCT,
discrete cosine transform. DWT, discrete wavelet transform. CS, compressed sensing. SC, sparse coding. PCA, principal component
analysis. ICA, independent component analysis. SVM, support vector machine. SLP, single-layer perceptron. MLP, multiple-layer
perceptron. CNN, convolutional neural network. RNN, recurrent neural network. RC, reservoir computing.

466

In particular, the memristor device performance
plays an important role in all memristor-based signal
processing implementations. For example, the memristor
array size limits the complexity and capability of
algorithms, and the number of analog conductance
states, intrinsic conductance fluctuations, on/off
ratio, and device yield all influence the processing
accuracy. Because the performance of actual hardware
implementations is largely affected by the device
nonidealities of current-stage memristors, the potential
of the above algorithms needs to be carefully verified by
experiments and tested in practical applications. From
this perspective, signal filtering algorithms, especially
the FIR filter that has been experimentally demonstrated
to identify seizure-related brain states for future brainmachine interfaces, could be considered one of the most
promising algorithms.
In addition to the MVM-based algorithms mentioned
above, memristors can also be applied to signal
processing in other ways. For example, nonvolatile
logic, using resistance states instead of electron charge
as the information carrier[83] , can realize Boolean
logic operations[84] , and has been proven to be
cascadable[85, 86] , showing great potential for low-power
computing with memristors[87–91] .
The rich dynamic characteristics of memristors can
also be utilized for temporal signal processing, such as
temporal correlation detection[92] . Liu et al.[93] proposed
that the key information of neural signals can be
extracted and encoded into memristor conductance
changes, based on which multi-channel neural
signal processing was realized for the experimental
demonstration of seizure prediction. Lastly, memristors
can be exploited as an associative memory, which
compares input data with stored data and then
returns the closest match result[94] . Memristor-based
associative memory has been demonstrated to implement
hyperdimensional computing[95] . All these algorithms
deserve further studies for efficient signal processing.
In the future, many challenges still need to be
solved to take full advantage of memristor-based
arrays for signal processing. First, in terms of
memristor devices, further improvements could be
realized by increasing the number of conductance states
and switching speed, reducing the operating voltage
and power consumption, improving the switching
linearity/symmetry and device yield, and suppressing
the device noise and variability. More importantly,

Tsinghua Science and Technology, June 2022, 27(3): 455–471

atomic-level characterization techniques with the help
of theoretical modeling[96–98] are in urgent need of a
better understanding of the physical mechanisms of
memristor switching processes. How the CF evolves
and how to improve the reliability are also the two key
issues to be figured out[99] . On the circuit level, the
demands for power-hungry analog-to-digital or digitalto-analog converters limit the overall performance
of memristor-based systems. The conversion process
consumes a lot of energy, and the conversion precision
also has a large impact on the accuracy of final results.
Fully analog computing is expected to overcome these
challenges for further performance enhancement[100] . In
addition, the IR drop and sneak path greatly hinder
the employment of large memristor arrays[22] . As
for architecture and algorithms, future works are
needed to explore new operators and signal processing
algorithms beyond MVM for memristors, such as matrix
inverse[31] , nonvolatile logic[83] , and hyperdimensional
computing[101] algorithms. Considering the non-ideal
characteristics of memristor devices, especially large
variability and limited precision, compensations from
circuit designs and error-tolerant algorithms are
equally essential. On the system level, most of the
above mentioned algorithms still require CMOS-based
peripheral circuits or digital computing units to complete
the matrix addition and nonlinear operations, such as
the activation functions in neural networks. Future
memristor-based systems demand a full hardware
implementation to fully realize their potentials. To do
so, more memristor arrays would be employed, and
thus, efficient scheduling strategies between various
memristor arrays are also important. In addition, by
employing more computing operators besides MVM,
complex algorithms and complicated problems can be
tackled using memristor-implemented hardware, with
which promising prospects can be expected for future
signal processing systems.
Acknowledgment
This work was supported in part by the National
Science and Technology Major Project of China (No.
2017ZX02315001-005) and the National Natural Science
Foundation of China (Nos. 91964104 and 61974081).

References
[1]

D. Wei, H. Ning, F. Shi, Y. Wan, J. Xu, S. Yang, and L. Zhu,
Dataflow management in the internet of things: Sensing,
control, and security, Tsinghua Science and Technology, vol.

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

26, no. 6, pp. 918–930, 2021.
W. S. Shi, J. Cao, Q. Zhang, Y. Li, and L. Y. Xu, Edge
computing: Vision and challenges, IEEE Int. Things J.,
vol. 3, no. 5, pp. 637–646, 2016.
Q. F. Xia and J. J. Yang, Memristive crossbar arrays for
brain-inspired computing, Nat. Mater., vol. 18, no. 4, pp.
309–323, 2019.
S. Ambrogio, P. Narayanan, H. Tsai, R. M. Shelby, I.
Boybat, C. Di Nolfo, S. Sidler, M. Giordano, M. Bodini,
N. C. P. Farinha, et al., Equivalent-accuracy accelerated
neural-network training using analogue memory, Nature,
vol. 558, no. 7708, pp. 60–67, 2018.
C. Li, M. Hu, Y. N. Li, H. Jiang, N. Ge, E. Montgomery,
J. M. Zhang, W. H. Song, N. Dávila, C. E. Graves, et al.,
Analogue signal and image processing with large memristor
crossbars, Nat. Electr., vol. 1, no. 1, pp. 52–59, 2018.
R. Waser and M. Aono, Nanoionics-based resistive
switching memories, Nat. Mater., vol. 6, no. 11, pp. 833–
840, 2007.
S. Raoux, W. Welnic, and D. Ielmini, Phase change
materials and their application to nonvolatile memories,
Chem. Rev., vol. 110, no. 1, pp. 240–267, 2010.
A. D. Kent and D. C. Worledge, A new spin on magnetic
memories, Nat. Nanotechnol., vol. 10, no. 3, pp. 187–191,
2015.
A. Chanthbouala, V. Garcia, R. O. Cherifi, K. Bouzehouane,
S. Fusil, X. Moya, S. Xavier, H. Yamada, C. Deranlot, N. D.
Mathur, et al., A ferroelectric memristor, Nat. Mater., vol.
11, no. 10, pp. 860–864, 2012.
C. H. Ho, C. L. Hsu, C. C. Chen, J. T. Liu, C. S. Wu, C. C.
Huang, C. M. Hu, and F. L. Yang, 9 nm half-pitch functional
resistive memory cell with <1 A programming current
using thermally oxidized sub-stoichiometric WOx film, in
Proc. 2010 Int. Electron Devices Meeting, San Francisco,
CA, USA, 2010, pp. 19.1.1–19.1.4.
B. Govoreanu, G. S. Kar, Y. Chen, V. Paraschiv, S. Kubicek,
A. Fantini, I. P. Radu, L. Goux, S. Clima, R. Degraeve, et al.,
1010 nm2 Hf/HfOx crossbar resistive ram with excellent
performance, reliability and low-energy operation, in Proc.
2011 Int. Electron Devices Meeting, Washington, DC, USA,
2011, pp. 31.6.1–31.6.4.
A. Redolfi, L. Goux, N. Jossart, F. Yamashita, E. Nishimura,
D. Urayama, K. Fujimoto, T. Witters, F. Lazzarino, and M.
Jurczak, A novel cbram integration using subtractive dryetching process of cu enabling high-performance memory
scaling down to 10nm node, in Proc. 2015 Sympo. VLSI
Technology (VLSI Technology), Kyoto, Japan, 2015, pp.
T134&T135.
N. Banno, M. Tada, T. Sakamoto, M. Miyamura, K.
Okamoto, N. Iguchi, T. Nohisa, and H. Hada, A fast
and low-voltage cu complementary-atom-switch 1mb array
with high-temperature retention, in Proc. 2014 Symp. VLSI
Technology (VLSI-Technology): Digest of Technical Papers,
Honolulu, HI, USA, 2014, pp. 1&2.
B. Govoreanu, L. Di Piazza, J. Ma, T. Conard, A.
Vanleenhove, A. Belmonte, D. Radisic, M. Popovici, A.
Velea, A. Redolfi, et al., Advanced a—VMCO resistive
switching memory through inner interface engineering with

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

467

wide (>102 / on/off window, tunable A-range switching
current and excellent variability, in Proc. 2016 IEEE Symp.
VLSI Technology, Honolulu, HI, USA, 2016, pp. 1–2.
K. Tsunoda, K. Kinoshita, H. Noshiro, Y. Yamazaki, T.
Iizuka, Y. Ito, A. Takahashi, A. Okano, Y. Sato, T. Fukano,
et al., Low power and high speed switching of Ti-doped
NiO ReRAM under the unipolar voltage source of less than
3 V, in Proc. 2007 IEEE Int. Electron Devices Meeting,
Washington, DC, USA, 2007, pp. 767–770.
C. H. Cheng, C. Y. Tsai, A. Chin, and F. S. Yeh, High
performance ultra-low energy rram with good retention and
endurance, in Proc. 2010 Int. Electron Devices Meeting,
San Francisco, CA, USA, 2010, pp. 19.4.1–19.4.4.
L. Goux, A. Fantini, G. Kar, Y. Y. Chen, N. Jossart,
R. Degraeve, S. Clima, B. Govoreanu, G. Lorenzo, G.
Pourtois, et al., Ultralow sub-500nA operating current highperformance TiNnAl2 O3 nHfO2 nHfnTiN bipolar RRAM
achieved through understanding-based stack-engineering,
in Proc. 2012 Symp. VLSI Technology (VLSIT), Honolulu,
HI, USA, 2012, pp. 159–160.
B. Govoreanu, A. Redolfi, L. Zhang, C. Adelmann, M.
Popovici, S. Clima, H. Hody, V. Paraschiv, I. P. Radu,
A. Franquet, et al., Vacancy-modulated conductive oxide
resistive ram (VMCO-RRAM): An area-scalable switching
current, self-compliant, highly nonlinear and wide on/offwindow resistive switching cell, in Proc. 2013 IEEE Int.
Electron Devices Meeting, Washington, DC, USA, 2013, pp.
10.2.1–10.2.4.
Q. Luo, X. X. Xu, H. T. Liu, H. B. Lv, T. C. Gong,
S. B. Long, Q. Liu, H. T. Sun, W. Banerjee, L. Li, et
al., Demonstration of 3D vertical RRAM with ultra lowleakage, high-selectivity and self-compliance memory cells,
in Proc. 2015 IEEE Int. Electron Devices Meeting (IEDM),
Washington, DC, USA, 2015, pp. 10.2.1–10.2.4.
E. Vianello, O. Thomas, G. Molas, O. Turkyilmaz, N.
Jovanović, D. Garbin, G. Palma, M. Alayan, C. Nguyen, J.
Coignus, et al., Resistive memories for ultra-low-power
embedded computing design, in Proc. 2014 IEEE Int.
Electron Devices Meeting, San Francisco, CA, USA, 2014,
pp. 6.3.1–6.3.4.
D. Ielmini and H. S. P. Wong, In-memory computing with
resistive switching devices, Nat. Electr., vol. 1, no. 6, pp.
333–343, 2018.
A. Sebastian, M. Le Gallo, R. Khaddam-Aljameh, and
E. Eleftheriou, Memory devices and applications for in
memory computing, Nat. Nanotechnol., vol. 15, no. 7, pp.
529–544, 2020.
M. A. Zidan, J. P. Strachan, and W. D. Lu, The future of
electronics based on memristive systems, Nat. Electr., vol. 1,
no. 1, pp. 22–29, 2018.
J. D. Zhu, T. Zhang, Y. C. Yang, and R. Huang, A
comprehensive review on emerging artificial neuromorphic
devices, Appl. Phys. Rev., vol. 7, no. 1, p. 011312, 2020.
Y. Xi, B. Gao, J. S. Tang, A. Chen, M. F. Chang, X. S. Hu,
J. Van Der Spiegel, H. Qian, and H. Q. Wu, In-memory
learning with analog resistive switching memory: A review
and perspective, Proc. IEEE, vol. 109, no. 1, pp. 14–42,
2021.

468
[26] W. Q. Zhang, B. Gao, J. S. Tang, P. Yao, S. M. Yu, M. F.
Chang, H. J. Yoo, H. Qian, and H. Q. Wu, Neuro-inspired
computing chips, Nat. Electr., vol. 3, no. 7, pp. 371–382,
2020.
[27] D. Ielmini, Modeling the universal set/reset characteristics
of bipolar RRAM by field- and temperature-driven filament
growth, IEEE Trans. Electr. Dev., vol. 58, no. 12, pp. 4309–
4317, 2011.
[28] R. Waser, R. Dittmann, G. Staikov, and K. Szot, Redoxbased resistive switching memories-nanoionic mechanisms,
prospects, and challenges, Adv. Mater., vol. 21, nos. 25&26,
pp. 2632–2663, 2009.
[29] S. H. Jo, K. H. Kim, and W. Lu, High-density crossbar
arrays based on a Si memristive system, Nano Lett., vol. 9,
no. 2, pp. 870–874, 2009.
[30] S. N. Truong and K. S. Min, New memristor-based crossbar
array architecture with 50-% area reduction and 48-%
power saving for matrix-vector multiplication of analog
neuromorphic computing, J . Semicond. Technol. Sci.,
vol. 14, no. 3, pp. 356–363, 2014.
[31] Z. Sun, G. Pedretti, E. Ambrosi, A. Bricalli, W. Wang, and
D. Ielmini, Solving matrix equations in one step with crosspoint resistive arrays, Proc. Natl. Acad. Sci. USA, vol. 116,
no. 10, pp. 4123–4128, 2019.
[32] A. Maleki, V. Rashtchi, and J. Mazloum, Design and
simulation of an infinite impulse response (IIR) filter with
memristor, Majl.J . Electr. Eng., vol. 12, no. 3, pp. 23–34,
2018.
[33] Z. W. Liu, J. S. Tang, B. Gao, P. Yao, X. Y. Li, D. K. Liu, Y.
Zhou, H. Qian, B. Hong, and H. Q. Wu, Neural signal
analysis with memristor arrays towards high-efficiency
brain-machine interfaces, Nat. Commun., vol. 11, no. 1,
p. 4234, 2020.
[34] S. N. Mirebrahimi and F. Merrikh-Bayat, Programmable
discrete-time type I and type II FIR filter design on the
memristor crossbar structure, Analog Integr. Circu. Sign.
Proc., vol. 79, no. 3, pp. 529–541, 2014.
[35] M. Nourazar, V. Rashtchi, F. Merrikh-Bayat, and A.
Azarpeyvand, Towards memristor-based approximate
accelerator: application to complex-valued fir filter bank,
Analog Integr. Circu. Sign. Proc., vol. 96, no. 3, pp. 577–
588, 2018.
[36] F. Merrikh-Bayat, F. Alibart, L. G. Gao, and D. B. Strukov,
A reconfigurable FIR filter with memristor-based weights,
arXiv preprint arXiv:1608.05445, 2016.
[37] K. Abdelhalim and R. Genov, 915-mhz wireless 64- channel
neural recording SoC with programmable mixed-signal FIR
filters, in Proc. 2011 Proc. ESSCIRC (ESSCIRC), Helsinki,
Finland, 2011, pp. 223–226.
[38] G. K. Anumanchipalli, J. Chartier, and E. F. Chang, Speech
synthesis from neural decoding of spoken sentences, Nature,
vol. 568, no. 7753, pp. 493–498, 2019.
[39] H. Higashi and T. Tanaka, Simultaneous design of FIR filter
banks and spatial patterns for EEG signal classification,
IEEE Trans. Biomed. Eng., vol. 60, no. 4, pp. 1100–1110,
2012.
[40] R. Z. Cai, A. Ren, Y. Z. Wang, and B. Yuan, Memristorbased discrete Fourier transform for improving performance

Tsinghua Science and Technology, June 2022, 27(3): 455–471

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

and energy efficiency, in Proc. 2016 IEEE Computer Society
Annu. Symp. VLSI (ISVLSI), Pittsburgh, PA, USA, 2016,
pp. 643–648.
M. Hu and J. P. Strachan, Accelerating discrete Fourier
transforms with dot-product engine, in Proc. 2016 IEEE Int.
Conf. Rebooting Computing (ICRC), San Diego, CA, USA,
2016, pp. 1–5.
S. F. Gao, G. J. Yang, X. Qiu, C. Yang, C. Zhang, B. H. Li,
C. Gao, H. Jiang, Z. X. Wang, J. Hu, et al., Programmable
linear RAM: A new flash memory-based memristor for
artificial synapses and its application to speech recognition
system, in Proc. 2019 IEEE Int. Electron Devices Meeting
(IEDM), San Francisco, CA, USA, 2019, pp. 14.1.1–14.1.4.
Y. Halawani, B. Mohammad, M. Al-Qutayri, and S. F. AlSarawi, Memristor-based hardware accelerator for image
compression, IEEE Trans. Very Large Scale Integr. Syst.,
vol. 26, no. 12, pp. 2749–2758, 2018.
K. R. Rao and P. Yip, Discrete Cosine Transform:
Algorithms, Advantages, Applications. San Diego, CA,
USA: Academic Press Professional, Inc., 1990.
B. G. Zhang, N. Uysal, and R. Ewetz, Computational
restructuring: Rethinking image processing using memristor
crossbar arrays, in Proc. 2020 Design, Automation & Test
in Europe Conf. Exhibition, Grenoble, France, 2020, pp.
1594–1597.
M. Le Gallo, A. Sebastian, G. Cherubini, H. Giefers,
and E. Eleftheriou, Compressed sensing recovery using
computational memory, in Proc. 2017 IEEE Int. Electron
Devices Meeting (IEDM), San Francisco, CA, USA, 2017,
pp. 28.3.1–28.3.4.
P. M. Sheridan, F. X. Cai, C. Du, W. Ma, Z. Y. Zhang, and
W. D. Lu, Sparse coding with memristor networks, Nat.
Nanotechnol., vol. 12, no. 8, pp. 784–789, 2017.
F. Y. Qian, Y. P. Gong, G. X. Huang, M. Anwar, and L.
Wang, Exploiting memristors for compressive sampling of
sensory signals, IEEE Trans. Very Large Scale Integr. Syst.,
vol. 26, no. 12, pp. 2737–2748, 2018.
F. Y. Qian, Y. P. Gong, and L. Wang, A memristor based
image sensor exploiting compressive measurement for lowpower video streaming, in Proc. 2017 IEEE Int. Symp.
Circuits and Systems (ISCAS), Baltimore, MD, USA, 2017,
pp. 2026–2029.
S. J. Liu, Y. Z. Wang, M. Fardad, and P. K. Varshney,
A memristor-based optimization framework for artificial
intelligence applications, IEEE Circuits Syst. Mag., vol. 18,
no. 1, pp. 29–44, 2018.
M. Le Gallo, A. Sebastian, G. Cherubini, H. Giefers, and E.
Eleftheriou, Compressed sensing with approximate message
passing using in-memory computing, IEEE Trans. Electr.
Dev., vol. 65, no. 10, pp. 4304–4312, 2018.
M. Amarlingam, P. K. Mishra, K. V. V. D. Prasad, and P.
Rajalakshmi, Compressed sensing for different sensors: A
real scenario for WSN and IoT, in Proc. 2016 IEEE 3rd
World Forum on Internet of Things (WF-IoT), Reston, VA,
USA, 2016, pp. 289–294.
P. M. Sheridan, C. Du, and W. D. Lu, Feature extraction
using memristor networks, IEEE Trans. Neural Netw.
Learning Syst., vol. 27, no. 11, pp. 2327–2336, 2016.

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing
[54] X. Ji, X. F. Hu, Y. Zhou, Z. K. Dong, and S. K. Duan,
Adaptive sparse coding based on memristive neural network
with applications, Cognit. Neurodynam., vol. 13, no. 5,
pp. 475–488, 2019.
[55] W. Woods and C. Teuscher, Fast and accurate sparse coding
of visual stimuli with a simple, ultralow-energy spiking
architecture, IEEE Trans. Neural Netw. Learning Syst., vol.
30, no. 7, pp. 2173–2187, 2019.
[56] D. M. Zhang, Y. C. Hou, L. Zeng, and W. S. Zhao, Hardware
acceleration implementation of sparse coding algorithm
with spintronic devices, IEEE Trans. Nanotechnol., vol. 18,
pp. 518–531, 2019.
[57] J. S. Seo, B. B. Lin, M. Kim, P. Y. Chen, D. Kadetotad,
Z. H. Xu, A. Mohanty, S. Vrudhula, S. M. Yu, J. P. Ye, et
al., Onchip sparse learning acceleration with CMOS and
resistive synaptic devices, IEEE Trans. Nanotechnol., vol.
14, no. 6, pp. 969–979, 2015.
[58] F. X. Cai, J. M. Correll, S. H. Lee, Y. Lim, V. Bothra, Z.
Y. Zhang, M. P. Flynn, and W D. Lu, A fully integrated
reprogrammable memristor-CMOS system for efficient
multiply-accumulate operations, Nat. Electr., vol. 2, no. 7,
pp. 290–299, 2019.
[59] S. Choi, J. H. Shin, J. Lee, P. Sheridan, and W. D.
Lu, Experimental demonstration of feature extraction and
dimensionality reduction using memristor networks, Nano
Lett., vol. 17, no. 5, pp. 3113–3118, 2017.
[60] P. M. P. Raj, V. J. Louis, S. K. Chatterjee, S. Kanungo,
and S. Kundu, Ferroelectric memristive networks for
dimensionality reduction: A process for effectively
classifying cancer datasets, Integr. Ferroelectr., vol. 201,
no. 1, pp. 126–141, 2019.
[61] P. Yao, H. Q. Wu, B. Gao, J. S. Tang, Q. T. Zhang,
W. Q. Zhang, J. J. Yang, and H. Qian, Fully hardwareimplemented memristor convolutional neural network,
Nature, vol. 577, no. 7792, pp. 641–646, 2020.
[62] M. E. Fouda, E. Neftci, A. Eltawil, and F. Kurdahi,
Independent component analysis using RRAMs, IEEE
Trans. Nanotechnol., vol. 18, pp. 611–615, 2019.
[63] A. Rak and G. Cserey, Independent component analysis
by memristor based neural networks, in Proc. 2014 14th
Int. Workshop on Cellular Nanoscale Networks and their
Applications (CNNA), Notre Dame, IN, USA, 2014, pp. 1–2.
[64] P. K. Reddy, V. J. Louis, V. Subramaniam, R. K. Tripathy,
S. Banerjee, and S. Kundu, Implementation of fast ICA
using memristor crossbar arrays for blind image source
separations, IET Circu. Dev. Syst., vol. 14, no. 4, pp. 484–
489, 2020.
[65] C. Li, D. Belkin, Y. N. Li, P. Yan, M. Hu, N. Ge, H. Jiang,
E. Montgomery, P. Lin, Z. R. Wang, et al., Efficient and
self-adaptive in-situ learning in multilayer memristor neural
networks, Nat. Commun., vol. 9, no. 1, p. 2385, 2018.
[66] P. Kumar, A. R. Nair, O. Chatterjee, T. Paul, A. Ghosh, S.
Chakrabartty, and C. S. Thakur, Neuromorphic in-memory
computing framework using memtransistor cross-bar based
support vector machines, in Proc. 2019 IEEE 62nd Int.
Midwest Symp. Circuits and Systems (MWSCAS), Dallas,

469

TX, USA, 2019, pp. 311–314.
[67] L. X. Xia, P. Gu, B. X. Li, T. Q. Tang, X. L. Yin, W. Q.
Huangfu, S. M. Yu, Y. Cao, Y. Wang, and H. Z. Yan,
Technological exploration of RRAM crossbar array for
matrix-vector multiplication, J . Comput. Sci. Technol., vol.
31, no. 1, pp. 3–19, 2016.
[68] M. Prezioso, F. Merrikh-Bayat, B. D. Hoskins, G. C. Adam,
K. K. Likharev, and D. B. Strukov, Training and operation of
an integrated neuromorphic network based on metal-oxide
memristors, Nature, vol. 521, no. 7550, pp. 61–64, 2015.
[69] P. Yao, H. Q. Wu, B. Gao, S. B. Eryilmaz, X. Y. Huang, W.
Q. Zhang, Q. T. Zhang, N. Deng, L. P. Shi, H. S. P. Wong,
et al., Face classification using electronic synapses, Nat.
Commun., vol. 8, p. 15199, 2017.
[70] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, Gradientbased learning applied to document recognition, Proc. IEEE,
vol. 86, no. 11, pp. 2278–2324, 1998.
[71] X. Guo, F. M. Bayat, M. Bavandpour, M. Klachko, M.
R. Mahmoodi, M. Prezioso, K. K. Likharev, and D. B.
Strukov, Fast, energy-efficient, robust, and reproducible
mixed-signal neuromorphic classifier based on embedded
nor flash memory technology, in Proc. 2017 IEEE Int.
Electron Devices Meeting (IEDM), San Francisco, CA,
USA, 2017, pp. 6.5.1–6.5.4.
[72] Y. C. Xiang, P. Huang, Y. D. Zhao, M. R. Zhao, B. Gao,
H. Q. Wu, H. Qian, X. Y. Liu, and J. F. Kang, Impacts of
state instability and retention failure of filamentary analog
RRAM on the performance of deep neural network, IEEE
Trans. Electr. Dev., vol. 66, no. 11, pp. 4517–4522, 2019.
[73] Z. R. Wang, C. Li, P. Lin, M. Y. Rao, Y. Y. Nie, W. H.
Song, Q. R. Qiu, Y. N. Li, P. Yan, J. P. Strachan, et al., In
situ training of feed-forward and recurrent convolutional
memristor networks, Nat. Mach. Intellig., vol. 1, no. 9, pp.
434–442, 2019.
[74] L. G. Gao, P. Y. Chen, and S. M. Yu, Demonstration of
convolution kernel operation on resistive cross-point array,
IEEE Electr. Dev. Lett., vol. 37, no. 7, pp. 870–873, 2016.
[75] R. Z. Han, P. Huang, Y. C. Xiang, C. Liu, Z. Dong, Z. Q.
Su, Y. B. Liu, L. Liu, X. Y. Liu, and J. F. Kang, A novel
convolution computing paradigm based on nor flash array
with high computing speed and energy efficiency, IEEE
Trans. Circ. Syst. I: Regular Papers, vol. 66, no. 5, pp. 1692–
1703, 2019.
[76] Y. H. Wu, M. Schuster, Z. F. Chen, Q. V. Le, M. Norouzi, W.
Macherey, M. Krikun, Y. Cao, Q. Gao, K. Macherey, et al.,
Google’s neural machine translation system: Bridging the
gap between human and machine translation, arXiv preprint
arXiv: 1609.08144v2, 2016.
[77] R. Jia and P. Liang, Data recombination for neural semantic
parsing, arXiv preprint arXiv: 1606.03622, 2016.
[78] C. Li, Z. R. Wang, M. Y. Rao, D. Belkin, W. H. Song, H.
Jiang, P. Yan, Y. N. Li, P. Lin, M. Hu, et al., Long short-term
memory networks in memristor crossbar arrays, Nat. Mach.
Intellig., vol. 1, no. 1, pp. 49–57, 2019.
[79] T. Chang, S. H. Jo, and W. Lu, Short-term memory to longterm memory transition in a nanoscale memristor, ACS
Nano, vol. 5, no. 9, pp. 7669–7676, 2011.

Tsinghua Science and Technology, June 2022, 27(3): 455–471

470
[80] Z. R. Wang, S. Joshi, S. E. Savel’ev, H. Jiang, R. Midya,
P. Lin, M. Hu, N. Ge, J. P. Strachan, Z. Li, et al.,
Memristors with diffusive dynamics as synaptic emulators
for neuromorphic computing, Nat. Mater., vol. 16, no. 1,
pp. 101–108, 2017.
[81] J. Moon, W. Ma, J. H. Shin, F. X. Cai, C. Du, S. H. Lee, and
W. D. Lu, Temporal data classification and forecasting
using a memristorbased reservoir computing system, Nat.
Electron., vol. 2, no. 10, pp. 480–487, 2019.
[82] Y. N. Zhong, J. S. Tang, X. Y. Li, B. Gao, H. Qian, and H.
Q. Wu, Dynamic memristor-based reservoir computing for
high-efficiency temporal signal processing, Nat. Commun.,
vol. 12, no. 1, p. 408, 2021.
[83] J. Borghetti, G. S. Snider, P. J. Kuekes, J. J. Yang, D. R.
Stewart, and R. S. Williams, ‘Memristive’ switches enable
‘stateful’ logic operations via material implication, Nature,
vol. 464, no. 7290, pp. 873–876, 2010.
[84] S. Kvatinsky, D. Belousov, S. Liman, G. Satat, N. Wald,
E. G. Friedman, A. Kolodny, and U. C. Weiser, Magicmemristor-aided logic, IEEE Trans. Circu. Syst. II: Exp.
Briefs, vol. 61, no. 11, pp. 895–899, 2014.
[85] K. M. Kim, N. Xu, X. L. Shao, K. J. Yoon, H. J. Kim, R. S.
Williams, and C. S. Hwang, Single-cell stateful logic using
a dual-bit memristor, Phys. Status Solidi (RRL)-Rapid Res.
Lett., vol. 13, no. 3, p. 1800629, 2019.
[86] N. Xu, L. Fang, K. M. Kim, and C. S. Hwang, Time-efficient
stateful dual-bit-memristor logic, Phys. Status Solidi (RRL)Rapid Res. Lett., vol. 13, no. 6, p. 1900033, 2019.
[87] J. J. Yang, D. Strukov, and D. Stewart, Memristive devices
for computing, Nat. Nanotechnol., vol. 8, no. 1, pp. 13–24,
2013.
[88] S. Balatti, S. Ambrogio, and D. Ielmini, Normally-off logic
based on resistive switches—part I: Logic gates, IEEE Trans.
Electr. Dev., vol. 62, no. 6, pp. 1831–1838, 2015.
[89] B. Chen, F. X. Cai, J. T. Zhou, W. Ma, P. Sheridan, and W. D.
Lu, Efficient in-memory computing architecture based on
crossbar arrays, in Proc. 2015 IEEE Int. Electron Devices
Meeting (IEDM), Washington, DC, USA, 2015, pp. 17.5.1–
17.5.4.
[90] M. Miyamura, M. Tada, T. Sakamoto, N. Banno, K.
Okamoto, N. Iguchi, and H. Hada, First demonstration
of logic mapping on nonvolatile programmable cell using
complementary atom switch, in Proc. 2012 Int. Electron
Devices Meeting, San Francisco, CA, USA, 2012, pp.
10.6.1–10.6.4.
[91] R. Rosezin, E. Linn, C. Kugeler, R. Bruchhaus, and R.
Waser, Crossbar logic using bipolar and complementary

resistive switches, IEEE Electr. Dev. Lett., vol. 32, no. 6,
pp. 710–712, 2011.
[92] A. Sebastian, T. Tuma, N. Papandreou, M. Le Gallo, L.
Kull, T. Parnell, and E. Eleftheriou, Temporal correlation
detection using computational phase-change memory, Nat.
Commun., vol. 8, no. 1, p. 1115, 2017.
[93] Z. W. Liu, J. S. Tang, B. Gao, X. Y. Li, P. Yao, Y. D. Lin,
D. K. Liu, B. Hong, H. Qian, and H. Q. Wu, Multichannel
parallel processing of neural signals in memristor arrays,
Sci. Adv., vol. 6, no. 41, p. eabc4797, 2020.
[94] R. Karam, R. Puri, S. Ghosh, and S. Bhunia, Emerging
trends in design and applications of memory-based
computing and content-addressable memories, Proc. IEEE,
vol. 103, no. 8, pp. 1311–1330, 2015.
[95] T. F. Wu, H. T. Li, P. C. Huang, A. Rahimi, G. Hills, B.
Hodson, W. Hwang, J. M. Rabaey, H. P. Wong, M. M.
Shulaker, et al., Hyperdimensional computing exploiting
carbon nanotube fets, resistive ram, and their monolithic 3d
integration, IEEE J. Solid-State Circ., vol. 53, no. 11, pp.
3183–3196, 2018.
[96] L. L. Cai, W. Y. Chen, Y. D. Zhao, X. Y. Liu, J. F. Kang,
X. Zhang, and P. Huang, Insight into effects of oxygen
reservoir layer and operation schemes on data retention
of HfO2-based RRAM, IEEE Trans. Electr. Dev., vol. 66,
no. 9, pp. 3822–3827, 2019.
[97] X. Mu, J. Yu, and S. Wang, The extended linear-drift
model of memristor and its piecewise linear approximation,
Tsinghua Science and Technology, vol. 19, no. 3, pp. 307–
313, 2014.
[98] M. R. Zhao, H. Q. Wu, B. Gao, Q. T. Zhang, W. Wu,
S. Wang, Y. Xi, D. Wu, N. Deng, S. M. Yu, et al.,
Investigation of statistical retention of filamentary analog
rram for neuromophic computing, in Proc. 2017 IEEE Int.
Electron Devices Meeting (IEDM), San Francisco, CA,
USA, 2017, pp. 39.4.1–39.4.4.
[99] H Q. Wu, X. H. Wang, B. Gao, N. Deng, Z. H. Lu, B.
Haukness, G. Bronner, and H. Qian, Resistive random
access memory for future information processing system,
Proc. IEEE, vol. 105, no. 9, pp. 1770–1789, 2017.
[100] Z. R. Wang, S. Joshi, S. Savel’ev, W. H. Song, R. Midya,
Y. N. Li, M. Y. Rao, P. Yan, S. Asapu, Y. Zhuo, et al.,
Fully memristive neural networks for pattern classification
with unsupervised learning, Nat. Electr., vol. 1, no. 2, pp.
137–145, 2018.
[101] P. Kanerva, Hyperdimensional computing: An introduction
to computing in distributed representation with highdimensional random vectors, Cognit. Comput., vol. 1, no. 2,
pp. 139–159, 2009.

Han Zhao received the BEng degree from
Harbin Institute of Technology, Harbin,
China in 2021. He is currently a PhD
candidate at the School of Integrated
Circuits, Tsinghua University. His research
interests include neuromorphic computing,
medical image processing, and brainmachine interfaces.

Zhengwu Liu received the BEng degree
from University of Electronic Science and
Technology of China (UESTC), Chengdu,
China in 2018. He is currently a PhD candidate
at the School of Integrated Circuits, Tsinghua
University. His research interests include
neuromorphic computing, brain-machine
interfaces, and biomedical signal processing.

Han Zhao et al.: Memristor-Based Signal Processing for Edge Computing
Jianshi Tang received the BEng degree
from Tsinghua University, Beijing, China
in 2008, and the PhD degree from the
University of California at Los Angeles
(UCLA), Los Angeles, CA, USA in 2014.
From 2015 to 2019, he worked at the IBM
T. J. Watson Research Center, Ossining, NY,
USA. He is currently an assistant professor
at the School of Integrated Circuits, Tsinghua University. His
research interests include emerging memory and neuromorphic
computing, and carbon electronics.
Bin Gao received the BS degree in physics
and the PhD degree in microelectronics
from Peking University, Beijing, China
in 2008 and 2013, respectively. He is
currently an associate professor at the
School of Integrated Circuits, Tsinghua
University, Beijing. His research interests
include fabrication, characterization, and
modeling of emerging semiconductor devices, especially RRAM.
Yufeng Zhang received the BEng, MEng,
and PhD degrees from Harbin Institute
of Technology, Harbin, China in 1999,
2001, and 2004, respectively, where he is
currently a professor and the chair of the
Department Microelectronics Science and
Technology. His research interests include
integrated circuit design and intelligent
microsystems.

471

He Qian received the PhD degree in
microelectronics from Xi’an Jiaotong
University, Xi’an, China in 1990. From
1990 to 2006, he was at the Institute
of Microelectronics, Chinese Academy of
Sciences, Beijing, China, where he became
a professor in 1996 and the director in 2001.
From 2006 to 2008, he was the director at
the Samsung Semiconductor China Research and Development
Center. In 2009, he joined the School of Integrated Circuits,
Tsinghua University, Beijing, as a professor. His research work
covers RRAM, 3-D NAND, and neuromorphic computing based
on RRAM array.
Huaqiang Wu received two BEng degrees
in material science and engineering and
enterprise management from Tsinghua
University, Beijing, China in 2000, and
the PhD degree in electrical engineering
from Cornell University, Ithaca, NY,
USA in 2005. From 2006 to 2008, he
was a senior engineer at Spansion LLC,
Sunnyvale, CA, USA. In 2009, he joined the School of Integrated
Circuits, Tsinghua University, as an associate professor. He was
promoted to full professor in 2018. He is currently the dean of
School of Integrated Circuits, Tsinghua University. His research
interests include advanced memory and brain-inspired computing
technologies.

