Design of a pixel-parallel feature extraction VLSI system for biologically-inspired object recognition method by Takashi Morie et al.
DESIGN OF A PIXEL-PARALLEL FEATURE EXTRACTION
VLSI SYSTEM FOR BIOLOGICALLY-INSPIRED
OBJECT RECOGNITION METHODS
Takashi Morie, Makoto Nagata, and Atsushi Iwata
Graduate School of Advanced Sciences of Matter, Hiroshima University
Higashi-Hiroshima, 739-8526 JAPAN
morie@dsl.hiroshima-u.ac.jp
￿
￿
￿
￿
￿
￿
￿
￿—
This paper proposes a biologically-inspired feature ex-
traction method, which consists of coarse region segmen-
tation by a resistive-fuse network and feature extraction by
Gabor wavelet transforms. Their pixel-parallel VLSI im-
plementation based on the pulse modulation circuit archi-
tecture is described, and measurement results of Gabor ﬁl-
ter operation by a test LSI chip with 1-D 20-pixels are pre-
sented.
I. INTRODUCTION
Beyond the Silicon retina [1], in order to mimic the func-
tions of the primary visual cortex, simple feature extraction
by Gabor wavelet transforms should be implemented ﬁrst,
which extract speciﬁc spatial frequency components with
given directions. In the brain, these transforms are per-
formed by the simple cortical cells, and massively parallel
processing is achieved in hypercolumn structures.
However, it is very difﬁcult for presentVLSIs to perform
massively parallel processing with 3-D matrix structures
and interconnections as in the hypercolumn. This situation
is different from the case of retina, which roughly has a
2-D structure. Fabrication technology for 3-D VLSIs is be-
ing developed [2], but it can construct at most several-layer
structures.
In this paper, ﬁrst, we propose a feature extraction
method suitable for the present 2-D structure VLSIs. In
the proposed method, ﬁrst, coarse region segmentation is
performed, where small shades are ignored, and then each
segmented region is extracted, and Gabor wavelet trans-
forms are performed for each extracted region. We brieﬂy
describe the pixel-parallel VLSI implementation based on
our pulse modulation circuit architecture. We focus on the
VLSI implementation of Gabor wavelet transforms, and
describe the results of design and measurement of a test
LSI chip for 1-D Gabor ﬁlters.
II. DESIGN OF A FEATURE EXTRACTION VLSI
SYSTEM
A. Pixel-parallel VLSI implementation
In order to perform information processing in 3-D struc-
tures by 2-D VLSIs, time-division processing should be in-
troduced. We adopt a pixel-parallel architecture, which has
a 2-D processing unit array corresponding to the pixel ar-
ray. This approach is promising because it will make high-
functional vision chips by integrating on-chip image sen-
sors.
Because the ordinary digital circuit approach requires
large circuit area, it is difﬁcult to adopt pixel-parallel ar-
chitecture. In contrast, the analog circuit approach makes
circuit size compact, but from the viewpoint of calculation
precision and controllability, large scale integration of the
pixel units is difﬁcult.
We have proposed a pulse modulation circuit architec-
ture, which has advantages of both digital and analog ap-
proaches [3, 4]. However, even when this architecture
is used, we can only design a VLSI chip with at most
100
￿100 pixels by using the present VLSI technology. In
order to process natural scene images with megapixels, in-
troducing a segmentation process is inevitable.
B. Feature extraction scheme with image segmentation
As biologically-inspired models for image recognition, hi-
erarchical architectures for integrating local features have
been proposed; e.g. Neocognitron [5]. However, this
type of architecture requires large amount of hardware re-
sources.
Here, we do not adopt such architecture. Instead, we
propose a combination of coarse region segmentation and
feature extraction in each segmented region, which is a re-
alistic solution for present VLSI systems.
In this approach, theoriginal imageshouldbe segmented
into recognition target objects, and each object is processed
for recognition. However, precise segmentation requires
the recognition result of the whole image, but it cannot be
obtained before segmentation. To solve this dilemma, we
temporarily segment the image by using brightness of localGabor wavelet extracted area edge data Output
Cellular Neural
Network
Oscillator Network,
Cellular Automaton
Resistive-fuse
Network Model
Feature
Extraction Region Extraction Image
Segmentation Function
Original Image
Memorized 
faces
?
Figure 1: Feature extraction process of natural scene im-
ages. The target object is human faces.
area, and then try to recognize the segmented region. If a
meaningful result is obtained, we realize that the segmen-
tation succeeded. If not, we redo the segmentation process
with different parameters.
Figure 1 shows the proposed feature extraction process.
The target object is human faces in this case.
1) By using the resistive-fuse network, a gray-level nat-
ural scene image is segmented into some regions
in each of which the pixels have nearly the same
brightness regardless of small parts [6]. For exam-
ple, whole face regions are segmented so that small
parts such as eyes, brows, a nose, a mouth are elimi-
nated.
2) Each segmented region is extracted one by one. For
this process, the nonlinear oscillator networks called
LEGION can be used [7], and weare also developing
a cellular-automaton-type region extraction method,
which can be implemented by digital circuits.
3) Feature extraction is performed for the extracted re-
gions by using Gabor wavelet transforms.
4) The obtained Gabor coefﬁcients are used for the ob-
ject recognition process.
Since the details of the resistive-fuse networks and the
oscillator networks are described in our other papers [6, 7],
the rest of this paper describes VLSI implementation of the
Gabor-ﬁlter.
III. PIXEL-PARALLEL GABOR-TYPE FILTER
CIRCUIT
A. Analog circuit for CNN-based Gabor-type ﬁlters
A 2-D cellular neural network (CNN) circuit performing
Gabor-type ﬁltering as shown in Fig. 2 has been proposed
by B. E. Shi [8]. Each pixel consists of two nodes corre-
sponding to real and imaginary parts of the Gabor coefﬁ-
V(m,n)
r
λ
2In
G2x G2x
G2y
G2y
G1y
G1y
G1y
G1y
G1x
G1x G1x
G1x
G0
G0
-
+
+-
-+
-
+
V(m,n)
i
V(m,n+1)
r
V(m,n-1)
r
V(m+1,n)
r
V(m+1,n)
i
V(m-1,n)
i
V(m-1,n)
r
V(m,n-1)
i
V(m,n+1)
i
Figure 2: 2-D CNN circuit performing Gabor-type ﬁl-
tering, where
￿
￿ is the input for node
￿;
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿,
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿,
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿.
cient. The convolution kernel for pixel coordinate
￿
￿ is
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿ (1)
where
￿ is the decay constant of the exponential envelop
function, and
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿ is a frequency vector with
a speciﬁc direction. This direction is determined by the ra-
tio of
￿
￿
￿ to
￿
￿
￿; i.e.,
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿, where
￿ is the
angle between the frequency vector and the
￿-axis. This
is a unique and important feature of this circuit. Only by
changing
￿
￿
￿ and
￿
￿
￿, i.e., the conductances of the net-
work, Gabor-type ﬁltering with arbitrary frequencies and
directions can be achieved in this circuit.
An example of numerical simulation results for impulse
responses of the 2-D Gabor-type ﬁlter circuit is shown in
Fig. 3. This result is nearly identical with the mathematical
result obtained from Eq. 1.
B. PWM pixel unit circuit for pixel-parallel
implementation of analog CNNs
In order to implement the analog CNN circuit as shown
in Fig. 2, we have proposed a pixel unit circuit with arbi-
trary nonlinear functions based on pulse modulation tech-
niques [4]. Our pixel unit circuit emulates the operation of
the node in analog CNN circuits by discrete-time dynamics
based on clock operation. A change in each node voltage
is calculated using Kirchhoff’s law, and the steady state is
obtained by repeating the updating process. The dynamics
of pixel node
￿ implemented is
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
(2)Real part Imaginary part
Figure 3: Impulse response of 2-D Gabor-type ﬁlter circuit
(numerical simulation result).
to other cells
from other cells or bias 
CIn initial 
data
On
COn
SEL
In
-
+
Vnon
C3
I+
I- Sign
Diff
SUB
in2
in1
Vramp2
NONL UPDT
SF
SF
INIT
Figure 4: Pixel unit circuit using PWM signals. (SF is a
source follower circuit used as an analog buffer, other cir-
cuit blocks are described in the text.)
where
￿
￿ and
￿
￿ are the node voltage and the input volt-
age, respectively;
￿
￿ represents neighbor nodes of
￿ in-
cluding node
￿ itself;
￿ is a constant. Functions
￿
￿
￿
￿
￿
￿ and
￿
￿
￿
￿
￿ are arbitrary odd functions; although the Gabor-type
ﬁlter circuit requires only linear functions, this pixel unit
circuit can implement arbitrary odd functions as required
for resistive-fuse networks.
Our pixel unit circuit is shown in Fig. 4. Voltages
￿
￿
and
￿
￿ are temporarily stored at capacitors
￿
￿
￿ and
￿
￿
￿,
respectively, and
￿
￿ is updated according to the discrete-
time dynamics. The circuit shown in Fig. 4 is slightly dif-
ferent from the previously proposed one [4]. Since subtrac-
tion circuit SUB directly receives voltage inputs, the pro-
cessing time can be shortened compared with the previous
circuit. The detail of SUB is described in Ref. [6].
The updating process proceeds as follows:
1) Selector SEL selects a set of signals to be calculated,
Oi
 Ii 
NONL UPDT
SUB
SEL
SF, INIT
1-D 20 pixels
µm 177 
µ
m
110 
Figure 5: Pixel unit circuit layout and micro-photograph of
a part of the PWM CNN test chip.
which corresponds to one term on the right side of
Eq. 2.
2) Circuit SUB calculates the absolute value and the
sign of the difference between the selected two
signals. A pulse-width modulation (PWM) signal
whose pulse width is proportional to the absolute
value appears at node Diff and the sign bit appears
at node Sign. The sign bit determines the updating
direction of
￿
￿.
3) The PWM pulse switches the voltage source
￿
￿
￿
￿
whose voltage waveform in the time domain is
the same as the corresponding (nonlinear) function
(
￿
￿
￿
￿
￿
￿ or
￿
￿
￿
￿
￿). Thus, capacitor
￿
￿ holds the value
of the corresponding term in Eq. 2.
4) The voltage stored in capacitor
￿
￿ is again converted
into a PWM pulse, and it switches the current source
￿
￿ or
￿
￿. Output
￿
￿ is thus updated by the corre-
sponding term.
5) Repeating the above processes,
￿
￿ is updated by an-
other term.
C. Design and measurement results of fabricated test
chip
We designed a PWM pixel unit circuit using 0.6
￿m CMOS
technology, and constructed a 1-D 20-pixel network. The
layout image of the pixel unit circuit and a photograph of
the fabricated chip are shown in Fig. 5. Each pixel has
two unit circuits because the Gabor-type ﬁlter requires two
nodes as described above. The supply voltage was 3.3 V,
and the power consumption was 33
￿W per pixel unit cir-
cuit.
Thischip canalsooperate asa resistive-fusenetwork [6],
here we show the measurement results for Gabor-type ﬁl-ter operation as shown in Fig. 6. This shows an impulse
response for a period (
￿
￿
￿
￿
￿
￿) of 4 pixels, and the con-
vergence time was about 600
￿s. The outputs were nearly
identical with the numerical results. These results veriﬁes
our chip successfully operates as a Gabor-type ﬁlter.
IV. CONCLUSION
We proposed a feature extraction method consisting of
coarse region segmentation by a resistive-fuse network and
feature extraction by Gabor wavelet transforms. The pixel-
parallel VLSI implementation based on the pulse modula-
tion circuit architecture was described. We designed a pixel
unit circuit and a test LSI chip with 1-D 20-pixels using
0.6
￿m CMOS technology,and veriﬁed the basic operation
by measuring the impulse response in the test LSI chip.
ACKNOWLEDGMENT
The authors wish to thank M. Miyake and S. Nishijima for
their collaboration on design and measurement of the test
chip. This work was supported by the Ministry of Educa-
tion, Science, Sports, and Culture under Grant-in-Aid for
Scientiﬁc Research on Priority Areas (A). The LSI chip
was fabricated in the chip fabrication program of VDEC
with the collaboration by Rohm Corporation and Toppan
Printing Corporation.
References
[1] C. Mead, Analog VLSI and Neural Systems, Addison-
Weisley, 1989.
[2] H. Kurino, M. Nakagawa, K. W. Lee, T. Nakamura, Y. Ya-
mada, K. T. Park, and M. Koyanagi, “Smart vision chip fabri-
cated using three dimensional integration technology,” Ad-
vances in Neural Information Processing Systems 13, pp.
720–726. MIT Press, 2001.
[3] A. Iwata and M. Nagata, “A concept of analog-digital merged
circuit architecture for future VLSI’s,” IEICE Trans. Funda-
mentals., vol. E79-A, no. 2, pp. 145–157, 1996.
[4] T. Morie, M. Miyake, S. Nishijima, M. Nagata, and A. Iwata,
“A multi-functional cellular neural network circuit using
pulse modulation signals for image recognition,” Proc. Int.
Conf. on Neural Information Processing (ICONIP), pp. 613–
617, Taejon, Korea, Nov. 2000.
[5] K. Fukushima, “Neocognitron: A self-organizing neural net-
work model for a mechanism of pattern recognition unaf-
fected by shift in position,” Biol. Cybern., vol. 36, pp. 193–
202, 1980.
[6] T. Morie, M. Miyake, M. Nagata, and A. Iwata, “A 1-D
CMOS PWM cellular neural network circuit and resistive-
fusenetwork operation,” Ext.Abs. ofInt.Conf.onSolid State
Devices and Materials, D-1-2, Tokyo, Sept. 2001.
[7] H. Ando, M. Miyake, T. Morie, M. Nagata, and A. Iwata, “A
nonlinear oscillator network for gray-level image segmenta-
tion and PWM/PPM circuits for its VLSI implementation,”
IEICE Trans. Fundamentals., vol. E83-A, no. 2, pp. 329–336,
2000.
[8] B. E. Shi, “2D focal plane steerable and scalable cortical ﬁl-
ters,” Proc. Int. Conf. on Microelectronics for Neural, Fuzzy
and Bio-inspired Systems (MicroNeuro), pp. 232–239, 1999.
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
Vr(7)
Vr(9)
Vr(11)
Vr(13)
Vr(15)
Vr(17)
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿￿￿￿￿￿￿￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
Vi(6)
Vi(8)
Vi(10)
Vi(12)
Vi(14)
Vi(16)
0 50 100 150 200
iteration
Imaginary part
Output pulse width[
µ
s]
-0.4
-0.2
0
0.2
0.4
0.6
51 0 1 5 2 0
Cell position
Real part
0 50 100 150 200
iteration
200 iteration
Output pulse width[
µ
s]
-0.4
-0.2
0
0.2
0.4
0.6
Output pulse width[
µ
s]
-0.4
-0.2
0
0.2
0.4
0.6
Re
Im
Figure 6: Measurement results of Gabor-type ﬁler com-
posed of the PWM pixel unit circuits: impulse response
when the initial non-zero value is only given at pixel #11.