128-bit multicomparator by Mead, Carver A. et al.
692 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. SC-11, NO. 5, OCTOBER 1976
128-Bit Multicomparator
CARVER A. MEAD, RICHARD D. PASHLEY, MEMBER,lEEE, LEE D. BRITTON, YOSHIAKI T. DAIMON,
AND STEWART F, SANDO, JR., MEMBER, IEEE
Abstract–A 128-bit multicomparator was designed to perform the
search-sort function on arbitrary length data strings. Devices can be
cascaded for longer block lengths or paralleled for bit-parallel, word-
seriai applications. The circuit utilizes a 3-phase static-dynamic shift
register cell for data handling and a unique gated EXCLUSIVE-NORcircuit
to accomplish the compare function. The compare operation is per-
formed bit parallel between a “data” register and a “key” register with
a third “mask” register containing DON’T CARE bits that disable the com-
parator. The multicomparator was fabricated using p-channel silicon-
gate metal-oxide-semiconductor (MOS) technolo~ on a 107 X 150
mil chip containing 3350 devices. With transistor-transistor logic
(TTL) input, data rates in excess of 2 MHr have been attained. The
average power dissipation was 250 mW in the dynamic mode and 300
mW in the static mode.
intrOdUCtiOn
oVER the past several years, there have been significantamounts of energy devoted to the fabrication of larger
and faster semiconductor memories and conventional
central processing units (CPU’s) in chip form. In the process,
many other applications of large-scale integration (LSI) to
computer architecture have been neglect ed [1]. LSI has re-
moved the technological distinction between logic and
memory. It is now economically feasible to decentralize the
CPU of a computer by replacing much of its maintenance
software with functiomd hardware to improve system effi-
ciency. Presently, an inordinate amount of processing time is
spent on organizing and accessing files in peripherals. Periph-
erals are usually controlled directly by the CPU and have
little or no associated logic of their own. A great improve-
ment in this situation can be made by developing peripheral
logic units. This would allow each peripheral to accomplish
its own internal processing and thus reduce CPU housekeeping
duties. This paper describes a 128-bit multicomparator that is
designed to perform the search-sort function.
The block diagram of the multicomparator is shown in Fig, 1.
The circuit consists of three independently clocked static-
dynamic shift registers with associated EXCLUSIVE-NOR gat-
ing. In operation, the device indicates a match between the
data word and the unmasked bits of the key word. The multi-
comparator is loaded with a key word by serially shifting the
word into the key register and locking the register in static
mode. While the key word is being loaded, the comparator is
enabled by entering zerosl in the appropriate locations of the
Manuscript received March 15, 1976; revised JulY 18, 1976.
C. A. Mead is with the California Institute of Technology, Pasadena,
CA 91125.
R. D. Pashley and S. F. Sando, Jr., are with the Intel Corporation,
Santa Clara, CA.
L. D. Britton is with the Hewlett-Packard Laboratories, Cupertino,
CA.
Y. T. Daimon is with the Sony Corporation, Tokyo, Japan.
1voltage convention: high–”! “-VDD, 1ow–’’O”- VCC. Note that
since VDD is negative for p-channel MOS and positive for n-MOS
transistor-transistor logic (TTL) levels may or may not have reverse
polarity depending on the processing used.
‘IN
Do
P+mi+-LCo
‘N4F=II-:
1 1
‘IN Mo
Fig. 1. Block diagram of multicomparator.
Do DIN ‘
‘IN—
= o~
K. KIN K. KIN k.–.
‘IN ~
* *
M.
‘IN+
co
!1-- —m
“‘“ E~;~RNAL
PuLLuP
(a)
I ,+, ‘DD
I
(b)
Fig. 2, Possible connections of multicomparator.
Bit-paratlel,word-serial.
(a) Cascaded. (b)
mask register. Masking allows the multicomparator to search
for bit strings of varying length and composition. For example,
assume it is necessary to search for all words containing a
specific 113 -bit code. By entering the 113-bit code in the key
register and masking out the rest of the comparator, the multi-
comparator is conformed to search for this code wherever it
occurs in the data fde. Once the multicomparator is loaded
with “key” and “mask” words, the file being searched is
serially shifted through the data register. The data words are
compared in bit parallel with the unmasked bits of the key
word as they pass through the data register. When a match is
found, the compare output goes high.
Large multicomparators can be constructed of the 128-bit
circuits. Cascaded [Fig. 2(a)] , the comparator can be used to
search for words longer than 128 bits. By implementing multi-
comparators in parallel [Fig. 2(b)] , a word-serial, bit-parallel
MEADet al.: 128-BIT MULTICOMPARATOR
organization is achieved which is completely compatible with
byte oriented machines. For example, an arbitrary length
string of ASCII characters (such as one line of a source listing)
can be easily searched using this technique, making the device
ideal for indexing, test, etc.
There are many applications for the multicomparator in
machine construction. As a result of its serial-searclh nature,
the multicomparator is ideally suited to interface serial-access
devices such as tapes or “&sks. For example, a “smart disk” can
be constructed which has as a part of its instruction set, “find
the following string.” The specification of the string of course
would rdlow DON’T CARES to mask out certain fields, For
example, bin sorting could be accomplished in this manner.
This capability coupled with a standard microprocessor would
provide a disk capable of doing a large fraction of the string
manipulation now done by CPU’S. Such string manipulation
capability on the word level would be useful in text processing
applications. Many of the Snobol-type instructions can be
directly implemented using the search chip as input to the
processor.
CHIP DESIGN
The multicomparator was fabricated using p-channel silicon-
gate metal-oxide-semiconductor (MOS) technology [2]. Since
the circuit was designed with applications in computer periph-
erals in mind, an MOS clock rate of 2 MHz was adequate and
MOS technology allowed a high packing density in combina-
tion with high yield. n-MOS processing can be used with the
same design, with approximately 2; X performance advant age.
The multicomparator utilizes a 3-phase static-dynamic shift
register cell [Fig. 3(a)]. The static feature is attained by
adding a clocked feedback transistor to a standard 2-phase
dynamic shift register cell. The clocking diagram is shown in
Fig, 3(b), In dynamic operation, @l and @2 are clocked se-
quentially while 03 is kept low. 03 serves as the clamping
clock which holds data in the shift register. During static
operation, @l is low (isolating each cell of the shift register)
and both 02 and 03 are high. 43 gates the output of an ele-
ment back to the input, and hence during the transition be-
tween modes of operation, $3 must be delayed somewhat
from @z to avoid an undefined data state. In practice, one
may use a delayed @2 as @3 for both static and dynamic
operation.
A unique five device activated EXCLUSIVE-NOR gate was
implemented to perform the compare operation [Fig. 4(a)].
The cross-coupled structure was chosen over the standard
EXCLUSIVE-NOR circuit for layout simplicity. However, the
operation is similar to an EXCLUSIVE-NOR gate with the addi-
tion of an activating transistor [Fig. 4(b)]. When the activat-
ing transistor is on, the comparator is operational. When off,
the comparator is disabled and the compare output indicates
a match irrespective of comparator inputs. Note that the
match state is a lo~cal ONE, hence when several EXCLUSIVE-
NOR gates are connected in parallel to construct a comparator,
only one not-compare element is necessary to pulll the com-
pare line down to the low state.
The maximum operating speed of the comparator is set by
the compare line capacitance and its associated pickup de-
vice resistance. A single not-compare bit must be able to pull
693
“D ,$1 muL—n5L
.#2JJu—unm
+
~3_—rlJum
\
DYNAMIC STATIC DYNAMIC
(a) (b)
Fig. 3. Basic shift register cell. (a) Schematic, (b) Clock timing,
RI c1rII010010-1
(a) (V)
Fig. 4. Gated EXCLtJSIVE-NORgate. (a) Schematic. (b) Truth table,
‘DD
=
y2 ~,
DATA
L
REGISTER ‘$3
9 coCOMPARATOR
KEY
REGISTER
Fig. 5. Fult schematic of one bit slice of the multicomparator.
the compare line down in one clock cycle. With the compara-
tor shown, the compare line capacitance for one row of 32
cells is approximately 2 pF and the ON resistance of the 3
series comparator transistors is about 6 kfl. Hence the time
constant of a low going not-compare output from one row is
approximately 12 ns. However, for a high going (compare)
signal, the time constant is set by the 60 kfl pull-up resistor
at the end of the compare line, Thus the high going time con-
stant is much larger (-120 ns), For this reason, the chip was
arranged in four independent rows of 32 bits each. The four
compare lines were NANDed to avoid paralleling the line capa-
citances and hence maintain the data rate achievable with the
basic 32-bit row.
The schematic of one bit of the multicomparator is shown in
Fig. 5. The circuit is composed of three static-dynamic shift
registers for data handling and an EXCLUSIVE-NOR gate to ac-
complish the compare function. The unit cell was laid out
using conservative design rules (1 O-p metal lines with 10-p
694 IEEE JOURNAL OF SOLID-STATE CIRCUITS, OCTOBER 1976
Fig. 6, Photomicrograph of multicomparator chip.
DATA INPUT
COMPARE OUTPUT
Fig. 7. Oscilloscope photo of multicomparator output with pseudoran-
dom data input. Note: Compare pulse when data are aligned with in-
ternal stored-bit pattern.
metal-to-metal spacing). On silicon, the unit cell measures
91 X 470 p. Photomicrographs of the 128-bit multicompara-
tor unit cell and chip layout are shown in Fig. 6. Each shift
register has independent clocks and separate output pads. The
3350 device multicomparator is 107 X 150 roils and has 18
bonding pads.
PERFORMANCE CHARACTERISTICS
A typical device characterization was carried out by enter-
ing a 128-bit data word composed of a pseudorandom se-
quence of ones and zeros in the key register. The comparator
was enabled by simultaneously entering zeros in the mask
register. The 128-bit data word was then cycled through the
data register. Only when alignment between the cycling data
word and its stored counterpart was achieved did match occur
(Fig. 7). As expected, the compare pulses had a width equal
to the time increment between @2 and @l and recorded every
128 even pulses. When all ones are entered in the mask re@-
ter, the comparator is disabled and the compare output is high
(indicating a match) since there are no unmasked bits to dis-
agree with the data register.
The multicomparator operates over a wide range of voltages
with a TTL input. Fig. 8 shows the operational range of the
multicomparat or at a clock frequency of 2 MHz. The clocks had
a 25 percent duty cycle and were symmetrically spaced in time.
V& was set at +5 V with V@referenced to it. The minimum
supply voltage is set by two MOS thresholds (approximately
22
1
OPERATIONAL VOLTAGES VDD V5 V$
i
8
I
+,,+2 25% DUTY CYCLE I
I 1 1 1 I 1 I 1 I
0246810 12 14 16 18 20 22 24
Vcc- VDO(VOLTS)
Fig. 8. Operational range of p-channel multicomparator chip.
TABLE I
Parameter Performan.cea
Clock rate
Dynamic supply current
Static supply current
Clock leakage current (01 )
Clock leakage current (@z)
Clock capacitance (01)
Clock capacitance (1#2)
Clock capacitance (03)
Interlock capacitance
Input capacitance
Output capacitance
0.0001-2 MHz
25 mA
30 mA
120 nA
300 nA
40 pF
60 PF
40 pF
7 PF
10PF
10PF
aTest Conditions:
T= 23”C, V’cc= 5 V, VDD = -5 V,
V@L= -+5V, V@H‘ -5 V, Vinput = 0.5 V.
2 V in the p-MOS process used). The maximum clock voltage
was set by the onset of inversion under the field oxide. At
high power supply voltages, the clock voltages were approxi-
mately one threshold less than V& - VDD.
The multicomparator performance characteristics are sum-
marized in Table I.
MEAD et al.: 128-BIT MULTICOMPARATOR 695
ACKNOWLEDGMENT
The authors are pleased to acknowledge
for wafer fabrication.
Intel Corporation
IZJZFERENCES
[1] T. C. Chen, `<Distributed titeMigence foruser-oriented computhg;
in 1972 Fall Joint Comput. Confi, AFIPS Conj Proc., VOL 41.
Montvale, NJ: AFIPS Press, 1972, pp. 1049-1056.
[2] F. Faggin and T. Klein, “Silicon gate technology,” Solid-State
Electron., vol. 13, pp. 1125-1144, 1970,
Carver A. Mead received the B.S., M,S., and
Ph.D. degrees from the California Institute of
Technology, Pasadena, in 1956, 1957, and
1959, respectively.
He has been a member of the faculty of the
California Institute of Technology, Pasadena,
CA, since 1957. His research has contributed
to the understanding of tunneling in solids,
current flow mechanisms in thin dielectric
fiims, metal-semiconductor barriers, band ener-
gies in semiconductors. and electronic mocesses
in insulators. He has ~roposed and demonstrated theopera~ion ofa
number of new solid-state electronic devices and holds several U.S.
Patents.
Dr. Mead isa Fellow of the American Phvsical Society and a member
of Sigma Xi.
Richard D. Pashley (M’75) was born in Ft.
Belvoir, VA, on September 15, 1947. He re-
ceived the B.A. degree in physics from the
University of Colorado, Boulder, CO, in 1969
and the M.S. and Ph.D. degrees in electrical
engineering from the California Institute of
Technology, Pasadena, CA, in 1970 and 1974,
respectively.
In 1973 he joined the staff of Intel Corpora-
tion, Santa Clara, CA, where his work has
included MOS device modeling, urocess devel-
-. .
opment, and static RAM design. He is presently Manager of high per-
formance MOS technology development at Intel.
Dr. Pashley is a member of Phi Beta Kappa, Sigma Xi, and Bohmische
Physikalische Gesellschaft.
Lee D. Britton received the B.S.E.E. degree from
the California Institute of Technology, Pasa-
dena, in 1973.
He then invented and marketed a computer-
aided memory for theater lighting control, and
now works for Hewlett-Packard Laboratories,
Cupertino, CA, designing and testing LSI inte-
grated circuits for mini-computers.
Yoshiakl T. Daimon was born in Kyoto, Japan,
on July 4, 1948. He received the B.S., M.S.,
and Ph.D. degrees from the California Institute
of Technology, Pasadena, in 1971, 1972, and
1975, respectively.
Since 1967, he has served several research
groups in the Institute. He worked as a Data
Processor in hydraulics from 1967 to 1969, en-
gaging in the analysis of the pressure distribu-
tion of solitary waves, the influence of the
geological features of a harbor upon the in-
duced standing-wave amplitude in the harbor. and the diffusion mecha-
nism of pollu~ed objec;s in moving fluids. ‘From 1969 to 1971, he
worked as an Experimentalist in the Material Science Department and
studied the switching and other electronic properties of newly developed
amorphous alloys from the low temperature of 4 K to room tempera-
ture. From 1971 to 1975 he was a Research and Teaching Assistant
both in the Electrical Engineering and Physics Departments at the Cal-
ifornia Institute of Technology. In the summer of 1971 and 1973, he
visited Sony Corporation, Tokyo, Japan, as a Product-AppraisaJ Engi-
neer at the Atsui plant and engaged in developments and applications
of bipolar technologies in video and power integrated circuits. He is
presently with the Sony Corporation, Tokyo, Japan. His interests lie
in the areas of digital and linear integrated circuit designs, the physics
of microelectronics, and artificial intelligence.
Stewart F. Sando, Jr. (M’73)was born in Hartford, CT, in 1949. He re-
ceived the B.S. degree in physics in 1971 and the M.S.E.E. degree in
1972 from the California Institute of Technology, Pasadena. He re-
ceived the M.B.A. degree from Stanford University, Stanford, CA, in
1974.
In 1973 he joined Siliconix, Inc., Santa Clara, CA, where he was em-
ployed as an Applications Engineer. In 1974 he joined Intel Corpora-
tion, Santa Clara, CA, where he is presently a Memory Components
Product Marketing Engineer.
