Large-Scale Field-Programmable Analog Arrays for Analog Signal Processing by Paul Hasler et al.
2298 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 52, NO. 11, NOVEMBER 2005
Large-Scale Field-Programmable Analog Arrays for
Analog Signal Processing
Tyson S. Hall, Member, IEEE, Christopher M. Twigg, Student Member, IEEE, Jordan D. Gray, Student Member, IEEE,
Paul Hasler, Senior Member, IEEE, and David V. Anderson, Senior Member, IEEE
Abstract—Field-programmable analog arrays (FPAAs) provide
a method for rapidly prototyping analog systems. Currently avail-
able commercial and academic FPAAs are typically based on op-
erational ampliﬁers (or other similar analog primitives) with only
a few computational elements per chip. While their speciﬁc ar-
chitectures vary, their small sizes and often restrictive intercon-
nect designs leave current FPAAs limited in functionality and ﬂex-
ibility. For FPAAs to enter the realm of large-scale reconﬁgurable
devices such as modern ﬁeld-programmable gate arrays (FPGAs),
new technologies must be explored to provide area-efﬁcient accu-
ratelyprogrammable analogcircuitry thatcanbeeasily integrated
into a larger digital/mixed-signal system. Recent advances in the
area of ﬂoating-gate transistors have led to a core technology that
exhibits many of these qualities, and current research promises a
digitally controllable analog technology that can be directly mated
tocommercialFPGAs.Byleveragingtheseadvances,anewgenera-
tionof FPAAsis introducedin thispaper thatwilldramatically ad-
vancethecurrent state of theart in terms of size, functionality, and
ﬂexibility. FPAAs have been fabricated using ﬂoating-gate transis-
tors as the sole programmable element, and the results of charac-
terization and system-level experiments on the most recent FPAA
are shown.
IndexTerms—Analogarrays,ﬁeld-programmableanalogarrays
(FPAAs), ﬂoating gate, reconﬁgurable.
I. LOW-POWER SIGNAL PROCESSING
T
HE GROWING demand for complex information pro-
cessing on portable devices has motivated signiﬁcant
research in the design of power efﬁcient signal processing
systems. One method for achieving low-power designs is to
move processing on system inputs from the digital processor to
analog hardware situated before the analog-to-digital converter
(ADC). However, for analog systems to be desirable to digital
signal processing engineers, they need to provide a signiﬁcant
advantage in terms of size and power and yet still remain
relatively easy to use and integrate into a larger digital system.
Reconﬁgurable analog arrays, dubbed ﬁeld-programmable
analog arrays (FPAAs), can speed the transition of systems
from digital to analog by providing the ability to rapidly imple-
ment advanced, low-power signal processing systems.
Manuscript received August 8, 2004; revised April 3, 2005. This work was
supportedin partbytheNationalScienceFoundationunderGrant0347792, and
Grant 0083172.
T. S. Hall is with the School of Computing, Southern Adventist University,
Collegedale, TN 37315-0370 USA (e-mail: tyson@southern.edu).
C. M. Twigg, J. D. Gray, P. Hasler, and D. V. Anderson are with the School of
Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta,
GA 30332-0250 USA (e-mail: ctwigg@ece.gatech.edu; jgray@ece.gatech.edu;
phasler@ece.gatech.edu; dva@ece.gatech.edu).
Digital Object Identiﬁer 10.1109/TCSI.2005.853401
Gene’s law postulates that the power consumption in digital
signal processing microprocessors, as measured in milliwatts
per million multiply-accumulate (mW/MMAC) operations, is
halved about every 18 months. These advances largely follow
Moore’s law, and they are achieved by using decreased fea-
ture size and other reﬁnements, such as intelligent clock gating.
Myriad applications only dreamed of a few years ago are pos-
siblebecauseofthesegains,andtheyhaveincreasedthedemand
for more advanced signal processing systems. Unfortunately, a
problem looms on the horizon: the power consumption of the
ADC does not follow Gene’s law and will soon dominate the
totalpowerbudgetofdigitalsystems.WhileADCresolutionhas
been increasing at roughly 1.5 bits every ﬁve years, the power
performance has remained the same, and soon, physical limits
will further slow progress.
Most current signal processing systems that generate digital
output place the ADC as close to the analog input signal as pos-
sible to take advantage of the computational ﬂexibilityavailable
in digital processors. However, the development of large-scale
FPAAs and the computer-aided design (CAD) tools needed for
theireaseofusewouldallowengineerstheoptionofperforming
some of the computations in reconﬁgurable analog hardware
prior to the ADC. This results in both a simpler ADC and a sub-
stantially reduced computational load on the digital processors
thatfollow.Furthermore,theanalogprocessorandADCmaybe
combined to form a specialized ADC tailored to the application
at hand.
FPAAs have been of interest for some time, but historically,
these devices have had very few programmable elements and
limited interconnect capabilities, making them limited in their
usefulness and versatility. Currently available commercial and
academic FPAAs are typically based on op-amp circuits with
only relatively few op-amps per chip [1]–[9]—see [10] for
a more exhaustive discussion of previous FPAA designs. By
building larger, more ﬂexible FPAAs, reconﬁgurable analog
devices will become more analogous to today’s high-density
FPGA architectures. This will enable a very useful rapid proto-
typing system for analog circuit development.
Recent advances in analog ﬂoating-gate technologies have
shown it to be a viable alternative to traditional FPAA designs
[15]. As shown in Fig. 1, analog ﬂoating-gate circuits have
shown tremendous gains in efﬁciency (a factor of as much as
10000) compared with custom digital approaches for the same
applications. A new FPAA design, dubbed the Reconﬁgurable
Analog Signal Processor (RASP) 1.5, has been fabricated that
is based on ﬂoating-gate technology (Fig. 2).
1057-7122/$20.00 © 2005 IEEEHALL et al.: LARGE-SCALE FPAAs FOR ANALOG SIGNAL PROCESSING 2299
Fig. 1. Data from [11] showing the power consumption trends in DSP
microprocessors along with data taken from a recent analog, ﬂoating-gate
integrated chip developed by the CADSP team [12]–[14].
Fig. 2. Die photo for the RASP 1.5 FPAA. It is used to characterize the
large-scale FPAA architecture explored in this paper.
This paper proceeds with a discussion of the challenges of
building reconﬁgurable analog devices in Section II. The archi-
tecture for our large-scale FPAAs is described in Section III. In
Section IV, the fabricated FPAA is discussed, and experimental
data is shown that characterizes some of the low-level compo-
nentsofthisdevice.InSectionV,theFPAAsystemperformance
is analyzedbylookingatresultsfrom experimentalsystemsthat
have been prototyped on the FPAA.
II. RAPID PROTOTYPING OF ANALOG SYSTEMS
The traditional analog IC design process can be lengthy,
lasting for over a year if multiple iterations of a design must be
fabricated. Thus, the beneﬁts of rapid prototyping for analog
circuits would be signiﬁcant in the design and testing of analog
systems. FPAAs provide a viable platform for rapid prototyping
of analog systems, and in design and function, they are the
analog equivalent to digital reconﬁgurable devices such as
programmable logic devices (PLDs) and ﬁeld-programmable
gate arrays (FPGAs).
Fig. 3. (a) Simple two-transistor current mirror illustrates the challenges
of designing reconﬁgurable ICs with ﬁne-grain building blocks. (b) Parasitic
capacitance that results from using switches to form the current mirror will
reduce the bandwidth of the system.
Traditional FPAAs resemble the early PLDs in that they are
focused on small systems such as low-order ﬁltering, ampliﬁca-
tion, and signal conditioning. However, the class of large-scale
FPAAs explored in this paper are more analogous to modern
FPGAs. These FPAAs are much larger devices with the func-
tionality needed to implement high-level system blocks such as
programmable high-order ﬁltering and fourier processing in ad-
dition to having a large number of basic analog blocks [e.g.,
operational transconductance ampliﬁers (OTAs), transistor ele-
ments, capacitors, etc.].
Rapid prototyping of analog systems is not completely
analogous to its digital counterpart. Developing robust, pro-
grammable analog circuits presents a number of challenges not
found in the digital world. In particular, the noise sensitivity
(and effects of the switch network on the results of the compu-
tation) and the design space to which programmable devices
are applicable are more critical factors in designing FPAAs.
A. Noise Sensitivity
Analog circuits tend to be more sensitive to noise than digital
designs. Because of the quantization and resulting representa-
tion of ones and zeros as discrete voltages, digital designs can
tolerate a relatively large amount of noise in the system without
changing the precision of the result. Problems arise only when
noise levels are high enough to move a signal from a logical one
to a logical zero or vice versa. In the analog domain, however,
values are represented as continuous voltages or currents. Any
noise in the system will directly affect the precision of the re-
sult. For reconﬁgurable analog systems that rely on networks of
switches to set the internal signal paths, this means that the par-
asitics of the switches in a signal’s path can affect the result and
are a critical factor in the performance of the FPAA.
Adding switches in the signal path can have several effects
including the addition of parasitic capacitance, resistance, and
transistor leakage currents to the path. Increased capacitance
and resistance on a signal line will lower the bandwidth of the
system. For example, a simple two-transistor current mirror is
shown in Fig. 3(a). The same circuit is shown in Fig. 3(b) with
switches added to the signal paths as they would need to be
if the current mirror were synthesized on the FPAA using the
MOSFET transistors in the computational analog block (CAB).
In this case, there should not be any current ﬂow between the
gate nodes, so the voltage should remain equal on the two gate
nodes even with the switches in the signal path. Other circuits2300 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 52, NO. 11, NOVEMBER 2005
that have switches in a signal path with current ﬂowing through
them will have a voltage drop across the transistor that can vary
nonlinearly. As the number of switches in a given design in-
creases, the performance and functionality will degrade signiﬁ-
cantly.
B. Design Space
Anotherdifferencebetweenreconﬁgurableanaloganddigital
devicesisthedesignspacethateachmustencompass.Function-
ality in the digital domain can be reduced to a basic set of com-
binational and sequential primitives. For example, a NAND gate
can be conﬁgured to implement any of the other Boolean logic
gates. Thus, with a sufﬁciently large number of NAND gates,
anycombinationallogicfunctioncanbeachieved.Similiarly,an
asynchronous read-only memory (ROM) primitive can be used
to implement any combinational function. For sequential func-
tions, any basic storage element (e.g., ﬂip-ﬂop or latch) can be
used to provide the necessary memory. Most modern FPGAs
use asynchronous ROMs to synthesize the combinational logic
and D-type ﬂip-ﬂops for implementing the memory/sequential
logic. Thus, by replicating these two basic primitives thousands
of times across a chip (and a sufﬁcient routing network), an
FPGA can be created that synthesizes a very large number of
different digital systems. It is tempting to think that one might
be able to do the same thing in the analog domain. However,
there has not been a sufﬁciently generic set of medium-grained
buildingblocks(onthesameorderofcomplexityasﬂip-ﬂopsor
asynchronous ROMs) proposed for synthesizing a wide-range
of analog circuits. To get the desired generality, one must use
ﬁne-grain building blocks, such as transistors, resistors, diodes,
and capacitors. Indeed, a large number of analog systems can
be built with these basic blocks; even digital systems could be
synthesizedwithsuchadevice.However,theseprimitivesareso
ﬁne-grained that it would require such a large number of com-
ponents—and thus a large number of switches—to implement
a design that the switch parasitics would signiﬁcantly degrade
the performance. For example, the circuit diagram for a basic
9-transistor OTA is shown in Fig. 4(a). In Fig. 4(b), the same
OTAisshownwiththeswitchesnecessarytosynthesizethiscir-
cuitonaﬁne-grainedFPAAwithtransistorsonly.TheFPAAde-
sign requires at least 27 switches, in addition to the nine transis-
tors, to implement the OTA. The switches will drastically affect
the performance and functionality of the OTA and may cause
the circuit to break. To mitigate these effects, coarser-grained
blocks must be used. The task then is to do so while still main-
taining sufﬁcient ﬂexibility, functionality, and generality.
Using coarse-grain blocks can be appealing given their in-
crease in performance and robustness over ﬁne-grain blocks.
However, if the basic building blocks in an FPAA are of too
high a level, then the ﬂexibility is greatly diminished. To be as
ﬂexible as possible, an FPAA needs to have a wide range of
ﬁne-grained, medium-grained, and coarse-grained components.
This means that there will often be more than one way of syn-
thesizing the same system on the FPAA. This provides the most
ﬂexibility to end-users, because they can vary the levels of per-
formance, utilization, ﬂexibility, and complexity.
Fig. 4. (a) Circuit diagram of a basic 9-transistor OTA. (b) Circuit diagram of
the same OTA with the switches needed as implemented on a ﬁne-grain FPAA
with only transistors. The addition of the 27 switches will dramatically reduce
the performance and functionality of this circuit.
III. LARGE-SCALE FPAAS
Previous FPAAs [2], [5], [6], [16] are somewhat limited due
to their small size and lack of generality. By addressing these
problems, we hope to extend the usefulness and acceptance of
FPAAs. In addition, large-scale FPAA designs must address the
complex design space that analog designs entail (including a
wide-range of linear and nonlinear functions) while keeping
switch parasitics minimized as discussed in the previous sec-
tion.
The switches used in FPAAs are very important, because
the signals are affected by any nonideal characteristics. An
ideal switch switch has zero impedance when ON and inﬁnite
impedance when OFF. For practical reasons, the switch should
also be small and easily controlled. The switches used in our
FPAA are based on ﬂoating-gate transistors and can be used to
approximate the ideal switch. Additionally, the ﬂoating-gate
switches can be programmed to states between ON and OFF,
synthesizing a ﬁnite resistance. Thus, switches can be used as a
resistive circuit element within the design [15].HALL et al.: LARGE-SCALE FPAAs FOR ANALOG SIGNAL PROCESSING 2301
Fig. 5. Layout, cross section, and circuit diagram of the ﬂoating-gate pFET in
a standard double-poly, n-well MOSIS process: The cross section corresponds
to the horizontal line slicing through the layout view. The pFET transistor is
the standard pFET transistor in the n-well process. The gate input capacitively
couples to the ﬂoating-gate by either a poly–poly capacitor, a diffused linear
capacitor, or a MOS capacitor, as seen in the circuit diagram (not explicitly
shown in the other two ﬁgures). Between V and the ﬂoating-gate is our
symbol for a tunneling junction-a capacitor with an added arrow designating
the charge ﬂow.
A. Building Compact Floating-Gate Switches
Theﬂoating-gatetransistorsusedintheseFPAAsarestandard
pFET devices whose gate terminals are not connected to signals
except through capacitors (e.g., no dc path to a ﬁxed potential)
[17]. Fig. 5 shows the layout, cross-section, and circuit symbol
for the ﬂoating-gate pFET device. Because the gate terminal is
well insulated from external signals, it can maintain a perma-
nent charge, and thus, it is an analog memory cell similar to
an EEPROM cell. With a ﬂoating gate, the current through the
pFET channel is dependent on the charge of the ﬂoating-gate
node. By using hot-electron injection to decrease the charge
on the ﬂoating-gate node and electron tunneling to increase the
charge on the ﬂoating-gate node, the current ﬂow through the
pFET channel can be accurately controlled [17], [18].
To increase the quality of a switch, the ﬂoating-gate transis-
tors are programmed to the far extremes of their range. When
switchesarebeingprogrammed OFF,currentsinthelowpicoam-
pere range must be measured. These measurements are near
the limits of standard laboratory equipment; therefore, to ex-
tend the viable programming range, current measurements are
taken at a larger drain-to-source voltage. Typically, is set
to the supply voltage, , and an increase in is achieved
by increasing . As shown in Fig. 6, measuring the currents
with V, allows the – curves to be visible to the
programming infrastructure 1 V below the point visible when
V [19].
For simplicity, the voltages on the gate capacitors of all the
switches are set to a constant potential. This means that the
voltage driving the gate capacitors will be the same for both ON
and OFF switches. To determine the appropriate gate voltage for
run mode, the relative quality of ON and OFF switches must be
balanced. From Fig. 6, it is clear that the OFF switches do not
Fig. 6. Floating-gate switches can be programmed within a wide range.
Here, examples of an ON, OFF, and midposition device are shown. During
programming, currents are measured with V =3 :3 V for large currents and
V =6 :5 V for small currents. This effectively extends the programming
range of the device.
pose a problem, since any gate voltage selected at or above 0.3
V should provide a sufﬁciently high impedance. However, the
ON switch exhibits a decrease in quality as the gate voltage is
increased to . Thus, an operating gate voltage of 0.3 V is
deemed optimal for the current programming scheme.
B. Switch as a Computational Element
When used as a switch, the ﬂoating gate should be as trans-
parent a part of the circuit as possible. However, Fig. 6 shows
that the ﬂoating-gate transistor can also be used as an in-cir-
cuit element [20]. By adjusting the charge on the ﬂoating-gate
node between the extremes used for ON and OFF, the impedance
of the switch can be varied over several orders of magnitude.
Thus, a variable nonlinear resistor can be synthesized by the
ﬂoating-gate switch.
Using the ﬂoating-gate switches as in-circuit elements allows
for a very compact architecture. The physical area needed for
the CABs is reduced greatly, because resistors, which consume
relatively large amounts of space on CMOS processes, are not
needed as separate components. Also, by reducing the number
ofindividualcircuitelements,signalroutingissimpliﬁed,while
retaining functionality.
C. Floating-Gate Transistors Within Computational Logic
Current FPAA designs rely on switches as the primary or
sole programmable element on the chip. Biases, multiplier co-
efﬁcients, resistances, and similar elements are set via off-chip
components or with low-resolution capacitor banks or current-
mirror banks. Thus, the ability to modify or program the ac-
tual analog computational logic is severely limited. By using
ﬂoating-gate transistors within the computational logic, circuit
characteristics can be directly modiﬁed.
In the FPAA explored here, ﬂoating-gate transistors are used
within theCABsto set biasvoltages for theOTAs[see Fig.9(a),
shown later], adjust the corner frequencies on the capacitively
coupled current conveyors , and set muliplier coefﬁcients2302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 52, NO. 11, NOVEMBER 2005
Fig.7. (a)Overallblockdiagram foralarge-scaleFPAA.Theswitchinginterconnectsarefullyconnectablecrossbarnetworksbuiltusingﬂoating-gatetransistors.
(b) There are two slightly different CABs on this FPAA. The large or regular CAB contains a four-by-four matrix multiplier, three wide-range OTAs, three
ﬁxed-value capacitors, a capacitively coupled current conveyor (C ) SOS, a peak detector, and two FET transistors. The small CAB is the same except it does not
include the four-by-four matrix multiplier. This design includes large CABs at the top and bottom of each column and small CABs in-between.
in the vector-matrix multipliers. In this manner, the ﬂoating-
gate transistors allow the characteristics of the computational
elements to be programmed on chip while still maintaining a
compact CAB. Thus, by allowing both the switch networks and
the computational logic to be programmable, the ﬂexibility and
usability of these FPAAs are greatly enhanced over previous
designs.
D. Programmability
By using ﬂoating-gate devices as the only programmable
element on the chip, conﬁguring the chip is greatly simpliﬁed.
Additionally, all of the ﬂoating-gate transistors are clustered
together to aid in the programming logic and signal routing.
Decoders on the periphery of the circuit are connected to
the drain, source, and gate (through a capacitor) terminals of
the ﬂoating-gate matrix. During programming mode, these
decoders allow each ﬂoating-gate transistor to be individually
programmed using hot-electron injection [18].
Part of the previous work has been the development of a sys-
tematic method for programming arrays of ﬂoating-gate tran-
sistors [18], [20], [21]. A microprocessor-based board has been
built to interface a PC to these analog ﬂoating-gate arrays for
thepurposesofprogrammingand testing.WithaPCcontrolling
the programming of these devices, the details of using hot-elec-
tron injection and tunneling to program individual ﬂoating-gate
switches have been abstracted away from the end-user. The pro-
gramming algorithms have been optimized for accuracy and
speed, while giving the end-user an easy-to-use interface for
conﬁguring arrays of ﬂoating-gate devices.
E. Switch Networks
As shown in Fig. 7(a), the routing architecture of our large-
scale FPAAs is a combination of global and local switch net-
works. Each CAB has an associated local switch network for
making connections within a single CAB. The switches’ source
lines are routed along the rows and connect the inputs and out-
puts of each CAB to the switch network. The drain lines of the
switches are connected along the columns. By turning a switch
on, a single row (source) can be connected through the switch
to a single column (drain).
The size of the switch network is dependent on the number
of I/O lines in each CAB. For the design shown in Fig. 7, there
are two types of CABs. For the larger CABs, the local switch
networks are comprised of a 10 42 matrix of switches, and
for the smaller CABs, the local switch network is a 10 32
matrix of switches. Each local switch network is integrated into
a matching global routing switch network. The global routing
switchnetworkallowslocalsignalsfromaCABtobeconnected
to the global routing busses and be routed off the chip or to
another CAB. There are also 10 8 switch networks at each
junction of the horizontal and vertical global routing busses.
The performance of circuit designs implemented on the
FPAA are dependent on the routing patterns used and the
number of switches needed in a given path. Frequency analysis
of ﬂoating-gate circuits and the effects of switch networks has
been discussed in [22] and placement and routing algorithms
for large-scale FPAAs have been discussed in [32].
F. Computational Analog Blocks
The computational logic is organized in a compact CAB pro-
viding a naturally scalable architecture. CABs are tiled across
the chip in a regular mesh-type architecture with busses and
local interconnects in-between as shown in Fig. 7(a).
Many example CABs can be imagined using this technology.
Fig. 7(b) shows one example CAB, whose functionality is
enhanced by a mixture of ﬁne-grained, medium-grained, and
coarse-grained computational blocks similar to many modernHALL et al.: LARGE-SCALE FPAAs FOR ANALOG SIGNAL PROCESSING 2303
FPGA designs. The computational blocks were carefully se-
lected to provide a sufﬁciently ﬂexible, generic architecture
while optimizing certain frequently used signal processing
blocks. For generality, three OTAs are included in each CAB.
OTAs have already been shown to be effective at implementing
a large class of systems including ampliﬁcation, integration,
ﬁltering, multiplication, exponentiation, modulation, and other
linear and nonlinear functions [23]–[26]. In addition, the two
FET devices provide the ability to perform logarithmic and
exponential functions as well as convert back and forth between
current and voltage. The three capacitors are ﬁxed in value to
minimize the size of the CAB and are primarily used on the
outputs of the OTAs; however, they will be available for any
purpose. The variable capacitor and/or current mirror banks
found in some designs are not needed here, because the use of
ﬂoating-gate transistors in the OTAs will give the user sufﬁcient
control in programming the transconductance of the ampliﬁers
[24], [27]. Eliminating the capacitor banks creates a large
savings in the area required for each CAB.
The high-level computational blocks used in this design are a
second-order section (SOS) bandpass ﬁlter module comprised
of two capacitively coupled current conveyors and the
4 4 vector-matrix multiplier block. In general, the SOS
module provides a straightforward method of subbanding an in-
coming signal. This allows Fourier analysis analagous to per-
forming a fast Fourier transform (FFT) in the digital domain.
The vector-matrix multiplier block allows the user to perform a
matrix transformation on the incoming signals. Together these
blocks can be used like a Fourier processor [18], [28]. In addi-
tion, a peak detector is added to each CAB. The peak detector
allows the amplitude to be extracted from the incoming wave-
form and is useful for doing static or dynamic gain adjustments
on individual subbands of the incoming signal.
ThearchitectureillustratedinFig.7(a)isnonhomogeneousin
thattherearetwodifferentCABstiledacrossthechip.Thesmall
CAB is identical to the large CAB except it does not include the
vector-matrix multiplier module. Since the vector-matrix multi-
plier takes four inputs and each input will often be derived in a
separate CAB (from a separate subband created by the SOS
module), designs will typically only utilize one vector-matrix
multiplier for every four CABs. Thus, FPAAs that have 50–100
CABs can be made more compact by removing the vector-ma-
trix multiplier from all the CABs except those on the top and
bottom rows (assuming the FPAA is more or less square in de-
sign). Alternatively, thevector-matrixmultipliers insome of the
CABs could be replaced with other specialized circuits to in-
crease functionality and performance for a targeted application.
IV. RASP 1.5 FPAA
Several FPAAs have been fabricated in 0.5- m, standard
CMOS process to characterize the switches, computational
logic, and programming infrastructure. The ﬁrst chip fabricated
was the RASP, and it is discussed in [15], [19], [27]. A second
version of RASP, dubbed RASP 1.5, has been fabricated with
a number of small circuit and architectural improvements over
the previous chip. Experimental results that characterize the
switch network and demonstrate system-level functionality
Fig. 8. Switch resistances for a ﬂoating-gate pFET, standard pFET, and a
standard two-transistor T-gate . The ﬂoating-gate switch has been programmed
to an extremely on position such that the high-impedance region (at 1.5 V for
the standard pFET shown) has effectively been shifted below the power rail.
This results in a relatively ﬂat resistance similar to the larger T-gate.
are discussed in the next two sections. RASP 1.5 contains two
CABs with a ﬂoating-gate crossbar switch network connecting
them. Both CABs are identical to the large CAB illustrated in
Fig. 7(b).
As discussed earlier, the resistance and capacitance of the
ﬂoating-gate switch are important characteristics. The ON
switch resistance is plotted in Fig. 8. For reference, this ﬁgure
also shows the resistance of a standard pFET (with an SRAM
memory bit setting the gate) and transmission gate (T-gate)
(both an nFET and a pFET passing the signal). When pro-
grammed to a point that is not extremely on, the ﬂoating-gate
switch exhibits a resistance that is very similar to the standard
pFET shown here (as seen in [27]. However, by injecting the
ﬂoating-gate switch further, the voltage on the isolated gate
node is pushed lower and thus the resistance curve shifts to
the left. This ﬁgure shows that by programming the switch far
enough, the resistance through the switch can maintain a more
consistent level through the operating range (power rails) of
the switch. This allows a single ﬂoating-gate pFET to exhibit
a resistive characteristic that is similar to the resistance of a
standard T-gate with two transistors. As shown, the resistance
of the ﬂoating-gate switches is approximately 10 k , which is
about what is expected for relatively small pFETs.
The OFF resistance is harder to measure given the limitations
of standard test equipment. Even at a V, current
through the OFF switches is below the measureable range of
standard picoammeters. Given this, the OFF resistance should
be in the gigaohm range and in the worst case hundreds of
megaohms. Likewise, the parasitic capacitance of the switches
isdifﬁculttomeasurewhentheyareembeddedintheswitchnet-
work and accessible only through the programming infrastruc-
ture. A theoritical estimate based on the layout and fabrication
parametersyieldsavalueof1fFforeachswitchoneachcolumn
and row. Thus, for the RASP 1.5, each column is estimated to
contribute 96 fF of parasitic capacitance and 46 fF for each row.2304 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 52, NO. 11, NOVEMBER 2005
Fig. 9. (a) Source-follower conﬁgured using a ﬂoating-gate current source.
By programming the ﬂoating gate charge, the current is set in the current mirror
(the other half of the current mirror is internal to the wide-range OTA). Thus,
the effective conductance can be modiﬁed for each of the OTAs on chip. (b)
Using the switch matrix, an OTA located in one of the CABs is connected in a
source-follower conﬁguration, and two external pins are routed to the OTA as
the input and output signals. The programmable biases illustrated in (a) are not
shown here for simplicity, but each OTA has a current mirror and ﬂoating-gate
current source that sets its bias.
V. SYSTEM RESULTS
A number of different analog systems have been synthesized
on the RASP 1.5. These systems vary from simple one and two
element systems to more complex systems with as many as
seven on-chip components. These systems also use a range of
different CAB components including ﬁne-grained (transistors
and capacitors), medium-grained (OTAs), and coarse-grained
( SOSandpeakdetector).Ineachoftheseexamples,ﬂoating-
gate transistors are used as current sources to set biases. De-
pending on the circuit, these programmable biases are shown to
control ﬁlter corner frequencies, Q-peaks, and time constants.
A. Low-Order Filtering With OTAs
Aﬁrst-orderﬁltercanbeimplementedintheRASP1.5FPAA
usingan OTA inoneofitsCABs. Fig.9showshowthecircuitis
mapped onto the FPAA using ﬁve ﬂoating-gate switches. Once
the switch network is conﬁgured, the biasing ﬂoating-gate tran-
sistor is programmed to vary the corner frequency of this ﬁrst-
order ﬁlter. The frequency response is shown for several pro-
grammedcornerfrequenciesinFig.10(a).TheplotinFig.10(b)
shows the correlation between programmed bias current and
measured corner frequency. By ﬁtting a curve to this data, the
ability to predict the necessary bias current for a desired corner
frequency can be achieved. This is important, because the user
will typically want to specify the system parameters in terms
of corner frequency, Q-peak, time constants, offsets, etc. and
then let the programming interface make the translation to the
appropriate bias currents to generate these paramete these cur-
rents while programming the ﬂoating-gate transistors. Experi-
mental results from Fig. 6 show a measurement threshold of 1
pA using present measurement techniques. An important con-
sideration here is the relative sizing of the transistors that set the
bias rs.
In Fig. 11, a SOS ﬁlter is shown along side the FPAA imple-
mentation. The ﬁxed-value capacitors and OTAs from a single
CAB are used to synthesize this circuit. Using the ﬂoating-gate
Fig. 10. (a) Frequency response of the source-follower circuit for several
bias currents. An internal ﬂoating-gate transistor is used as a current source
to set the OTAs bias. Injecting the ﬂoating-gate device increases the current
and thus the bandwidth of this ﬁrst order ﬁlter. (b) Plot shows the correlation
between bias current and programmed corner frequency. This data can be used
in future experiments to select the appropriate bias current for the desired
corner frequency.
programmble biases, the two OTAs in a source–follower con-
ﬁguration were biased to the same level and the the third OTAs
bias current was increased to adjust the Q-peak of the system.
The frequency response for this circuit is shown in Fig. 11(c).
As expected, the Q-peak increases as the third bias current (e.g.,
conductance) increases.
For second-order functions such as the SOS and diff2 cir-
cuit, reasonable Q-peaks and ﬁlter bandwidths require small
bias currents (in the picoampere to femtoampere range). While
the ﬂoating-gate transistors can set bias currents this low, the
constraint becomes the ability to accurately measure these cur-
rents while programming the ﬂoating-gate transistors. Experi-
mental results from Fig. 6 show a measurement threshold of 1
pA using present measurement techniques. An important con-
sideration here is the relative sizing of the transistors that set the
biascurrents.Theﬂoating-gatetransistorshowninFig.9(a)setsHALL et al.: LARGE-SCALE FPAAs FOR ANALOG SIGNAL PROCESSING 2305
Fig.11. (a)SOSﬁltercanbeimplementedwithtwoOTAsinasource-follower
conﬁgurationandathirdOTAthatcreatespostivefeedback.(b)Usingtheswitch
matrix, two OTAs within the CABs are connected in a SOS conﬁguration. The
programmable biases shown in Fig. 9(a) are not included here for simplicity,
but each OTA has a current mirror and ﬂoating-gate current source that sets its
bias. (c) The experimental frequency response of a Access is not allowed to one
of your selected collections: Rates SOS ﬁlter is shown here. The Q parameter is
adjusted by increasing the bias current of the positive feedback ampliﬁer via a
ﬂoating-gate current source.
the current through the nMOS current mirror (the other half of
the current mirror is internal to the OTA module). To set small
biascurrents,itispreferrabletohavethenFETandﬂoating-gate
transistor sized larger than the current mirror nFET internal to
the OTA. In this conﬁguration, the current mirror functions as
a current divider, and thus, very low bias currents can be set by
programming the ﬂoating-gate transistor to generate currents in
the picoampere range.
The OTAs on the RASP 1.5 FPAA are standard nine-tran-
sistor wide-range OTAs with and a bias tran-
sistor W/L of 8.1/1.8. Although the OTAs used in this example
are fairly generic, the FPAA fabric is designed to be highly ﬂex-
ible, and the CABs can be enhanced with any OTA ﬂavor de-
sired. Thus, the overall architecture can remain ﬁxed, while the
speciﬁc components and their respective performance charac-
teristics (signal-to-noise ratio, dynamic range, distortion, input
linear range, etc.) can be modiﬁed for different target markets.
Additionally, the use of ﬂoating-gate transistors to set the bias
Fig. 12. (a) Canonical prototype of a third-order Butterworth
double-resistance terminated LC ﬁlter. (b) The Gm–C implementation of the
same ﬁlter. This form of the ﬁlter can be realized on the RASP 1.5 FPAA. (c)
The experimental frequency response of a third-order Gm–C ﬁlter is shown
here. The corner frequency is adjusted by programming the bias currents of
the four OTAs.
currents allows a large degree of freedom in adjusting the cir-
cuitcharacteristcs. Floating-gatetransistors similar tothose that
set the bias current in the OTAs have been shown to be pro-
grammable over at least three and half orders of magnitude [29]
andmorerecenteffortsareyieldingprogrammabilityoverseven
orders of magnitude [30].
B. Third-Order – Ladder Filter
The availability of OTAs and grounded capacitors makes the
RASPidealforimplementing – ﬁlters.Oneway torealize
a particular ﬁlter is by modeling it with resistors, inductors, and
capacitors, and then synthesize the design using – ﬁlters.
In this example, a third-order Butterworth ﬁlter is implemented.
The canonical prototype of the ﬁlter, a double-resistance ter-
minated LC ﬁlter, is shown in Fig. 12(a). By using the signal
simulation method outlined in [31], the – ﬁlter shown in
Fig. 12(b) is generated. In order to maintain a maximally ﬂat re-
sponse, the following must hold: . Accordingly,
the bias current of OTA-3 was set to half of the other OTA bias2306 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 52, NO. 11, NOVEMBER 2005
Fig. 13. The C SOS block is comprised of two C circuits with a buffer in between. By spreading the corner frequencies of one of the C circuits to be far
apart, the frequency response of the other C can be measured. This method is used here to generate the frequency response plots for the ﬁrst (a) and the second
(b) C circuits. Then, the frequency response for the SOS is generated by programming both C to the same corner frequencies as shown in (c).
currents. A range of bias currents was used to create the fre-
quency response shown in Fig. 12(c). As expected, the corner
frequency of the ﬁlter is proportional to the bias currents of the
OTAs. The lower corners were obtained by using a bias current
intherangeofhundredsofpico-amps,whilethehighestcorners
required currents of up to 1 A.
C. Coarse-Grain CAB Components
As mentioned earlier, the CABs on this FPAA have several
special-purpose components that have been designed to opti-
mize speciﬁc functions. In particular, these CABs include pro-
grammable peak detectors and programmable bandpass ﬁlter
modules ( SOS circuits).
There is a wide range of systems that can be implemented
and conﬁgured on FPAAs with many of these CABs on them.
In particular, differentiators, cascaded SOSs, bandpass ﬁlters,
matrix transforms (including DCTs and wavelet transforms),
and frequency decomposition are all well suited for this archi-
tecture. In the audio arena alone, designs could be prototyped
to implement forms of noise suppression, audio enhancement,
feature extraction, auditory modeling, and simple audio array
processing. Other potential interest areas include communica-
tions signal conditioning (modulation, mixing, etc.), transform
coding, and neural networks (with external training). Many of
these systems rely on efﬁcient subband processing; therefore,
each CAB has been designed with a SOS bandpass ﬁlter
module to optimize this operation.
The SOS module is comprised of two modules cas-
caded with a buffer in-between them. Either module can be
usedalonebyspreadingapartthecornerfrequenciesoftheother
module. To characterize this module, frequency response plots
of each of the individual modules are shown in Fig. 13(a)
and (b). The bandwidth and Q-peak of the modules are quite
different. This is due to the difference in output capacitance of
each module. The output of the ﬁrst is tied to the input of
a buffer, which results in a relatively small capacitance. How-
ever,theoutputof thesecond is tiedintotheswitch network.
Therefore, the output load capacitance for this device will be
much higher due to the parasitics of the switches and the capac-
itance of the next circuit in the path. In these experiments, the
next stage was a relatively large buffer in an output pad.
Fig. 14. Circuit diagram for a typical subband system. The incoming signal
is bandpass ﬁltered and then the magnitude of the subband is output from the
peak detector. This is analagous to taking a discrete Fourier transform. (b)
Experiment data from the FPAA for the system in part (a). The input waveform
is an amplitude modulated signal with 1.8 kHz and 10.0 kHz components. The
output of the peak detector is shown with and without an integrating capacitor
added to the output stage.
Whenbothofthecascaded aresettothesamecornerfre-
quencies, the output of the module shows the desired second-
order roll-off as shown in Fig. 13(c). In all of these plots, the
corner frequencies are shown to be programmable over a wide
range of frequencies. The bias current to corner frequency cor-
relation is different for each of the cascaded devices. However,
all of the bias currents for these plots were within the range of
25 pA to 200 nA.
The coarse-grain components are most useful when they can
be combined to form a larger system. In Fig. 14(a), a circuit is
shownthatusesa SOSblock,anOTA,andapeakdetectorin
series. This conﬁguration is very powerful when it is replicated
64, 128, or more times on the FPAA with the center frequen-
cies of the bandpass ﬁlters varying over the desired frequency
range. The outputs of the different subbands are analagous toHALL et al.: LARGE-SCALE FPAAs FOR ANALOG SIGNAL PROCESSING 2307
the magnitudes of the discrete Fourier transform. As a test of
this system, data was taken from RASP 1.5 for a single sub-
band. As shown in Fig. 14(b), the input is an amplitude-modu-
lated signal with 1.8 and 10.0 kHz frequency components. The
SOS module is biased to have a center frequency near 1.8
kHz, and the OTA is conﬁgured to be a noninverting buffer. The
output of the system is shown in Fig. 14(b). Also, the output of
the system is shown after an external 2.2 capacitor has been
added at the output of the FPAA. This change has the effect of
smoothing (i.e., low-pass ﬁltering) the output, thus creating a
longer effective time constant for the system.
VI. CONCLUSION
Large-scale FPAAs based on ﬂoating-gate technologies pro-
vide the necessary levels of programmability and functionality
to implement complex signal processing systems. The ﬂoating-
gate transistors are shown to provide a compact switch that ex-
hibits relatively ﬂat resistance characteristics across the full op-
erating voltage and can be programmed to be an active circuit
element (variable resistor). FPAAs based on ﬂoating-gate cir-
cuits have been built and characterized, and system-level re-
sults have been shown. Systems implemented on these FPAAs
are demonstrated to be programmable over a wide range of fre-
quencies, Q-peaks, bandwidths, and/or time constants. With or-
ders of magnitude power consumption savings over traditional
digital approaches, this reconﬁgurable analog technology offers
an attractive alternative for implementing advanced signal pro-
cessing systems in low-power embedded systems.
REFERENCES
[1] M. A. Sivilotti, “Wiring Considerations in Analog VLSI Systems,
With Application to Field-Programmable Networks (VLSI),” Ph.D.,
California Institute of Technology, Pasadena, CA, 1991.
[2] K. Lee and P. Gulak, “A transconductor-based ﬁeld-programmable
analog arry,” in Dig. Tech. Papers IEEE Int. Solid-Sate Conf., Feb.
1995, pp. 198–199.
[3] ,“ACMOSﬁeld-programmableanalogarray,”inDig.Tech.Papers
IEEE Int. Solid-Sate Conf., Feb. 1991, pp. 186–188.
[4] S. Chang, B. Hayes-Gill, and C. Paul, “Multi-function block for a
switched current ﬁeld programmable analog array,” in Proc. Midwest
Symp. Circuits and Systems, vol. 1, Aug. 1996, pp. 158–161.
[5] D. Anderson, C. Marcjan, D. Bersch, H. Anderson, P. Hu, O. Palusinski,
D. Gettman, I. Macbeth, and A. Bratt, “A ﬁeld programmable analog
array and its application,” in Proc. IEEE Custom Integrated Circuits
Conf., May 1997, pp. 555–558.
[6] “ispPACOverview,”LatticeSemiconductorCorporation,Hillsboro,OR,
1999.
[7] X. Quan, S. Embabi, and E. Sanchez-Sinencio, “A current-mode based
ﬁeld programmable analog array architecture for signal processing
applications,” in Proc. IEEE Custom Integrated Circuits Conf., Santa
Clara, CA, May 1998, pp. 277–280.
[8] “Totally re-conﬁgurable analog circuit-TRAC,” Fast Analog Solutions,
Ltd, Oldham, U.K., 1999.
[9] C. A. Looby and C. Lyden, “A CMOS continuous-time ﬁeld pro-
grammable analog array,” in Proc. 5th Int. ACM Symp. Field-Pro-
grammable Gate Arrays, 1997, pp. 137–141.
[10] T. S. Hall, C. M. Twigg, P. Hasler, and D. V. Anderson, “Developing
large-scaleﬁeld-programmableanalogarraysforrapidprototyping,”Int.
J. Embed. Syst., 2004.
[11] G. Franz, “Digital signal processor trends,” IEEE Micro, vol. 20, no. 6,
pp. 52–59, Nov.–Dec. 2000.
[12] R.Ellis,H.Yoo,D.Graham,P.Hasler,andD.Anderson,“Acontinuous-
time speech enhancement front-end for microphone inputs,” in Proc.
IEEE Int. Symp. Circuits and Systems, vol. 2, Phoenix, AZ, May 2002,
pp. II.728–II.731.
[13] P. Hasler, P. Smith, R. Ellis, D. Graham, and D. V. Anderson, “Biolog-
ically inspired auditory sensing system interfaces on a chip,” in Proc.
IEEE Sensors Conf., Orlando, FL, Jun. 2002, invited Paper.
[14] P. D. Smith, M. Kucic, R. Ellis, P. Hasler, and D. V. Anderson, “Mel-
frequency cepstrum encoding in analog ﬂoating-gate circuitry,” in Proc.
Int. Symp. Circuits and Systems, vol. 4, 2002, pp. 671–674.
[15] T. S. Hall, P. Hasler, and D. V. Anderson, “Field-programmable analog
arrays: A ﬂoating-gate approach,” in Proc. 12th Int. Conf. Field Pro-
grammable Logic and Applications, Montpellier, France, Sep. 2002, pp.
424–433.
[16] H. W. Klein, “The EPAC architecture: An expert cell approach to ﬁeld
programmable analog circuits,” in Proc. IEEE Midwest Symp. Circuits
and Systems, vol. 1, Aug. 1996, pp. 169–172.
[17] P. Hasler, B. A. Minch, and C. Diorio, “Adaptive circuits using pFET
ﬂoating-gate devices,” in Proc. 20th Anniversary Conference on Adv-
naced Research in VLSI, Atlanta, GA, Mar. 1999, pp. 215–229.
[18] M. Kucic, A. Low, P. Hasler, and J. Neff, “A programmable contin-
uous-time ﬂoating-gate Fourier processor,” IEEE Trans. Circuits Syst.
II, Analog Digit. Signal Process., vol. 48, no. 1, pp. 90–99, Jan. 2001.
[19] T. S. Hall, C. M. Twigg, P. Hasler, and D. V. Anderson, “Developing
large-scale ﬁeld-programmable analog arrays,” in Proc. 18th Int. Par-
allel and Distributed Processing Symp., Santa Fe, NM, Apr. 2004.
[20] M. Kucic,P. Hasler,J.Dugger,andD.V.Anderson,“Programmableand
adaptive analog ﬁlters using arrays of ﬂoating-gate circuits,” in Proc.
Conf. Advanced Research in VLSI, Mar. 2001, pp. 148–162.
[21] P. Smith, M. Kucic, and P. Hasler, “Accurate programming of analog
ﬂoating-gatearrays,”inProc.IEEEInt.Symp.CircuitsandSystems,vol.
5, Phoenix, AZ, May 2002, pp. V.489–V.492.
[22] J.D.Gray,C.M.Twigg,D.N.Abramson,andP.Hasler,“Characteristics
and programming of ﬂoating-gate pFET switches in an FPAA crossbar
network,” in Proc. IEEE Int. Symp. Circuits and Systems, vol. 1, May
2005, pp. 468–471.
[23] B. Ray, P. P. Chaudhuri, and P. K. Nandi, “Design of OTA based ﬁeld
programmable analog array,” in Proc. 13th Int. Conf. VLSI Design, Jan.
2000, pp. 494–498.
[24] B. Pankiewicz, M. Wojcikowski, S. Szczepanski, and Y. Sun, “A ﬁeld
programmable analog array for cmos continuous-time ota-c ﬁlter appli-
cations,” IEEE J. Solid-State Circuits, vol. 37, pp. 125–136, Feb. 2002.
[25] E. Sanchez-Sinencio, J. Ramirez-Angulo, B. Linares-Barranco, and A.
Rodriguez-Vazquez, “OTA-based nonlinear function approximations,”
Proc. IEEE Int. Symp. Circuits and Systems, vol. 1, pp. 96–99, May
1989.
[26] (1999) Totally re-conﬁgurable analog circuit-TRAC. Fast Analog Solu-
tionsLtd.,Manchester,U.K..[Online].Available:http://www.zetex.com
[27] T. S. Hall, C. M. Twigg, P. Hasler, and D. V. Anderson, “Application
performance of elements in a ﬂoating-gate FPAA,” in Proc. 2004 IEEE
Int. Symp. Circuits and Systems, May 2004, pp. II.589–II.592.
[28] P. Hasler, B. A. Minch, and C. Diorio, “An autozeroing ﬂoating-gate
ampliﬁer,” IEEE Trans. Circuits Syst. II, Analog Digit. Signal Process.,
vol. 48, no. 1, pp. 74–82, Jan. 2001.
[29] A. Bandyopadhyay, G. J. Serrano, and P. Hasler, “Programming analog
computational memory elements to 0.2% accuracy over 3.5 decades
using a predictive method,” in Proc. IEEE Int. Symp. Circuits and Sys-
tems, May 2005, pp. 2148–2151.
[30] P. D. Smith and P. Hasler, “A kappa projection algorithm (KPA) for pro-
gramming to femtoampere currents in standard CMOS ﬂoating-gate el-
ements,” in Proc. Midwest Symp. Circuits and Systems, Cincinnati, OH,
2005.
[31] Y. E. Sun, Design of High Frequency Integrated Analogue Fil-
ters. London, U.K.: The Institution of Electrical Engineers, 2002.
[32] F.Baskaya,S.Reddy,S.K.Lim,T.Hall,andD.Anderson,“Mappingal-
gorithm for large-scale ﬁeld programmable analog array,” in Proc. 2005
ACM Int. Symp. Physical Design, San Fransisco, CA, Apr. 2005.
Tyson S. Hall, photograph and biography not available at the time of publica-
tion.
Christopher M. Twigg, photograph and biography not available at the time of
publication.
Jordan D. Gray, photograph and biography not available at the time of publi-
cation.
DavidV. Anderson, photograph and biography not availableat the time of pub-
lication.