ACKNOWLEDGEMENT........................... by The Pennsylvania State University CiteSeerX Archives
FIELD{PROGRAMMABLE ANALOG ARRAYS:
A FLOATING{GATE APPROACH
A Dissertation
Presented to
The Academic Faculty
by
Tyson S. Hall
In Partial Fulllment
of the Requirements for the Degree
Doctor of Philosophy
School of Electrical and Computer Engineering
Georgia Institute of Technology
July 12, 2004TABLE OF CONTENTS
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
SUMMARY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
ACKNOWLEDGEMENT . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
CHAPTER 1 RAPID PROTOTYPING OF ANALOG SYSTEMS . 1
1.1 Low{Power Signal Processing . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Cooperative Analog/Digital Signal Processing . . . . . . . . . . . . . 4
CHAPTER 2 HISTORY AND MOTIVATION . . . . . . . . . . . . . . 9
2.1 Background on FPAAs . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.2 Discrete{time FPAAs . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.3 Continuous{time FPAAs . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3.1 Computational Granularity and Capability . . . . . . . . . . 13
2.3.2 Interconnect Structure . . . . . . . . . . . . . . . . . . . . . 16
2.3.3 Complimentary Research . . . . . . . . . . . . . . . . . . . . 18
2.3.4 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.5 Application Focus . . . . . . . . . . . . . . . . . . . . . . . . 21
CHAPTER 3 BUILDING A LARGE{SCALE FPAA . . . . . . . . . . 22
3.1 Noise Sensitivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.2 Switch Networks and Interconnect Design . . . . . . . . . . . . . . . 25
3.3 Design Space and Computational Analog Block Design . . . . . . . . 28
3.4 Floating{gate Technology in Programmable Analog Circuits . . . . . 32
3.4.1 Floating{gate Switches . . . . . . . . . . . . . . . . . . . . . 34
3.4.2 Switch as Computational Element . . . . . . . . . . . . . . . 35
3.4.3 Programmability . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.5 Selecting a Switch . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
CHAPTER 4 RASP 1.0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
4.1 Computational Analog Blocks . . . . . . . . . . . . . . . . . . . . . 42
4.2 Analog Circuit Components . . . . . . . . . . . . . . . . . . . . . . . 43
4.2.1 Basic Analog Elements . . . . . . . . . . . . . . . . . . . . . 44
4.2.2 Matrix{Vector Multiplication . . . . . . . . . . . . . . . . . . 45
4.2.3 Filtering and Fourier Processing . . . . . . . . . . . . . . . . 47
4.3 Switch Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.3.1 Floating{gate Switches . . . . . . . . . . . . . . . . . . . . . 49
4.4 Circuit Characterization . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.5 System Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
ii4.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
CHAPTER 5 RASP 1.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
5.1 RASP 1.5 Architecture . . . . . . . . . . . . . . . . . . . . . . . . . 60
5.1.1 RASP 1.5 vs. RASP 1 . . . . . . . . . . . . . . . . . . . . . 60
5.1.2 Switch Networks . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.2 Computational Analog Blocks . . . . . . . . . . . . . . . . . . . . . 64
5.3 Testbed FPAA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
5.4 System Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
5.4.1 Low{Order Filtering with OTAs . . . . . . . . . . . . . . . . 69
5.4.2 Third{Order Gm-C Ladder Filter . . . . . . . . . . . . . . . 73
5.4.3 Coarse{Grain CAB Components . . . . . . . . . . . . . . . . 74
CHAPTER 6 MIXED{SIGNAL PROTOTYPING PLATFORM . . . 80
6.1 Development System . . . . . . . . . . . . . . . . . . . . . . . . . . 80
6.1.1 FPGA Development Board . . . . . . . . . . . . . . . . . . . 81
6.1.2 FPAA Daughter Card . . . . . . . . . . . . . . . . . . . . . . 82
6.2 Programming Software . . . . . . . . . . . . . . . . . . . . . . . . . 83
6.2.1 Graphical Circuit Specication . . . . . . . . . . . . . . . . . 83
6.2.2 High{level Matlab Code . . . . . . . . . . . . . . . . . . . . . 87
6.2.3 Low{level C Code . . . . . . . . . . . . . . . . . . . . . . . . 87
CHAPTER 7 FPAAS: A ROADMAP TO THE FUTURE . . . . . . . 89
7.1 Next{generation RASP . . . . . . . . . . . . . . . . . . . . . . . . . 89
7.1.1 Switch Networks . . . . . . . . . . . . . . . . . . . . . . . . . 90
7.1.2 Computational Analog Blocks . . . . . . . . . . . . . . . . . 91
7.2 Future FPAAs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
7.3 Testbed Applications for Large{Scale FPAAs . . . . . . . . . . . . . 97
7.3.1 Auditory Feature Extraction . . . . . . . . . . . . . . . . . . 97
7.3.2 Neuromorphic Modeling . . . . . . . . . . . . . . . . . . . . . 100
7.4 Future Research . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
7.4.1 Enabling Technologies . . . . . . . . . . . . . . . . . . . . . . 103
7.4.2 Computational Elements . . . . . . . . . . . . . . . . . . . . 104
7.4.3 CAD Tool Flow for Analog Systems . . . . . . . . . . . . . . 104
7.5 Commercialization . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
7.6 Original Contributions . . . . . . . . . . . . . . . . . . . . . . . . . 106
7.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
iiiLIST OF TABLES
Table 1 Summary of Signal Processing Functionality . . . . . . . . . . . . . 5
Table 2 Summary of FPAA Granularity . . . . . . . . . . . . . . . . . . . . 11
ivLIST OF FIGURES
Figure 1 Analog design cycle: Traditional vs. FPAA{enabled . . . . . . . . . 2
Figure 2 FPAA to FPGA analogy . . . . . . . . . . . . . . . . . . . . . . . . 3
Figure 3 Gene's law . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Figure 4 Moving the analog/digital boundary . . . . . . . . . . . . . . . . . 6
Figure 5 Switched{capacitor designs . . . . . . . . . . . . . . . . . . . . . . 10
Figure 6 Summary of interconnect designs . . . . . . . . . . . . . . . . . . . 15
Figure 7 Current mirror circuit diagrams: with and without switches . . . . 23
Figure 8 Inverter circuit diagrams: with and without switches . . . . . . . . 24
Figure 9 Eects of switches in an FPAA{implemented digital inverter . . . . 25
Figure 10 Switch network designs: Full crossbar . . . . . . . . . . . . . . . . . 26
Figure 11 Switch network designs: Full binary{tree . . . . . . . . . . . . . . . 27
Figure 12 Operational transconductance amplier circuit diagram: with and
without switches . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Figure 13 Layout, cross section, and circuit diagram of a oating{gate pFET 32
Figure 14 Traditional programmability achieved switchable arrays of elements 33
Figure 15 Drain curves for a single oating{gate transistor programmed over
a range of currents. . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Figure 16 Array isolation technique used during programming . . . . . . . . . 36
Figure 17 Resistance plot for a simple pFET switch . . . . . . . . . . . . . . 37
Figure 18 Resistance for a T{gate switch . . . . . . . . . . . . . . . . . . . . 38
Figure 19 Resistance for a oating{gate switch . . . . . . . . . . . . . . . . . 40
Figure 20 Die photo of the RASP 1.0 FPAA . . . . . . . . . . . . . . . . . . 42
Figure 21 Block diagram of a generic FPAA chip . . . . . . . . . . . . . . . . 43
Figure 22 Computational Analog Block (CAB) components in the RASP 1.0
FPAA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
Figure 23 Current summing operation . . . . . . . . . . . . . . . . . . . . . . 45
Figure 24 Multiplication operation . . . . . . . . . . . . . . . . . . . . . . . . 45
Figure 25 Vector{matrix multiplication operation . . . . . . . . . . . . . . . . 46
Figure 26 C4: RASP 1.0 bandpass lter circuit . . . . . . . . . . . . . . . . . 47
Figure 27 Programming infrastructure . . . . . . . . . . . . . . . . . . . . . . 48
Figure 28 Floating{gate switch programmed across range from o to on . . . 50
Figure 29 Switch characteristics of oating{gate transistors on RASP 1.0 . . . 52
Figure 30 Floating{gate transistor programming accuracy . . . . . . . . . . . 53
Figure 31 Integrator circuit diagrams . . . . . . . . . . . . . . . . . . . . . . 54
Figure 32 Frequency response of the integrator circuit as implemented on RASP 1.0 55
Figure 33 Second{order section circuit diagram . . . . . . . . . . . . . . . . . 57
Figure 34 Simulated frequency response of the second{order section as imple-
mented on RASP 1.0 . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Figure 35 Frequency response of the second{order section circuits as imple-
mented on RASP 1.0 . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Figure 36 Simulation of frequency decomposition as implemented on RASP 1.0 59
vFigure 37 Top{level VLSI layout of the RASP 1.5 FPAA . . . . . . . . . . . . 61
Figure 38 Die photo of the RASP 1.5 FPAA . . . . . . . . . . . . . . . . . . 62
Figure 39 Routing diagram of RASP 1.5 . . . . . . . . . . . . . . . . . . . . . 63
Figure 40 CAB components on RASP 1.5 . . . . . . . . . . . . . . . . . . . . 65
Figure 41 Block diagram of the C4 second{order section on RASP 1.5 . . . . 66
Figure 42 C4 circuit used in the bandpass lter module on RASP 1.5 . . . . . 67
Figure 43 Switch resistance on RASP 1.5 as compared to standard pFET and
T{gate switches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Figure 44 Integrator circuit diagrams . . . . . . . . . . . . . . . . . . . . . . 69
Figure 45 Frequency response of the integrator circuit as implemented on RASP 1.5 70
Figure 46 Correlation of integrator's bias current with its programmed corner
frequency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
Figure 47 Second{order section circuit diagrams . . . . . . . . . . . . . . . . 72
Figure 48 Frequency response of the second{order section circuit as imple-
mented on RASP 1.5 . . . . . . . . . . . . . . . . . . . . . . . . . . 73
Figure 49 Third{order ladder lter circuit diagrams . . . . . . . . . . . . . . 74
Figure 50 Frequency response of the third{order ladder lter circuit . . . . . 75
Figure 51 Output of the peak detector component for dierent time constants 76
Figure 52 Frequency response of the C4 second{order section circuit as imple-
mented on RASP 1.5 . . . . . . . . . . . . . . . . . . . . . . . . . . 77
Figure 53 Circuit diagram of a typical subband system . . . . . . . . . . . . . 78
Figure 54 Experimental waveforms from the subband system . . . . . . . . . 79
Figure 55 Picture of the mixed{signal prototyping platform . . . . . . . . . . 81
Figure 56 Block diagram of the mixed{signal prototyping system . . . . . . . 82
Figure 57 Screen capture of the graphical conguration software . . . . . . . 84
Figure 58 Screen capture of the graphical conguration software with a second{
order section circuit being entered . . . . . . . . . . . . . . . . . . 86
Figure 59 Sample netlist output from the graphical FPAA conguration software 87
Figure 60 Top{level block diagram of the RASP 2.5 FPAA . . . . . . . . . . 90
Figure 61 Block diagram of the routing and CAB architectures of RASP 2.5 . 91
Figure 62 Top{level VLSI layout of RASP 2.5 . . . . . . . . . . . . . . . . . . 92
Figure 63 VLSI layout of the small CAB on RASP 2.5 . . . . . . . . . . . . . 93
Figure 64 VLSI layout of the full CAB on RASP 2.5 . . . . . . . . . . . . . . 94
Figure 65 Routing architecture of FPAAs with hundreds of CABs . . . . . . . 95
Figure 66 megaCAB architecture . . . . . . . . . . . . . . . . . . . . . . . . . 96
Figure 67 Feature extraction system diagram . . . . . . . . . . . . . . . . . . 98
Figure 68 First{order dierence equation implementation . . . . . . . . . . . 98
Figure 69 FPAA synthesis of the feature extraction algorithm . . . . . . . . . 99
Figure 70 Generalized block diagrams for typical audio processing synthesis . 100
Figure 71 Block diagrams for example integrate{and{re neuron models . . . 101
Figure 72 Block diagrams for example cochlea models . . . . . . . . . . . . . 102
Figure 73 Ideal computer{aided design (CAD) tool ow for large{scale FPAAs 105
viSUMMARY
Field{programmable analog arrays (FPAAs) provide a method for rapidly pro-
totyping analog systems. Currently available commercial and academic FPAAs are
typically based on operational ampliers (or other similar analog primitives) with only
a few computational elements per chip. While their specic architectures vary, their
small sizes and often restrictive interconnect designs leave current FPAAs limited
in functionality, exibility, and usefulness. Recent advances in the area of oating{
gate transistors have led to an analog technology that is very small, accurately pro-
grammable, and extremely low in power consumption. By leveraging the advantages
of oating{gate devices, a large{scale FPAA is designed that dramatically advances
the current state of the art in terms of size, functionality, and exibility. A large{
scale FPAA is used as part of a mixed{signal prototyping platform to demonstrate
the viability and benets of cooperative analog/digital signal processing. This work
serves as a roadmap for future FPAA research. While current FPAAs can be com-
pared with the small, relatively limited, digital, programmable logic devices (PLDs)
of the 1970s and 1980s, the oating{gate FPAAs introduced here are the rst step
in enabling FPAAs to support large{scale, full{system prototyping of analog designs
similar to modern FPGAs.
viiACKNOWLEDGEMENT
Research is rarely pursued in a vacuum. I am grateful to each and every friend
for their support, encouragement, assistance, and inuence. My advisor, Dr. David
Anderson, has been there every step of the way. I am most thankful for his thoughtful
guidance, understanding, and friendship. My co{advisor, Dr. Paul Hasler, has been
a continual source of knowledge and encouragement. If it were not for Paul, I would
still be ignorant of oating{gate transistors and even FPAAs.
I count it a privilege to know each one of the many graduate students in the
Cooperative Analog/Digital Signal Processing (CADSP) group. In particular, I rec-
ognize David Graham and Paul Smith for their assistance during my early days of
VLSI layout. I am also grateful for the talent and skills that Chris Twigg, Jordan
Gray, and Dave Abramson have brought to the FPAA research team. We have spent
many long hours together working on layout, testing FPAA chips, and discussing all
things FPAA. I also appreciate the time and eort that Sourabh Ravindran has spent
working on analog implementations of his feature extraction algorithms. Sourabh is
one of my longest and best friends in the CADSP group.
If I did not have such a great team of editors, reading this document would have
been much more tedious. In particular, Julie Hall (cousin), Vicki Hall (mother),
Burton Hall (father), Barry Hall (brother), and Dr. Roger Hall (uncle) all graciously
read rough drafts of this thesis and/or gave editorial assistance despite their varying
levels of interest in the topic.
I thank my friends and family who have kept me in their thoughts and prayers
during my tenure at Georgia Tech. To the Creator of all that is and was and is to
come, I am most grateful. As my knowledge increases, my awe of His handiwork
abounds.
viiiCHAPTER 1
RAPID PROTOTYPING OF ANALOG SYSTEMS
The process of designing, fabricating, and testing an analog chip requires certain
expertise and is often long and expensive. As shown in Fig. 1, the process is not
unlike designing digital ASICs (application{specic integrated circuits), except that
there are fewer tools and libraries available to the designer. The traditional analog
design cycle often requires several iterations of the fabrication process, which with
the simulation, VLSI layout, and testing phases can easily consume a year or more for
typical IC designs. However, the use of a recongurable analog chip, dubbed a eld{
programmable analog array (FPAA), would dramatically reduce the design cycle by
removing the fabrication stage from the iterative process. Thus, many designs may
be tested and modied within a single day.
Like eld{programmable gate arrays (FPGAs), FPAAs are not optimal for all
solutions. They are, however, very useful for many situations, and a solution can be
found for many problems not requiring full functionality. Relative to custom{designed
analog circuits, a design implemented on an FPAA results in higher parasitics as well
as increased die area for a given design; therefore, the design always possesses some
ineciencies (i.e., lower bandwidth and higher consumed power). On the other hand,
since analog circuit design is often time{consuming, these adverse characteristics are
well balanced by markedly decreased time to market.
FPAAs have been of interest for some time, but historically, these devices have had
very few programmable elements and limited interconnect capabilities, making them
limited in their usefulness and versatility. The next{generation FPAA needs to correct
these problems in order to extend the usefulness and acceptance of FPAAs. As shown
in Fig. 2, traditional FPAAs resemble the early PLDs in that they are focused on small
systems such as low{order ltering, amplication, and signal conditioning. However,
1Concept Simulation Testing VLSI Layout Fabrication
Concept Simulation VLSI Layout Fabrication Testing
x 20
(3 months)
x 3
FPAA−based Rapid Prototyping Design Cycle:
Traditional Analog Design Cycle:
Figure 1. This gure illustrates the advantages of designing analog ICs using an FPAA{based rapid
prototyping technology as opposed to the traditional design cycle of VLSI layout and fabrication.
The traditional analog design cycle often requires 3 or more iterations of the fabrication process
which extends the development process to over a year. With an FPAA{based system, designs can
be synthesized, tested, and modied 20 or more times within a matter of days instead of years.
the class of large{scale FPAAs that we are exploring here are more analogous to
modern FPGAs. These FPAAs are much larger devices with the functionality needed
to implement high{level system blocks such as programmable high{order ltering,
fourier processing, and signal analysis in addition to having a large number of ne{
and medium{grain, programmable analog blocks (e.g., operational transconductance
ampliers (OTAs), transistor elements, capacitors, etc.).
1.1 Low{Power Signal Processing
The future of FPAAs lies in their ability to speed the implementation of advanced,
low{power signal processing systems. In this thesis, an FPAA architecture is pre-
sented for achieving exible, large{scale FPAAs targeted at mainstream signal pro-
cessing systems. These FPAAs are intended to impact analog signal processing in
two ways: rst, they perform the function of all rapid prototyping devices in reduc-
ing development time. Second, they are a platform for implementing advanced signal
processing functions|usually realized only in digital systems|with analog circuits.
2Ctl
Reg
M
e
m
M
e
m
Ctl
P
C
A
L
U
4
CPU Datapath (Hard or Soft Core)
0 0 0
0 0 1
1 0 0
0 1
1 0 0
1
0
1
0 0
1
Memory
z−1 z−1 z−1
B
2
B
3
B
4
B
1
DSP / Filter Blocks
PLD FPGA
+
− +
−
B
1
B
2
B
3
out I
in V
Sub−band Processing
3 Out
W W W W
V3
33 32 31 30
−
V3
+
Out0
W W W W
V
00 01 02 03
−
0
V0
+
+
− +
−
+
− +
−
+
− +
−
+
− +
−
+
− +
−
FPAA
Programmable
Filtering
Vector−Matrix Multiplication
Large−scale FPAA
(a) (b)
Figure 2. (a) Digital PLDs can be used to implement small, carefully dened pieces of a complex
system, while FPGAs can be used to implement entire systems including processor datapaths, com-
plex DSP functions, and more. Modern FPGAs can be 100 - 10,000 times larger and more complex
than the PLDs of the 1970s and 1980s. (b) Analagously, traditional FPAAs resemble the early
PLDs in that they are focused on small systems such as low{order ltering, amplication, and signal
conditioning. However, the FPAAs based on oating{gate devices presented here are much larger
devices with the functionality needed to implement high{level system blocks such as programmable
high{order ltering and fourier processing in addition to having a large number of programmable
op{amp and transistor elements.
The primary benet of implementing analog signal processing systems is the po-
tential for large savings in power consumption. For digital signal processing (DSP)
microprocessors, Gene's law postulates that the power consumption, as measured in
mW/MIPS, is halved about every 18 months, as shown in Fig. 3 [27]. These advances
largely follow Moore's law, and they are achieved by using decreased feature size, in-
telligent clock gating, and other renements. Unfortunately, a problem looms on the
horizon; the power consumption of the analog{to{digital (A/D) converter does not
follow Gene's law and will soon dominate the total power budget of digital systems.
While A/D{converter resolution has been increasing at roughly 1.5 bits every ve
years, the power performance has remained the same, and soon, physical limits will
further slow progress.
For analog systems to be desirable to the largely DSP{oriented commmunity,
they not only need to have a signicant advantage in terms of size and power, but
31980 1990 2000 2010 2020 2030
Year
Programmable
Analog Power
Savings
>20 Year Leap
in Technology
0.1mW
10mW
100W
1W
Gene's Law
DSP Power
CADSP Power
Gene's Law
DSP Power
CADSP Power
10nW
1mW
Figure 3. Data from [27] showing the power consumption trends in DSP microprocessors along with
data taken from a recent analog, oating{gate integrated chip developed by the CADSP team [21,
45, 76].
they must also be relatively easy to use and easily integrated into a larger digi-
tal system. In addition, they must be shown to be accurately programmable and
eective at implementing many of the key systems found within DSP. As shown
in Table 1, the functionality desired for any technology focused on signal process-
ing includes monolithic lters, linear and nonlinear scalar functions, vector{matrix
operations (i.e., transforms, distance metrics, winner-take-all, principle component
analysis, etc.), linear{phase lters, adaptation, and tap delay lines for FIR systems.
1.2 Cooperative Analog/Digital Signal Processing
This thesis is part of a larger eort within the Cooperative Analog/Digital Signal
Processing (CADSP) group at the Georgia Institute of Technology. The goal of the
CADSP group is to investigate the partitioning of signal processing systems between
4Trad. Large{scale
Functionality DSP P Analog FPAA
Programmable  { 
Monolithic Filters   
Linear Scalar Operations   
Nonlinear Scalar Operations   
Vector{Matrix Operations   
Linear{phase Filters  { 
Adaptivity  { 
Tap Delay Lines   
Key:
{ = No or very limited support
 = Possible
 = Ecient, well{suited to technology
Table 1. Summary of Signal Processing Functionality
the analog and digital domains. Most current signal processing systems that gen-
erate digital output place the ADC as close to the analog input signal as possible
to take advantage of the computational exibility available in digital processors (see
Fig. 4). However, the development of large{scale FPAAs|and the CAD tools needed
for their ease of use|would allow engineers the option of performing some of the
computations in recongurable analog hardware prior to the analog{to{digital (A/D)
converter, resulting in both a simpler A/D converter and a substantially reduced
computational load on the digital processors that follow. By leveraging the power
eciencies mentioned in the previous section, some analog signal processing systems
have been shown to achieve as much as ve orders of magnitude over typical DSP mi-
croprocessor implementations [5, 21, 76]. As illustrated in Fig. 3, this corresponds to
a 20 year leap forward on the power curve predicted by Gene's law [37]. Additionally,
the output of such an analog system can be higher{level information, such as Fourier
coecients or phonemes of speech. This information can potentially be converted
into the digital domain with a much lower resolution and/or conversion speed than
would be needed in the traditional system where a literal sampling of the incoming
5￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
A/D
Converter
DSP
Processor
DSP
Processor
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
A/D
Specialized A/D
ASP
IC
Real−World
(analog)
Computer
(digital)
Figure 4. In most traditional digital signal processing systems, the analog{to{digital converter
(ADC) is placed as close to the real{world, analog inputs as possible. However, signicant power
savings can be achieved by moving some of the signal processing functionality into the analog domain
(in front of the ADC). Conceptually, this analog signal processing can be combined with the ADC
to form a \specialized ADC" as shown here.
signal is needed [44]. Thus, a simpler and much smaller A/D converter can be used.
As an example of the advantages of moving the A/D boundary, one can consider a
software radio system. In the traditional implementation, the analog signals coming
from multiple (e.g., four to eight) antennas must each be converted to the digital
domain. The signals coming from the antennas are still modulated by the carrier
frequency, and thus, multiple high{speed A/D converters must be used. In a CADSP
system, however, the demodulation and beam{forming functions can be implemented
in oating{gate analog circuitry [40, 44, 49, 62]. The output from the analog domain
is then a single, baseband signal that can be converted using a much slower A/D
converter.
To realize the power savings of analog systems in advanced signal processing
systems, the CADSP group focuses on the use of oating{gate devices as the en-
abling technology. Recent advances in oating{gate circuit technology have yielded
promising results in the implementation of complex analog signal processing systems.
6Floating{gate circuits have already been demonstrated in the areas of noise suppres-
sion, speech processing, image processing, and adaptive systems [6, 19, 38, 22, 45, 76,
80]. Work has also progressed on creating fast, accurate methods of programming
large arrays of oating{gate transistors [75]. This is an important factor in determin-
ing the viability of using oating{gate devices in large, complex systems, particularly
in mass{production environments.
The CADSP group brings together researchers from three areas of electrical en-
gineering: analog circuit design, digital signal processing, and computer/systems en-
gineering. Within the group, the analog circuit designers focus on the development,
characterization, and testing of low{level oating{gate circuits as the core technology
of CADSP systems. The digital signal processing researchers focus on the algorith-
mic translation of digital signal processing systems into the analog domain. They are
particularly interested in capitalizing on the large{scale system parallelization that
is possible within the analog domain, as well as the real{time, continuous{ow na-
ture of analog systems and its aect on traditional algorithms. Finally, the computer
engineering researchers focus their attention on the system{level development of the
hardware systems and computer{aided design (CAD) tool ows necessary to realize
practical CADSP systems.
This thesis falls within the computer engineering sub-area of the CADSP group.
As a practical matter, the investigation of a recongurable system for rapidly pro-
totyping analog and digital systems is extremely important in moving the CADSP
eort forward. To date, there are no large{scale FPAAs available that will allow
designers to test complex analog and mixed{signal systems. Furthermore, given the
programmable nature of oating{gate devices, the development of FPAAs based on
this core technology is a natural and important step toward the realization of a widely
accessible method of creating analog and mixed{signal designs. FPAAs will provide
a platform for exploring cooperative analog/digital signal processing systems that
7optimally balance the low{power, real{time computational nature of analog circuits
with the exibility and robustness of digital systems.
8CHAPTER 2
HISTORY AND MOTIVATION
Recongurable hardware has long been of interest to circuit designers and engineers.
In the digital domain, programmable logic devices (PLDs) have made a large impact
on the development of custom digital chips by enabling a designer to try custom
designs on easily recongurable hardware. Since their conception in the late 1960s
and early 1970s, PLDs have evolved into high{density eld{programmable gate arrays
(FPGAs) [9, 13, 79]. Modern FPGAs are widely used in the laboratory for rapidly
prototyping digital hardware, as well as in production goods to decrease time-to-
market and to allow products to be easily upgraded after being deployed.
In the analog domain, however, progress has been much slower. While early ana-
log integrated circuits (ICs) were often tunable with adjustable biases, truly recon-
gurable analog circuitry in the form of eld{programmable analog arrays (FPAAs)
did not emerge until the late 1980s [35, 74], and commercial oerings did not reach
the market until 1996 [60].
2.1 Background on FPAAs
FPAAs can be broadly classied into two categories: continuous{time devices and
discrete{time devices [33]. There are academic and commercial examples of both
categories, as well as advantages unique to each design methodology. This section will
focus more on continuous{time FPAAs, because the large{scale FPAA architecture
developed herein is a continuous{time FPAA; however, previous work on discrete{
time FPAAs will be summarized for comparison sake. Previous FPAAs have varied
greatly in terms of computational granularity and capability, interconnect structure,
performance, and application focus.
Fundamentally, FPAAs include two functions: routing and computation. (The
9V in V out
S1 S2 S4
C 2C 4C
Sn
n 2 C
Figure 5. Most discrete{time FPAAs use switched{capacitor designs. As the switch momentarily
rests on the Vin terminal, the input voltage charges the capacitor. Then the switch moves to the Vout
terminal where it is discharged through the next phase of circuitry. The programmability within
switched{capacitor designs is usually achieved using an array of appropriately sized capacitors.
Thus, the eective capacitance at each switch can be varied by setting the n digital memory cells
controlling switches S1 to Sn. In essense, this capacitor bank is equivalent to having a Digital{to{
Analog Converter (DAC) being included in each computational block.
next section will show that some designs combine these two functions into a single
unit; however, both elements are still present whether explicitly or implicitly.) The
routing elements are typically networks of switches connected together by signal lines
with the network architecture and switch types varying dramatically across dierent
FPAAs. The switch networks then connect to the computational elements of the
system. If there is more than one type of computational element, the computational
elements are usually grouped together to form a computational analog block (CAB)
that is analagous to the computational logic blocks found on FPGAs. If there is only
one type of computational element, the element and the CAB are one and the same;
however, for convenience, the computational elements will still be referred to as CABs
here.
2.2 Discrete{time FPAAs
Discrete{time FPAAs are typically switched{capacitor designs. For these circuits, the
incoming voltage is sampled by opening and closing a switch that connects the input to
an initial capacitor (Fig. 5). The switch and capacitor form a type of analog register,
and the system's signal path is partitioned by these registers. The basic computational
10Typical
Computational
Elements Advantages Disadvantages
Fine Transistors Small
Simple CAB design
Generic building blocks
Large # of switches
Large parasitics
Medium Op-amp
OTA
Current conveyor
Semi-generic building blocks
Moderate CAB design
Large variety of CAB /
interconnect designs
Limited size
Severe functionality /
performance trade{o
Coarse Fourier Processor
\Expert cell"
Higher performance
Easier user interface
Limited exibility
Limited functionality
Table 2. Summary of FPAA Granularity: The granularity of the computational logic used in an
FPAA impacts the size, performance, exibility, and functionality of the device.
elements are usually operational ampliers and analog registers, which synthesize a
linear resistor whose value is determined by the switching rate and capacitor value.
The synthesis of linear variable resistors gives switched{capacitor FPAAs greater
exibility than traditional continuous{time FPAAs; however, they can also be harder
to design prociently, because the switches and capacitors can introduce noise and
nonlinearities into the system that must be overcome [60]. In addition, these designs
have a limited bandwidth based on the sampling rate, are complicated by the need
for continuous{time anti{aliasing and reconstruction lters at the input and output,
and can be large if programmable capacitor arrays are included [35, 60].
In the late 1990s, several switched{capacitor FPAAs were introduced by both aca-
demic and commercial entities. In the academic arena, basic computational elements
vary from the simple operational amplier [20, 48] to the more complex blocks, such
as a lossless integrator and lossy integrator connected in a loop [52]. These devices
also can have programmable capacitor and/or programmable resistor arrays, which
add programmability [20, 35]. In the commercial arena, Motorola was one of the
rst companies to bring a general{purpose FPAA to market with their MPAA020
and MPA1000 series [4, 10, 60]. Since then, a spin{o company named Anadigm has
11marketed these switched{capacitor FPAAs [2]. The newest Anadigm devices have
CABs with two dierential operational ampliers, programmable capacitor banks, a
successive approximation register (for implementing an A/D converter), and a high{
speed comparator [3]. However, even the latest devices are relatively limited with
only four CABs per chip and are targeted at basic signal conditioning and ltering
applications.
Switched{capacitor designs are not the only discrete{time FPAAs. Switched{
current circuits can also be used to build an FPAA. The advantages of this technique
include not requiring operational ampliers, capability of fabrication on standard
digital CMOS processes, and elimination of distortion on the signals due to para-
sitic resistances. To their detriment, these designs can produce less accuracy than
switched{capacitor circuits, and since the signals are all currents, a given output stage
can drive only one input stage [12].
2.3 Continuous{time FPAAs
Continuous{time FPAAs typically use an array of xed components (often opera-
tional ampliers and/or transistors) that are interconnected by a switching matrix.
The switches are usually controlled by digital registers, which can be loaded by an ex-
ternal controller, thus allowing the FPAA to be congured to implement a number of
dierent designs. This type of FPAA is advantageous because potential sampling arti-
facts are avoided; anti{aliasing lters are not needed; common, relatively easy design
processes can be used (e.g., standard CMOS processes); and large signal bandwidths
can be supported with predictable performance [60]. However, the switching net-
works introduce parasitic impedances into the signal path that limit the bandwidth
and add noise to the system. Some of the literature has focused on minimizing the
number of switches in the signal path, but this can severely limit the exibility of the
FPAA [23, 54, 57, 66].
122.3.1 Computational Granularity and Capability
The granularity of the computational logic that forms the basis of the FPAA's de-
sign is an important design characteristic. As summarized in Table 2, the nest{
grain architectures typically use transistors as the core computational cell. While
these designs oer the most exibility and generality, synthesizing a suciently com-
plex system requires a large number of transistors to be wired together. Thus, a
large number of switches are introduced into the signal path. The switch parasitics
and nite resistance increases the noise within the system and limits the perfor-
mance/bandwidth [23, 47, 66]. Fine{grain FPAAs have been primarily relegated to
research in evolvable hardware [46, 71, 77], where the lowest{level building blocks are
desirable for generating unique designs using non-traditional design methodologies.
Systems that are designed using genetic algorithms are not as negatively aected by
the parasitics and non-ideal resistances of switches, since these parameters are taken
into account and even exploited throughout the evolutionary design process.
On the coarse{grain extreme, one nds FPAAs such as IMP's EPAC devices,
which contain an \expert cell" as the core computational block [47]. In the IMP50E10
device, this cell is a very high{level block with limited interconnects that is aimed
directly at signal conditioning applications. The logic within the cell can be cong-
ured to function as an amplier with an optional low{pass lter or as a comparator
with optional hysteresis. There is also a dedicated D/A converter for dening the
reference point for the comparator. These coarse{grain designs sacrice exibility
and generality in favor of increased, more predictable performance [47].
The majority of FPAAs fall in{between these two extremes. A number of FPAAs
use an operational transconductance amplier (OTA) as the basic computational
element [25, 63, 64, 66, 69, 70]. OTAs work well as the core computational cell,
because their transconductance can be programmed either by varying the analog bias
voltage or by changing the gain of the output current mirrors [1, 64]. In addition, it has
13been shown that OTAs can implement a wide range of linear and nonlinear circuits.
Several FPAA designs have focused on synthesizing linear circuits and use OTAs to
implement amplication, integration, and ltering functions [63, 64, 66]. Ray et al.
have proposed a more generalized scheme in which linear circuits are synthesized using
an OTA{based lossless integrator and an OTA{based lossy integrator as the basic
functional blocks. They also use an OTA{based multiplier and OTA{based integrator
as the basic functional block for synthesizing nonlinear circuits such as amplitude and
frequency modulation [69]. Sanchez{Sinencio et al. have used OTAs to implement
nonlinear functions such as multiplication, division, square root, exponentiation, and
piece{wise linear operations [70].
Similiarly, several FPAA designs have been proposed using a current{conveyor
structure as the basic building block. Current conveyors are similar to OTAs; how-
ever, when used as an amplier they exhibit a constant bandwidth that is independent
of gain. They also do not require compensation circuitry to insure stability, and thus,
they can operate at higher frequencies [34]. Gaudet et al. have proposed a current{
conveyor based FPAA in which each CAB contains a second{generation current{
conveyor, two programmable capacitors, and two programmable resistors (transcon-
ductors) [34]. This CAB is shown to implement amplication and rst{order ltering
functions, as well as log and anti{log functions with the addition of switchable diodes.
Premont et al. also describe an FPAA based on current{conveyors [67]. The core cell
includes tuneable resistors and a current{conveyor. It has been demonstrated that
this cell can be congured to implement a tuneable capacitor, and thus is suitable for
amplication and ltering functions.
Other medium{grain computational blocks have been used in FPAAs as well.
Pierzchala et al. used an OTA{based amplier/integrator cell that does not require
switches in the signal path [66]. Quan et al. proposed a current{mode FPAA that
uses a cascode current{mode integrator as the basic building block [68]. This core cell
14CAB
CAB
CAB
CAB CAB
CAB CAB
CAB
(a) (b)
CAB
CAB
CAB
CAB
CAB
CAB CAB
CAB
CAB
CAB CAB
CAB
Sw
Sw
Sw
S
w
S
w
S
w
S
w
(c) (d)
Figure 6. A number of dierent interconnect schemes have been used in FPAAs including (a) local
connections, (b) global connections, (c) cross{bar networks, and (d) fat{tree interconnects.
can implement amplication, integration, and attenuation with a minimum number
of switches in the signal path [23]. In a more specialized chip design for prototyping
machine learning algorithms, Bridges et al. used a mix of components including tran-
sistors, a dierential pair, a current mirror, a bias current source, and congurable
capacitors [11]. They used these blocks to synthesize single learning primitives such
as correlational learning, regression learning, and clustering.
152.3.2 Interconnect Structure
Aside from CAB components, a number of dierent interconnect structures have also
been proposed for FPAAs (see Fig. 6). The most common interconnect switch is a
MOS transistor controlled by a digital memory [4, 12, 34, 54, 64, 68]. Unfortunately,
switch resistance in these devices can range from 1000 to 5000 
 making them a
limiting factor in designing large, complex FPAAs [20]. Lee and Gulak used this type
of pass{transistor switch in [54, 57]; however, the parasitic eects of these switches
greatly limited the performance and capability of their FPAA [58]. As a result, they
replaced the pass transistors with four{transistor transconductors, which increased
the performance and functionality of their FPAA [55]. The new design reduced the
parasitic eects, increased the linearity, and increased the noise immunity. In addi-
tion, a transconductor switch exhibits a linear resistance, thus each switch can also
be used as a variable resistor by driving the gate voltage with a multi-valued memory
(or another internal or external signal). However, the large transistors needed for
low{frequency operation and the addition of a multi-valued memory for each switch
greatly increases the area required for the interconnects [58].
Other switch designs have been proposed as well. Premont et al. used a current
conveyor as the switching element [67]. This was particularly novel, because they used
the current conveyor for both the switching element and the active computational
element. In an eort to provide a radiation{tolerant FPAA for space applications,
Edwards et al. proposed the use of metal{to{metal antifuses for the switches [20].
The antifuse design they used also has the benet of a relatively low resistance (in
the 15 to 25 
 range).
In addition to using dierent switches, interconnect schemes have also varied in
overall architecture. In the Premont et al. FPAA previously mentioned [67], the use
of a current conveyor for both the switching element and active computational ele-
ment results in CABs and switching interconnects that are indistinguishable from one
16another. The overall architecture becomes a homogeneous mesh of logic with a min-
imum number of switches introduced into the signal path. Various other approaches
have also been tried in an eort to minimize the number of switches. Quan et al.
proposed the use of local interconnects. In their architecture, each CAB can be con-
nected to its eight neighbors and itself [23]. This would seem to be a severe limitation
on the exibility of this FPAA; however, they focus on the large number of ana-
log circuits with mostly local interconnections [68]. A similar idea was proposed by
Becker et al. [7]. They used digitally tunable transconductors as the computational
element. In this design, CABs were arranged in a hexagonal pattern; each CAB con-
sists of seven programmable transconductors|six for connecting to each neighboring
CAB and one for local feedback. This FPAA is designed for implementing Gm{C
lters (with the required capacitance being provided by the parasitic impedance of
the transconductors). Again, the lack of dedicated routing hardware is a limiting fac-
tor in this design, but if the target applications are constrained to be Gm{C lters,
it is satisfactory. Pierzchala et al. tried an even more limiting architecture in which
no electronic switches were included in the signal paths [66]. While these designs
may provide benets in bandwidth and signal{to{noise ratio (SNR), they lack the
exibility and generality needed in a truly general{purpose FPAA.
In another design, Pierzchala et al. introduced an interconnect scheme with both
local and global signal paths [65]. This conguration provided local routing paths for
a cell's four neighbors (north, south, east and west), as well as connections to global
busses that run horizontally, vertically, and diagonally. This two{tiered hierarchy in-
creases the routing exibility within the FPAA. An even more exible interconnection
network is the crossbar switch [69]. The crossbar switch oers a nonblocking, fully
connectable architecture; however, for a large number of inputs and outputs its size
can be too big (O(N2) growth rate) [54]. Lee and Gulak tried to solve this problem
by using an area{universal fat{tree network [57]. They used a heirarchical fat{tree
17network of small crossbar switches where the CABs were connected as the leaves of
the tree. In an additional eort to minimize the size required by the switch networks,
the number of connections was constrained [54]. Unfortunately, their prototype was
too small to fully test this interconnect concept.
2.3.3 Complimentary Research
There are several areas of research progressing in elds complimentary to FPAAs. On
the hardware side, there are dierent attempts at combining programmable digital
and analog hardware. Chow et al. have proposed a single{chip eld{programmable
mixed{analog{digital array (FPMA) for prototyping mixed{signal systems [14]. This
eort combines an array of standard digital FPGA cells with a programmable array
of operational ampliers, capacitors, resistors, and diodes [15]. The recongurable
analog hardware is limited in nature and is not the primary emphasis in this case.
Instead, the focus of this work is on congurable A/D and D/A converters that reside
between the blocks. Lee has also done some work in this area; however, his focus is
also on the design of recongurable A/D and D/A converters and not the design of
complex analog arrays [56].
Faura took a slightly dierent approach to the FPMA concept by combining digital
FPGA logic, programmable analog logic, and a digital microcontroller on the same
integrated circuit (IC) [26]. Once again, the recongurable analog hardware was not
very complex. In this case, the analog circuitry was used to perform limited signal
conditioning on the incoming signals, and as such, it was designed using a very large{
grained architecture that limited the degree of programmability.
Dudek and Hicks took yet another approach to reprogrammable hardware with
their analog microprocessor [16]. Their system was modeled after a standard digital
microcontroller with digital memory and digital fetch and decode control logic; how-
ever, the actual datapath was analog. They used a sampled{data approach similar
to switched{capacitor systems to temporarily store values in analog registers between
18instructions [17]. While this research may not be directly relevant to FPAAs now,
it does hold promise for the development of future FPAAs that contain advanced
intellectual property (IP) modules, such as analog microprocessors similar to modern
digital FPGAs.
On the software side, several dierent eorts are focused on various aspects of
the CAD tool ow necessary for the rapid prototyping of analog and mixed{signal
systems. Within the simulation and synthesis processes, it is important to have an
accurate and ecient method of modeling the behavior of the generated system.
Various models have been suggested including those by Enright and Mack in [24] and
Long in [59]. Additionally, work has been done in the synthesis of analog circuits from
user input in the form of an analog high{level description language (AHDL). Several
AHDLs exist including Cadence's SpectreHDL. In one eort, Binns et al. introduced
a high{level, top{down methodology for designing analog systems that are described
in SpectreHDL [8]. However, with the creation of an IEEE standard AHDL that
has been dubbed VHDL{AMS (VHSIC hardware description language { analog and
mixed{signal), proprietary AHDLs will probably fade in popularity. Thus, the eorts
of Ganesan and Vemuri in building a VHDL{AMS synthesis tool [29, 30] seem more
relevant to the tool ow of future FPAAs.
Ganesan and Vemuri are developing an automated CAD tool ow for analog cir-
cuits that they call the VHDL{AMS Synthesis Environment (VASE) [29]. VASE is
built on their previous work on a performance{oriented router for FPAAs [28]. The
router was designed for and tested using Motorola's MPAA020 FPAAs (now sold by
Anadigm [2]). As was previously noted, these devices were based on a small num-
ber of operational ampliers and switched{capacitor technology. The design of a
large{scale, functionally rich FPAA will provide researchers in the CAD tool ow
arena with a more complex and more interesting platform to test and develop their
19automated software applications. In a similar eort, Ganesan and Vemuri have pub-
lished work focusing on the partitioning of systems between the analog and digital
domains [32, 33]. Again, a large{scale, mixed{signal prototyping platform will benet
this area of research by providing a viable platform to investigate the advantages and
diculties involved in the analog{digital partitioning of systems.
2.3.4 Performance
The performance characteristics of FPAAs in regards to bandwidth, noise immunity,
and signal{to{noise ratio (SNR) are important in order for FPAAs to be widely
accepted by circuit designers. As with many new technologies, the initial eort on
FPAAs has focused more on their functional development and proof{of{concept rather
than the rigid performance parameters needed for marketable products. Thus, the
single biggest unknown in designing large{scale FPAAs is the resulting performance.
Often, the initial reaction to FPAAs is that the noise introduced by device mismatch
and the parasitic eects of large numbers of switches will cripple the performance of
any large, highly exible FPAA.
Floating{gate devices alleviate some of these problems. Specically, device mis-
match can be compensated for by individually programming the oating{gate tran-
sistors to match in output behavior. In addition, the programmable nature of the
switches will allow them to be used as variable resistors within the system, and thus,
use the non-ideal nature of the switch as an active circuit element [36]. However,
this does not completely solve the problem of switch parasitics, because certainly, a
number of \unwanted" switches will ultimately be required in a given circuit. The
eect of these oating{gate switches has been studied here and results are shown in
Chapters 4 and 5. The user and/or automated compilers can use this information to
optimize a given system in terms of switch routing so as to minimize their eect and
meet the required performance constraints.
Previous FPAAs have also been limited in bandwidth based on their underlying
20technology. One of the rst commerical FPAAs, the EPAC device, was limited to
125 kHz at the input [47]. In general, switched{capacitor designs are typically found
to have maximum frequencies in the kilohertz to low{megahertz range [35]. This is
similar performance to that shown by continuous{time FPAAs with local routing [23,
68]. One of the fastest designs on record is a small FPAA with a simple CAB based on
the second{generation current{conveyor. An amplier implemented on this FPAA is
shown to have a 3-dB frequency of 11 MHz, which is on the order of the performance
needed for video applications [34]. While scaling up this FPAA to a meaningful size
would inevitably diminish it's bandwidth, it does oer hope that future FPAAs will
be able to balance exibility, functionality, and performance.
2.3.5 Application Focus
A general{purpose, commercially viable FPAA similar to commercial FPGAs remains
elusive. Many FPAA designs have sacriced size and generality in favor of better
performance for a constrained set of circuit designs. FPAAs have been proposed for
evolvable hardware [46, 71, 77], neural networks [54, 57], signal conditioning [47],
programmable lters [7, 23, 68], fuzzy logic [65], machine learning algorithms [11],
and high{frequency applications [34]. Other FPAA designs have attempted to focus
on a broader class of systems including both linear and nonlinear elements [10, 69].
However, these eorts have failed to produce a suitably generic, user{friendly FPAA.
In addition, all of the FPAAs to date have been very small. The number of CABs on
a given device remains under 50 with many of the devices having less than 10 CABs.
While several companies currently sell FPAA devices, the market remains relatively
immature, and no single device or technology has received wide{spread acceptance.
21CHAPTER 3
BUILDING A LARGE{SCALE FPAA
As shown in Chapter 2, traditional FPAAs resemble the early PLDs in that they are
focused on small systems often having only a handful of computational logic blocks
and limited programmability. Likewise, an analogy can be made between modern
FPGAs and the large{scale FPAAs introduced in this thesis. However, while their
overall architectures may be similar, recongurable analog systems are not completely
analogous to their digital counterparts. Developing robust, programmable analog cir-
cuits presents a number of challenges not found in the digital world. In particular, the
noise sensitivity (and eects of the switch network on the results of the computation)
and the design space to which programmable devices are applicable are more critical
factors in designing FPAAs.
3.1 Noise Sensitivity
Analog circuits tend to be more sensitive to noise than digital designs. Because of
the quantization and resulting representation of ones and zeros as discrete voltages,
digital designs can tolerate a relatively large amount of noise in the system without
changing the precision of the result. Problems arise only when noise levels are high
enough to move a signal from a logical one to a logical zero or vice versa. In the analog
domain, however, values are represented as continuous voltages or currents. Any noise
in the system will directly aect the precision of the result. For recongurable analog
systems that must rely on networks of switches to set the internal signal paths, this
means that the parasitics of the switches in a signal's path can aect the result and
are a critical factor in the performance of the FPAA.
Adding switches in the signal path can have several eects including the addition
22Iin Iout Iin Iout
(a) (b)
Figure 7. (a) A simple, two{transistor current mirror illustrates the challenges of design recong-
urable ICs with ne{grain building blocks. (b) The parasitic capacitance that results from using
switches to form the current mirror will reduce the bandwidth of the system.
of parasitic capacitance, resistance, and transistor leakage currents to the path. In-
creased capacitance and resistance on a signal line will lower the bandwidth of the
system. For example, a simple two{transistor current mirror is shown in Fig. 7a. The
same circuit is shown in Fig. 7b with switches being added to the signal paths as
they would need to be if the current mirror were synthesized on the FPAA using the
MOSFET transistors in the CAB. In this case, there should not be any current ow
between the gate nodes, so the voltage should remain equal on the two gate nodes
even with the switches in the signal path. However, the parasitic capacitance will
decrease the bandwidth of the current mirror.
As shown in Fig. 8, other circuits that have switches in a signal path with current
owing through them will have a voltage drop across the transistor. In the case of a
digital inverter (also considered a high{gain amplier), adding switches to the sources
and drains of the nFET and pFET transistors will result in an output with maximum
and minimum voltages that are slightly inside the power rails (see Fig. 9). Since the
switches in the FPAAs designed as a part of this thesis are pFET devices, one can
expect this eect to be much worse for low voltages (i.e., ground) than voltages near
the 3.3 V power rail. In this case, the parasitics also increase the rise and fall times,
23In Out Out In
(a) (b)
Figure 8. (a) A MOSFET inverter (or high{gain amplier) circuit diagram is shown here without
the switches as it would be designed in a custom CMOS chip. (b) Synthesizing the inverter on
an FPAA using switches to connect transistors from the computational logic blocks results in this
circuit diagram. The switches will add parasitic capacitance as well as create a voltage drop across
the transistor for those switches that have current owing through them.
240 0.5 1 1.5 2 2.5 3
0
0.5
1
1.5
2
2.5
3
V
in
V
o
u
t
Switch Partially  on
Switch Fully  on
Effects of Switches
in the Signal Path 
Figure 9. Here, the input to output voltage relationship is shown for an inverter that is synthesized
in a testbed FPAA. The inverter is synthesized using an nFET and pFET in one of the CABs on
the chip. The switches used to make the appropriate connections are shown in Fig. 8b. The eects
of the switches in the signal path are evident by the uncharacteristic linear decay as the output
approaches ground. Also, it is clear that when the input is High, the output is not equal to 0 V
as one would expect. The switches in the signal path connecting ground to the inverter cause the
output to settle to a voltage greater than 0 V due to the non{zero resistance of the switches.
and thus, descrease the maximum frequency for the system.
3.2 Switch Networks and Interconnect Design
The resistive/voltage drop eects of the switches will certainly increase as more
switches are added to the signal path. However, there is another subtlety involved.
The parasitic capacitance and resistance added by the switches at any given switch
terminal is actually the sum of all the parasitics of all the switches in a row or column.
The summation of parasitics is due to the fact that all the sources for a given row of
switches are tied together and that all the drains for a given column of switches are
tied together. Therefore, the parasitic contribution of the switches will be present
25Out In
Figure 10. A full crossbar switch provides the most interconnectivity between the components and
the input/output (I/O) pins. This architecture also leads to the fewest number of switches in series
in a given signal path. However, it results in the worst parasitic eects because of the large number
of switches tied to each row and column.
regardless of the state (on or o) of each switch. Since the primary eect of parasitics
is lowering the bandwidth, an important architectural issue is present, especially if
the goal is high{speed design. As FPAAs scale up in size, designers will not be able
to use large crossbar switches to make global connections without seriously limiting
the bandwidth of large{scale FPAAs.
Instead, hierarchical interconnects will be necessary. By limiting the number
of switch connections at each tier of routing, the maximum bandwidth can be set.
However, as more tiers are added to the heirarchy, the number of switches that a signal
must travel through will increase, thus increasing the resistive drop in voltage due to
the routing network. Extreme examples of these two routing paradigms (crossbar vs.
hierarchical) are shown in Fig. 10 and Fig. 11. In Fig. 10, a full crossbar switch is
shown to illustrate the large parasitic contributions of the switches that result from
26In Out
Figure 11. A full binary tree provides the least parasitic eects; however, it requires the most
switches to be present in series in a given signal path, and has limited interconnectivity between the
components and I/O pins.
this type of routing design. The transistors that contribute parasitics to the path
from In to Out are highlighted. Figure 11 shows the other extreme, a binary tree. In
this case, the parasitics at each tier of the hierarchy are minimized, but the number of
switches through which a signal must pass increases rapidly with the size of the FPAA.
Connectivity is also more limited in a binary tree style architecture. Large{scale
FPAA designs must balance the need for bandwidth, signal degradation (number of
switches in the signal path), and interconnectivity. The optimal point for this trade{
o will be a function of the signal mode of the computational logic (current{mode,
voltage{mode, mixed{mode), the size of the overall FPAA, the process technology
used (and thus the level of parasitics present), and the bandwidth requirements of
the end{user.
A compromise between the extreme routing designs is necessary. One such so-
lution uses crossbar switch networks in the local CAB routing to provide the most
connectivity and exibility in connecting the components together. Global routing
will then occur through a moderate number of tiers. For example, CABs can be clus-
tered into megaCABs (groups of four to eight CABs), where limited routing connects
27CABs within each megaCAB together. These megaCABs can then be tiled across
the FPAA in a mesh{style architecture with horizontal and vertical routing between
the megaCAB blocks. By tiering the global routing interconnects, the routing area
growth is slowed from O(N2) to O(logN).
3.3 Design Space and Computational Analog Block Design
Another dierence between recongurable analog and digital devices is the design
space that each must encompass. Functionality in the digital domain can be reduced
to a basic set of combinational and sequential primitives. For example, a NAND gate
can be congured to implement any of the other Boolean logic gates. Thus, with a
suciently large number of NAND gates, any combinational logic function can be
achieved. Similiarly, an asynchronous read{only memory (ROM) primitive can be
used to implement any combinational function. For the sequential functions, any
basic storage element (e.g., ip-op or latch) can be used to provide the necesary
memory. Most modern FPGAs use asynchronous ROMs to synthesize the combi-
national logic and D{type ip{ops for implementing the memory/sequential logic.
Thus, by replicating these two basic primitives thousands of times across a chip (and
a sucient routing network), an FPGA can be created that synthesizes a very large
number of dierent digital systems. In the analog domain, it is tempting to think
that one might be able to do the same thing with transistors, resistors, diodes, and
capacitors as the basic building blocks. Indeed, a large number of analog systems can
be built with these basic blocks; even digital systems could be synthesized with such
a device. However, these primitives are so ne{grained that it would require such
a large number of components|and thus a large number of switches|to implement
a design that the switch parasitics would degrade the performance. For example,
the circuit diagram for a basic 9{transistor operational transconductance amplier
(OTA) is shown in Fig. 12a. In part (b) of this gure, the same OTA is shown with
28in− V in+ V
bias V
out V
(a)
out V
in− V in+ V
bias V
(b)
Figure 12. (a) The circuit diagram of a basic 9{transistor operational transconductance amplier
(OTA). (b) The circuit diagram of the same OTA with the switches needed if it is implemented on
a ne{grain FPAA with only transistors. The addition of the 27 switches will dramatically reduce
the performance and functionality of this circuit.
29the switches necessary to synthesize this circuit on a ne{grained FPAA with tran-
sistors only. The FPAA design requires at least 27 switches, in addition to the nine
transistors, to implement the OTA. The switches will drastically aect the perfor-
mance and functionality of the OTA and may cause the circuit to break. To mitigate
these eects, coarser{grained blocks must be used. The task then is to do so while
still maintaining sucient exibility, functionality, and generality.
Using coarse{grain blocks can be appealing given their increase in performance
and robustness over ne{grain blocks. However, if the basic building blocks in an
FPAA are of too high a level, then the exibility is greatly diminished. To be as
exible as possible, an FPAA needs to have a wide range of ne{grained, medium{
grained, and coarse{grained components. This means that there will often be more
than one way of synthesizing the same system on the FPAA. This provides the most
exibility to end{users, because they can vary the levels of performance, utilization,
exibility, and complexity. For example, a CAB could contain a high{level cell for
bandpass ltering, several OTAs, and several pFET and nFET transistors. If there
are a sucient number of CABs, then a bandpass lter could be implemented in at
least three ways:
1. Specialized bandpass lter block: The specialized cell in one of the CABs could be
used directly. This is the simplest implementation method, and it will provide
the highest performing design. However, since the full circuit is set in silicon,
this will provide the least exibility in terms of specifying the lter parameters
and circuit topology.
2. OTA{level Design: The OTAs located in CABs could be connected via the
switch network to implement a number of dierent circuit topologies. This
design method will result in lower performance than the rst method, but it
gives the end{user a lot more exibility and input into the lter specications.
Here, the only xed parameter is the type of OTA that is used in the CABs.
303. Transistor{level Design: The transistors located in the CABs could be used to
synthesize dierent operational ampliers or other medium{sized blocks, which
could then be wired together as in method two to form a bandpass lter module.
Obviously, this would result in a large number of switches in the signal paths,
and thus, it would use alot of the routing infrastructure and would be plagued
with large parasitics. Because of the large number of switches used in this
design, it would result in the lowest performance (assuming basic functionality
can be achieved at all), but it would result in the highest degree of freedom in
specifying the circuit topology and lter parameters.
The perfect CAB is elusive indeed. However, careful analysis of common circuit
topologies can lead one to a group of components that strike a reasonable balance
between exibility, performance, and generality. Also, one should not limit designs
to homogeneous meshes of CABs. In reality, the overall architecture presented in this
thesis is designed to be a framework that can be used with many dierent CABs,
while not requiring changes to the overall infrastructure. It is forseen that a number
of dierent CABs will be tried and used in various circuit genres. For example, one
can imagine FPAAs targeted at problems in audio processing, image processing, neu-
romorphic signal processing, telecommunications, etc. These dierent FPAAs need
not largely dier. Instead, a reasonably general{purpose CAB can be tiled across the
FPAA with special{purpose CABs targeted at each genre interspersed among them.
This type of architecture is very similar to modern FPGAs, where a mesh of general{
purpose logic blocks (i.e., asynchronous ROMs and D ip{ops) is interspersed with
specialized blocks, such as dense memory blocks, hardware multipliers, digital lters,
and even entire processor cores.
31p+
n-well n-well
n+
p-substrate
MOS Tunneling
Capacitor
SiO2
Floating Gate
Transistor
Input Capacitor
Vin Vtun
Vfg
Vs Vd
poly2 cap
SiO2
Floating Gate
Metal 1 Layer
(Floating Gate)
Figure 13. Layout, cross section, and circuit diagram of the oating{gate pFET in a standard
double{poly, n-well MOSIS process: The cross section corresponds to the horizontal line slicing
through the layout view. The pFET transistor is the standard pFET transistor in the n-well process.
The gate input capacitively couples to the oating{gate by either a poly{poly capacitor, a diused
linear capacitor, or a MOS capacitor, as seen in the circuit diagram (not explicitly shown in the
other two gures). Between Vtun and the oating{gate is our symbol for a tunneling junction|a
capacitor with an added arrow designating the charge ow.
3.4 Floating{gate Technology in Programmable Analog Cir-
cuits
Previous FPAAs have suered from their small size and lack of functionality/generality.
Next{generation FPAAs need to correct these problems in order to extend the useful-
ness and acceptance of FPAAs. Ideally, one would like a small, easily programmable
element that can be congured to act as an ideal switch, variable resistor, and con-
gurable computational element. While such a device is indeed ideal, oating{gate
transistors do oer some of these qualities. Previously, we have shown that the
oating{gate transistor can be used as a (nonideal) switch, variable resistor, and
programmable element within larger computational blocks (e.g., analog multiplier,
programmable lter, programmable OTAs, etc.) [36].
In addition, the small size of the oating{gate structure will allow larger, more
32S2 S2 S4 S4
Iin 2Iin 4Iin
Iin
V bias
S4 S4 S1
Effective W/L = 1 Effective W/L = 2 Effective W/L = 4
Figure 14. Standard designs often achieve circuit programmability by embedding switchable arrays
of elements (such as transistors or capacitors) within the logic cells. Here, a conceptual version
of [64]'s programmable current mirror is shown. In their case, 5 bits were used to set the switches.
This required 64 MOS transistors, 31 digitally controlled switches, and ve memory cells to hold the
conguration of the switches. Using oating{gate technology, this entire structure can be replaced
with two programmable oating{gate transistors.
functional FPAAs to be built using this technology. One example of the capabil-
ity/area improvement that can be achieved with oating{gate transistors is the pro-
grammable current mirror. Pankiewicz et al. have presented one of the most recent
FPAA designs. Their FPAA is based on OTAs in which the current mirrors on the
dierential outputs can be programmed. They use a bank of current mirrors similar
to the simplied form shown in Fig. 14. Each current mirror requires 64 MOS transis-
tors, 31 digitally controlled switches, and ve memory cells to hold the conguration
of the switches. The entire structure can be replaced with two MOS transistors and
a programmable oating{gate transistor. The area savings in this case are consider-
able. Furthermore, the resolution of the bank of current mirrors is set at ve bits;
whereas, the oating{gate current mirror's resolution can be varied based on the need
of a given application with a maximum resolution of approximately 10 bits [72].
The oating{gate transistors used in these FPAAs are standard pFET devices
whose gate terminals are not connected to signals except through capacitors (e.g.,
no DC path to a xed potential) [42]. Because the gate terminal is well insulated
from external signals, it can maintain a permanent charge, and thus, it is an analog
33memory cell similar to an EEPROM cell. With a oating gate, the current through
the pFET channel is dependent on the charge of the oating{gate node. By using
hot{electron injection to decrease the charge on the oating{gate node and electron
tunneling to increase the charge on the oating{gate node, the current ow through
the pFET channel can be accurately controlled [42, 51].
3.4.1 Floating{gate Switches
Using a oating{gate transistor as a switch requires that the device be turned on or
turned o. Ideally, the on state corresponds to the free ow of current through the
device or equivalently, zero impedance between the source and the drain. Likewise,
the o state is ideally characterized by zero current owing through the device {
an innite impendance between the source and the drain nodes. A oating{gate
transistor, however, will not act as a perfect switch. The on state will be characterized
by an impedance greater than zero, and the o state will have an impedance less
than innity. Therefore, the quality of a oating{gate transistor as a switch will be
determined by measuring the on and o impedances.
The quality of the switches is an important factor in determining the nal archi-
tecture. The main concern is that routing a signal through multiple switches could
degrade data as the cummulative impedance of the switches becomes prohibitive.
The impedance of the oating{gate transistor is a function of the charge on the
oating{gate node allowing it to be set using hot{electron injection and electron
tunneling. Figure 15 shows the relative I{V curves for a oating{gate transistor as
it is programmed from o to on. Ideally, each transistor would be programmed to
the extreme ends of the graph, but programming oating{gate transistors is a time{
comsuming task. The desired quality of the switch will have to be chosen with regard
to the time it will take to program the device.
Choosing a reasonable time scale for programming leads to a compromise in the
quality of the switch. Also, note that the operating voltage of the gate node is not
340 0.5 1 1.5 2 2.5 3
10
−10
10
−9
10
−8
10
−7
10
−6
10
−5
D
r
a
i
n
 
c
u
r
r
e
n
t
 
(
A
)
Gate Voltage (V)
Figure 15. This is a current{voltage plot of a single oating{gate transistor programmed to dierent
levels of oating{gate charge. The oating{gate transistors can be programmed over a wide range
of currents.
xed. Due to the parasitic capacitances between the gate and drain/source nodes,
the gate voltage|and thus the switch's impedance|will vary as a function of the
signal current. This variation must be minimized.
3.4.2 Switch as Computational Element
When used as a switch, the oating{gate should be as transparent a part of the circuit
as possible. However, Fig. 15 shows that the oating{gate transistor can be used as
an in{circuit element [41, 50]. By adjusting the charge on the oating{gate node
between the extremes used for on and o, the impedance of the switch can be varied
over several orders of magnitude. Thus, a variable linear resistor can be synthesized
by the oating{gate switch.
Using the oating{gate switches as in{circuit elements allows for a very compact
architecture. The physical area needed for the CABs is greatly reduced, because
35R0
R2
R1
C0 C3 C1 C2
Drain Control
Voltage
Gate Control
Voltage
Figure 16. By selectively setting the gate and drain voltages of the columns and rows, a sin-
gle oating{gate transistor can be programmed using hot{electron injection without aecting the
neighboring devices.
resistors, which consume relatively large amounts of space on CMOS processes, are
not needed as separate components. Also, by reducing the number of individual
circuit elements, signal routing is simplied without losing functionality.
3.4.3 Programmability
The use of oating{gate devices as the only programmable element on the chip,
greatly simplies chip conguration. Additionally, all of the oating{gate transistors
are clustered together to aid in the programming logic and signal routing. Decoders
on the periphery of the circuit are connected to the drain terminals, source terminals,
and gate capacitors of the oating{gate matrix. In programming mode, these decoders
allow each oating{gate transistor to be individually programmed using hot{electron
360 0.5 1 1.5 2 2.5 3
10
3
10
4
10
5
10
6
V
S (V)
R
e
s
i
s
t
a
n
c
e
 
(
W
)
Figure 17. This is a typical resistance plot for a pFET switch in the on conguration. With an
exponentially increasing resistance as the DC bias approaches 0 V, the pFET switch clearly does
not pass low voltage signals very well.
injection (see Fig. 16) [51].
Part of the previous work has been the development of a systematic method for
programming arrays of oating{gate transistors [50, 51, 75]. A microprocessor{based
board has been built to interface a PC to these analog oating{gate arrays for the
purposes of programming and testing. With a PC controlling the programming of
these devices, the details of using hot{electron injection and tunneling to program
individual oating{gate switches have been abstracted away from the end{user. The
programming algorithms have been optimized for accuracy and speed, while giving
the end{user an easy{to{use interface for conguring arrays of oating{gate devices.
370 0.5 1 1.5 2 2.5 3
10
3
10
4
10
5
10
6
V
S (V)
R
e
s
i
s
t
a
n
c
e
 
(
W
)
Figure 18. This is a typical resistance plot for a T{gate switch in the on conguration. With
an additional nFET transistor, the T{gate switch exhibits a much lower resistance for low input
voltages than the lone pFET transistor switch. This benet, however, comes at the cost of more
area and more parasitic capacitance added to the signal line.
3.5 Selecting a Switch
As discussed in Section 2.3.2, a number of dierent switches that have been tried in
FPAAs. A switch must be selected based on bandwidth, area, and switch quality (on
and o impedances). Typically, the switch is turned on or o with a digital memory
element, such as an SRAM cell. This scheme is popular in modern FPGAs, as it is
well understood and relatively quick to congure.
The simplest switch is a single pFET transistor; however, pFET transistors have
an on resistance that exponentially increases as DC bias voltage drops. A typical
resistance plot for a relatively small pFET device is shown in Fig. 17. As this gure
illustrates, the on pFET device will not pass low voltage signals.
A larger switch can be formed by adding an nFET transistor to the pFET and
38forming a T{gate structure. This allows low{voltage signals to be passed equally as
well as those closer to the positive power rail. As shown in Fig. 18, the resistance
through a T{gate switch is more constant than the lone pFET transistor. The peak
in this resistance plot near 2 V can be changed by carefully sizing the pFET and
nFET that are used in the T{gate switch. While the switch resistance is much better
than the pFET transistor switch, the disadvantages of the T{gate structure include
increased area and lower bandwidth due to the increased parasitic capacitance. The
T{gate switch requires an additional nFET transistor and an inverter for the select
lines (two more transistors). Thus, it is at least three times larger then the simple
pFET switch. Also, the extra nFET transistor in the signal path means at least dou-
bling the parasitic capacitance and resistance. Increasing the parasitics on one or two
switches may not seem signicant; however, as mentioned previously, the parasitics
are summed across the rows and columns of the switch networks. Even a moder-
ately sized FPAA could have 50{100 switches along a single dimension. Doubling
the parasitics on each switch in the network would result in a signicant decrease in
bandwidth.
A perfect switch would have the small size, minimal parasitic eects, and simplicity
of the pFET switch and the lower, more constant resistance of the T{gate switch.
The oating{gate switch comes close to this ideal. As shown in Fig. 19, the resistance
of the oating{gate switch is much more constant across the full voltage swing than
the pFET transistor. By injecting the oating{gate node to an extremely on point,
the eective voltage on the gate of the transistor is allowed to drop below 0 V. This
results in a shifting of the resistance curve until the exponential region is pushed below
0 V as well. The primary trade{o for this increased performance is conguration
speed. Programming oating{gate switches to this extreme position currently takes
nearly one to two minutes. The current programming scheme, however, is externally
controlled. By moving the programming scheme onto the chip and optimizing the logic
390 0.5 1 1.5 2 2.5 3
10
3
10
4
10
5
10
6
V
S (V)
R
e
s
i
s
t
a
n
c
e
 
(
W
)
Figure 19. This is a typical resistance plot for a oating{gate switch that has been programmed
into the extreme on position. By injecting the oating{gate node, the eective voltage on the gate
of the switch is allowed to drop below 0 V. This results in a shifting of the resistance curve until the
exponential region seen in Fig. 17 is pushed below 0 V as well.
for programming switches, a signicant increase in speed can be achieved. In addition,
work is progressing within the CADSP group on parallelizing the programming control
logic so that an entire row or column of switches can be programmed at once. The
potential decrease in programming time with this scheme is quite dramatic.
40CHAPTER 4
RASP 1.0
Large signal processing systems will require a chip with many (100 or more) computa-
tional analog blocks (CABs) on it. However, before such a chip could be successfully
designed, a number of implementation details were tested on a smaller scale. Speci-
cally, two testbed FPAAs have been designed, fabricated, and tested as a part of this
thesis. Recongurable Analog Signal Processor (RASP) 1.0 was primarily used to
characterize the switch networks and test small systems, and it will be discussed in
detail in this chapter. RASP 1.5 was similar; however, a number of layout and archi-
tectural improvements were made that increased the quality of switches and allowed
larger testbed systems to be synthesized. Chapter 5 contains a detailed discussion of
RASP 1.5.
RASP 1.0 was fabricated in a 0.5-micron, standard CMOS process. A die photo
of this chip is shown in Fig. 20. This FPAA contains two CABs connected by a
oating{gate crossbar switch network [36]. The CAB design included a bandpass
lter module, 4 x 4 vector{matrix multiplier, and three wide{range OTAs. RASP 1.0
is designed as a testbed integrated circuit (IC) to study the oating{gate switches and
the interaction between the CAB components and the switch matrix. A two CAB
system should be of a sucient size to test the concept of oating{gate FPAAs.
RASP 1.0 uses oating{gate transistors as the sole programmable element within
the FPAA. Floating{gate analog circuits are used to implement advanced signal pro-
cessing functions and are very useful for processing analog signals prior to A/D con-
version. The architecture introduced here is extensible to larger systems and will
allow systems that go beyond simple programmable ampliers and lters to include
programmable and adaptive lters, multipliers, winner{take{all circuits, matrix{array
signal operations, frequency decomposition, and subband processing.
41Figure 20. This is a die photo of the RASP 1.0 chip. It contains two CABs with a single switch
network to connect the CABs together.
4.1 Computational Analog Blocks
The computational logic is organized into a compact CAB providing a naturally
scalable architecture. CABs can be tiled across the chip with busses and local inter-
connects in{between as shown in Fig. 21.
Each CAB is comprised of components critical to signal processing applications,
including a 4 x 4 matrix{vector multiplier, three wide{range OTAs, and a transistor{
only version of the autozeroing oating{gate amplier (AFGA) or capacitively coupled
current conveyor (C4) [39]. The CAB architecture is shown in Fig. 22.
42CAB CAB CAB CAB
....
....
CAB CAB CAB CAB
....
....
CAB CAB CAB CAB
....
....
CAB CAB CAB CAB
....
....
CAB
Analog
Circuits
Switch
Matrix
Switch
Matrix
Switch
Matrix
(a) (b)
Figure 21. Block diagram of our generic FPAA chip. (a) Architecture diagram of analog pro-
grammable computing blocks. Future FPAAs will have 100 or more Computational Analog Blocks
(CAB) each with 23 ports to the mesh switching matrix. (b) Signal ow in a single CAB.
4.2 Analog Circuit Components
Selecting the types of analog components to include in a general{purpose FPAA is
a dicult task. To be as universal as possible, one must consider adding a num-
ber of basic linear and nonlinear functions, including integration, summation, gain
amplication, logarithmic and exponential operations, and more [31]. Because these
elements are so basic, constructing larger systems can become very complex due to
the required routing resources. Also, as discussed earlier, as the number of switches
involved in a circuit increases, the cummulative eects of the switches on the circuit
may seriously degrade the performance and/or results. To mitigate these challenges,
RASP 1.0 was constrained to be a signal{processing FPAA with specic functions
such as adaptive ltering and Fourier (frequency domain) processing in mind. A
limited number of basic elements were also included for completeness; however, the
focus was placed on selecting an appropriate mix of higher{level components that
could facilitate the prototyping of a wide range of problems.
43Computational Analog Block
Computational Analog Block
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
+
−
WR_OTA
C4 (AFGA)
In Out
+
−
WR_OTA
+
−
WR_OTA
+
−
WR_OTA
C4 (AFGA)
In Out
+
−
WR_OTA
+
−
WR_OTA
Figure 22. Each Computational Analog Block (CAB) has a four{by{four matrix multiplier, three
wide{range operational transconductance ampliers (OTAs), and a capcatively coupled current con-
veyor (C4). The input and output signals shown in this gure are routed to the rows of the switch
matrix.
4.2.1 Basic Analog Elements
The basic analog functions such as summation, integration, and gain amplication
can be included in the FPAA with only a few analog components. In the case of
summation, only the switch matrix is needed. Figure 23 shows that if the input
signals are currents, summation is achieved by simply connecting the input signals
together (Kircho's current law).
By adding several congurable OTAs to each CAB, one can congure the FPAA to
44I1 I2 I3 Ik In
Iout
Iout =
PN
k=1 Ik
Figure 23. The output current of a node is equal to the sum of the input currents (Kircho's
current law).
Vdd Vdd
Vtun Vtun Vdibl
Vin
+
Iout
- Vin
Iout = Ibias+W

V
+
in   V
+
in

Figure 24. Multiplication of a signed (dierential) input by a stored weight can be achieved using
two oating{gate transistors. Since the weight is stored as the charge of the oating{gate node it is
both programmble and adaptable.
perform integration, dierentiation, gain amplication and more. Also, by including
op{amps in the computational logic, RASP 1.0 can compete with current commercial
oerings that are based solely on programmable op{amps.
4.2.2 Matrix{Vector Multiplication
Multiplication is an important element in many signal processing applications. Fig-
ure 24 shows a basic multiplier using two oating{gate transistors. The dierential
voltage input allows signed numbers to be represented. The multiplier's functionality
can be extended by cascading them together to form a matrix{vector multiplier. Each
CAB on RASP 1.0 has a 4 x 4 matrix multiplier in which four signed (dierential)
inputs are mutliplied by a 4 x 4 matrix of programmable weights. The functionality
of this matrix{vector multiplier is shown in Fig. 25. Of course, by setting the appro-
priate weights to zero, matrices smaller than 4 x 4 can be multiplied by the input
vector.
45W W W
Out
W
V
V
3
3
3
33 32 31 30
+
−
W W W
Out
W
V
V
0
00 01 02 03
0
0
+
−
h
Out0 Out1 Out2 Out3
i
=
h
(V
+
0   V
 
0 ) (V
+
1   V
 
1 ) (V
+
2   V
 
2 ) (V
+
3   V
 
3 )
i
2
6 6 6
4
W00 W10 W20 W30
W01 W11 W21 W31
W02 W12 W22 W32
W03 W13 W23 W33
3
7 7 7
5
Figure 25. Matrix{vector multiplication is achieved by cascading the oating{gate multiplier shown
in Fig. 24 and summing the outputs along each row. The multiplier outputs are current values, so
they can be summed by tying the signal lines together. The weight matrix is stored as the charges
on the oating{gate nodes of the transistors that form the multipliers.
46Vdibl
Vhigh
Vlow
C1
C2
Cw
Figure 26. This circuit is a capacitively coupled current conveyor (C4). It is an element in the CAB
and performs a bandpass lter operation. The corner frequencies of the lter can be programmed
using the oating{gate current sources that connect through a current mirror to the biases Vlow and
Vhigh.
4.2.3 Filtering and Fourier Processing
In the near future, FPAA devices based on RASP 1.0's oating{gate architecture will
have as many as 100 or more CABs on a single chip. At this level of complexity, a
number of interesting signal processing systems can be implemented using a Fourier
processor. An analog Fourier processor decomposes an incoming signal into its fre-
quency components (sub{bands). Each sub{band is then operated on before recon-
structing the output signal by summing the sub{bands together. This is analagous to
taking a Fourier transform, operating on each frequency component, and then taking
an inverse Fourier transform to generate the time{domain output signal.
In RASP 1.0, the frequency decomposition is done by using C4 circuits to bandpass
lter the incoming signal [43, 51]. A C4 circuit is built using two oating{gate tran-
sistors and several capacitors as shown in Fig. 26. This method allows the bandpass
47DECODER Gate Pin
Column
S
S
S
S S S S
S
S
S S S
Input Signals / Circuitry
Figure 27. The switching matrix is built with oating{gate transistors. The charge of the oating{
gate node can be adjusted to allow current to ow through the channel (on) or to restrict current ow
(o). When in programming mode, T{gates connect the oating{gate transistors to the decoders,
and when in operational mode, the decoders are unconnected and the output signals are switched
on.
lters to be programmably placed at the desired frequencies (some applications prefer
linear spacing while others prefer logarithmic spacing of the subbands). When used
in combination with the oating{gate multiplier, a wide range of lters, including
adaptive lters, can be achieved [42].
4.3 Switch Matrix
RASP 1.0 has one 16 x 64 crossbar switching matrix that provides local interconnects
between the two CABs and connections to the external input/output signal lines.
The switching matrix uses oating{gate transistors as the switches (Fig. 27).
The digital decoders on the outside of the switch matrix provide access to the
individual oating{gate transistors for programming. After programming is complete,
the decoders are disconnected (using T{gate switches), and external bus lines are
48connected to start the operation of the chip.
The source lines of the 16 oating{gate switches in each row are connected to-
gether; likewise, the drain lines of the 64 oating{gate switches in each column are
tied together. Programming a switch to on allows current to ow in the source{drain
channel and connects a row with a column. Programming a switch to o restricts
the current ow in the channel creating a very high impedance between the row and
column.
The input/output signals from each CAB are connected to the source nodes of the
oating{gate switches. The drain nodes (columns) are either connected to external
busses or are used for internal connections only. By doing this, any input/output
signal from the CABs can be connected to an external bus by turning on one oating{
gate switch. Similarly, it can be connected to another signal in the same CAB or to
a signal in an adjacent CAB by turning on two switches in the same column.
4.3.1 Floating{gate Switches
As mentioned in Chapter 3, using a oating{gate transistor as a switch requires
that the device be turned on or turned o. Ideally, the on state corresponds to the
free ow of current through the device or equivalently, zero impedance between the
source and the drain. Likewise, the o state is characterized by zero current owing
through the device { an innite impedance between the source and the drain nodes.
A oating{gate transistor, however, does not act as a perfect switch. The on state is
characterized by an impedance greater than zero, and the o state has an impedance
less than innity. Therefore, the quality of a oating{gate transistor as a switch is
determined by measuring the on and o impedances.
The oating{gate switch network has been characterized in [37]. The switches
were found to exhibit similar characteristics to standard pFET switches with an on
resistance as low as 11 k
 and an o resistance in the low gigaohm range. They
have also been shown to be accurately programmable and capable of implementing a
490 0.5 1 1.5 2 2.5 3 3.5
10
−11
10
−10
10
−9
10
−8
10
−7
10
−6
10
−5
10
−4
V
G with respect to V
D
I
D
 
(
l
o
g
 
A
) V
DD = 3.3 V
V
DD = 6.5 V
"off"
"on"
operating V
G
Figure 28. Floating{gate switches can be programmed within a wide range. Here, examples of
an on, o, and mid{position device are shown. During programming, currents are measured with
VDD = 3:3 V for large currents and VDD = 6:5 V for small currents. This eectively extends the
programming range of the device.
variable resistance. As shown in Fig. 28, the oating{gate switch can be programmed
between the on and o extremes.
To increase the quality of a switch, the oating{gate transistors are programmed to
the far extremes of their range. In this case, one of the limiting factors is the ability of
the measurement equipment to measure the very small currents present as the switch
is programmed o. To extend the viable programming range, current measurements
are taken at larger VDD's as shown in Fig. 28. Measuring the currents with VDD =
6:5 V , allows the I{V curves to be visible to the programming infrastructure 1 V
below the point visible when VDD = 3:3 V .
In the operating mode of this FPAA, the voltage on the gate capacitor for all
switches is the same. From Fig. 28, it is clear that the o switches do not pose a
problem, since any gate voltage selected at or above 0.3 V should provide a suciently
50high impedance. However, the on switch exhibits a decrease in quality as the gate
voltage is increased to VDD. Thus, an operating gate voltage of 0.3 V is deemed
optimal for the current programming scheme.
4.4 Circuit Characterization
The resistance of the oating{gate switch in the on position proved similar to standard
pFET switches with the measured on{resistance starting at 11 k
. As shown in
Fig. 29a, the architecture of our FPAA required that the resistance be measured
through two devices in series. During these experiments, both of the transistors
were programmed to the same position; the measured resistances shown here have
been divided by two to report the resistance for a single transistor. The resistance
of the oating{gate switch in the o position was not directly measured due to the
extremely small currents. Instead, the saturation current of the device was determined
(Fig. 29b). This data was acquired with VDD = 7:5 V since this was the rst reliable
measurement voltage. The resistance can then be calculated to be R =
UT
ISAT =
25:8 mV
145:4 pA = 177:5 M
. At an operating VDD = 3:3 V , the o{resistance is higher. At
this level, the worst case bias current is 70 pA, which equates to an o resistance in
the 1 G
 range.
Figures 29c and 29d show the current and resistance of a switch as it is pro-
grammed mid{way between the on and o positions. As can be seen by the current
measurements, these devices have been injected close to the o position. While this
resistance is non-linear over the full operational range, linearity can be achieved given
a sucient constraint on the input.
Using the programming method described in [73], oating{gate switches in the
FPAA can be accurately programmed. Figure 30 shows a switch that was pro-
grammed to an arbitrary current of 8 nA at an operating gate voltage of 0.3 V.
Thus, oating{gate switches can be used to accurately set a current level within the
510 0.5 1 1.5 2 2.5 3
10
15
20
25
30
35
40
45
50
55
60
V
S
R
e
s
i
s
t
a
n
c
e
 
(
k
W
)
V S
V D − 0.05 =V S
0.3V
0.275V
0 0.5 1 1.5 2 2.5 3
10
−11
10
−10
10
−9
V
SD
I
D
 
(
l
o
g
 
A
)
I
SAT = 145.4 pA
R = 
U
T
I
SAT
 = 
25.8 mV
145.4 pA
 = 177.5 MW
(a) (b)
−3 −2 −1 0 1 2 3
−30
−20
−10
0
10
20
30
40
50
V
D − V
S
I
D
 
(
m
A
)
Injection 
−3 −2 −1 0 1 2 3
0
100
200
300
400
500
600
700
V
S − V
D
R
e
s
i
s
t
a
n
c
e
 
(
k
W
)
Injection
(c) (d)
Figure 29. Switch Characteristics of Floating{gate Transistors: (a) Resistance for an on switch
was measured using two oating{gate devices in series, both programmed to the on position. Here,
the experimental resistance has already been divided by two to represent a single oating{gate
switch. (b) Currents for an o switch are too low for reliable picoammeter measurements. So, a
conservative o resistance was determined by measuring the saturation current of a oating{gate
transistor programmed to the o position while VDD = 7:5 V . (c) The oating{gate switch can
also be programmed mid{position (between on and o) to synthesize a variable resistance. Here, a
sampling of the dierential currents achievable with this programming scheme are shown. (d) The
resistance plots correspond to the same injection levels as the dierential currents shown in (c).
520 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
10
−11
10
−10
10
−9
10
−8
10
−7
V
G
I
D
 
(
l
o
g
 
A
)
Injection 
Operating V
G
Figure 30. This is a oating{gate switch programmed to 8 nA at an operating gate voltage of 0.3 V.
Thus, current sources can be accurately dened within the switching network itself.
system (i.e., these devices can be used to implement a current source).
4.5 System Results
As an initial example of the testbed system, a rst{order lter is implemented using
an OTA in one of the CABs. Figure 31 shows how the circuit is mapped onto the
FPAA using ve oating{gate switches. Once the switch network is congured, the
biasing oating{gate transistor is programmed to vary the corner frequency of this
rst{order lter. The frequency response is shown for several programmed corner
frequencies in Fig. 32. The moderate gain in the lower frequencies is due to the
switches in the feedback loop of the OTA. Ideally, the output node and the negative
input node would be directly connected. However, in the FPAA, this path must be
routed via the switch network, which means that a minimum of two oating{gate
switches will be in the feedback loop. The gain can be minimized by injecting the
53+
−
WR_OTA
V fg
+
−
WR_OTA
(a) (b)
Figure 31. (a) The source{follower congured using a oating{gate current source. By programming
the oating gate charge, the current is set in the current mirror (the other half of the current mirror
is internal to the wide{range OTA) is xed. Thus, the eective conductance can be modied for each
of the OTAs on the chip. (b) Using the switch matrix, an OTA located in one of the Computational
Analog Blocks (CABs) is connected in a source{follower conguration, and two external pins are
routed to the OTA as the input and output signals. The programmable biases illustrated in (a) are
not shown here for simplicity, but each OTA has a current mirror and oating{gate current source
that sets its bias.
oating{gates of these switches to a lower charge, or if gain is desired for a given
application, then it can be set by programming these switches to a higher charge.
In Fig. 33, a second{order section lter is shown along side the FPAA imple-
mentation. Once again, explicit capacitors are eliminated since the switch parasitics
provide the necessary capacitance. Using the oating{gate programmble biases, the
two OTAs in a source{follower conguration were biased to the same level and the
the third OTA's bias current was increased to adjust the Q{peak of the system. The
simulated frequency response for this circuit is shown in Fig. 34. As expected, the
Q{peak increases as the bias current (e.g., conductance) increases. The experimental
frequency response shown in Fig. 35, also shows a programmable Q{peak; however,
the peak is not very high. This is most likely an eect of the switches in the signal
paths. Improvements were made to the OTA and switch components in RASP 1.5,
and a much higher Q{peak resulted (see Chapter 5).
For second{order functions such as the second{order section and the di2 circuit
54.1 1 10
−18
−16
−14
−12
−10
−8
−6
−4
−2
0
2
Frequency (log Hz)
G
a
i
n
 
(
d
B
)
f
c = 1.2 kHz
f
c = 2.2 kHz
f
c = 3.2 kHz
Figure 32. Here, the frequency response of the source{follower circuit is shown for several bias
currents. An internal oating{gate transistor is used as a current source to set the OTA's bias.
Injecting the oating{gate device, increases the current and thus the bandwidth of this rst order
lter.
introduced by Mead in [61], reasonable Q{peaks and lter bandwidths require small
bias currents (in the picoamp to femtoamp range). While the oating{gate transistors
can set bias currents this low, the constraint becomes the ability to accurately measure
these currents while programming the oating{gate transistors. Experimental results
from Fig. 28 show a measurement threshold of 1 pA using present measurement
techniques. An important consideration here is the relative size of the transistors
that set the bias currents. The oating{gate transistor shown in Fig. 31a sets the
current through the nMOS current mirror (the other half of the current mirror is
internal to the OTA module). To set small bias currents, it is preferable to have the
nFET and oating{gate transistor sized larger than the current mirror nFET, which
is internal to the OTA. In this conguration, the current mirror functions as a current
divider, and thus, very low bias currents can be set by programming the oating{gate
55transistor to generate currents in the picoamp range.
Based on these testbed systems, one can start to imagine a wide class of systems
that can be implemented and congured on FPAAs with many CABs on them. In
particular, dierentiators, cascaded second{order sections, bandpass lters, matrix
transforms (including DCTs and wavelet transforms), and frequency decomposition
are all well{suited for this architecture. In the audio arena alone, designs could
be prototyped to implement forms of noise suppression, audio enhancement, feature
extraction, auditory modeling, and simple audio array processing. Other potential
interest areas include communications signal conditioning (modulation, mixing, etc.),
transform coding, and neural networks (with external training). Most of these systems
rely on ecient sub{band processing; therefore, each CAB has been designed with
a C4 bandpass lter to optimize this operation. As shown in Fig. 36, the center
frequency of the C4 lter can be moved over a large range of frequencies.
4.6 Summary
FPAAs based on oating{gate technologies are an emerging design concept that will
increase the current state of the art in the analog and mixed{signal prototyping. In
particular, these FPAAs are well{suited to facilitate the design of low{power sig-
nal processing systems based on analog oating{gate devices. In this chapter, a
novel FPAA architecture was presented that utilizes oating{gate transistors as pro-
grammable switches, in{circuit active elements, and the congurable device within
the computational analog blocks. Several systems were implemented on a testbed
FPAA, and they were shown to have a high degree of programmability via on{chip,
oating{gate current sources that set the bias currents. Thus, the bandwidth and Q
parameters of various OTA lter designs can be easily modied on{chip without the
need for external biases.
56+
−
WR_OTA +
−
WR_OTA
+
−
WR_OTA
(a)
+
−
WR_OTA
+
−
WR_OTA
+
−
WR_OTA
(b)
Figure 33. (a) A second{order section lter can be implemented with two OTAs in a source{follower
conguration and a third OTA that creates postive feedback. (b) Using the switch matrix, two OTAs
within the CABs are connected in a second{order section conguration. The programmable biases
shown in Fig. 31(a) are not included here for simplicity, but each OTA has a current mirror and
oating{gate current source that sets its bias.
57.1 1 10
−50
−40
−30
−20
−10
0
10
20
Input Frequency (kHz)
G
a
i
n
 
(
d
B
)
Figure 34. The simulated frequency response of a second{order section lter is shown here. The
Q parameter is adjusted by increasing the bias current of the positive feedback amplier via a
oating{gate current source.
.1 1 10
−6
−5
−4
−3
−2
−1
0
Input Frequency (kHz)
G
a
i
n
 
(
d
B
)
Figure 35. The experimental frequency response of a second{order section lter is shown here.
The Q parameter is adjusted by increasing the bias current of the positive feedback amplier via a
oating{gate current source.
580.001 0.01 0.1 1 10 100 1000
−80
−60
−40
−20
0
20
40
Frequency (kHz)
O
u
t
p
u
t
 
M
a
g
n
i
t
u
d
e
 
(
d
B
)
Figure 36. Frequency decomposition (sub{band processing) can be achieved on the test{bed FPAA
by using the C4 bandpass lter block in each CAB. In this simulation, the center frequency of the
C4 is shown to be programmable over a wide range of frequencies.
59CHAPTER 5
RASP 1.5
The RASP 1.0 FPAA allowed the switches in the switch network to be characterized
and some small systems to be implemented on it. However, a number of circuit{
level changes were needed to realize the full functionality of the RASP architecture.
RASP 1.5 is an FPAA that is similar in size to RASP 1.0, but it has a number of small
circuit and architectural improvements that allow the system{level functionality of
the computational logic to be further tested.
5.1 RASP 1.5 Architecture
The RASP 1.5 FPAA is fabricated in AMI's 0.5-micron CMOS process. A picture of
the top{level layout is shown in Fig. 37. A die photo of RASP 1.5 is shown in Fig. 38.
5.1.1 RASP 1.5 vs. RASP 1
The improvements to RASP 1.5 can be categorized as programming infrastructure,
circuit{level, and architectural changes. In the programming infrastructure, advance-
ments were made to bring more of the programming logic on the chip. During pro-
gramming mode, the control logic on RASP 1.5 automatically sets the gate and source
voltages on the non{selected rows to turn all switches o except the one currently
selected.
At the circuit level, a number of the transistors were resized. The switches in the
switch network were made bigger to reduce the resistance through the devices. The
T{gate structures in the programming control logic were also made much bigger to
allow more current to ow through them. This is important when turning on multiple
switches in a column, since the currents through the multiple switches will sum on
the column lines. If the T{gates are not large enough, then the current through the
switches will overload the T{gate causing a voltage drop across the T{gate. This
60Figure 37. This is a picture of the top{level layout for the RASP 1.5 FPAA. It has two CABs and
a single switch network connecting the CABs.
raises the drain voltage on the switches and slows or even stops injection. Additional
changes included modifying transistor sizing in the CAB components to improve the
performance of the computational logic elements. In particular, current dividers were
added on all of the biases so that the currents needing to be sourced by oating{
gate current sources would be in an optimal programming range for the oating{gate
transistors ( 100 pA { 10 uA).
As a part of the architectural changes, two levels of I/O hierarchy are introduced.
On RASP 1, signals coming into or going o of the FPAA were only allowed to do so
61Figure 38. This is a die photo for the RASP 1.5 FPAA. It has two CABs and a single switch
network connecting the CABs together.
only through source lines. This scheme requires a minimum of two switches to get an
input or output from the I/O pad to a component input or output. On RASP 1.5,
however, two levels of I/O signals are included: Level 1 and Level 2. Level 2 I/O is
routed through the source lines just like RASP 1.0. Level 1 I/O, however, is routed
into the columns (drain lines). Signals entering RASP 1.5 on Level 1 I/O can be
routed to the input of any component with a single switch. Adding I/O pads to the
columns dramatically increases the capacitance on those drain lines, which results in
lower bandwidths on those nodes. Therefore, only a limited number of drain lines
are used as Level 1 I/O lines. The majority of the columns are reserved for internal
routing.
The second part of the architectural changes involved increasing the size and
functionality of the CABs. First, three xed{value capacitors, two transistors (one
6240 Columns
9
2
 
R
o
w
s
S
o
u
r
c
e
/
G
a
t
e
 
C
o
n
t
r
o
l
 
L
o
g
i
c
Drain Control Logic
Figure 39. The RASP 1.5 has a single fully connectable crossbar switch that has 40 x 92 switches
in it. The gate and source nodes from the transistors run horizontally into the programming control
logic and row decoders on the side of the chip. The drain lines for the columns and run vertically
into the drain control lines and column decoders.
nFET and one pFET), a peak detector, and a min detector were added. Additionally,
the C4 bandpass lter block was replaced with a C4{based second{order section (SOS)
bandpass lter module (see Section 5.2).
5.1.2 Switch Networks
As shown in Fig. 39, the routing infrastructure on RASP 1.5 is achieved with a single,
fully connectable crossbar switch. The switch network has 40 columns (drains) and
92 rows (sources). The inputs and outputs from all of the components in the two
CABs connect to the rows of the switch network. I/O lines that go o the FPAA chip
are divided into two levels: Level 1 and Level 2 I/O. Level 1 I/O pins are connected
63directly to the columns allowing them to be connected to any input or output on
the computational logic through a single switch. There are six Level 1 I/O pins on
RASP 1.5 and two of them are dedicated outputs since they are routed through an
output buer in their I/O pads. There are 12 Level 2 I/O pins. They connect to the
switch network via extra rows that are not used for the computational logic. Level 2
I/O pins require at least two switches to connect them to any input or output of the
computational logic.
5.2 Computational Analog Blocks
The computational logic is organized in a compact computational analog block (CAB)
providing a naturally scalable architecture. CABs are tiled across the chip in a regular
mesh{type architecture with busses and local interconnects in between. In RASP 1.5,
there are two CABs with a single switch network connecting them together.
Many potential CABs can be imagined using this technology. Figure 40 shows one
example CAB, whose functionality is enhanced by a mixture of ne{grained, medium{
grained, and coarse{grained computational blocks similar to many modern FPGA
designs. The computational blocks were carefully selected to provide a suciently
exible, generic architecture while optimizing certain frequently used signal processing
blocks. For generality, three OTAs are included in each CAB. OTAs have already been
shown to be eective at implementing a large class of systems, including amplication,
integration, ltering, multiplication, exponentiation, modulation, and other linear
and non-linear functions [25, 64, 69, 70]. In addition, the two FET devices provide
the ability to perform logarithmic and exponential functions, as well as convert from
current to voltage and vice versa. The three capacitors are xed in value in order to
minimize the size of the CAB and are primarily used on the outputs of the OTAs;
however, they will be available for any purpose. The use of oating{gate transistors in
the OTAs will give the user sucient control in programming the transconductance of
64Analog Circuit Components
+
−
WR_OTA
+
−
WR_OTA
+
−
WR_OTA
In Out
C   (SOS) 4
min
max
Figure 40. The CAB on RASP 1.5 contains a four-by-four matrix multiplier, three wide{range
operational transconductance ampliers (OTAs), three xed{value capacitors, a capacitively coupled
current conveyor (C4) second{order section (SOS), a peak detector, a min detector, and two FET
transistors.
the ampliers, thereby eliminating the need for the variable capacitor and/or current
mirror banks found in some designs [37, 64]. Removing the capacitor banks creates a
large savings in the area required for each CAB.
The high{level computational blocks used in this design are an SOS bandpass
lter module comprised of two C4 and the 4 x 4 vector{matrix multiplier block.
In general, the C4 SOS module provides a straightforward method of subbanding
an incoming signal. This allows Fourier analysis analagous to performing a Fourier
transform. The vector{matrix multiplier block allows the user to perform a matrix
transformation on the incoming signals. Together these blocks can be used like a
Fourier processor [43, 51]. In addition, a peak detector and min detector is added to
65Figure 41. This is a block diagram of the second{order section (SOS) bandpass lter module used
in RASP 1.5's CAB. The C4 SOS is comprised of two C4 circuits (as shown in Fig. 42) with a buer
in between them. The corner frequencies for both the C4's are independently controlled with the
oating{gate current sources located in the CAB.
each CAB.
The bandpass lter module is dierent from that of RASP 1.0. As shown in
Fig. 41, the bandpass lter is an SOS that is comprised of two C4 circuits with a
buer in between them. Both C4 blocks have the circuit topology shown in Fig. 42.
As shown in the gure, the C4 circuit automatically sets the Vdibl gate voltage needed
in RASP 1.0 (see Fig. 26). This increases the linear range of the module while
maintaining relative ease of control. The corner frequencies for both of the C4 blocks
are controlled with the oating{gate current sources located in the CABs. This allows
the maximum exbility and control.
5.3 Testbed FPAA
RASP 1.5 contains two CABs with a oating{gate crossbar switch network connecting
them. Both CABs are identical to the large CAB illustrated in Fig. 40.
As discussed earlier, the resistance and capacitance of the oating{gate switch
are important characteristics. The on switch resistance is plotted in Fig. 43. For
reference, this gure also shows the resistance of a standard pFET (with an SRAM
memory bit setting the gate) and T-gate (both an nFET and a pFET passing the
signal). When programmed to a point that is not extremely on, the oating{gate
66Vlow
Vlow
Cw
C1
Vhigh Vhigh
Figure 42. This is the version of the capacitively coupled current conveyor (C4) used in the second{
order section in RASP 1.5's CAB. It does not use an explicit C2 capacitor in the middle node. Also,
this diagram includes an \autodibl" circuit that automatically sets the Vdibl gate voltage based on
the selected corner frequencies. This increases the linear range of the module while maintaining
relative ease of control.
switch exhibits a resistance that is very similar to the standard pFET shown here
(as seen in [37]. However, by injecting the oating{gate switch further, the voltage
on the isolated gate node is pushed lower and thus the resistance curve shifts to the
left. This gure shows that by programming the switch far enough, the resistance
through the switch can maintain a more consistent level through the operating range
(power rails) of the switch. This allows a single oating{gate pFET to exhibit a
resistive characteristic that is similar to the resistance of a standard T-gate with two
transistors. As shown, the resistance of the oating{gate switches is approximately
10 K
, which is about what is expected for relatively small (3.2m x 1.8m) pFETs.
The o resistance is harder to measure given the limitations of standard test
equipment. Even at VDS = 3:3V , current through the o switches is below the
measureable range of standard picoammeters. Given this, the o resistance should
be in the gigaohm range, and in the worst case, hundreds of megaohms. Likewise, the
parasitic capacitance of the switches is dicult to measure when they are embedded
670 0.5 1 1.5 2 2.5 3
10
3
10
4
10
5
10
6
V
S (V)
R
e
s
i
s
t
a
n
c
e
 
(
W
)
pFET
TG
FG pFET
Figure 43. Here the switch resistances for a oating{gate pFET, standard pFET controlled by an
SRAM memory cell, and a standard two{transistor T{gate are shown. The oating{gate switch has
been programmed to an extremely on position such that the high{impedance region (at 1.5 V for the
standard pFET shown) has eectively been shifted below the power rail. This results in a relatively
at resistance similar to the larger T{gate.
in the switch network and accessible only through the programming infrastructure.
A theoretical estimate based on the layout and fabrication parameters yields a value
of 1 fF for each switch on each column and row. Thus, for RASP 1.5, each column
is estimated to contribute 96 fF of parasitic capacitance and 46 fF for each row.
5.4 System Results
A number of dierent analog testbed systems have been synthesized on RASP 1.5.
These systems vary from simple one{ and two{element systems to more complex
systems with as many as seven on{chip components. These systems also use a range
of dierent CAB components, including ne{grained (transistors and capacitors),
medium{grained (OTAs), and coarse{grained (C4 SOS and peak detector). In each
68+
−
WR_OTA
V fg
+
−
WR_OTA
(a) (b)
Figure 44. (a) The source{follower is congured using a oating{gate current source. By program-
ming the oating gate charge, the current is set in the current mirror (the other half of the current
mirror is internal to the wide{range OTA). Thus, the eective conductance can be modied for each
of the OTAs on the chip. (b) Using the switch matrix, an OTA located in one of the Computational
Analog Blocks (CABs) is connected in a source{follower conguration, and two external pins are
routed to the OTA as the input and output signals. The programmable biases illustrated in (a) are
not shown here for simplicity, but each OTA has a current mirror and oating{gate current source
that sets its bias.
of these examples, oating{gate transistors are used as current sources to set biases.
Depending on the circuit, these programmable biases are shown to control lter corner
frequencies, Q{peaks, and time constants.
5.4.1 Low{Order Filtering with OTAs
As an initial example of the testbed system, a rst{order lter is implemented using an
OTA in one of the CABs. Figure 44 shows how the circuit is mapped onto the FPAA
using ve oating{gate switches. Once the switch network is congured, the biasing
oating{gate transistor is programmed to vary the corner frequency of this rst{order
lter. The frequency response is shown for several programmed corner frequencies in
Fig. 45. The plot in Fig. 46 shows the correlation between programmed bias current
and measured corner frequency. By tting a curve to this data, it is possible to predict
the necessary bias current for a desired corner frequency. This is very important for
the usability of FPAAs in general. The end{user will typically want to specify the
6910
2
10
3
10
4
10
5 −40
−35
−30
−25
−20
−15
−10
−5
0
Frequency
G
a
i
n
 
(
d
B
)
Figure 45. Here, the frequency response of the source{follower circuit is shown for several bias
currents. An internal oating{gate transistor is used as a current source to set the OTA's bias.
Injecting the oating{gate device, increases the current and thus the bandwidth of this rst order
lter.
system parameters in terms of corner frequency, Q-peak, time constants, osets, etc.,
and then let the programming interface make the translation to the appropriate bias
currents necessary to generate these parameters.
In Fig. 47, an SOS lter is shown alongside the FPAA implementation. Once
again, explicit capacitors are eliminated since the switch parasitics provide the nec-
essary capacitance. Using the oating{gate programmble biases, the two OTAs in a
source{follower conguration were biased to the same level, and the third OTA's bias
current was increased to adjust the Q{peak of the system. The frequency response
for this circuit is shown in Fig. 48. As expected, the Q{peak increases as the bias
current (i.e., conductance) increases.
For second{order functions, such as the second{order section and di2 circuit, rea-
sonable Q{peaks and lter bandwidths require small bias currents (in the picoampere
7010
3
10
4
10
5
10
−6
10
−5
Corner frequency (kHz)
P
r
o
g
r
a
m
m
e
d
 
b
i
a
s
 
c
u
r
r
e
n
t
 
(
A
)
Figure 46. This plot shows the correlation between bias current and programmed corner frequency.
This data can be used in future experiments to select the appropriate bias current for the desired
corner frequency.
to femtoampere range). While the oating{gate transistors can set bias currents this
low, the constraint becomes the ability to accurately measure these currents while
programming the oating{gate transistors. Experimental results from Fig. 28 show a
measurement threshold of 1 pA using present measurement techniques. An important
consideration here is the relative sizing of the transistors that set the bias currents.
The oating{gate transistor shown in Fig. 44a sets the current through the nMOS
current mirror (the other half of the current mirror is internal to the OTA module).
To set small bias currents, it is preferrable to have the nFET and oating{gate tran-
sistor sized larger than the current mirror nFET, which is internal to the OTA. In
this conguration, the current mirror functions as a current divider, and thus, very
low bias currents can be set by programming the oating{gate transistor to generate
currents in the picoampere range.
71+
−
WR_OTA +
−
WR_OTA
+
−
WR_OTA
(a)
+
−
WR_OTA
+
−
WR_OTA
+
−
WR_OTA
(b)
Figure 47. (a) A second{order section lter can be implemented with two OTAs in a source{follower
conguration and a third OTA that creates postive feedback. (b) Using the switch matrix, two OTAs
within the CABs are connected in a second{order section conguration. The programmable biases
shown in Fig. 44a are not included here for simplicity, but each OTA has a current mirror and
oating{gate current source that sets its bias.
7210
2
10
3
10
4
10
5 −70
−60
−50
−40
−30
−20
−10
0
10
Frequency (Hz)
G
a
i
n
 
(
d
B
)
Figure 48. The experimental frequency response of a second{order section lter is shown here.
The Q parameter is adjusted by increasing the bias current of the positive feedback amplier via a
oating{gate current source.
5.4.2 Third{Order Gm-C Ladder Filter
The availability of OTAs and grounded capacitors makes the RASP FPAA ideal for
implementing Gm-C lters. One way to realize a particular lter is by modeling
it with resistors, inductors, and capacitors, and then synthesizing the design using
Gm-C lters. In this example, a third{order Butterworth lter is implemented. The
canonical prototype of the lter, a double{resistance terminated LC lter, is shown
in Fig. 49a. By using the signal simulation method outlined in [78], the Gm-C lter
shown in Fig. 49b is generated. In order to maintain a maximally at response, the
following must hold: 2gm1 = gm2. Accordingly, the bias current of OTA-3 was set to
half of the other OTA bias currents. A range of bias currents was used to create the
frequency response shown in Fig. 50. As expected, the corner frequency of the lter
is proportional to the bias currents of the OTAs. The lower corners were obtained
731 F 1 F
1 ohm 2 H
1 ohm −
+
(a)
m2
+
− OTA2
g m2 g
+
OTA1
−
m1 g
OTA3
+
−
m2
+
− OTA4
g
C C C
(b)
Figure 49. (a) This is the canonical prototype of a third{order Butterworth double{resistance
terminated LC lter. (b) This is the Gm-C implementation of the same lter. This form of the lter
can be realized on the RASP 1.5 FPAA.
by using a bias current in the range of hundreds of picoamperes, while the highest
corners required currents of up to one microampere.
5.4.3 Coarse{Grain CAB Components
As mentioned earlier, the CABs on this FPAA have several special{purpose com-
ponents that have been designed to optimize specic functions. In particular, these
CABs include programmable peak detectors and programmable bandpass lter mod-
ules (C4 SOS circuits).
The peak detector's operation is shown in Fig. 51. A sine wave is input with a
sudden increase magnitude occurring in the middle of the data window. The output of
the peak detector is shown to follow the peak of the sine wave through this transition.
In addition, the time constant of the decay for this peak detector can be varied using
a oating{gate controlled bias. The output of the peak detector is shown for two
dierent time constants.
There is a wide range of systems that can be implemented and congured on
7410
2
10
3
10
4
10
5
−45
−40
−35
−30
−25
−20
−15
−10
−5
0
Frequency
G
a
i
n
 
(
d
B
)
Figure 50. The experimental frequency response of a third{order Gm-C lter is shown here. The
corner frequency is adjusted by programming the bias currents of the four OTAs.
FPAAs with many of these CABs on them. In particular, dierentiators, cascaded
second{order sections, bandpass lters, matrix transforms (including DCTs and wavelet
transforms), and frequency decomposition are all well suited for this architecture. In
the audio arena alone, designs could be prototyped to implement forms of noise sup-
pression, audio enhancement, feature extraction, auditory modeling, and simple audio
array processing. Other potential interest areas include communications signal con-
ditioning (modulation, mixing, etc.), transform coding, and neural networks (with
external training). Many of these systems rely on ecient subband processing; there-
fore, each CAB has been designed with a C4 SOS bandpass lter module to optimize
this operation.
The C4 SOS module is comprised of two C4 modules cascaded with a buer in{
between them. Either C4 module can be used alone by spreading apart the corner
frequencies of the other module. To characterize this module, frequency response
750 1 2 3 4 5 6 7 8 9 10
2.3
2.4
2.5
2.6
2.7
2.8
2.9
3
3.1
3.2
Time (ms)
V
o
l
t
a
g
e
Small t 
Large t 
Figure 51. The output of the peak detector module is shown to track the peaks of the sine wave
input. The time constant is varied by programming a oating{gate transistor connected as a current
source to the bias of the peak detector module. The outputs of the peak detector shown here are
for two dierent time constants.
plots of each of the individual C4 modules are shown in Fig. 52a and 52b. The
bandwidth and Q-peak of the C4 modules are quite dierent. This is due to the
dierence in output capacitance of each module. The output of the rst C4 is tied to
the input of a buer, which results in a relatively small capacitance. The output of
the second C4, however, is tied into the switch network. Therefore, the output load
capacitance for this device will be much higher due to the parasitics of the switches
and the capacitance of the next circuit in the path. In these experiments, the next
stage was a relatively large buer in an output pad.
When both of the cascaded C4s are set to the same corner frequencies, the output
of the module shows the desired second{order roll{o as shown in Fig. 52c. In all of
these plots, the corner frequencies are shown to be programmable over a wide range
of frequencies. The bias current to corner frequency correlation is dierent for each
7610
2
10
3
10
4
10
5 −50
−40
−30
−20
−10
0
10
Frequency
G
a
i
n
 
(
d
B
)
(a)
10
2
10
3
10
4
10
5 −40
−30
−20
−10
0
10
20
Frequency
G
a
i
n
 
(
d
B
)
(b)
10
2
10
3
10
4
10
5 −50
−40
−30
−20
−10
0
10
Frequency
G
a
i
n
 
(
d
B
)
(c)
Figure 52. The C4 second{order section (SOS) block is comprised of two C4 circuits with a buer in
between. By spreading the corner frequencies of one of the C4 circuits to be far apart, the frequency
response of the other C4 can be measured. This method is used here to generate the frequency
response plots for the rst (a) and the second (b) C4 circuits. Then the frequency response for the
SOS is generated by programming both C4 circuits to the same corner frequencies as shown in (c).
77peak
detector
In Out
C   (SOS) 4
+
−
WR_OTA
Figure 53. This is the circuit diagram for a typical subband system. The incoming signal is
bandpass ltered and then the magnitude of the subband is output from the peak detector. This is
analagous to taking a discrete fourier transform.
of the cascaded devices. However, all of the bias currents for these plots were within
the range of 25 pA to 200 nA.
The coarse{grain components are most useful when they can be combined to form
a larger system. In Fig. 53, a circuit is shown that uses a C4 SOS block, an OTA, and
a peak detector in series. This conguration is very powerful when it is replicated 64,
128, or more times on the FPAA with the center frequencies of the bandpass lters
varying over the desired frequency range. The outputs of the dierent subbands are
analagous to the magnitudes of the discrete Fourier transform. As a test of this
system, data was taken from RASP 1.5 for a single subband. As shown in Fig. 54,
the input is an amplitude{modulated signal with 1.8 KHz and 10.0 KHz frequency
components. The C4 SOS module is biased to have a center frequency near 1.8 KHz,
and the OTA is congured to be a noninverting buer. The output of the system is
shown in Fig. 54. Also, the output of the system is shown after a 2.2 F capacitor
has been added at the output of the peak detector. This change has the eect of
smoothing (i.e., low{pass ltering) the output, thus creating a longer eective time
constant for the system.
780 1 2 3 4 5 6 7 8 9 10
1.2
1.4
1.6
1.8
2
2.2
2.4
2.6
2.8
Time (ms)
V
o
l
t
a
g
e
Raw Peak Detector Output
Peak Detector Output  
with 2.2mF Capacitor
(Peak Detector Input)
AM Modulated Input
C
4 SOS Output
Figure 54. This is the experiment data from the subband system shown in Fig. 53. The input
waveform is an amplitude modulated signal with 1.8 KHz and 10.0 KHz components. The output
of the peak detector is shown with and without an integrating capacitor added to the output stage.
79CHAPTER 6
MIXED{SIGNAL PROTOTYPING PLATFORM
The FPAAs introduced here are designed for use within mixed{signal systems. When
coupled with FPGAs, they can provide a complete mixed{signal rapid prototyping
system. In addition, the same infrastructure that is developed to connect FPGAs and
FPAAs during run{time mode can be used during programming mode to congure
the FPAA.
6.1 Development System
The current development system was originally built for the RASP 1.0 FPAA. How-
ever, it has been easily extended to work with more recent FPAAs (and other custom
oating{gate chips as well). The core of the platform consists of an FPGA develop-
ment board and a daughter card for interfacing to the FPAA (Fig. 55). For simplicity,
an o{the{shelf FPGA board is being used here along with a custom daughter card
developed for the FPAA; however, future versions of this system could easily have
the functionality of both boards integrated into a single mixed{signal recongurable
prototyping board. Work is also progressing on moving much of the programming
functionality shown here onto the FPAA chip [73]. This eort will further simplify
the board{level design of mixed{signal prototyping platforms.
A block diagram of the overall development system is shown in Fig. 56. A program
running on the PC controls the programming at the highest level. The FPGA receives
the high{level commands from the PC and implements the specic protocols to carry
out the comamand. The FPAA daughter card consists of level shifters and D/A
converters to interface from the FPGA to the FPAA, A/D converters for interfacing
from the FPAA to the FPGA, and the appropriate power supplies for running and
programming the FPAA.
80Figure 55. This is a picture of the mixed{signal prototyping platform. It is currently comprised of
an o{the{shelf FPGA development board and a custom daughter card developed for the FPAA.
6.1.1 FPGA Development Board
The FPGA board used is the Nios Development Kit{Stratix Edition (Nios{Stratix)
provided by the Altera Corporation. In addition to the FPGA IC, the Nios{Stratix
board contains a rich array of prototyping devices, including 1 MB of SRAM, 8 MB
of Flash memory, 16 MB of SDRAM, a CompactFlash connector, an Ethernet con-
nector, two serial ports, and two 41{pin general I/O headers for attaching daughter
boards. The FPGA on this board is a Stratix EP1S10, which contains approximately
300,000 general{purpose logic gates, over 920 Kbits of on{chip RAM, and a number of
specialized digital signal processing blocks. This FPGA is large enough to synthesize
a customized soft{core processor (Altera's 32{bit Nios processor), and when needed,
81Software
Environments
Matlab
Custom GUI
Analog
System
Partition
Computer
Programming
Prog.
Control/
C
o
m
m
u
n
i
c
a
t
i
o
n
FPGA
Control/
Data
Data
FPAA
D
/
A
A
/
D
Analog
Inputs
D/A
Digital
System
Partition
Control
P
r
o
g
r
a
m
m
i
n
g
 
 
/
Ethernet
Figure 56. This is a block diagram of the overall mixed{signal development system. The FPGA
receives the high{level commands from the PC and implements the specic programming protocols
to perform the specied operation to the FPAA.
specialized hardware modules that handle timing{critical communication between the
FPAA daughter card and the soft{core processor. The Nios processor controls the
overall programming system and handles the Ethernet communications with the PC.
6.1.2 FPAA Daughter Card
The FPAA daughter card is a custom printed circuit board that has been designed
to interface directly with the I/O headers on the Nios{Stratix FPGA board. The
daughter card has three primary functions:
1. Provide the physical interface between the FPAA and the FPGA
2. Provide supply voltages for the FPAA in both programming and operational
modes
3. Provide current and voltage feedback data to the FPGA for controlling the
programming algorithms and analyzing systems implemented on the FPAA
The interface to the FPGA is complicated by the requirement that the FPAA's
supply voltage must be able to vary during programming. Thus, the digital control
signals coming from the FPGA must be level shifted to be compatible with the supply
82voltage of the FPAA at any given time during the programming process. Open{
collector inverting buers are used for this purpose; the outputs of the inverters are
pulled up to the FPAA's power supply with resistor networks.
The FPAAs discussed in this thesis are designed to operate at very low currents.
Bringing these currents o of the FPAA while maintaining a reasonable SNR is very
dicult. In addition to using standard board design techniques, great care must
be taken when dealing with small currents. For example, current outputs can not
be routed through an I/O header block, because the pin{to{pin resistance for stan-
dard headers is in the tens of megaohms range. This can lead to nanoampere or
even low microampere leakage currents between the pins, which will drastically aect
measurements when the desired currents are at or below this range. Ideally, current{
to{voltage converters will be integrated onto future FPAA ICs and currents will not
be dealt with at the board level. However, on legacy chips, pins must be raised o of
the board and connected directly to picoammeters to achieve reliable measurements.
6.2 Programming Software
There are three dierent levels of software currently used to program the RASP
FPAAs. At the most abstracted level, a graphical schematic{entry environment is
used to specify the desired circuit. Next, a Matlab script is used to initiate pro-
gramming of the individual switches and biasing transistors needed to implement the
circuit. Finally, rmware running on the the soft{core processor (synthesized on the
FPGA) receives the commands from the PC and executes the programming protocols
to set the individual oating{gate transistors at the desired current levels.
6.2.1 Graphical Circuit Specication
A custom Visual Basic program has been written to enable graphical specication of
the desired circuit. Design entry is completed by manually selecting the rows and
columns for each signal path in the desired circuit. As shown in Fig. 57, RASP 1.0 is
83Figure 57. This is a screen capture of the graphical FPAA conguration software. This program
allows the user to enter a circuit diagram graphically and export a text le with netlist for the
entered circuit. This version of the software is customized for the RASP 1.0 FPAA. A circuit is
entered by clicking on the switch symbols (squares) to specify each on switch.
84small enough to allow the entire switch network and CAB components to be displayed
on a single screen. This greatly simplies the program code and design entry. Larger
FPAA designs will require that more complexity be added to the chip view in this
software.
To enter a design, the user clicks on the individual switches (illustrated by square
checkboxes) to turn them on or o. When a switch is turned on, an implicit connection
is made between the row and column of the switch. Multiple switches can be turned
on for any given row or column. Selecting Preview from the File menu will hide the
o switches and explicitly show the column and row connections. As an example,
the second{order section (SOS) circuit introduced in Fig. 47 was entered into this
program. The Preview{mode view of this circuit is shown in Fig. 58.
Bias currents can be specied for each of the CAB components by clicking on the
respective component in the program window. A unique dialog box will appear for
each CAB component that allows the user to specify the appropriate biases for that
component. These biases are not used or displayed elsewhere within this program,
but they are exported in the output netlist le.
Once a circuit design is entered into the graphical conguration program, a circuit
netlist is exported. This netlist contains the row and column indices for each switch
that needs to be turned on and pertinent information for each of the CAB component
bias transistors being used. The netlist le for the SOS circuit illustrated in the
previous gure is shown in Fig. 59. As can be seen in this gure, the switches are
listed without voltage and current information, since switch programming is uniformly
implemented in the programming code. The bias transistors, however, must have
voltage and current values specied. In this case, there are three biases (for the three
OTAs in the circuit) that must be set. The current values for these biases are all
specied at a gate voltage of 0.3 V, since that is the operating gate voltage of the
oating{gate transistors during run mode.
85Figure 58. In Preview mode, the software connects the switches that are turned on and turns the
rest of the switches o. This view allows the user to more easily verify the entered circuit design.
Here, the second{order section (SOS) shown in Fig. 47 has been entered.
86Switch?, Row, Column, Voltage, Current
1, 12, 13, 0, 0
1, 15, 14, 0, 0
1, 18, 14, 0, 0
1, 13, 14, 0, 0
1, 16, 15, 0, 0
1, 19, 15, 0, 0
1, 14, 14, 0, 0
1, 17, 15, 0, 0
1, 20, 14, 0, 0
1, 32, 15, 0, 0
1, 33, 13, 0, 0
0, 8, 16, 0.3, 300e-9
0, 9, 16, 0.3, 300e-9
0, 10, 16, 0.3, 340e-9
Figure 59. The graphical FPAA conguration software outputs a netlist that includes the complete
circuit specication. This sample netlist le is the output for the second{order section circuit as
entered in Fig. 58. The row and column numbers used here are for the RASP 1.0 FPAA. This
netlist can be directly converted into a Matlab script that will congure the FPAA to synthesize the
specied circuit.
6.2.2 High{level Matlab Code
Matlab is correctly used to implement the high{level command and control of the
programming circuitry. During development, the details of the programming algo-
rithms were also implemented in Matlab. However, as settings were solidied, most
of the low{level details of programming were moved to the rmware executing on the
FPGA. At this point, Matlab is used primarily for its ability to easily script the pro-
gramming commands for a given circuit. In addition, Matlab is used during run{time
to retrieve data from the FPGA and computer{controlled measurement equipment.
It is then used to view and analyze the data.
6.2.3 Low{level C Code
The FPGA has a customized Nios soft{core processor synthesized on it. The rmware
running on this processor is written in C and controls the low{level details of pro-
gramming. The Nios processor has SPI peripherals implemented on the FPGA that
communicate with the D/A converters on the FPAA daughter card to control the
voltages needed during programming. The Nios processor also controls the external
picoammeter via a serial port peripheral. Implementing the programming algorithms
87directly in rmware speeds up the programming process by avoiding the overhead
incurred in computer{to{FPGA communications.
88CHAPTER 7
FPAAS: A ROADMAP TO THE FUTURE
Large{scale FPAAs are feasible. While previous FPAAs have suered from their
small size and lack of functionality/generality [4, 47, 53, 58], next{generation FPAAs
based on the architecture introduced in this thesis can overcome these challenges,
thereby extending the usefulness and acceptance of FPAAs. In addition, large{scale
FPAA designs with computational logic included at multiple levels of granularity will
address the complex design space that analog designs entail (including a wide{range
of linear and non{linear functions) while keeping switch parasitics minimized.
The work outlined in this thesis is only the beginning of modern FPAA research.
As a direct result of the research explored in this thesis, a number of dierent research
avenues have emerged. Many of these directions are already being pursued by doctoral
students within the CADSP group, and others will be explored as time and resources
become available. In addition to future research, a start{up company coming out of
the CADSP group is working to commercialize this FPAA technology. Impetus on the
development side of FPAAs in cooperation with the research activities in the CADSP
group will provide an ideal environment for pushing FPAAs from a laboratory testbed
to a robust, o{the{shelf solution.
7.1 Next{generation RASP
The next{generation RASP FPAA architecture has been developed as a part of this
thesis. Chips based on this architecture have been designed with as many as 72 CABs
on them. Several designs are currently being fabricated. One of those FPAAs, dubbed
RASP 2.5, is described in more detail in this section. A top{level block diagram for
RASP 2.5 is shown in Fig. 60.
899
CAB CAB CAB CAB CAB CAB CAB CAB
CAB CAB CAB CAB CAB CAB CAB CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
(small)
CAB
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
L2 I/O
L1
I/O
L1
I/O
L1
I/O
L1
I/O
L1
I/O
L1
I/O I/O
L1
I/O
L1
9−bit Decoder
Programming Decoder
P
r
o
g
r
a
m
m
i
n
g
 
D
e
c
o
d
e
r
9
−
b
i
t
 
D
e
c
o
d
e
r
Vert. Decoder
Horiz. Decoder
Control
7
9
7
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
Figure 60. This is the top{level block diagram for the RASP 2.5 FPAA. It has non{homogeneous
mesh of CABs with 16 full CABs and 40 small CABs for a total of 56 CABs.
7.1.1 Switch Networks
As shown in Fig. 61a, the routing architecture of our large{scale FPAAs is a combi-
nation of global and local switch networks. Each CAB has an associated local switch
network for making connections within a single CAB. The switches' source lines are
routed along the rows and connect the inputs and outputs of each CAB to the switch
network. The drain lines of the switches are connected along the columns. By turning
a switch on, a single row (source) can be connected through the switch to a single
column (drain).
The size of the switch network is dependent upon the number of I/O lines in each
CAB. For the design shown in Fig. 61, there are two types of CABs. For the large
CABs, the local switch networks are comprised of a 10 x 42 matrix of switches; in the
small CABs, the local switch network is a 10 x 32 matrix. Each local switch network is
integrated into a matching global routing switch network. The global routing switch
908
8
10 10
Block
Analog
Computational
Block
Analog
Computational
Block
Analog
Computational
Block
Analog
Computational
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
(Small CAB) (Small CAB)
10 x 8 10 x 8
20 x 46
Crossbar Network
20 x 46
Crossbar Network
20 x 32
Crossbar Network
20 x 32
Crossbar Network
10 x 8 10 x 8
Analog Circuit Components
+
−
WR_OTA
+
−
WR_OTA
+
−
WR_OTA
In Out
C   (SOS) 4
min
max
(a) (b)
Figure 61. (a) This block diagram shows the routing architecture for RASP 2.5. The switching
interconnects are fully connectable crossbar networks built using oating{gate transistors. (b) This
FPAA has two slightly dierent CABs. The large or regular CAB contains a 4 x 4 matrix multiplier,
three wide{range OTAs, three xed{value capacitors, a C4 SOS, a peak detector, and two FET
transistors. The small CAB is the same except it does not include the 4 x 4 matrix multiplier. This
design includes large CABs at the top and bottom of each column and small CABs in between.
network allows local signals from a CAB to be connected to the global routing busses
and be routed o the chip or to another CAB. There are also 10 x 8 switch networks
at each junction of the horizontal and vertical global routing busses.
7.1.2 Computational Analog Blocks
The computational logic in RASP 2.5 is organized in a compact CAB providing a
naturally scalable architecture. CABs are tiled across the chip in a regular mesh{
type architecture with busses and local interconnects in between as shown in Fig. 61a.
Many potential CABs can be imagined using this technology. Figure 61b shows
one example CAB, whose functionality is enhanced by a mixture of ne{grained,
medium{grained, and coarse{grained computational blocks similar to many modern
FPGA designs. The computational blocks were carefully selected to provide a suf-
ciently exible, generic architecture while optimizing certain frequently used signal
processing blocks. For generality, three OTAs are included in each CAB. OTAs have
91Figure 62. This is the top{level layout for the RASP 2.5. It is comprised of 16 large CABs (on the
top and bottom rows) and 40 small CABs for a total of 56 CABs. In TSMC 0.35-micron process, it
covers an area of approximately 9 mm2.
already been shown to be eective at implementing a large class of systems includ-
ing amplication, integration, ltering, multiplication, exponentiation, modulation,
and other linear and non-linear functions [25, 64, 69, 70]. In addition, the two FET
devices provide the ability to perform logarithmic and exponential functions, as well
as convert from current to voltage and vice versa. The three capacitors are xed in
value to minimize the size of the CAB and are primarily used on the outputs of the
OTAs; however, they will be available for any purpose. The variable capacitor and/or
current mirror banks found in some designs are not needed here, because the use of
92Figure 63. This picture is a more detailed look at the small CAB and associated local switch
network.
oating{gate transistors in the OTAs will give the user sucient control in program-
ming the transconductance of the ampliers [37, 64]. Eliminating the capacitor banks
creates a large savings in the area required for each CAB.
The high{level computational blocks used in this design are an SOS bandpass
lter module comprised of two C4 and the 4 x 4 vector{matrix multiplier block. In
general, the C4 SOS module provides a straightforward method of subbanding an
incoming signal. This is analagous to performing a Fourier transform. The vector{
matrix multiplier block allows the user to perform a matrix transformation on the
incoming signals. Together these blocks can be used like a Fourier processor [43, 51].
In addition, a peak detector is added to each CAB.
The architecture illustrated in Fig. 61a is non-homogeneous in that there are two
93Figure 64. This picture is a more detailed look at the full CAB and associated local switch network.
dierent CABs tiled across the chip. The small CAB is identical to the large CAB
except it does not include the vector{matrix multiplier module. Since the vector{
matrix multiplier takes four inputs and each input will often be derived in a separate
CAB (from a separate subband created by the C4 SOS module), designs will typically
utilize only one vector{matrix multiplier for every four CABs. Thus, FPAAs that have
50-100 CABs can be made more compact by removing the vector{matrix multiplier
from all the CABs except those on the top and bottom rows (assuming the FPAA is
more or less square in design).
A sample FPAA based on this architecture with 56 CABs (16 large CABs and 40
94￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
CAB
CAB
CAB
CAB
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
megaCAB
CAB
CAB
CAB
CAB
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
megaCAB
CAB
CAB
CAB
CAB
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
megaCAB
CAB
CAB
CAB
CAB
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
 
￿
 
￿
 
￿
 
￿
 
 
￿
 
￿
 
￿
 
￿
 
!
￿
!
￿
!
￿
!
￿
!
!
￿
!
￿
!
￿
!
￿
!
"
￿
"
￿
"
￿
"
￿
"
"
￿
"
￿
"
￿
"
￿
"
megaCAB
#
￿
#
￿
#
￿
#
#
￿
#
￿
#
￿
#
#
￿
#
￿
#
￿
#
$
￿
$
￿
$
$
￿
$
￿
$
$
￿
$
￿
$
%
￿
%
￿
%
￿
%
%
￿
%
￿
%
￿
%
%
￿
%
￿
%
￿
%
&
￿
&
￿
&
&
￿
&
￿
&
&
￿
&
￿
&
’
￿
’
￿
’
’
￿
’
￿
’
’
￿
’
￿
’
(
￿
(
￿
(
(
￿
(
￿
(
(
￿
(
￿
(
10 x 10 10 x 10
10 x 10 10 x 10
Figure 65. This shows an example of future FPAAs with hundreds of CABs. As FPAAs scale up
in size, more tiers of routing must be added to the heirarchy. Here, global vertical and horizontal
busses (tier 1) connect clusters of CABs, dubbed megaCABs, together.
small CABs) on a single chip is being fabricated in TSMC 0.35-micron; it covers an
area of approximately 9 mm2. A picture of the top{level layout is shown in Fig. 62,
and more detailed pictures of the small and large CABs are shown in Fig. 63.
7.2 Future FPAAs
Commercially viable FPAAs are forseen that have hundreds if not thousands of
CABs based on this same architecture. These designs use a heirarchical routing
structure with more levels than has been shown. An example of a 512 CAB FPAA
is shown in Fig. 65. The CABs are clustered in groups of four with a routing scheme
9510 10
megaCAB
Routing
Global
Routing
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
￿
30 x 41
Crossbar Network
+
−
+
−
+
−
30 x 41
Crossbar Network
+
−
+
−
+
−
30 x 41
Crossbar Network
+
−
+
−
+
−
+
−
+
−
+
−
30 x 46
Crossbar Network
10 x 10
Figure 66. In future FPAAs with hundreds of CABs, CABs are clustered together in megaCABs
that contain four to eight CABs. megaCAB{level routing (tier 2 routing) connects the local switch
networks (tier 3 routing) together.
96similar to RASP 2.5. These CAB clusters have been dubbed megaCABs and an
example is shown in Fig. 66. Within each megaCABs, CABs can be varied slightly
to provide more exibility. In this example, only one of every four CABs contains
a vector{matrix multiplier. The other CABs then have additional transistors and a
signal{by{signal multiplier.
An FPAA with this architecture can synthesize 512{channel subbanding, a 44{
point discrete cosine transform, 256 third{order ladder lters, or many other circuit
combinations. Larger systems can be built as well. As is shown in Section 7.3,
this FPAA can implement a 32{channel auditory feature extraction system, a 128{
channel cochlear model, or a low{power hearing aid algorithm. These examples are
just a sample of the full systems that can be achieved on large{scale FPAAs. In
addition, FPAAs based on this architecture can be eectively used to prototype analog
front{ends for digital systems, broadband processing for high{speed communication
systems (allowing low{speed, baseband A/D converters), real{time image processing,
low{power distributed sensor networks, and data acquisition systems.
7.3 Testbed Applications for Large{Scale FPAAs
To illustrate the viability of large{scale FPAAs, several distinct applications will be
discussed. Their functionality will be shown along with their implementation on
large{scale FPAAs.
7.3.1 Auditory Feature Extraction
Extraction of features from an incoming audio stream is a key component in au-
dio classication and auditory scene analysis systems. The algorithm illustrated in
Fig. 67 has been developed previously and digital implementations exist. However,
for embedded processing environments, this algorithm is computationally intensive.
By implementing most or all of this algorithm in analog hardware, a big savings in
time and power can be achieved.
97Low−pass
Filter
Log
Compression
Transform
Cosine
Discrete
Half−wave
Rectification
Spatial
Derivative
Time
Derivative
Bandpass
Filter
Low−pass
Filter
Log
Compression
Transform
Cosine
Discrete
Half−wave
Rectification
Spatial
Derivative
Time
Derivative
Bandpass
Filter
Low−pass
Filter
Log
Compression
Transform
Cosine
Discrete
Half−wave
Rectification
Spatial
Derivative
Time
Derivative
Bandpass
Filter
Input
D(0) D(1) D(K−1)
Figure 67. This is the block diagram for a digital feature extraction algorithm. Typically, 32{128
channels are needed for this algorithm.
h
I0 I1 I2 I3
i
=
2
6 6 6
4
1  1 0 0
0 1  1 0
0 0 1  1
0 0 0 0
3
7 7 7
5
h
(V0) (V1) (V2) (V3)
i
Figure 68. The vector{matrix multiplication module is used to compute the rst{order dierence
equation for three of the channels.
981 −1
I−V
w/ Log Comp.
In Out
C   (SOS) 4
peak
detector
K VMMs
with
DCT
1 −1
I−V
w/ Log Comp.
In Out
C   (SOS) 4
peak
detector
K VMMs
with
DCT
1 −1
I−V
w/ Log Comp.
In Out
C   (SOS) 4
peak
detector
K VMMs
with
DCT
In Out
C   (SOS) 4
Input
D(0) D(1) D(K−1)
I I I
Figure 69. This is the block diagram for the analog feature extraction algorithm as implemented
on an FPAA. Without the DCT block, a 32{channel system can be synthesized on RASP 2.5. For
a 32{channel system, the DCT blocks will require 64 4 x 4 vector{matrix multiplier blocks. The
future FPAA outlined in Section 7.2 has 128 of these blocks and could implement the entire feature
extraction algorithm shown here.
The analog blocks required to implement this function are shown in Fig. 69. As
shown in this gure, the bandpass lter module and time derivative can be combined
into a C4 SOS bandpass lter module. The spatial derivative can be computed with
a rst{order dierence of the neighboring subband using the vector{matrix multiplier
to compute three subbands (see Fig. 68). Finally, the half{wave rectication and
temporal averaging are achieved with a peak detector programmed to an appropriate
time constant.
The RASP 2.5 FPAA is large enough to implement a 32{channel version of this fea-
ture extraction algorithm with the exception of the discrete cosine transform (DCT).
In this case, the analog/digital boundary would fall between the peak detector and
the DCT as shown by the dashed line in Fig. 69.
To implement the entire algorithm in analog, the future FPAA outlined in Sec-
tion 7.2 would need to be used. With 128 4 x 4 vector{matrix multiplier blocks,
99In Out
C   (SOS) 4
In Out
C   (SOS) 4
In Out
C   (SOS) 4
Matrix
Transformation
Matrix
Transformation
Matrix
Transformation
I1 I0 IN−1
Input
Output
In Out
C   (SOS) 4
peak
detector
In Out
C   (SOS) 4
peak
detector
Gain
Transformation
In Out
C   (SOS) 4
peak
detector
Gain
Transformation
Gain
Transformation
I0 I1 IN−1
Input
Output
(a) (b)
Figure 70. These generalized block digram typify many audio processing functions, including noise
suppression, audio classication, and hearing aids. The various RASP architectures have been
optimized to implement one or both of these system ows.
a 32 x 32 matrix transformation implementing the DCT can be implemented along
with the rst{order dierence equations needed for each channel.
In addition to the feature extraction algorithm outlined above, a number of dier-
ent audio and speech processing functions can be realized on large{scale FPAAs with
architectures similar to the RASP FPAAs. The generalized block diagrams shown in
Fig. 70 are typical of many audio processing functions, including noise suppression,
audio classication, and hearing aids. The various RASP architectures have been
optimized to implement one or both of these ows.
7.3.2 Neuromorphic Modeling
It is natural to use analog circuits to model neural systems [61]. Models of the basic
neural processes|such as axons, synapses, dendrites, action potentials, and whole
neurons|are often not very large when done in isolation. However, modeling full
neuromorphic systems requires hundreds or thousands of these basic blocks. The
size and functionality of the large{scale FPAAs introduced here will enable the rapid
prototyping of neuromorphic systems, such as networks of neurons, the cochlea, and
more.
100bias2 V
bias1 V
in V
1 C
2 C
out V
bias2 V
bias1 V
in V
1 C
2 C
ref V
out V
+
−
WR_OTA
(a) (b)
Figure 71. These circuit diagrams illustrate two slight variations of the self{resetting (or integrated{
and{re) neuron from [61]. Both circuits can be implemented in RASP 2.5; however, the circuit in
part (a) requires one more CAB than the circuit in part (b) because of the six transistors needed.
One basic model of the neuron is Mead's self{resetting neuron [61]. Two slight
variations of this design are shown in Fig. 71. Both of these circuits can be imple-
mented in RASP 2.5; however, the transistor{only version requires an additional CAB
because of the six transistors needed.
Like the variety of neuron models, a number of dierent models exist for building
larger neuromorphic systems. As an example, two models of the cochlea are shown
in Fig. 72. Both of these systems can be synthesized with RASP FPAAs. Typically,
these models will be used with 32 channels. The center frequencies of the C4 SOS
modules are spaced logarithmetically through the human hearing range.
Modeling more complex neuromorphic systems often requires adaptability. It has
been shown previously that oating{gate transistors are well{suited to implementing
adaptive systems []. An adaptive CAB based on this technology could be built that
101In Out
C   (SOS) 4
In Out
C   (SOS) 4
In Out
C   (SOS) 4
In Out
C   (SOS) 4
In Out
C   (SOS) 4
High
Frequency
Low
Frequency
Input
Center Frequencies
Logarithmically Spaced
(a)
Vector−Matrix
Multiplier
Vector−Matrix
Multiplier
Vector−Matrix
Multiplier
Vector−Matrix
Multiplier
Vector−Matrix
Multiplier
In Out
C   (SOS) 4
In Out
C   (SOS) 4
High
Frequency
Low
Frequency
In Out
C   (SOS) 4
In Out
C   (SOS) 4
In Out
C   (SOS) 4
I−V
Converter
I−V
Converter
I−V
Converter
I−V
Converter
I−V
Converter
Input
Logarithmically Spaced
Center Frequencies
(b)
Figure 72. These circuit diagrams illustrate two models of the cochlea. Both of these systems can
be synthesized on the RASP FPAAs. Typically, 32 channels will be used with the center frequencies
of the variations spaced logarithmetically throughout the auditory range.
would readily t within the RASP architecture. This ability to combine general{
purpose computational elements with blocks that can enhance the synthesize of spe-
cic signal processing functions is an important feature of the FPAA architecture and
framework presented in this thesis.
7.4 Future Research
More than just a single device, the FPAA enables a whole new design paradigm. The
FPAA architecture and infrastructure developed as a part of this thesis form a design
methodology that can be applied to an ever{widening scope of problems. The FPAA
framework introduces programmability, recongurability, and reuseability to analog
circuit design. While FPAAs have existed before this thesis, they have not had the
size or functionality to enable true system{level design, and thus, they have failed to
be the enabling technology that takes rapid prototyping of analog systems into the
mainstream.
102Approaching FPAA design from the perspective of design methodology|instead
of just circuit design|has led to a number of dierent avenues of research. These
research directions are a direct result of this FPAA research and can be broadly
classied into three areas: enabling technologies, computational logic, and computer{
aided design (CAD) of analog systems.
7.4.1 Enabling Technologies
Developing large{scale FPAAs has created a need for new and improved methods of
programming. The increased size alone mandates that switches must be programmed
quickly or the large device is handicapped by the amount of time required to pro-
gram it. The vast majority of the oating{gate transistors on large{scale FPAAs
are switches, and programming switches does not typically require a high degree of
accuracy. Thus, a separate programming scheme can be used for switches that is
optimized for speed to the detriment of programming accuracy.
An additional increase in programming speed can be achieved by moving the
programming control logic on{chip. When this is done, the programming logic can
be replicated for each row of the device, and a row{parallel programming scheme can
be implemented. The speed{up that can be realized in this scheme is signicant and
is an ongoing research project in the CADSP group.
The architecture of these large{scale FPAAs requires large switch networks. Even
in a routing{hierarchy approach, the local switch interconnects will need to be a
crossbar switch to enable maximum connectivity. As can be seen in the layout pic-
ture in Fig. 62, the switch networks consume a large area. Making these networks
denser would be a large benet in terms of cost and scalability. Research on the
core oating{gate technology is ongoing. Researchers in the CADSP group are cur-
rently characterizing alternative methods of programming (particularly tunnelling)
oating{gate transistors.
1037.4.2 Computational Elements
A number of dierent CAB designs have been proposed within the CADSP group.
Researchers are currently working on CAB designs that include minor variations to
the RASP CABs, designs optimized for high{speed operation, and designs based on
an emerging logic circuits such as multiple{input translinear elements (MITEs). Some
of these designs will t directly in the RASP architecture with no signicant change to
the routing interconnects or control infrastructure. Other designs, such as the MITE
FPAAs, are new designs that have been developed from the framework introduced
here.
7.4.3 CAD Tool Flow for Analog Systems
When FPAAs are small, manually specifying placement and routing of circuits onto
the FPAAs architecture is not too burdensome. However, as FPAAs increase in size to
include complex CABs, non{homogeneous meshes of computational logic, and hierar-
chical routing interconnects, manual design becomes impractical or even impossible.
Thus, a CAD tool ow needs to be developed for FPAAs. As shown in Fig. 73, the
ideal CAD tool ow would support a high{level entry methodology such as VHDL and
VHDL-AMS for mixed{signal hardware design specication. The CAD tools would
then automatically partition the system into the optimal balance of digital and analog
hardware and issue separate digital and analog netlists. Each netlist would then pro-
ceed through the synthesis, placement, routing, and programming of the respective
recongurable devices (FPGA and FPAA). As outlined in Section 2.3.3, some work
has already progressed on the analog side. Most of this work is targeted at smaller
FPAAs such as Anadigm's FPAAs. It has yet to be shown if these algorithms will
scale to the larger FPAAs discussed here. Researchers in the CADSP group have
begun to look at the challenges of placement and routing for large{scale FPAAs and
are using the RASP 2.5 architecture as a testbed target for their software.
104(schematic, HDL, etc.)
A/D Partitioner
Synthesis Synthesis
Place & Route
FPGA Programmer
Place & Route
FPAA Programmer
Design Entry
Mixed−Signal Logic Compiler
Mixed−Signal Prototyping Platform
FPGA FPAA
Figure 73. This is a block diagram of the ideal tool ow to enable mixed{signal rapid prototyping.
Here, the CAD toolow accepts a high{level, mixed{signal system description (i.e., a VHDL-AMS
description or graphical schematic) and then performs an automated partitioning of the system into
an optimal balance of digital and analog circuitry. The resulting analog and digital netlists are then
passed through separate synthesis, place and route, and programming tool ows that are targeted
for the specic devices on the development system.
7.5 Commercialization
As FPAA research has progressed, academia and industry have shown increased in-
terest in this work. Because of this interest, the FPAA research discussed here is
covered in a pending patent [18]. In particular, GTronix, a start{up company coming
from Georgia Tech's VentureLabs incubator, has received funding to pursue commer-
cialization of an FPAA on the scale of the RASP 2.5. This eort will result in much
development in order to transform the RASP FPAA architecture from a laboratory
testbed into a viable commercial product. It is also anticipated that by moving the
development associated with large{scale VLSI design into the corporate environment,
105students and faculty in the CADSP group will be able to focus on advancing the re-
search.
7.6 Original Contributions
This thesis has detailed the investigation of a novel, large{scale FPAA that is analo-
gous to current commercial FPGAs in its size, usefulness, and exibility. The engi-
neering contribution of this work can be summarized as follows:
1. System{level architecture of a oating{gate FPAA: The system{level
architecture of an FPAA based on oating{gate technology has been presented.
A novel, large{scale FPAA architecture has been specied and two testbed
FPAAs based on this design have been fabricated and tested. Using analysis,
simulation, and experimental data, these FPAAs have been shown to be highly
exible, functional, and generic in nature. The feasibility of large{scale, useful
FPAAs based on this design has been shown.
2. Mixed{signal prototyping platform for signal processing applications:
A mixed{signal prototyping platform has been created. By combining large{
scale FPAAs and FPGAs together into a single system, the rst realistically
viable mixed{signal prototyping platform is demonstrated.
3. Roadmap for future FPAAs: This research has laid the foundation of a new
genre of research in recongurable analog systems. By showing the viability of
large{scale FPAAs, this eort has renewed interest in the research and develop-
ment of recongurable analog systems in both academia and industry. Within
the CADSP group alone, FPAA research has transitioned from a single project
to being a dening thrust of the group. Not only is a core team of researchers
now dedicated to FPAA research, but the design methodologies introduced by
recongurable logic have permeated many of the research projects within the
106group (see Section 7.4). On the industry side, a number of corporations have
shown an interest in the research presented here, and a start{up company has
been created and funded to commericialize an FPAA based on this work (see
Section 7.5).
7.7 Conclusion
Large{scale FPAAs based on oating{gate technologies provide the necessary levels of
programmability and functionality to implement complex signal processing systems.
With orders{of{magnitude savings in power consumption over traditional digital ap-
proaches, this recongurable analog technology oers an attractive alternative for
implementing advanced signal processing systems in low{power embedded systems.
The oating{gate transistors are shown to provide a compact switch that exhibits
relatively at resistance characteristics across the full operating voltage and can be
programmed to be an active circuit element (e.g., variable resistor).
The technologies and architectures explored in this thesis have been demonstrated
in fabricated, testbed FPAAs. Characterization and system{level data have been
experimentally acquired. Systems implemented on these FPAAs have shown that
FPAAs are highly exible, and individual computational elements have proved to be
programmable across a wide range of frequencies, Q-peaks, bandwidths, and time
constants.
Large{scale FPAAs based on the testbed architectures have been designed and
are currently being fabricated. However, more than just a single design, large{scale
FPAAs enable a whole new design paradigm. The FPAA architecture and infras-
tructure developed as a part of this thesis form a design methodology that can be
applied to an ever{widening scope of problems. This FPAA framework introduces
programmability, recongurability, and reuseability to analog circuit design.
107REFERENCES
[1] Adams, W. J., Nedungadi, A., and Geiger, R. L., \Design of a pro-
grammable OTA with multi{decade transconductance adjustment," in Proceed-
ings of the International Symposium on Circuits and Systems, vol. 1, pp. 663{666,
May 1989.
[2] Anadigm, http://www.anadigm.com/Prs 15.asp/, Anadigm Company Fact
Sheet, May 2003.
[3] Anadigm, http://www.anadigm.com/Supp 05.asp/, Anadigm FPAA Family
Overview, Mar. 2003.
[4] Anderson, D., Marcjan, C., Bersch, D., Anderson, H., Hu, P.,
Palusinski, O., Gettman, D., Macbeth, I., and Bratt, A., \A eld pro-
grammable analog array and its application," in Proc. IEEE Custom Integrated
Circuits Conference, May 1997.
[5] Anderson, D. V., Hasler, P., Ellis, R., Yoo, H., Graham, D. W., and
Hans, M., \A low{power system for audio noise suppression: A cooperative
analog{digital signal processing approach," in Proc. 2002 DSP Workshop, (Pine
Mountain, GA), Oct. 2002.
[6] Bandyopadhyay, A. and Hasler, P., \A fully programmable CMOS block
matrix transform imager architecture," in Proceedings of the 2003 IEEE Custom
Integrated Circuits Conference, pp. 189{192, Sept. 2003.
[7] Becker, J. and Manoli, Y., \A continuous{time eld programmable analog
array (FPAA) consisting of digitally recongurable Gm{cells," in Proceedings of
108the 2004 IEEE International Symposium on Circuits and Systems, pp. I.1092{
I.1095, May 2004.
[8] Binns, R. J., Hallam, P., Mack, B., and Massara, R., \High{level design
of analogue circuitry using an analogue hardware description language," in IEE
Colloquium on Mixed{Signal AHDL/VHDL Modelling and Synthesis, pp. 3/1{
3/8, Nov. 1997.
[9] Birkner, J. M. and Chua, H.-T., \Programmable array logic circuit."
U.S. Patent No. 4,124,899.
[10] Bratt, A., \Motorola eld programmable analogue arrays, present hardware
and future trends," in IEE Half{day Colloquium on Evolvable Hardware Systems,
pp. 1/1{1/5, Mar. 1998.
[11] Bridges, S., Figueroa, M., Hsu, D., and Diorio, C., \Field{programmable
learning arrays," in Advances in Neural Information Processing Systems 15
(Becker, S., Thrun, S., and Obermayer, K., eds.), pp. 1155{1162, Cam-
bridge, MA: MIT Press, 2002.
[12] Chang, S., Hayes-Gill, B., and Paul, C., \Multi-function block for a
switched current eld programmable analog array," in 1996 Midwest Symposium
on Circuits and Systems, Aug. 1996.
[13] Chow, P., Seo, S. O., Rose, J., Chung, K., Paez-Monzon, G., and Ra-
hardja, I., \The design of an SRAM{based eld{programmable gate array|
part i: architecture," IEEE Transactions on Very Large Scale Integration Sys-
tems, vol. 7, pp. 191{197, June 1999.
[14] Chow, P., \A eld{programmable mixed{analog{digital array," Master's thesis,
University of Toronto, 1994.
109[15] Chow, P. and Gulak, P. G., \A eld{programmable mixed{analog{digital
array," in Proc. 3rd International ACM Symposium on Field{Programmable Gate
Arrays, pp. 104{109, ACM Press, 1995.
[16] Dudek, P. and Hicks, P. J., \A CMOS general{purpose sampled{data analog
processing element," IEEE Transactions on Circuits and Systems II, vol. 47,
pp. 467{473, May 2000.
[17] Dudek, P. and Hicks, P. J., \A CMOS general{purpose sampled{data ana-
logue microprocessor," in Proceedings of the International Symposium on Circuits
and Systems, vol. 2, pp. 417{420, May 2000.
[18] Duggar, J. D., Hall, T. S., Hasler, P., Anderson, D. V., Smith, P. D.,
Kucic, M. R., and Bandyopadhyay, A., \Floating{gate analog circuit."
U.S. Patent Application No. 20030183871, 2003. Patent Pending.
[19] Dugger, J. and Hasler, P., \A continuously-adapting analog node using
oating-gate synapses," in IEEE Midwest Conference on Circuits and Systems,
(East Lansing, MI), Aug. 2000.
[20] Edwards, R. T., Strohbehn, K., and Jaskulek, S. E., \A eld{
programmable mixed{signal array architecture using antifuse interconnects," in
Proceedings of the International Symposium on Circuits and Systems, vol. 3,
pp. 319{322, May 2000.
[21] Ellis, R., Yoo, H., Graham, D., Hasler, P., and Anderson, D., \A
continuous{time speech enhancement front{end for microphone inputs," in Pro-
ceedings of the IEEE International Symposium on Circuits and Systems, vol. 2,
(Phoenix, AZ), pp. II.728{II.731, May 2002.
110[22] Ellis, R., Yoo, H., and Anderson, D. V., \An analog oating-gate IC
for audio noise suppression," in Proceedings of the International Symposium on
Circuits and Systems, (Phoenix, AZ), May 2002. Invited Paper.
[23] Embabi, S., Quan, X., Oki, N., Manjrekar, A., and Sanchez-Sinencio,
E., \A eld programmable analog signal processing array," in IEEE 39th Midwest
Symposium on Circuits and Systems, vol. 1, pp. 151{154, Aug. 1996.
[24] Enright, D. and Mack, R. J., \A hierarchical technique to model paramet-
ric device variation in non{linear analogue CMOS architectures," in IEE Col-
loquium on Mixed{Signal AHDL/VHDL Modelling and Synthesis, pp. 1/1{1/7,
Nov. 1997.
[25] Fast Analog Solutions Ltd., http://www.zetex.com, Totally re-congurable ana-
log circuit { TRAC, Mar. 1999.
[26] Faura, J., Lacadena, I., Torralba, A., and Insenser, J. M., \Pro-
grammable analog hardware: a case study," in IEEE International Conference
on Electronics, Circuits and Systems, vol. 1, pp. 297{300, Sept. 1998.
[27] Franz, G., \Digital signal processor trends," IEEE Micro, vol. 20, pp. 52{59,
Nov{Dec 2000.
[28] Ganesan, S. and Vemuri, R., \FAAR: A router for eld{programmable analog
arrays," in Proc. 12th International Conference on VLSI Design, pp. 556{563,
Jan. 1999.
[29] Ganesan, S. and Vemuri, R., \A methodology for rapid prototyping of analog
systems," in International Conference on Computer Design, pp. 482{488, Oct.
1999.
111[30] Ganesan, S. and Vemuri, R., \Technology mapping and retargeting for eld{
programmable analog arrays," in Proc. Design, Automation and Test in Europe
Conference and Exhibition, pp. 58{64, Mar. 2000.
[31] Ganesan, S. and Vemuri, R., \Technology mapping and retargeting for eld{
programmable analog arrays," in DATE 2000 Proceedings: Design, Automation
and Test in Europe Conference 2000, Mar. 2000.
[32] Ganesan, S. and Vemuri, R., \Analog{digital partitioning for eld{
programmable mixed signal systems," in Proc. Conference on Advanced Research
in VLSI, pp. 172{185, Mar. 2001.
[33] Ganesan, S. and Vemuri, R., \Behavioral partitioning in the synthesis of
mixed analog{digital systems," in Proc. Design Automation Conference, pp. 133{
138, June 2001.
[34] Gaudet, V. C. and Gulak, P. G., \CMOS implementation of a current
conveyor{based eld{programmable analog array," in Conference Record of the
31st Asilomar Conference on Signals, Systems and Computers, vol. 2, pp. 1156{
1159, Nov. 1997.
[35] Gulak, P. G., \Field{programmable analog arrays: past, present and future
perspectives," in IEEE Region 10 International Conference on Microelectronics
and VLSI, pp. 123{126, Nov. 1995.
[36] Hall, T. S., Hasler, P., and Anderson, D. V., \Field{programmable ana-
log arrays: A oating{gate approach," in Proc. 12th International Conference on
Field Programmable Logic and Applications, (Montpellier, France), pp. 424{433,
Sept. 2002.
[37] Hall, T. S., Twigg, C. M., Hasler, P., and Anderson, D. V., \Appli-
cation performance of elements in a oating{gate FPAA," in Proceedings of the
1122004 IEEE International Symposium on Circuits and Systems, pp. II.589{II.592,
May 2004.
[38] Hasler, P., Bandyopadhyay, A., and Smith, P., \A fully programmable
CMOS block matrix transform imager architecture," in Proceedings of the 2002
IEEE International Symposium on Circuits and Systems, pp. III.337{III.340,
May 2002.
[39] Hasler, P., Kucic, M., and Minch, B. A., \A transistor-only circuit model
of the autozeroing oating-gate amplier," in Midwest Conference on Circuits
and Systems, (Las Cruces, NM), 1999.
[40] Hasler, P. and Lande, T., \Special issue on oating-gate devices, circuits,
and systems," IEEE Journal of Circuits and Systems, vol. 48, Jan. 2001.
[41] Hasler, P. and Minch, B. A., Floating-gate Devices, Circuits, and Systems.
in press, 2002.
[42] Hasler, P., Minch, B. A., and Diorio, C., \Adaptive circuits using pFET
oating-gate devices," in Proceedings of the 20th Anniversary Conference on
Advnaced Research in VLSI, (Atlanta, GA), pp. 215{229, March 1999.
[43] Hasler, P., Minch, B. A., and Diorio, C., \An autozeroing oating-gate
amplier," IEEE Transactions on Circuits and Systems II, vol. 48, pp. 74{82,
Jan. 2001.
[44] Hasler, P. and Anderson, D. V., \Cooperative analog-digital signal process-
ing," in Proceedings of the IEEE International Conference on Acoustics, Speech,
and Signal Processing, (Orlando, FL), May 2002. Invited Paper.
113[45] Hasler, P., Smith, P., Ellis, R., Graham, D., and Anderson, D. V.,
\Biologically inspired auditory sensing system interfaces on a chip," in 2002
IEEE Sensors Conference, (Orlando, FL), June 2002. Invited Paper.
[46] Keymeulen, D., Zebulum, R. S., Jin, Y., and Stoica, A., \Fault{tolerant
evolvable hardware using eld{programmable transistor arrays," IEEE Transac-
tions on Reliability, vol. 49, pp. 305{316, Sept. 2000.
[47] Klein, H. W., \The EPAC architecture: an expert cell approach to eld pro-
grammable analog circuits," in IEEE 39th Midwest Symposium on Circuits and
Systems, vol. 1, pp. 169{172, Aug. 1996.
[48] Koneru, S., Lee, E. K. F., and Chu, C., \A exible 2{d switched{capacitor
FPAA architecture and its mapping algorithm," in 42nd Midwest Symposium on
Circuits and Systems, vol. 1, pp. 296{299, Aug. 1999.
[49] Kucic, M., Hasler, P., Dugger, J., and Anderson, D. V., \Programmable
and adaptive analog lters using arrays of oating-gate circuits," in 2001 Con-
ference on Advanced Research in VLSI (Brunvand, E. and Myers, C., eds.),
pp. 148{162, IEEE Computer Society, March 2001.
[50] Kucic, M., Hasler, P., Dugger, J., and Anderson, D. V., \Programmable
and adaptive analog lters using arrays of oating-gate circuits," in 2001 Con-
ference on Advanced Research in VLSI (Brunvand, E. and Myers, C., eds.),
pp. 148{162, IEEE Computer Society, March 2001.
[51] Kucic, M., Low, A., Hasler, P., and Neff, J., \A programmable
continuous-time oating-gate Fourier processor," IEEE Transactions on Circuits
and Systems II, vol. 48, pp. 90{99, Jan. 2001.
114[52] Kutuk, H. and Kang, S.-M., \A eld{programmable analog array (FPAA)
using switched{capacitor techniques," in Proceedings of the International Sym-
posium on Circuits and Systems, vol. 4, pp. 41{44, May 1996.
[53] Lattice Semiconductor Corporation, Hillsboro, OR, ispPAC Overview, Sept.
1999.
[54] Lee, E. K. F. and Gulak, P. G., \A CMOS eld{programmable analog
array," IEEE Journal of Solid-State Circuits, vol. 26, pp. 1860{1867, Feb. 1991.
[55] Lee, E. K. F., Field{programmable analog arrays on MOS transconductors.
PhD thesis, University of Toronto, 1995.
[56] Lee, E. K. F., \Recongurable pipelined data converter architecture," in IEEE
39th Midwest Symposium on Circuits and Systems, vol. 1, pp. 162{165, Aug.
1996.
[57] Lee, K. and Gulak, P., \A CMOS eld-programmable analog array," in IEEE
International Solid{State Conference Digest of Technical Papers, pp. 186{188,
Feb. 1991.
[58] Lee, K. and Gulak, P., \A transconductor-based eld-programmable analog
array," in IEEE International Solid{State Conference Digest of Technical Papers,
pp. 198{199, Feb. 1995.
[59] Long, D. I., \Behavioural modelling of mixed{signal circuits using PWL wave-
forms," in IEE Colloquium on Mixed{Signal AHDL/VHDL Modelling and Syn-
thesis, pp. 2/1{2/6, Nov. 1997.
[60] Marsh, D., \Programmable analogue ICs challenge spice{and{breadboard de-
signs," in EDN Europe, pp. 30{36, http://www.ednmag.com: Reed Business
Information, Oct. 2001.
115[61] Mead, C., Analog VLSI and Neural Systems. Reading, MA: Addison-Wesley,
1989.
[62] Minch, B. A., Diorio, C., Hasler, P., and Mead, C. A., \Translinear
circuits using subthreshold oating-gate MOS transistors," Analog Integrated
Circuits and Signal Processing, vol. 9, no. 2, pp. 167{179, 1996.
[63] Pankiewicz, B., Wojcikowski, M., Szczepanski, S., and Sun, Y., \A
CMOS eld programmable analog array and its application in continuous{time
OTA-C lter design," in Proceedings of the International Symposium on Circuits
and Systems, vol. 1, pp. 5{8, May 2001.
[64] Pankiewicz, B., Wojcikowski, M., Szczepanski, S., and Sun, Y., \A
eld programmable analog array for CMOS continuous{time OTA-C lter appli-
cations," IEEE Journal of Solid-State Circuits, vol. 37, pp. 125{136, Feb. 2002.
[65] Pierzchala, E., Perkowski, M. A., and Grygiel, S., \A eld pro-
grammable analog array for continuous, fuzzy, and multi{valued logic applica-
tions," in 24th International Symposium on Multiple{Valued Logic, pp. 148{155,
May 1994.
[66] Pierzchala, E., Perkowski, M. A., Halen, P. V., and Schaumann, R.,
\Current{mode amplier/integrator for a eld{programmable analog array," in
IEEE International Solid{State Conference Digest of Technical Papers, pp. 196{
197, Feb. 1995.
[67] Premont, C., Grisel, R., Abouchi, N., and Chante, J.-P., \Current{
conveyor based eld programmable analog array," in IEEE 39th Midwest Sym-
posium on Circuits and Systems, vol. 1, pp. 155{157, Aug. 1996.
116[68] Quan, X., Embabi, S., and Sanchez-Sinencio, E., \A current-mode based
eld programmable analog array architecture for signal processing applica-
tions," in IEEE 1998 Custom Integrated Circuits Conference, (Santa Clara, CA),
pp. 277{280, May 1998.
[69] Ray, B., Chaudhuri, P. P., and Nandi, P. K., \Design of OTA based eld
programmable analog array," in Proc. 13th International Conference on VLSI
Design, pp. 494{498, Jan. 2000.
[70] Sanchez-Sinencio, E., Ramirez-Angulo, J., Linares-Barranco, B.,
and Rodriguez-Vazquez, A., \OTA-based non-linear function approxima-
tions," in ISCAS Proc., vol. 1, pp. 96{99, May 1989.
[71] Santini, C. C., Zebulum, R., Pacheco, M. A. C., Vellasco, M. M. R.,
and Szwarcman, M. H., \Evolution of analog circuits on a programmable
analog multiplexer array," in Proc. IEEE Aerospace Conference, vol. 5, pp. 2301{
2308, Mar. 2001.
[72] Sarpeshkar, R., Ecient precise computation with noisy components: extrap-
olating from an electronic cochlea to the brain. Pasadena, CA: PhD thesis, Cali-
fornia Institute of Technology, 1997.
[73] Serrano, G., Smith, P., Lo, H. J., Chawla, R., Hall, T., Twigg, C.,
and Hasler, P., \Automatic rapid programming of large arrays of oating{gate
elements," in Proceedings of the 2004 IEEE International Symposium on Circuits
and Systems, pp. I.373{I.376, May 2004.
[74] Sivilotti, M. A., Wiring Considerations in Analog VLSI Systems, with Appli-
cation to Field-Programmable Networks (VLSI). PhD thesis, California Institute
of Technology, Pasadena, CA, 1991.
117[75] Smith, P., Kucic, M., and Hasler, P., \Accurate programming of analog
oating{gate arrays," in Proceedings of the 2002 International Symposium on
Circuits and Systems, (Phoenix, AZ), pp. V.489{V.492, May 2002.
[76] Smith, P. D., Kucic, M., Ellis, R., Hasler, P., and Anderson, D. V.,
\Mel{frequency cepstrum encoding in analog oating{gate circuitry," in Proceed-
ings of the International Symposium on Circuits and Systems, vol. 4, pp. 671{674,
2002.
[77] Stoica, A., Zebulum, R., Keymeulen, D., Tawel, R., Daud, T., and
Thakoor, A., \Recongurable VLSI architectures for evolvable hardware: from
experimental eld programmable transistor arrays to evolution-oriented chips,"
IEEE Transactions on Very Large Scale Integration Systems, vol. 9, pp. 227{232,
Feb. 2001.
[78] Sun, Y., ed., Design of high frequency integrated analogue lters. London, UK:
The Institution of Electrical Engineers, 2002.
[79] Wakerly, J. F., Digital Design: Principles and Practices, ch. 5: Combinational
logic design practices, pp. 311{455. New Jersey: Prentice Hall, 3rd ed., 1999.
[80] Yoo, H., Anderson, D. V., and Hasler, P., \Continuous{time audio noise
suppression and a custom low-power IC implementation," in Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal Processing, (Or-
lando, FL), May 2002. Invited Paper.
118