Abstract-This paper discusses architectural and circuit-level aspects related to hardware realizations of fuzzy controllers. A brief overview on fuzzy inference methods is given focusing on chip implementation. The singleton or zero-order Sugeno's method is chosen since it offers a good tradeoff between hardware simplicity and control efficiency. The CMOS microcontroller described herein processes information in the current-domain, but input-output signals are represented as voltage to ease communications with conventional control circuitry. Programming functionalities are added by combining analog and digital techniques, giving rise to a versatile microcontroller, capable of solving different control problems. After identifying the basic component blocks, the circuits used for their implementation are discussed and compared with other alternatives. This study is illustrated with the experimental results of prototypes integrated in different CMOS technologies.
I. INTRODUCTION
A T present, most information processing is performed digitally, however, interfacing with the real world remains an analog issue. Besides the fact that A/D and D/A interfaces are essential, analog circuitry can be also found as pre and post-processors in many applications [1] . In general, analog implementations are preferable for high frequency and low power, although choosing which part of a chip be analog or digital is a complex issue. This reasoning still applies for the specific case of fuzzy circuits, which are becoming an attractive approach to process control. Fuzzy techniques provide a general framework to handle the control of complex processes which are very difficult to describe with mathematical equations but can be controlled by human experts. The often intuitive and ambiguous human knowledge is easily represented by fuzzy rules and can be included into an automated control strategy [2] - [4] .
The application of fuzzy inference techniques to real-time control problems (robotics, image and speech processing, etc.) calls for hardware realizations tailored to the fuzzy paradigm. Many reported fuzzy microcontrollers are digital because of the advantages deriving from sound design techniques, flexible programmability, and easy embedding into digital processing environments [5] - [7] . However, analog hardware seems to be more suited for the implementation of massively parallel systems that process continuous or multivalued inManuscript received April 10, 1996 . The authors are with the Departamento de Diseño de Circuitos Analógicos, Instituto de Microelectrónica de Sevilla, Centro Nacional de Microelectrónica, Edificio CICA, Avda Reina Mercedes s/n, 41012 Sevilla, Spain.
Publisher Item Identifier S 1063-6706(97)00751-0.
formation-like fuzzy systems. Probably, the main drawback of analog implementations is their relatively low accuracy, which does not seem a severe limitation in view of the typical demands of many fuzzy control applications [8] - [11] . While some analog realizations are designed for specific applications [12] , [13] , we find it more interesting to implement programmable microcontrollers, following the work in [14] - [16] . Programmability allows the fuzzy chip to be reconfigured to solve different control problems without requiring complex adjusting procedures. This paper presents mixed-signal transistor-only IC's which combine the best of analog and digital worlds to implement programmable fuzzy microcontroller chips. Typical functionalities are interchanged: the analog circuitry provides the required computing power, while the digital part allows a systematic way to program the fuzzy chip with a simple interface. We focus on CMOS circuits realized with currentmode continuous-time techniques because they are especially suited for direct implementation of the basic fuzzy operators. Working in current-mode with transistors biased in strong inversion, dynamic range problems can be handled with flexibility and high-speed operation can be attained [17] . The microcontroller described here processes the fuzzy information in the current domain, but to improve the interface capability the I/O signals are represented as voltage since the majority of existing devices are voltage-processing circuits.
The structure of this paper is as follows. Section II briefly reviews the wide range of possibilities to choose from when designing a fuzzy controller. Studies and results from mathematical and engineering fields are combined with microelectronics criteria to explain the choices made in the presented fuzzy microcontroller. Its ability to provide different control actions is discussed in Section III. Section IV evaluates three types of architecture to implement the chosen inference mechanism (singleton or zero-order Sugeno's method). They are compared in terms of functional block and digital word requirements. The circuits used to realize the different functional blocks are described in Sections V to VII. Experimental results are included to illustrate this analysis. System performance is detailed in Section VIII. The static and dynamic behavior of a programmable two-input one-output controller chip implementing nine rules is illustrated through HSPICE simulations.
II. HARDWARE COMPONENTS OF FUZZY CONTROLLERS
A fuzzy controller maps crisp input into crisp output via IF-THEN rules like the following (in the case of single-output Fig. 1): 1) an interfacing shell providing the mapping between crisp values and fuzzy sets (fuzzification mechanism) and vice versa (defuzzification mechanism); 2) a decision-making or inference core performing the mathematical transformations to derive control actions from the set of rules. The fuzzification interface provides a fuzzy partition of the input and output universes of discourse of the different variables as well as a choice of the membership function which describes each fuzzy set. In general, the larger the number of fuzzy sets, the more complex the fuzzy controller and the higher its potential capabilities [18] - [20] .
Concerning the input variables, the fuzzification operation is performed by circuits known as membership function circuits (MFC's). Several classes of parameterized functions are used to define membership functions: triangles and trapezoids are popular in industrial control applications [10] , [11] due to their simplicity while Gaussian, sigmoidal, or bell-shaped functions are especially used in adaptive neuro-fuzzy controllers [21] , [22] . We have selected triangle and trapezoidal membership functions because they are very easily implemented by currentmode CMOS circuits (this will be discussed in Section V). Unlike memory-based MFC's, e.g., those employed in digital realizations [5] - [7] , an analog MFC implements a continuous membership function by its transfer characteristics. Hence, the input signals are directly processed and the input universes of discourse need not have a discrete representation. The parameters required to define a generic trapezoidal/triangle membership function are: i) the situation in the universe of discourse, ii) the support or interval on the universe of discourse where the degrees of membership are not zero, and iii) the slopes of the trapezoid/triangle that take into account how the support is covered by the function (Fig. 2) . On the other hand, the output universe of discourse needs a discrete representation since it must be generated on-chip. How the output space is dealt with greatly determines the resulting hardware. To better understand this, let us briefly review some commonly used fuzzy inference mechanisms, focusing on hardware implementation.
Two operations have to be performed in the inference mechanism ( Fig. 3) : the calculation of each rule's output or conclusion and the aggregation of all of them to obtain the overall output. To infer each rule's conclusion, the first value to calculate is the firing strength or activation degree of the rule which measures the degree of matching between the controller input values and the antecedent part of the rule. This value results from the connection of antecedents via triangular norms (conjunctions "and") or conorms (disjunctions "or") [3] . Among the associated operations (minimum and maximum, algebraic product and sum, bounded product and sum, etc.), minimum (min) and maximum (max) are very easily implemented by integrated circuits (Section VI). They are also very popular in control applications. The operation which combines the antecedent part (represented by the activation degree) and the consequent part of a fuzzy rule is known as fuzzy implication [3] .
Most of the first fuzzy controllers used fuzzy sets to define the consequent linguistic variables. The inference methods that they implemented were the "min-max" Mamdani's method, which uses a min operator to perform the fuzzy implication and a max operator to aggregate rules [23] , and the "productsum" method, which uses algebraic product and sum instead of min and max [10] . To defuzzify the global fuzzy output, the traditional strategies were the middle of maxima (MOM) and the center of gravity (COG) methods [3] , [24] , [25] . This is shown in Fig. 4(a) .
From a microelectronics point of view, the definition of the consequent variable as a fuzzy set is very costly in terms of area or time. The first analog fuzzy chip, proposed by Yamakawa [14] , represented the output space with 25 elements. The membership degrees of these elements to a particular fuzzy set were provided in parallel via a bus of 25 lines, resulting in high-area occupation. To allow implementing more rules on a single chip, an alternative architecture proposed by the authors [26] employed serial circuits to generate these membership functions at the cost of a slower inference speed.
The key point is that this hardware complexity is not justified by control efficiency. The studies in [24] and [25] reveal that the MOM method can make the controller input-output relation nonmonotone, discontinuous, and even independent of the input membership functions. Regarding the COG method, some authors [25] , [27] have detected the problem that the final output is more influenced by ambiguous rather than by precise consequents, because ambiguous consequents are represented by wider membership functions.
Hardware is significantly reduced if the consequent linguistic variable is defined by several parameters instead of a membership function distributed along a discrete output space. These parameters will concentrate the information that a membership function provides about a fuzzy set. As cited for antecedents, this information concerns the situation of the membership function in the universe of discourse (given by the element with maximum degree of pertenence or by the center of area), its grade of fuzziness (represented by the support or width), and the way of covering that support (represented by the shape) (Fig. 2) . This can be achieved by using inference methods which first defuzzify each rule's output (generally given by a fuzzy set to obtain a representative value and then aggregate all of them via a weighted average [24] , [25] , [28] , [29] [ Fig. 4(b) ]. The following general expression shows how these methods derive the controller output:
where is the activation degree of the th rule. In these methods, the information about the situation and the shape of the consequent membership function is included in the value , while the influence of consequent supports is included in the weight
In particular, the weighted fuzzy mean (WFM) [24] and quality method (QM) [25] take as constants , namely the center of area of the consequent fuzzy sets , while the weights are chosen as the activation degrees scaled by a constant parameter,
In the first method, the parameter is proportional to the support of the consequent fuzzy set, so that the global output is similar to that obtained by the productsum COG method, thus suffering from the same drawback of ambiguous consequent dominance previously commented. The QM avoids this problem by employing a parameter inversely proportional to the fuzzy set's support. The QM and WFM methods consider only the situation and support of the consequent fuzzy sets, not their shapes. The method proposed by Berenji and Khedkar (BKM) [28] takes them into account. If consequent sets are represented by piece-wise linear functions, the rule's conclusion derived by this method is a crisp value that can be expressed as where is the activation degree of the rule and are constants related with the consequent piece-wise linear function. Unlike the former methods, the value now includes information not only about the situation of the consequent fuzzy set, but also about its shape. This value implicitly depends on the input signals via the activation degree In Takagi and Sugeno's method (TSM) [29] , the value is also influenced by the input signals. In this method, the influence is explicitly expressed by the following relation: (2) where are constants. (Table I summarizes all these simplified inference methods.)
Finally, when is equal to and is a constant, , the resulting method is known as fuzzy mean, height method, singleton, or zero-order Sugeno's method [22] , [24] , [25] , [30] , [31] , which can be considered as a simplification of all the above methods. In this case, (1) is reduced to the following expression:
From a hardware point of view, this is the method with the least memory requirements (the least number of parameters to store and program). In addition, it does not need multiplier, but rather scaler circuits, which are simpler to design and ease digital programmability. From a theoretical point of view, although the shapes and supports of consequent fuzzy sets cannot be taken into account, there are enough parameters to choose from so as to ensure efficient control [24] .
III. CONTROL SURFACES PROVIDED BY A SINGLETON MICROCONTROLLER
Generally, fuzzy controllers perform a nonlinear mapping between the input and output space. The design problem is to accurately approximate the desired control surface. The microcontroller selected here (with trapezoidal/triangular membership functions to represent antecedents, min as fuzzy conjunction, product as fuzzy implication, and singleton or crisp values to represent consequents) satisfies the conditions stated by Castro [31] for universal approximator. This means that it can uniformly approximate any real function to an arbitrary degree of accuracy by using enough rules, antecedent fuzzy sets, and consequent singletons.
Although the control of any process is theoretically possible, it is helpful to know the kind of approximation this system performs. Let us consider a single-input single-output (SISO) fuzzy system, where the maximum number of overlapping antecedent membership functions is two. Under this assumption, given an input , two rules are activated
If is then is If is then is
where the fuzzy sets and are assumed to be described by triangle membership functions with slopes and respectively (Fig. 5 ). According to (3) , for an input value the output is given by (4) From the first and second derivatives of the above expression with respect to information can be obtained about the controller output for belonging to the overlapping interval. This result, summarized in Table II and illustrated in Fig. 5 , shows that a SISO system may perform nonlinear approximation without resorting to the use of fuzzy consequents by only changing the shapes and overlapping degree of the input fuzzy sets. Similar outputs (concave or convex nonlinearities) are also obtained when more than two antecedent membership functions overlap each other. This nonlinear behavior is illustrated in Fig. 6 (a), where a cosine function is approximated with four rules, being the absolute error below 3.1%. usually employed in mathematical approximation theory [32] . cludes the circuits to implement the scaling with singleton consequents (CONS). Many analog implementations reported in the literature employ a data path for each rule [13] , [14] , [33] , [34] which means that each rule has its own MFC's, MIN, and CONS blocks. If this architecture is chosen to implement a programmable fuzzy chip, the number of building blocks and parameters to program (associated with antecedents and consequents) are proportional to the number of rules which is fixed in the chip. The main drawback is that the possible repetition of linguistic variables in different rules leads to redundant information and replicated blocks.
An alternative is to fix the number of input fuzzy sets and that all rules share the MFC's. Let us consider a controller chip with rules, input variables, and fuzzy sets covering them (for simplicity, this number is assumed equal for all the input). In this approach, the number of MFC's is reduced from to is usually greater than so that the ratio of MFC saving is given by (5) The total number of bits to program the MFC's is also reduced from to , where is the number of bits required by each antecedent fuzzy set. However, information must be added, defining which MFC's are connected within each rule, implying additional bits. The result is that the ratio of bits saved when the MFC's are shared by all the rules can be expressed as (6) A second alternative is to also share the singleton values of the output variables, that is, to share the CONS blocks by all the rules. If the number of possible values is this means a reduction of CONS blocks from to If is the number of bits required to program each singleton value, bits are saved. However, bits must be added to assign a CONS block to each rule. The expressions of the ratios of CONS blocks and bits saved by this approach, together with (5) and (6) , are summarized in Table III .
The data in Table III help the designer to choose the most suitable architecture for a set of given parameters and In general, sharing MFC's is advantageous when the number of rules is large compared to the number of input fuzzy sets (controllers with input partitions of low resolution Similarly, sharing consequents is preferable when the number of rules is large compared to the number of output singleton values (output partition of low resolution,
In particular, sharing both MFC's and CONS is advantageous when the complete fuzzy rule set is implemented on-chip, which is practical for many typical controllers with two or three input and input/output partitions of low resolution. Fig. 7 illustrates the resulting matrix-like architecture for a two-input controller. It shows the connection of the three functional stages (antecedent fuzzifier, connective, and aggregation stages) and the digital buses to program the antecedent/consequent parameters and the rule set. In this case, the number of MIN circuits is the same as the number of rules, so that no bits are required to define which pairs of MFC's must be connected. The bits which program the rule set are then aimed to defining the CONS block associated to each rule.
V. FUZZIFIER STAGE
The fuzzifier stage, which performs the antecedents fuzzification, is composed of MFC's. The MFC's in our design implement trapezoidal/triangular functions that can be digitally programmed. To understand the mathematical operations involved in an MFC, let us discuss three strategies to generate symmetric trapezoidal functions illustrated in Fig. 8 .
1) The fuzzifier operation shown in Fig. 8(a) is carried out as follows:
where is a rectification or bounded difference operator defined as if otherwise.
Current-mode realizations that implement this operation have been proposed in [12] and [35] . 2) The technique illustrated in Fig. 8(b) performs the following transformation:
The current-mode circuit used in [36] follows this technique.
3) The strategy proposed by the authors in [37] employs the parameters depicted in Fig. 8(c) . It realizes the following transformation: fuzzy
In any case, the basic mathematical operations required are addition subtraction rectification and scaling The current-mode circuit proposed by the authors following the last technique is shown in Fig. 9 . Working with currents, addition and subtraction are reduced to connecting wires (in this manner " -" is implemented at the input node). To realize the bounded difference, the traditional solution is a diode-connected transistor, which can be the input transistor of a current mirror (the operation " " is implemented by transistor A double-bounded difference operation (" " and " ") is efficiently carried out by transistors and which are never conducting simultaneously [38] . Unlike the current-mode circuits in [12] , [35] , [36] is better performance in terms of area occupation and power dissipation. Precision of current-mode circuits is usually limited by mismatching in the current mirrors, due to systematic and random errors [39] . In this sense, an advantage of the proposed MFC regarding precision is the reduced number of replications. The factor which can provoke systematic errors is large voltage swing at the input node. This can be reduced by biasing the gates of transistors and with different voltage sources [26] or by including negative feedback via a simple inverter [40] , as shown in Fig. 9 .
An interesting feature of this MFC is that digital programmability can be included in a straightforward manner. D/A converters have been employed to program and and to perform the scaling by . In this sense, the currentmode approach allows very small data converters capable of operating at low-power and high-speed, contrary to the high cost in area and/or time of traditional voltage-mode data converters. Another advantage is that these parameters can be changed independently and without affecting the value of the maximum degree-of-membership parameter
We have designed a programmable MFC with 4 b to fix and 2 b to fix and . A microphotograph of the circuit is shown in Fig. 10(a) . The active area is 0.07 mm in 1.5 m CMOS technology. Fig. 10(b) shows the experimental results obtained by programming the slope value . Fig. 10 (c) illustrates a family of eight triangular membership functions resulting from different digital codes for Fig. 10(d) shows the test obtained with the same codes in but with different codes for and , resulting in a family of eight trapezoidal functions.
To provide this MFC with voltage input, the differential version of the voltage-to-current converter proposed in [41] can be employed. The last equation enables employing a max operator so that the bounded difference with is not performed within each MFC, but after antecedent connection. This manner was chosen since current-mode max operators are less costly to implement than min circuits, as shown in this section.
In general, fuzzy controllers have multiple antecedents, and thus require a multiple-input operator. The different algorithms proposed for multiple-input max operators are defined by the following expressions:
where are the input, is the output, and is a big number. The first algorithm is analyzed in [42] and a current-mode realization is proposed. Each input current is replicated to inhibit all other input, resulting in an structure with the sequel of large area occupation and slow operation due to many parasitic capacitors. In the other algorithm (proposed in [43] ) the inhibition signal for each input is represented by the mean value of those input which have not been zeroed out. Thus, inhibition signals are simple to generate since they do not involve internal variables. Consequently, structures of complexity are possible. From a hardware point of view, the last algorithm is realized using feedback schemes with high-gain blocks to implement the scaling by
The precision of a max circuit is evaluated by the difference required between input signals to obtain a single dominating one and by the error observed at the output when all the input signals have an equal value. A high-gain value of means that former errors are proportionally reduced. A typical voltage realization is the classical diode gate, used for OR operation in Boolean logic [see Fig. 11(a) ]. This is the structure employed in the bipolar fuzzy rule chip of [14] . A typical solution to obtain high precision is to include a diode in the feedback loop of an operational amplifier [44] , [45] , as shown in Fig. 11(b) . This solution leads to large area (because of operational amplifiers) and low operational speed since it requires a dominant pole to achieve stability (if several gain stages are cascaded).
These drawbacks are avoided by the current-mode circuit proposed by the authors in [46] . Its simple structure, shown in Fig. 12 , is the result of exploiting all the potentialities of MOS transistors. The voltage in the shared node is locally fed back and fixed to the value needed by an MOS transistor to be in saturation conducting the maximum input current, This is the idea proposed by Lazzaro et al. [47] to implement a winner-take-all circuit, which is also used by Sasaki et al. [33] to realize a max circuit. The novelty of the circuit in Fig. 12 is that transistors are not only employed as competitive devices, but also as switches that convey the current from lowimpedance to high-impedance node, thus avoiding additional cascode or regulated output stages as used in [33] . Transistors act as diodes: if the current through is lower than the current through enters ohmic region, so that the voltage at the gate of decreases and goes off; contrarily if enters saturation and acting as a voltage follower, closes the feedback loop which makes
The circuit can be viewed as the connection of improved Wilson current mirrors which share their output diode-connected transistor. This avoids dc matching errors typical of conventional Wilson mirrors [48] .
A three-input max circuit has been integrated in 2.4-m CMOS technology and is shown in the microphotograph of Fig. 13(a) . Fig. 13(b) illustrates the static behavior obtained when two of the input currents vary, while the third current is fixed to A. Precision of 1.16% of the total output current was measured. To illustrate the dynamic behavior, -toand -to-converters were added at the input and output, respectively. Then the circuit performs as a voltage-mode min operator. Despite these converters, high speed is achieved (transient response of 160 ns) as depicted in Fig. 13(c) and (d).
VII. AGGREGATION STAGE
This stage performs the weigthed average of consequent singletons to obtain a crisp output. The mathematical operations involved are addition, scaling, and division [see (3) ]. Division has traditionally been the most difficult arithmetic function to implement with analog computing devices [49] . On one hand, this is due to nature of division which leads to very large output as the denominator approaches zero (and the numerator is not zero). As a result, an ideal divider must have infinite gain and dynamic range. On the other hand, division derives from a linear relation, that is, to obtain the starting point is to attain This is not easy to achieve if working with intrinsically nonlinear devices such as MOS transistors.
Sasaki and Ueno [30] have proposed avoiding division by selecting a particular antecedent connective and a particular input fuzzy partition, so that the denominator in (3) is constant. However, this approach can not be applied to multi-input fuzzy controllers. A way of avoiding division in multiple-input controllers is to use product as an antecedents' connective, input fuzzy sets such that for every input, the sum of membership values is constant, and a complete rule set. However, the overall result can be an increased rather than reduced hardware cost (compare -input multipliers with -input max circuits and a divider).
Other solutions may basically follow three strategies: 1) normalization of weight parameters (activation degrees) followed by a sum of scalings, according to the structure illustrated in Fig. 14(a) output (14) 2) simultaneous weighting and averaging by means of an aggregation circuit as reported by Mead [50] . Since Mead's circuit employs OTA's, it implements another simplified method, namely one in which [i.e., (1)] is proportional to
To realize the singleton method, true multipliers must be used in a feedback loop [ Fig. 14(b) ] to achieve output (15) 3) use of a two-level structure with a first stage of scaler operators followed by a divider, as shown in Fig. 14(c) . Two CMOS current-mode schemes that approximate a true normalizer operator have been reported in literature [11] , [33] , [51] . These approaches do not verify one feature of simplified methods: the importance of each rule must depend only on its activation degree or its consequent parameters [i.e., (1)], and not on the normalization operation. In particular, the normalization locked loop proposed in [11] , [33] changes the value of the maximum degree of membership without proper scaling of the input domain. This causes some rules to be activated when they should not be, and vice versa. The other reported structure [51] is the implementation of Gilbert's normalizer [52] with CMOS instead of bipolar transistors. The MOS version is nonlinear due to the quadratic behavior of MOS transistors in saturation.
Let us now consider the robustness of Strategy B against mismatching and second-order effects. If an ideal multiplier should realize output (16) a real one includes a slope deviation and an offset which can be modeled as output (17) All these errors are added when implementing the aggregation operation If the slope deviations are represented by their medium value and offset by the approximated relative error at the output is error (18) where is the number of connected multipliers. Variations in multipliers' gain can be counteracted to some extent by adjusting parameters -as in programmable or selftuning chips. However, offset errors are difficult to overcome. For digitally programmable fuzzy chips, in addition to these multipliers (possibly requiring compensation circuitry), D/A converters are required to program the parameters Strategy C allows saving area by exploiting the capability of D/A converters to perform as scalers. Compared to analog multipliers, D/A scalers, in addition to their easier design, offer the advantages of digital programmability and storage [53] . In particular, current-mode D/A scalers do not suffer from offset errors, since they are based on current mirrors. Therefore, only one divider circuit must be compensated for offset. Regarding frequency behavior, the load capacitance affecting the divider is lower than the capacitance in the aggregation circuit; thus, Strategy C may offer higher operation speed. These advantages lead us to choose Strategy C to implement programmable singleton fuzzy chips.
A. Scaling Operation
The circuits that implement the fuzzy implication CONS's are then current-mode D/A converters, like the circuits which generate and in the MFC's. Fig. 15(a) and (b) show experimental results of 2-and 4-b converters. Both display no offset and deviations from ideal values of less than 2% of maximum output current. Fig. 15(c) illustrates the schematic of the circuit corresponding to the results in Fig. 15(b) . It uses the common technique of binary-weighted current sources (19) 
B. Addition Operation
To date, we have exploited one of the advantages of currentmode techniques so that currents can be added by simply connecting wires. In particular, this operation is now required to obtain the numerator and denominator of (3), as shown in Fig. 14(c) . It must be pointed out that the precision of this operation depends on the number of currents to sum, the output resistance of current sources and the input resistance in the adding node Fig. 16 shows a circuit to model the addition of current sources. Applying the current Kirchhoff's law in the adding node results that (20) which can be approximated as (21) To increase the output resistance of current sources, the current mirrors involved are preferably cascode mirrors.
C. Division Operation
This section presents a unified framework to describe different solutions for divider circuits, which implement the equation
As commented above, the starting point is to determine gain-variable linear circuits Linearization techniques have been widely studied by analog designers since CMOS transistors are inherently nonlinear devices. For a survey on this topic, the authors refer to [17, ch. 5] . The most simple approach for a linearization technique is to combine two perfectly matched transistors operating in saturation or triode region [17] , [41] . However, the resulting gain-variable elements are not easily controlled, which is a problem for our purpose. This was overcome by resorting to the use of four matched transistors, connected as shown in Fig. 17 . This cell provides a variety of solutions for divider circuits, depending on the region of operation and how input and output are selected. Three types are briefly analyzed in the following. Type I: When the cell in Fig. 17 is biased in saturation, it can be viewed as a four-transistor translinear loop [54] . On this basis, the circuit depicted in Fig. 18(a) can perform as a squarer or a square-rooter [54] , [55] since its transfer characteristic is (22) where the simple square-law model of the MOS transistor has been assumed with (these parameters have their usual meaning).
A current-input current-output divider is then obtained by combining two squarers (as proposed in [54] ) or cascading a square-rooter with a squarer [as illustrated in Fig. 18(b) ]. A divider prototype according to the latter has been integrated in a 2.4-m CMOS process (its active area is 0.0513 mm without considering biasing current mirrors). The transistor geometries were chosen so that the output of the square-rooter is and the output of the squarer is thus obtaining the following overall output:
The main drawback of this divider is that it is very sensitive to deviations from the simple square-law model caused by length-channel modulation, mobility reduction, or mismatching. Experimental results of our prototype confirmed this. Fig. 18(c) shows the output obtained for fixed at 10 A and denominator current changing from 15 to 30 A in steps of 2.5 A, while axis is the numerator current, Offset about 1.5 A (10% of the maximum output current) and 3 A (20%) were typically measured at the input and output, respectively.
Working with currents is more suitable since the numerator and denominator of (3) are represented by currents. However, it is preferable that the controller output be given in voltagemode for interface capabilities. This means that the output of this type-I divider should be converted to voltage, which is performed by a resistance or a transresistance element. The two following approaches configure the basic four-transistor element in Fig. 17 to directly obtain a variable-transresistor with a transfer function On one hand, since MOS transistors are basically transconductor devices some kind of feedback must be included to reverse its behavior On the other, since MOS transistors are voltage-controlled devices, an -to-converter must be added so that the variable-resistance is current rather than voltage controlled. This results in (24) Type II: A voltage-controlled transresistance element is obtained if the basic four-transistor cell is biased in saturation, as shown in Fig. 19(a) . Feedback is introduced by connecting transistors and in a diode configuration. Hence, their gates/drains are low-impedance nodes whose voltages can be taken as output variables. Applying the simple square-law model of MOS transistors, the resulting transfer characteristic is (25) Whenever feedback is included, care must be taken to ensure that the overall feedback is always negative; that is, the poles of the circuit are on the left-half plane-otherwise, the circuit is unstable. In this case, the circuit stability is achieved if is greater than A symmetric CMOS structure [ Fig. 19(b) ] has been chosen to provide high input impedance for both control nodes and thus avoiding voltage sources of low output impedance. This type of structures is widely used in the design of transconductors and multipliers [56] - [58] . Two circuits, like those in Fig. 19(b) , have been combined to obtain an inputcurrent output-voltage divider [see Fig. 19(c) ] (26) A prototype of this divider has been integrated in 2.4-m CMOS technology, occupying an active area of 0.0648 mm (without considering biasing current mirrors and analog output buffer), shown in the microphotograph of Fig. 19(d) . The output voltage swing is 1.6 V for 5-V supply voltage and biasing current, of 60 A, while input ranges are 60 A for denominator current. These results are shown in Fig. 19(e) , where the different lines correspond to changing from 0 to 60 A in steps of 10 A. Measured output offset voltages were typically less than 13 mv (0.8% of the total output range), while input offset currents were less than 0.75 A (1.5% of the input range).
Type III: A voltage-controlled linear element is also obtained when the four-transistor cell operates in triode-region and the voltages at nodes and in Fig. 20(a) have the same value. This circuit has been used in many applica- tions, acting as a multiplier or a resistive network [17, pp. 59-61] . Let us consider the circuit shown in Fig. 20(b) , where the four-transistor cell is in the feedback path of a singleended differential amplifier. This circuit may be seen as a transresistance amplifier which converts an input current into voltage through a voltage-controlled resistance. Its transfer characteristic is (27) where must be greater than to achieve stability. Several advantages result from biasing the core cell in triode region. In particular, it allows a wide dynamic range since there are no stacked transistors, and voltage drops are usually low since the cell is operating in triode region. Additionally, body effect is inherently cancelled [17] . The most significant second-order effect (mobility reduction) can be reduced if fully-differential amplifiers are employed.
Some authors [60] , [61] have used the circuit in Fig. 20 (b) with another four-transistor element at the input so that -is obtained from a voltage difference, thus resulting in a voltagemode multiplier/divider. The objective of our application is just the opposite: to add an -to-converter so that -is obtained from a current
In particular, a prototype has been also integrated in 2.4-m CMOS technology with the same -to-converter as used for the Type-II divider. A twostage single-ended Miller op-amp has been employed. The total active area (not including input current mirrors) is 0.0497 mm It is shown in the microphotograph of Fig. 20(c) . The output voltage swing is 1.8 V for 5 V supply voltage, while input current ranges are 60 A for denominator current Output offset voltages of less than 10 mv (0.5% of the total output range) were measured and input offset currents were less than 1 A (1% of the input range). These results are shown in Fig. 20(d) , where the parametric lines correspond to changing from 10 A to 60 A in steps of 10 A, with fixed at 10 A. Of all the divider circuits analyzed, the last two are good candidates to implement division in the average stage of fuzzy controllers. Design of the saturation-region divider requires a good selection of transistor geometries and careful layout to avoid mismatching. In the case of the triode-region divider, care must be taken in the design of the amplifier used.
VIII. DESIGN OF A FUZZY CONTROLLER CHIP
The circuits described to implement the fuzzy inference (MFC's, MIN/MAX's, CONS, and divider circuits) are input-output compatible. They can be managed as library cells to place and route following an efficient architecture. As an example, we have designed a two-input one-output fuzzy chip which implements nine rules. Three fuzzy sets cover the universe of discourse for input and output variables. Digital words to program the antecedent parameters, and , have 5, 3, and 3 b, respectively, while digital words to codify the consequent parameters have 5 b. According to the discussion of Section IV, three kinds of architecture can be chosen: a) a data-path for each rule, b) sharing MFC's, and c) sharing both MFC's and CONS's. Applying the figures given in Table III to this example and shows that: 1) sharing MFC's allows a reduction of 66% in the analog part Since the complete rule set is implemented, all the MFC combinations are required, and no additional bits must be included to define the antecedent part of the rules. As a result, the number of bits is also reduced 66%
as compared with an architecture where the MFC's are associated to each rule; 2) sharing CONS circuits saves 66% of the analog part and about 6.7% of the digital part as compared with an architecture where CONS circuits are associated to each rule. Consequently, the third architecture is selected. Fig. 21 shows the analog core of the resulting layout corresponding to 2.4-m CMOS technology. It follows the block diagram of Fig. 7 , where the average stage contains three 5-bit D/A converters (CONS blocks) and a Type-III divider followed by an analog buffer. The area occupied is only 700 m 1400 m. The circuits described in this paper allow implementing the singleton fuzzy inference method with high approximation. In this sense, software fuzzy tools capable of performing the singleton method can be used to design the appropriate fuzzy controller for a given problem. The result can be downloaded into the fuzzy chip through programming. For example, let us consider the design of a proportional integral (PI) fuzzy controller which is usually referenced in the literature (Fig. 22) . The output of the system under control is compared with a reference signal. The error signal and its rate of change are the input of the fuzzy controller. The output of the controller is the change of the control signal that must be applied to the plant The plant chosen for this example is a first-order system with a time constant of 0.5 s. The rule base, the membership functions of antecedents, and the singleton values of consequents (shown in Fig. 23 ) have been defined with the help of a fuzzy tool called "Xfuzzy" [62] . The former two-input one-output CMOS chip has been simulated with HSPICE, and the resulting control surface versus and [ Fig. 24(a) ] shows an absolute error below 2.5% [ Fig. 24(b) ]. The transient behavior of the fuzzy chip has been also simulated for different pulse signals at the input and a load of 5 pF at the output (terminal (Fig. 25) . The maximum delay is less than 450 ns. These static and dynamic features are good enough to ensure the control of the plant, as shown in Fig. 26 . This figure corresponds to the HSPICE simulation of the fuzzy chip, working in the feedback-loop configuration of Fig. 22 .
IX. CONCLUSION
Efficient solutions at architectural and operator levels have been provided to implement programmable fuzzy controllers. In particular, the singleton method has been chosen as the inference mechanism. This method has proven very adequate for hardware realization while maintaining sufficient interpolation ability to approximate any control surface. This paper has introduced circuits to implement the required functional blocks: a) membership function circuits to define the input variables of the fuzzy controller, b) min circuits to perform the antecedent connection, c) scaler circuits to perform the fuzzy implication, and d) divider circuits to obtain the control output.
The mathematical operations required in the different functional blocks (addition, subtraction, bounded difference, and scaling) are easily implemented with current-mode techniques, and thus result in very simple circuits. Current-mode D/A converters are used to program antecedent and consequent parameters. Their ability to perform as programmable scalers is exploited in the membership function circuits and the circuits that carry out the fuzzy implication. A design methodology for divider circuits is presented. Three types of realization are obtained from a four-transistor cell capable of implementing gain-variable circuits.
The resulting fuzzy controller is a mixed-signal IC where the computing power is provided by the analog part, while the digital part offers user-friendly programmability. The internal processing is performed in current-mode, but I/O interface is realized in voltage-mode to ease communication with conventional circuitry.
Iluminada Baturone received the Licenciado en Fisicas degree in 1991 from the University of Seville, Spain. She is currently working toward the Ph.D. degree.
In 1991, she joined the Instituto de Microelectrónica de Sevilla of the Centro Nacional de Microelectronica, Seville, Spain. Since 1993, she has been working through a fellowship from the regional government in the Departamento de Electronica y Electromagnetismo at the University of Seville, Spain. Her current research interests include mixed-signal circuit design for fuzzy systems and testing of analog circuits.
Santiago Sánchez-Solano (M'94) received the Licenciado en Fisicas degree, in 1980, and the Doctor en Ciencias Fisicas degree, in 1990, both from the University of Seville, Spain.
Currently, he is a Senior Researcher at the Instituto de Microelectrónica de Sevvilla of the Centro Nacional de Microelectronica, Seville, Spain. His main research interests include VLSI design, CAD tools for microelectronic design, and hardware implementation of fuzzy controllers.
