I. INTRODUCTION

B
UILT-IN self-test (BIST) and design for testability (DFT) of analog circuits are important and necessary to produce highly reliable mixed-signal systems [1] . Due to the constant increase of analog circuit density, the nature of analog faults, and the embedding of analog functions within large digital systems, the detection and isolation of faults in these circuits is becoming more difficult.
Many research groups have suggested techniques to make on-chip frequency-domain testing of mixed-signal circuits and on-chip testing of analog-digital converters (ADCs). The goal of these techniques is to overcome the complexity of integrating a traditional ac characterization approach [2] . Well-defined techniques for reducing the size of the test set while maintaining high fault coverage have been reported [3] , [4] . Some ac BIST techniques inject optimized digital inputs to a linear device under test (DUT) and extract a dc signature [5] , [6] . These approaches are simple, but their precision is limited. On the other hand, Toner and Roberts [7] have proposed several methods to make frequency-domain tests on sigma-delta converters by generating precise on-chip sine waves and analyzing the results with an on-chip digital signal processor (DSP), but the area overhead is large. Several techniques have been published to generate on-chip linear ramps and perform ADC BIST [8] - [11] , but the results either depend largely on the accuracy of the additional components in the test circuitry, or have not been proven experimentally. In [12] , the authors present a self-calibration scheme for linear ramps, but the linearity is not high enough for linearity test of ADCs. A technique very similar to ours is presented in [13] , following a previous publication from our group [14] . However, the former does not provide measurement results. An on-chip ramp generator (OCRG) is desirable to perform monotonicity and histogram tests of ADCs. A novel application of the OCRG is proposed in [15] , which allows the ac test of a linear circuit in time-domain using the ramp generator.
In this paper, we propose a unified approach at designing OCRGs using a discrete-time (DT) adaptive approach. Section II describes the adaptive scheme used in all the implementations. Section III gives the details of two low-complexity implementations. Section IV describes the third OCRG version. Section V presents the measurement results and Section VI gives the conclusions. Fig. 1 illustrates a conceptual system where an OCRG can be used to perform time-domain ac BIST [15] . The goal is to extract several rise or fall time delay values from the DUT. A first step signal is sent to the DUT and the sampler is used to generate the target thresholds (for example, 10% and 90%) from the steady-state value of the DUT. When a second step is sent to the DUT and the response crosses the threshold, the timer is started or stopped, depending on the test (rise or fall delays) being performed. An analog timer is used instead of a digital timer because the latter would require the clock frequency to be too high for the same order of accuracy. For example, if the DUT has a rise time of 10 s (for a second-order low-pass filter with a cutoff frequency of approximately 52 kHz) and an accuracy of 1% is needed, the clock frequency would need to be 10 MHz. If the whole system operates in the range of 52 kHz, the need 0018-9200/03$17.00 © 2003 IEEE for a 10-MHz clock only to test the system would not be an acceptable solution. The analog timer is, in fact, a ramp generator for which the start and stop times are controlled by the delay to be measured. Once the ramp is stopped, the final analog value ( ) is directly proportional to the delay to be measured. Therefore, an accurate OCRG is desirable. In the case where the DUT can be approximated by a first-or second-order system, simple relations between time-domain and frequency-domain can be used by an external computer or tester to compute the frequency-domain characteristics ( ) [16] and/or diagnose a fault. Further arbitrary rise and fall delay tests can also be performed that do not necessarily have a frequency-domain relation. These test results can be used as signature of the DUT, be compared to a predefined signature of a fault-free circuit, and a go/no-go signal can be generated. The measurements can also be sent to an external diagnosis system.
II. ADAPTIVE SCHEME FOR ON-CHIP RAMP GENERATION
A. Application of the OCRG to Time-Domain and ADC BIST
In the case of ADC testing, the practical implementation of the monotonicity test [17] is usually complicated by the presence of digital noise on the ADC output codes. However, simple digital signal processing can be used to reduce the effect of that noise and extract the monotonicity behavior of the ADC. If a very linear ramp is available on chip, linearity measurements based on histogram testing can be computed on chip with digital circuitry. Simple self-testing mechanisms to trace a calibration failure can be added to the technique to ensure that the OCRG is functioning properly before using it.
B. Inaccuracies in the Basic Ramp Generator
The principle of the basic ramp generator is to charge a capacitor with a constant current source. In the circuit of Fig. 2 , and is directly proportional to the pulsewidth of the step signal. In the case of time-domain testing as shown in Fig. 1 , would be the delay under test. To achieve the required performance (small slope and linear ramp in the case of ADC testing, and precisely known slope in the case of time-domain testing), we need a current source with a near-infinite output impedance that sources a very small current and a very large capacitor. Both have to be precise and constant. For most ADC tests, we have , where is the ramping time and represents the longest portion of the ADC test.
is the number of hits per code used for the histogram, is the number of bits of the ADC, and is the ADC sampling period. If we want to use an on-chip ramping capacitor in the range of 10 pF, the design of the extremely small current source required to test a typical ADC becomes a challenge. Moreover, the process variation affecting the absolute value of the current source and on-chip capacitor ( 20%) would result in significant slope errors.
C. Adaptive Scheme
The proposed solution is an adaptive scheme [14] . The advantage of this approach is that a precise and process-independent current reference is not needed since the adaptive scheme will generate the proper value automatically to satisfy the least mean square (LMS) condition [18] . Fig. 3(a) shows the general block diagram of the adaptive scheme.
The discrete adaptive calibration procedure is based on two reference values: the clock signal and a reference voltage used as a target. The calibration procedure consists of letting the output ramp during a constant delay (which, in this case, is derived from the clock, and is not the delay under test) then comparing with the target dc level ( ). The LMS block uses this information to vary the control voltage of the current source. This process of ramping and adjusting the control voltage is repeated until becomes exactly the same as . Once the calibration is completed, we can use the slave ramp generator to measure delays in the DUT. In the case of ADC testing, we can discard the slave ramp generator and use as the Ramp output. Referring to a typical LMS block [ Fig. 3(b) ], our input signal is and the output is . Mult is the ramp generator. As will be seen, Mult is not necessary in our case. Fig. 4 illustrates the basic current source used in the first implementation. Modifications will be made to this current source in the following implementations, but its characteristics, as far as the adaptive scheme is concerned, remain the same.
is the current driver, -form a cascode current mirror and is the charging capacitor.
is used for stopping the current source when step is at logic 1.
is used to reset . If the output impedance of the current source is neglected, is approximately given by (1) where and is the current from the current source. From control-system theory, the power of the error signal is , which means that if , the error power is minimized (
). This error is the result of the capacitor imprecision as well as the current imprecision. The latter includes process variations affecting and . Therefore, the technique compensates for these process variations. The steepest descent algorithm is used to ensure that the system minimizes the error. (2) where the usual constant has been replaced by the variable so that no confusion is made with the electron mobility. The system has only one weight, represented by . It can be shown that by writing the right part of (2) as and assuming (in Fig. 4 ) is always in saturation, we obtain (3) where is the control voltage at time . Equation (3) describes the functionality needed for the LMS block in Fig. 3(a) . Since the implementation of an exponential function is complex, it was decided to simplify this task by implementing a first-order Taylor approximation of (3) (4) where . The gain of the integrator will define the speed and stability of the system. We can also see that the gain should be affected by the initial value of the integrator. This effect will not be implemented in our case since the gain will be set by a constant. It can be shown that the same approximation is valid if operates in weak inversion.
D. Error Sources and Performance Criteria
There are a number of error sources that can affect the performance of the circuit both at system and implementation levels. The closed-loop system will compensate for the inaccuracy of on-chip capacitor and current source. The linearity of the ramp depends on the output impedance of the current source and the intrinsic capacitor linearity. In the case of ADC testing, the chosen process should offer capacitors with linearity higher than the ADC under test. In a typical process such as the AMI 0.5 m from MOSIS, linearity of 15-16 bits can be obtained for on-chip poly-poly capacitors [19] . Offsets also affect the precision and stability of the generated ramp. An offset at the input of the LMS block (on the subtractor) will cause a slope error. Any other constant offset in the adaptive loop (gain stages, current source) will be compensated by the adaptive scheme. Offsets caused by charge injection on the basic ramp generator (Fig. 4) will cause start and stop errors on the ramp. Considering the time period range of interest (1 s for time-domain testing, 1 ms for ADC BIST), clock jitter is negligible. The matching between the master and slave ramp generators [see Fig. 3(a) ] is also a source of error. Careful layout techniques (especially for the matching of the current sources and the capacitors) can reduce the slope mismatch below 0.5%, which is acceptable for our application. Performance criteria also include stability, calibration time, and programmability of the slope.
We have identified four application categories, each of which necessitates different performance levels from the ramp generator (see Table I ). The implementations presented in Sections III and IV can be used to fulfill any requirement in Table I at minimal area and complexity costs.
III. LOW-COMPLEXITY OCRG IMPLEMENTATIONS
A. First Implementation: Continuous-Time Integrator
The first OCRG implementation is shown in Fig. 5 . An operational transconductance amplifier-capacitor (OTA-C) integrator according to (4) now replaces the LMS block.
allows for a DT integration to be realized (the error must be integrated only once the ramp has finished) and ensures that does not discharge through the output impedance of OTA1 while the timers are ramping. Because is not a continuous function, we have to use a DT implementation of the adaptive scheme (5) where defines the delay during which the integration of the error is performed by the OTA-C (controlled by ). Switches and limit the leakage of into the resistor and in Fig. 4 . The gain must be set so that the feedback gain is low enough to guarantee stability. The delay must be set in such a way that the rise or fall on from one time step to the other are converging. In order to guarantee convergence, each OTA is made of a differential input stage with long channel length pMOS drivers to ensure low (0.7 A/V) and a cascode current-mirror output stage to ensure high output impedance (see Fig. 6 ).
was set to 1 s.
B. Second Implementation: Error-Cancellation LMS Integrator
When the calibration delay becomes large (for monotonicity or histogram tests of ADCs) and the ramping capacitor is on chip, the control voltage has to be very precise. The second implementation of the LMS integrator (Fig. 7) uses an error-cancellation approach with two single-ended integrators to reduce the effect of charge injection. Also, the basic ramp generator ( Fig. 4) was modified by removing and placing it as a switch between the drain of and . This allows the elimination of the resistor in Fig. 4 and the switches and in Fig. 5 . The OTAs are identical to the one used in the first OCRG version. The charging capacitor is 14.9 pF. Leakage current in the switches [20] causes the stored voltage on the capacitor to decrease as (6) where is the capacitor voltage, is the initial capacitor voltage, is the leakage current of the switch, and is the capacitor used to store the control voltage of the adaptive loop. The leakage current of a wide (20/0.6) transistor in the AMI 0.5-m technology is less than 2.5 pA/ m, which is a very conservative value for our small switches. The use of transmission gates and the fact that some capacitors are connected to two switches (Fig. 7 ) raises the leakage current to 10 pA/ m. Using a 15-pF capacitor and a 1-ms delay results in a maximum voltage drop of 0.67 mV. This has virtually no effect on the output current charging the ramping capacitor, and is smaller than the clock feedthrough and charge injection error sources.
The second OCRG implementation shows significant improvement on the precision of long ramps with on-chip capacitors. Fig. 8 shows a simulation of the calibration process for a ramp ranging from 0 to 2 V within 1 ms. Fig. 9 shows a more detailed view (zoom) of the ramp once calibrated, along with the integral nonlinearity (INL) of the ramp. 
IV. THIRD IMPLEMENTATION: SC FULLY DIFFERENTIAL LMS INTEGRATOR
The purpose of the third version of the OCRG was to assemble a BIST scheme for pipeline ADCs. The design constraints were to improve the linearity and stability of the ramp at the cost of a slightly more complex circuitry, as well as to convert it to a fully differential topology. In the case of an ADC histogram test, the maximum INL of the input ramp should be , where is the full-scale range of the ADC under test, is the number of bits of the ADC, and is the number of desired hits per code. The linearity provided by the previous implementations is insufficient for typical ADC histogram tests.
A. Improvement of Ramp Stability
In order to perform ADC histogram testing, a very stable lowcurrent source with high power-supply rejection ratio (PSRR) is needed. We decided to use the self-biased Widlar current source [21] shown in Fig. 10 .
, operating in triode region, is added to control the current. Also, by using additional design variables (such as the relative sizes of and , as well as the sizes of and ), the of the current source can be lowered, which reduces the OCRG loop gain and, thus, decreases the sensitivity to noise and leakage on the charging capacitor. By using the change of variable , we have and (7), shown at the bottom of the next page, where and are the current mirror ratios -and -, respectively. When ( ) becomes large, the whole equation becomes a constant. Fig. 11 shows a plot of this function for typical component values. Proving that this current source can be used in an adaptive scheme such as Fig. 3 (a) involves solving (2) and (7), which is quite complex. Instead, we will take advantage of the fact that with the proper choice of components, the shape (curvature) of the relation described by (7) becomes fairly linear in the range of interest. In that case, we have , where incorporates the effect of the choice of components. It is now easy to show that the control voltage is given, according to the LMS algorithm, by the same equation as (4). Moreover, it has been found from behavioral and transistor-level simulations that the adaptive scheme gives very good results in the case where the versus relation of the current source is not linear.
B. Improvement on Ramp Linearity
In the simplified ramp generator Fig. 12(a) ], represents the real capacitor voltage, including the effect of the output impedance of the current source. An output-dependent offset current will flow through and add to the nominal current causing nonlinearity on the output ramp. The capacitor is reset to on (which corresponds to in the following analysis). Fig. 12(b) shows the Thevenin equivalent, which simplifies the analysis. It can be shown that the maximum error in the ramp is given by is the final voltage of the ideal ramp (with ) and . If we want to test a 12-b ADC with V and make a 16 hits per code histogram, (8) predicts that we need an output impedance of about 300 G . Note that this is dc output impedance. Still, this is a very large value and it is unlikely that we can achieve such a high value in a low-voltage process. The present current source (Fig. 10) has a dc output impedance of 19 G , based on simulation results.
A reduction on the effect of can be achieved by keeping the capacitor node constant, which is done by using an operational amplifier (opamp) in feedback of the charging capacitor as shown in Fig. 13 . It can be shown that for an opamp gain of , the maximum error is now given by (10)
The linearity improvement is in the order of . The slope of the curve is given by the derivative of with respect to time and is When becomes large, the slope becomes (using L'Hopital's rule) (13) which is a constant for a given current . This proves that the slope is constant at all times and, thus, that the curve is linear. Also, when becomes large, the end point of the curve becomes independent of . We can set and and find the required nominal value for in Fig. 13   (14) A systematic methodology can now be used to define the value of needed for a particular application where a ramp is needed to go from to in a delay of , with a maximum . 1) Start with a small value of . For example, . 2) Choose a design value for such that its associated offset current will never get higher than the total needed. We already know that we need , therefore, . This will set a design constraint on the current source. 3) Find the nominal value of needed with (14). 4) Find the resulting with (10) . means that is too low. In that case, increase and repeat the procedure from step 3.
The simulation result from a transistor-level implementation of the current source with the feedback opamp is shown in Fig. 14. The INL error is now only 27 V. This precision is sufficient to test the 12-b ADC with a 2-V input range and 16 HPC described earlier.
C. Implementation of the Third Version
All the internal signals of the third version, except for the control signal, are fully differential in order to reduce the effect of offsets and the noise in the system and because the ADC under test has a differential input. We now have one target voltage for each ramp. The input range of the ADC is from 0.75 to 1.25 V and the power supply is 1.8 V (see Fig. 15 ).
The fully differential opamp is a two-stage topology with a telescopic cascode input stage (see Fig. 16 ). It was decided not to put the cascode on the output stage so that the output swing would be large enough. Since the ramping phase is long, the opamp contains a continuous-time common-mode feedback circuit in addition to the reset of its input nodes to at every clock cycle. The common-mode feedback circuit is a simple differential pair with diode-connected loads. The dc gain of the opamp is 104 dB and the gain bandwidth is 46 MHz. The phase margin is 75 . The linear-gain output swing is 0.55-1.5 V. The input and feedback capacitors of the LMS integrator are 0.5 and 4 pF, respectively (gain of 1/8). The differential-to-single-ended block is used to convert the differential output of the LMS integrator to the single-ended input of the current source in the ramp generator. The topology is a simple differential stage with diode-connected active loads and a gain around unity.
The clocking necessary for this system is made of three nonoverlapping phases. The first phase is the ramping phase and lasts for 1.024 ms. The two other phases, and (integration and reset phases, respectively), last for 1 s each. In addition to these basic phases, we also need their inverse and their delayed version for bottom-plate sampling. The clock generator is a three-flip-flop state machine and a counter to create the 1.024-ms delay on .
V. MEASUREMENT RESULTS
All three versions of the OCRG implementations have been fabricated and tested. The first implementation was fabricated on an ORBIT 2-m process from MOSIS (see Fig. 17 ). The of 10 kHz. Extra capacitance was added externally to the ramping and LMS capacitors. We can see that the output ramp (OUT) gets precisely calibrated to the target final value. This shows the validity of the adaptive scheme. The measured transconductance of the OTA is 0.7 A/V, and the error on the final value of the ramp is 20 mV, which translates into a slope error of 0.6%.
The second version of the OCRG has been fabricated on an AMI 0.5-m process from MOSIS (see Fig. 19 ). Post-layout The third version of the OCRG was fabricated on a 0.18-m process from Texas Instruments Inc. (see Fig. 21 ). Dimensions are 0.6 0.3 mm . In measurements, the ramp goes from 0.7 to 1.3 V within 1.024 ms with a start and stop error of 12 mV. The average slope error is 0.6%. The calibration process is shown in Fig. 22 . The INL of the ramp was measured with a test setup using a PXI6071E multifunction data acquisition card from National Instruments. A LabView program was written to perform several tasks such as curve fitting to extract the INL shape from the noise generated by the digital signals. It also computes the best-fit straight line from the acquired ramp in order to calculate the INL (see Fig. 23 
VI. CONCLUSION
This paper presented three versions of on-chip adaptive ramp generator designs to be used in time-domain BIST for analog circuits and ADC testing. The adaptive approach allows the generation of very long on-chip ramps without the need for manual fine tuning and makes the system process independent. The technique needs only the system clock and a reliable reference voltage, which, in the case of time-domain testing, can be almost any voltage reference already present in the system. Table II provides the applicability of each implementation in consideration of its performance and complexity. It also compares the obtained performance with other published implementations. Many performance results were unavailable since the performance metrics for on-chip ramp generation are application dependent and not well established. It is clear that we can trade complexity with performance, depending on the application we are considering. The third version of the OCRG is currently being used in a pipeline ADC BIST technique.
