This work presents a novel circuit for detecting transient faults in combinational and sequential logic. The detection mechanism features a built-in current sensor connected to the bulks of the monitored logic. The proposed circuit was optimized in terms of power consumption and enhanced with low-power sleep-mode. In addition, a calibration method for bulk built-in current sensors is presented. Overhead results indicate an increase of only 15% in power consumption which represents an improvement of factor 7 compared to similar existing sensors.
Introduction
Error resilience is a drastically rising demand on integrated systems while, at the same time, the application of ultra-deep submicron technologies requires higher robustness to natural aging processes or environment sources like radiations from cosmic origin or every day material [1] . In addition to these natural phenomena, malicious fault-based attacks can be used to bypass security mechanisms of secure systems and extracting information on confidential data [2] . Both these natural or malicious phenomena on integrated circuits can induce transient effects that provoke bit-flips of stored results during the system lifetime.
Until the early 2000's, researches on transient faults focused essentially on memory elements, which were considered the system's most vulnerable circuits. Many concurrent error detection and/or correction mechanisms were proposed to mitigate soft errors induced by transient faults in memory cells. In the last decade, however, more sensitive deep-submicron technologies as well as the increasing demand for secure systems have also pushed for the development of countermeasures against transient faults in combinational parts of the circuits. These faults indeed can propagate up to storage elements and thus cause soft errors as well. On the other hand, if the transient fault does not induce any error due to an electrical, logical or latching-window masking effect, its detection is crucial all the same in secure applications since the fault itself reveals an attempt of attack.
The today's trend in efficient protections against transient faults is applying mitigation techniques at different abstraction levels of the design [3] [4] . The idea behind is the avoidance of costly fault-tolerance mechanisms like triple modular redundancy, taking advantage of cheaper mitigation techniques that ensure satisfactory soft error coverage for the system's most recurrent operations. This strategy is exemplified through system's recovery schemes fired in function of the indication of concurrent error detection (CED) circuitries.
CED mechanisms implemented at transistor or gate level guarantee an early detection, as soon as the faults happen, preventing more critical failure scenarios such as the induction and propagation of multiple errors to other clock cycles, stages, or parts of the system. In case of a fault, the generated error flag is able to activate, for instance, recovery machines already implemented in modern systems for dealing with branch misprediction 2 [3] [4] . Thereby, faulty operation can be repeated in faultfree conditions, adapting the system to perform again its normal computational sequence. This paper proposes a new low-cost CED scheme that efficiently identifies transient faults. The proposed circuit monitors transistor bulks of system's blocks such as similar existing Bulk Built-In Current Sensor (BBICS) [5] [6] , which we further discuss in section 2. Our solution, though, is optimized to satisfy today's need for low-power transient-fault robust systems. More precisely, to the best of our knowledge, the innovative contributions of this paper are:
 An optimization of the original BBICS's circuitry [5] [6] to achieve reasonable overheads in power consumption (section 3);  The introduction of the sleep-mode for BBICS that allows additionally energy savings when the system is on standby (section 3);  A design method for calibrating BBICS with the capability of detecting a minimum transient-fault profile (section 4);  An extensive transistor-level simulation-based exploration of the BBICS size limits in function of their response times and power consumptions (section 5).
Fundamentals
This section presents preliminary information regarding soft errors, built-In current sensors and leakage currents in nanometer technologies.
Transient faults in integrated circuits
Transient voltage variations during the lifetime of combinational or sequential circuits are defined as transient faults. The first harmful effects of transient faults are soft errors by inverting stored results of system operations (i.e. bit-flips of storage elements).
Due to the transistor shrinking and the growing communication of confidential data, soft errors can happen today even at ground level by means of perturbation events arisen from environmental or intentional sources. Examples of environment events are alpha particles released by radioactive impurities and more importantly neutrons from cosmic rays [1] . On the other hand, intentional perturbation events are usually produced by optical sources such as flashlights or laser beams [2] that can maliciously induce transient effects on secure circuits like smartcards to retrieve their secret information.
Soft errors and transient faults are also known as single event upsets (SEU) and single event transients (SET) in fault-tolerance-related fields. If provoked by malicious fault-based attacks, such circuit misbehaviors provide fundamental information for cryptanalysis methods that break security applications.
The modeling of transient faults in CMOS circuits is done by injecting a double exponential current pulse I fault at the sensitive node. Thereby, the shape of the pulse can be approximated by following equation [7] :
f r Q I ee tt (1) with Q f is the charge collected due to the particle strike, t f means the decay time of the current pulse, and t r labels the time constant for initially establishing the ion track
Built-in current sensors detecting transient faults
Built-In Current Sensors (BICS) were initially proposed as a mechanism for detecting large increases in the current I DDQ consumed by a CMOS circuit during its quiescent state, i.e. when the circuit is not switching. The mechanism allows thus testing CMOS circuits against permanent faults [8] . Further, BICS were also adapted for detecting transient faults in memory cells (i.e. bit-flips) [9] [10][11] [12] . Recently, efforts were made for monitoring transient faults in combinational logic as well [13] . All these techniques connect BICS to the power lines (V DD and GND) of the monitored circuit to distinguish anomalous transient currents from normal currents. The today's problem is that the amplitude of transient currents induced by radiation effects or fault attacks can have the same order of currents normally generated by switching activities in combinational logic circuits. Hence, schemes monitoring power lines are very limited for detecting just small range of transient faults.
On the other hand, BICS connected to the bulks of the monitored circuit's transistors are able to detect a wide range of transient faults [5] [6] [14] . As Fig. 1 (a) and (b) illustrate, Bulk-BICS (BBICS) identify anomalous transient currents I fault flowing through the junction between a bulk and a reversely-biased drain of a perturbed transistor (MOSFETs "off" in Fig. 1 ). BBICS indeed take advantage of two facts:
(1) In fault-free scenarios (i.e. I fault = 0) the bulk-todrain (or drain-to-bulk) current I B is negligible even if the MOSFET is switching in function of new input stimuli;
(2) During transient-fault scenarios, I fault is much higher than the leakage current flowing through the junction.
The range of detectable transient faults is easily adjustable by calibrating the size of the transistors that constitute the BBICS. Hence, schemes based on BBICS can be designed to latch a flag of fault indication for abnormal currents within a defined range that represents a risk of resulting in soft errors. Note that in such a strategy the connection between the monitored circuit (e.g. system's block 1) and the BBICS circuitry (e.g. NMOS-BBICS 1) is done via metal from the body-ties of each monitored transistor (e.g. T11, T12, and T13) up to the input of the BBICS circuitry. Thereby, the peak of the anomalous transient current (i.e. the transient fault) is almost not attenuated, ensuring thus an efficient detection [14] . In fact, this very small attenuation is a function of the local distance between the struck zone of the monitored transistor and its body-tie.
The work in [3] shows that area overheads imposed by BBICS's mechanisms for protecting adder circuits can be up to 13.4 % without impact on the system's operating frequency. The costs therefore are considerably smaller than the ones due to classic CED schemes [15] . Moreover, BBICS approach is much more efficient for dealing with transient faults of long duration and multiple faults. Nevertheless, the negative issue of existing BBICS is the elevated power consumption to provide high detection sensitivity in ultra-deep submicron technologies [5] [6] [16] .
Leakage in nanometer technologies
Ideally, CMOS cells draw no current or rather dissipate no power when idle. Unfortunately, this is not true for real cells realized in nanometer technologies. A major impact originates from sub-threshold leakage current I sub which is the current between source and drain of a transistor when the device should in fact be cut off. A commonly used approximation of I sub is [17] :
with a 1 is a technology factor, W means the gate width, L labels the gate length, q corresponds to the charge of an electron, n means the sub-threshold swing coefficient, k B is Boltzmann's constant, T labels the operating temperature, V gs is the gate-source voltage, and v th labels the threshold voltage. Further, the threshold voltage can be modeled with [17] :
with v th0 is the zero-bias threshold voltage, γ is the bodybias coefficient, a 2 and a 3 label technology constants, η corresponds to the Drain Induced Barrier Lowering (DIBL) coefficient, V sb labels the source-bulk voltage, and V ds is the drain-source voltage.
Sleep-mode improved bulk built-in current sensor
The circuit of our Sleep-mode Improved Bulk Built-In Current Sensor (SIBBICS) is presented in Fig. 2 (a) . If the mode of operation is identical to the original BBICS circuit [6] shown in Fig. 2 (b) , our structure is optimized in such a way that the power consumption could be largely reduced.
The following subsections detail the basic structure and the low-power features of our proposed scheme.
Basic structure
BBICS circuits in Fig. 2 are designed for monitoring NMOS bulks in pull-down networks, and thus they represent the blocks NMOS-BBICS in Fig. 1 (c) . W min represents the minimum diffusion width of the transistors, L min is the minimum channel length, and design factors X and Y are used for calibrating the sensors (see section 4).
The PMOS-BBICS structures for monitoring pull-up networks are the complement counterparts of the schemes in Fig. 2 . For the sake of simplicity, the similar illustrations and behaviors of PMOS-BBICS are omitted in this section, although their results have been also taken into account in analysis of section 4 and 5. The NMOS-BBICS's basic structure in Fig. 2 is composed of a latch (transistors 5, 6, 7, and 8) that is responsible for amplifying the anomalous transient currents coming from the bulk "NMOS_Bulk" of the monitored block (e.g. "virtual_ gnd_1" of system's block 1 in Fig. 1  (c) ). Higher gain of amplification is obtained by increasing factors X and Y, hence higher BBICS's sensitivity in detecting transient faults is also determined in terms of these design factors.
BBICS's latch has, moreover, the function of memorizing a flag in case of a transient fault within a defined current rangei.e. "Flag_N" in Fig. 2 keeping V DD level. On the other hand, as soon as the flag of fault is processed by higher instances of the system, BBICS's latch must be reset (through the input "Reset" in Fig. 2) to detect other transient faults.
Low-power features
Previous subsection highlights that growing factor X of Fig. 2 's transistors 6 and 8 allows improving the BBICS sensitivity in detecting transient faults. However, it also contributes considerably to the increase of static power consumption, which is today responsible for up to 50% of the power dissipation of systems based on ultra-deep submicron technologies [19] .
In fact, if higher BBICS sensitivity is desired, the sensor amplification has to be increased by higher X, i.e. greater diffusion widths and decreased v th of transistors 6 and 8. As consequence, lower v th makes possible "Flag_N" switches (from GND to V DD level) with lower amplitudes of anomalous transient currents on "NMOS_bulk". The sensor is thus able to detect smaller profiles of transient faults (I fault from section 2) that propagate from the "NMOS_bulk" (through transistor 2) to the node "Flag_N". On the other hand, following from equation (2), lower v th results in considerably higher I sub , and thus greater static power consumption.
One could argue that V gs of transistors 6 and 8 are zero in fault-free condition, and then from equation (2) I sub would be negligible. However, we have to consider that there is a very small voltage drop on nodes "Flag_N" and "node1" (complement) due to transistor 5 and 7, which act as high ohmic transistors (large channel lengths), creating an offset on V gs of transistors 6 and 8. In addition, a small offset is also induced on V sb of monitored transistors (e.g. T11, T12, and T13 in Fig. 1 (c) ), and thus, from (3) and (2), their v th are slightly lower, and their I sub are higher.
Our solution for reducing this static power consumption is introducing transistor 9 such as illustrated in Fig. 2 (a) . It allows the utilization of a sleep-mode when the system is left on standby. Transistor 9 is, in this case, set "on", making a less resistive path between the node "NMOS_bulk" and GND. Consequently, V gs of transistors 6 and 8 and V sb of monitored transistors approach zero, I sub becomes much lower, and thus the static power consumption is drastically reduced.
Furthermore, based on simulation experiments by using a 32-nm CMOS technology [18] , we identified that the costly transistors 9, 10, and 11 from original BBICS in Fig. 2 (b) are not necessary to efficiently and quickly detect short and long duration transient faults.
Method for calibrating bulk built-in current sensors
The previous section defines the two factors X and Y in Fig. 2 that allow calibrating the amplification of the anomalous transient current, and thus, adjusting the BBICS sensitivity in detecting transient faults. This section defines a calibration method that searches, at design time and in function of X, for the smallest and the largest Y able to detect a minimum transient-fault profile (I fault in section 2).
In order to find these factors Y max and Y min , several sensor versions are designed in function of different values for Y and X. Transistor-level simulations are performed with the injection of a single transient fault. Firstly, for a determined X, if the simulation result for a chosen value Y shows that a flag is not latched (i.e. the transient fault could not be detected), it can be concluded that the chosen Y is lesser than Y min . Then, greater amplification is required and another higher Y must be tested up to a simulation result illustrating a sensor version that detects (a) "sibbics_esref2012" (b) "bbics_tns2008" lirmm-00715117, version 1 -6 Jul 2012 the fault. In contrast, if the simulation result shows that a flag is always latched even before the fault occurrence, the sensor is not capable of identifying a fault, and the chosen value Y is greater than Y max . Then, other lower Y must be tested to search a value that makes a good detection sensor. This calibration method, thus, allows finding, for any technology, the optimal BBICS design factors within a range between Y max and Y min . The definition of the smallest transient-fault profile I fault that the sensor must be capable of detecting is given in function of a chosen reasonable minimum amplitude and duration. A minimum profile must be able to propagate without masking through a combinational logic and result in soft error (i.e. a bit-flip of a storage element). A larger profile than the smallest chosen one will be then also detectable since BBICS are capable of amplifying and latching its higher and longer induced voltages. Evidently, the largest profile of transient fault is limited by the highest current that does not cross the technology specification for causing permanent effects on the circuit.
The shortest durations of transient faults induced by radiation effects on integrated circuits are of the order of tens of picoseconds [20] . Hence, this paper had defined the minimum profile of transient fault that the sensors must detect as a double-exponential transient current I fault with duration of 50ps (measured at the half amplitude of the current). Moreover, the resultant transient voltage must have minimum detectable amplitude of 50% of V DD .
Simulation results
Transistor-level simulations were performed for comparing the proposed "sibbics_esref2012" with the previous BBICS labelled as "bbics_tns2008". The circuit versions were designed with V DD of 0.9V and in nominal conditions of a 32-nm CMOS technology [18] . Table 1 illustrates the resultant values for Y max and Y min of the schemes in Fig. 2 by using the method of section 4. Fig. 3 shows the normalized power consumption and area of a reference circuit (10 chains of 10 inverters) in which all its transistors are monitored by only one BBICS cell composed of two parts (PMOS and NMOS). The "bbics_tns2008" and "sibbics_esref2012" versions were designed with Y max , Y min , and different X. In addition, Fig.  3 illustrates the results when the sleep mode is active. The 200 transistors of the chains were designed with minimum size to analyze the technology's smallest capacitances, which represent the most sensitive nodes. Thus, this casestudy circuit allows inducing the smallest profiles of transient fault as well as evaluating the minimum detection sensitivity of the BBICS. We can conclude from Fig. 3 that "bbics_tns2008" versions present the largest power consumption and area overheads. On the contrary, "sibbics_esref2012" versions have smaller area and very lower power consumption, which is further reduced during sleep-mode. Note that the results allow defining the minimum and maximum increases since our calibration method searches for the smallest and the largest possible Y. Moreover, the method identifies the optimal trade-off between power and area.
Finally, Fig. 4 illustrates that the response times of "sibbics_esref2012" versions are at least equivalent to "bbics_tns2008" versions, and they can be of the order of hundreds of picoseconds.
Conclusions
The paper presents a promising solution for detecting transient faults in integrated circuits. With no performance penalty and much lower overheads in terms of area and power consumption than state-of-the-art's CED techniques, the SIBBICS are perfectly suitable for system design flows based on CMOS standard-cell layout approach. In fact, the standard cells of commercial libraries are never modified but a set of SIBBICS cells is designed in function of them. Ongoing works are the fabrication of a prototype and the validation of the approach by using a laser beam to inject transient faults. Table 1 Results of the calibration method for a profile of fault: the minimum and maximum factors Y for a range of X. It means, in function of X, the largest and the smallest sensor versions that can detect a transient fault with amplitude of 50% of V DD and 50ps of duration. 
