I. INTRODUCTION
T HE physical limit of the minimum subthreshold slope (SS) of CMOS technologies (SS > 60 mV/decade) makes it impossible to achieve efficient tradeoffs between low threshold voltages and acceptable leakage currents that could allow reducing supply voltage without degrading circuit speed. Thereby, this translates into power density problems for high-performance applications requiring nominal supply voltages and energy inefficiency in low-voltage applications, where increased long time delays raise leakage current energy so much that any advantages obtained by scaling dynamic power with supply voltage are cancelled out. In this context, intensive research is being conducted into devices with steeper SSs (SS < 60 mV/decade) by which low voltage operation at acceptable speeds can be achieved including significant power and energy savings. Tunnel transistors are one of the most attractive steep SS devices [1] - [4] . Recently, in [5] , benchmarking of many beyond-CMOS devices reinforces that tunnel FETs (TFETs) are the leading low-power devices.
Although the limited ON current of these devices is one of their main causes for concern, TFETs with improved ON currents around 1900 μA/μm of channel width with a supply voltage of 0.4 V have been projected [6] . Moreover, band-to-band TFETs based on 2-D transition metal dichalcogenide semiconductors, with an average SS of 31.1 mV/decade for four decades of drain current, at a supply voltage of V DD = 0.1 V at room temperature have been recently reported in [7] .
Emerging devices need to be evaluated at circuit level for a number of reasons and different remarkable benchmarking experiments comparing different emerging devices have been carried out [5] , [8] . Benchmarking is necessary to evaluate gains over CMOS and, thereby, identify the devices which are the most promising candidates for replacing or complementing CMOS under different metrics or in different application areas. Several works have shown that TFETs offer significant power and energy reductions [9] - [13] . Many of them have compared realizations of a given circuit implemented with TFETs with its CMOS counterpart at a nominal supply voltage, often producing application-dependent figures of merit. Others have evaluated the impact of reducing supply voltage in CMOS too, comparing a single TFET with a single CMOS. It would be very interesting, however, to make a broader comparison, taking into account not only different application scenarios (high performance and low stand-by power), but also CMOS devices targeting different objectives (HP and LP), operated at nominal and reduced supply voltages, and different TFET devices.
Avedillo and Núñez [14] reported a preliminary work in which power versus frequency curves were built and compared for several TFET and CMOS technologies at different supply voltages. In this paper, we present a considerably more comprehensive, in-depth, and complete evaluation and comparison, redressing some of the limitations of our previous work. More TFET devices and benchmark circuits have also been included, since TFETs can be designed for different targets of OFF current (I OFF ) and ON current (I ON ), which translates in different performances in terms of speed, power, and energy.
Many earlier comparative studies relied on analytical expressions to calculate delay and power from a reduced set of technological parameters, such as I ON , I OFF , input capacitance, and supply voltage. However, several recent papers have illustrated the great impact of certain features specific to TFET transistors, such as superlinear onset, unidirectional conductance, enhanced Miller capacitance, and dominant gate-to-drain capacitance during performance [15] - [18] . To take these features into account, we carried out an evaluation-based simulation, carefully choosing the characterized circuits and completing the typical fan-out of 4 (FO4) inverter analysis with more complex circuits.
The technologies were evaluated and compared assuming given operating frequency targets. We believe that this provides a clearer picture than the energy versus delay or power versus frequency (at a given supply voltage) curves shown in other papers [12] . Architectural aspects such as logic depth (LD) were also included in the study, as suggested by the results reported in [19] . This paper is structured as follows. Section II describes the experiments we carried out. The results obtained for the FO4 inverter are shown and discussed in Section III. Section IV analyzes the results obtained for more complex gates in order to explore the impact of the fan-in; 8-b adders are evaluated and compared in Section V, and finally, some conclusions are presented in Section VI.
II. EXPERIMENTAL DESCRIPTION

A. Transistors
TFET devices can be designed for different targets of I OFF and I ON . Several contributions [5] , [12] report results for a set of TFETs exhibiting distinct performance in terms of speed, power, and energy. This motivates the inclusion in our work of different types of TFETs. Those public models available in the NANOHUB website [20] satisfy these criteria.
Five different tunnel transistor models were used in this paper. Two of them were derived by Pennsylvania State University (PSU) [21] and the other three by Notre Dame (ND) University [22] , [23] . The TFETs from PSU were lookup tablebased Verilog-A models for III-V interband TFETs based on calibrated Synopsys TCAD device simulations. Models with the gate lengths of 20 nm are available for both a double gate InAs Homojunction TFET (PSU HOMO ) and a double gate GaSb-InAs heterojunction TFET (PSU HETE ). The TFET models from ND were based on the Kane-Sze formula for tunneling. In this paper, we used models for a planar double-gate InAs TFET (ND HOMO ), a double gate AlGaSb/InAs TFET (ND HETE,1 ), and a single gate GaN/InN TFET (ND HETE,2 ).
Four different CMOS transistors were also evaluated for comparison purposes, all of them predictive models obtained from the PTM web page [24] . The ones selected were those with channel lengths similar to the available TFETs [25] for the n-type transistors from each of these technologies. From Table I , it can be inferred that ND HETE,1 is suitable for those scenarios in which static power is dominant (low frequencies). On the other hand, PSU HETE exhibits the largest I ON , thus being the best option for high-speed applications. Experiments described below provide a more accurate evaluation in terms of selected target operation frequencies, using V DD as a design parameter. Fig. 1(a) shows the circuit we used to evaluate the FO4 inverter performance. Note that an intermediate inverter (in blue) was evaluated. The importance of taking into account nonideal inputs in order to evaluate actual performance is well known. The results are very different if the first stage (in red) is used. In [18] , these differences were studied with regard to delay and power in TFET inverters. Their effect on supply voltages differs considerably for each technology, and this impacts not only the absolute values obtained for selected figures of merit, but also the comparison itself. Note that the simple analytical models used for evaluation did not take into account the effect of nonideal inputs. Fig. 1(b) shows the circuit used to evaluate and compare logic gates with different fan-ins. An inverter, a two-input NAND gate (NAND2) and a three-input NAND gate (NAND3) were evaluated (in blue). Note that, again, the gates being tested were loaded with four minimum inverters and their inputs were not ideal but generated with the chains of inverters.
B. Circuits and Measurements
A circuit level benchmark was also included, to take into account some issues which had not appeared in previous experiments but which could have an impact on speed or power. These phenomena included glitches due to the propagation of signals through paths with different delays, leading to extra power, and sustained noise voltage pulses due to capacitive coupling and the asymmetric conduction exhibited by TFET devices, leading to delay degradation [16] , [25] . An 8-b ripple carry adder (RCA) was chosen for this circuit level analysis. An RCA is built by interconnecting full adders (FAs). Fig. 1(c) shows the logic diagram we used for the FA. Note that it comprises inverters and NAND gates.
C. Transistor Sizing
In all the benchmarking circuits, transistors were sized using a minimum gate length. In each case, the n-type transistor width was also the minimum allowable (one finger for the FinFETs). MOSFET p-type transistors were widened (to twice the minimum value) to compensate for mobility differences. Minimum p-type TFET transistors were used, because the models already assumed identical drive-ON currents for both types of transistors.
Applying the typical scaling rule, the n-type transistors in the NAND2 and NAND3 gates were doubled and tripled in width, respectively, to maintain similar rise and fall characteristics in all the technologies except for the FinFET circuits, where a single finger was used for all the gates.
D. Measurements
The benchmarking circuits were characterized in terms of delay and power by simulation at different supply voltages (V DD from 0.05 to 1 V with a voltage step equal to 0.05 V), in order to take into account the impact of the transistors' distinctive characteristics on performance, as mentioned earlier. For each circuit and technology, the minimum allowable V DD was determined as the minimum supply voltage at which correct functionality could be observed with a maximum logic swing degradation of 10%.
Worst case high-to-low and low-to-high propagation delays were measured (at V DD /2) for the gates and the RCA. As a figure of merit, we used the average value for these delays.
Average power was evaluated at different operating frequencies, switching activities (α), and LDs for each of the gates under characterization. The average power for the RCA was measured by applying 100 random input combinations at different frequencies.
The technologies were evaluated and compared assuming given target operating frequencies. That is to say, average power and average energy per operation were compared using the minimum V DD required to operate at each target frequency ( f TG ). This V DD value could be different for each technology. Thirteen target frequencies were selected, from 2 KHz to 6 GHz. For the gate-level experiments, it was assumed that they were being conducted within a logic network with a given LD. Fig. 2 shows the power versus frequency curves for LD = 50 and α = 0.1 (with a switching activity of 10%), with logarithmic scales applied to both axes. For each target operating frequency, we evaluated the minimum V DD at which 
III. FO4 INVERTER RESULTS
is the maximum achievable frequency for certain values of LD, and the delay of a minimum V DD -dependent FO4 inverter, FO4 . The results were normalized with respect to MOSFET HP , the negative values thus corresponding to technologies displaying better performance than MOSFET HP . Note that some technologies were not able to achieve the larger frequency targets.
For low frequencies and switching actives (i.e., f TG < 10 MHz), ND HETE,1 clearly had the best performance, as expected from Table I . At these frequencies, static and dynamic dissipations were similar in all the technologies, but static power in ND HETE,1 was around three orders of magnitude smaller than in the others, resulting in a significantly lower total power performance. Note from Table I that I OFF differences between ND HETE,1 and MOSFTE LP or FinFET LP devices were not so significant (around one order of magnitude in the worst case) when compared at the same V DD . However, our results show more important improvements because, for low frequencies, ND HETE,1 required lower V DD values than LP CMOS technologies. This was the only TFET technology to achieve power savings at very low frequencies. The other four TFETs analyzed obtained worse power results than MOSFET LP and FinFET LP at the smallest frequency target.
When the operating frequency was increased ( f TG ≥ 100 MHz), PSU HETE was the most power-efficient Table I , now we can accurately define the frequency range in which PSU HETE is competitive. PSU HETE consumed more power than MOSFET HP at 2 GHz and could not achieve the 3 GHz target. PSU HETE was the only technology that could compete with MOSFET HP for target frequencies over 200 MHz. The five TFET technologies were competitive with respect to both LP CMOS technologies (FinFET LP and MOSFET LP ) over that frequency.
The impact of the variation of LD is shown in Fig. 3(a) , where this parameter is halved with regard to Fig. 2 . A reduction in LD implies that the minimum V DD at which correct operation is possible (at a certain frequency) can be decreased (thereby also lowering power consumption). This results in larger maximum frequencies up to which power performance is better than for MOSFET HP ( f EFF ). This effect is more significant for TFET than for the MOSFET and FinFET technologies. In PSU HETE , for instance, the frequency is shifted up from 1.8 GHz (marked with an arrow in Fig. 2 ) to 3.7 GHz. On the other hand, in CMOS technologies, f EFF does not vary significantly with LD ( f EFF ≈ 2 MHz for MOSFET LP and FinFET LP ), because for such low frequencies their minimum V DD cannot be reduced when LD decreases, and power performance cannot therefore be improved.
The effect of changes in α is shown in Fig. 3(b) , where this value is reduced to 0.01 [LD = 25, as shown in Fig. 3(a) ]. Static power contribution to total power is significant up to higher frequencies for lower values of α. In this scenario, the most notable power reductions in comparison with MOSFET HP were observed in MOSFET LP and FinFET LP , which, with static powers approximately one order of magnitude lower, became more efficient up to around f EFF ≈ 100 MHz ( f EFF ≈ 2 MHz at α = 0.1). In the TFETs, the frequency up to which ND HETE,1 was the best increased, rising from 10 MHz (α = 0.1) to 100 MHz (α = 0.01), as shown in Fig. 3(b) . Fig. 3(c) shows the results for the experiment in Fig. 3 (a) (LD = 25 and α = 0.1), when the minimum explored V DD (V DD,MIN ) was raised from 0.05 to 0.25 V to take into account the possible practical limitations of lowering supply voltage so much, especially in terms of variability issues [26] . The significant differences between Fig. 3(a) and (c) are not qualitative but quantitative. For example, the power savings in MOSFET LP improved with respect to MOSFET HP at the lowest frequency target. The relative order of ND HOMO and PSU HOMO , and also of MOSFET LP and FinFET LP , was interchanged at that frequency. In general, these quantitative changes were seen at the lower frequencies for which operation with a V DD of under 0.25 V was possible. The exception was PSU HETE . For this technology, differences were observed up to higher frequencies. Power savings fell for the intermediate target frequencies (from tenths of kilohertz to around 1 GHz) that can be achieved with supply voltages of under 0.25 V with these transistors.
We next evaluated power consumption versus frequency for three values of α (0.01, 0.1, 0.5) and two of LD (25, 50) . For each frequency, LD and α, we chose the most power efficient technology and normalized it with respect to the best CMOS (MOSFET/FinFET) technology. The results are summarized in Table II . In each cell, the first row reports the technology which achieved the lowest power among the nine we analyzed ("Best tech."). The second row shows the CMOS technology with the best power result ("Best CMOS tech."). The third row reports the power consumed by "Best tech." normalized with respect to "Best CMOS tech.". Note that, as shown in Fig. 2 , none of the technologies worked for LD = 50 and fTG > 3 GHz.
It is clear from the table that the power advantages of TFET devices depend heavily on required operating frequency, switching activity, and LD. ( f TG , LD, α) combinations were identified for which no TFET was competitive (the cells with MOSFET HP as the "Best tech.").
The range of normalized power (or power ratio) values obtained was very wide. ND HETE,1 had the lowest power consumption ratio (0.0010) with respect to MOSFET LP (at f TG = 1 MHz and LD = 50 and α = 0.01, marked in blue). This means that ND HETE,1 power was 0.1% of the best CMOS power, representing a power saving of 99.9%, that is, ND HETE,1 power 1000 times smaller than the best CMOS power. In general, very low ratios were also observed for this technology at low frequencies (reductions of over one order of magnitude in all cases and of over two orders of magnitude in many cases). This is explained by the technology's ultralow OFF-state current (static power).
The best case in which PSU HETE was better than MOSFET HP gave a power ratio of 2.8% ( f TG = 20 MHz and LD = 25 and α = 0.5, marked in green). The worst case in which PSU HETE was better than MOSFET HP gave a power ratio of 98% ( f TG = 2 GHz and LD = 50 and α = 0.01, marked in red). That is to say, the values varied widely depending on frequency, LD and α. From f TG = 2 GHz upward, MOSFET HP began to be the most efficient node. In no case were the homojunction transistors the most competitive.
It was also observed, as described earlier [ Fig. 3(a) ], that the frequency values at which the same normalized power was achieved in comparison with CMOS nodes were larger for LD = 25 than for LD = 50. As a case example, for ND HETE,1 , normalized power values (with respect to FinFET LP ) around 3% are obtained at 20 and 10 MHz for LD = 25 and LD = 50, respectively (with α = 0.01).
On the other hand, for the same f TG , normalized power was equal or better for the lowest LD. Improvements were greater for higher frequencies. At f TG = 1 GHz and α = 0.1, for example, the normalized power of PSU HETE compared with MOSFET HP was over twice as small for LD = 25 (0.1094) than for LD = 50 (0.2312). And for f TG = 2 GHz and α = 0.01, it was more than four times smaller. For both technologies, power fell when LD was decreased from 50 to 25, since V DD required to operate at a given frequency decreases when the logic paths for signal propagation are shortened (smaller LD). The results therefore show that the amount by which power is reduced is larger for PSU HETE than for MOSFET HP . This is due to the different delay versus V DD behaviors shown by TFETs and CMOS.
In TFETs, there is a larger, flat region in which delay slightly increases with V DD reduction. The reduction in V DD achieved by changing from LD = 50 to LD = 25 is larger for PSU HETE than for MOSFET HP . This illustrates the limitations of benchmarking by simply comparing the performance of FO4 inverters or using a fixed LD value, and suggests that evaluation experiments should ideally take into account architectural aspects (like LD). In other words, different architectural options may be more appropriate for different technologies. With regard to the LD parameter, for example, LD = 25 can be interpreted as a two-stage pipelined implementation of an original circuit with LD = 50 (assuming ideal pipeline registers). In this scenario, our results show that pipelining to reduce power consumption is more efficient for the TFET technology than for CMOS. We compared power for both LD values in each technology. For PSU HETE , power with LD = 25 was 13% of power with LD = 50, and for MOSFET HP , it was 65%. The significantly larger saving shown by PSU HETE suggests that pipelining can be an efficient technique for obtaining power and energy savings (more remarkable for TFETs than for CMOS) even taking into account the power overheads associated with its registers.
Finally, it should be noted that FinFET HP did not emerge as "Best CMOS Tech" in any of the cases analyzed. This, however, should not be interpreted as an indication that this technology is not competitive with respect to MOSFET HP , but as the result of the sizing selected for the experiment, which produced larger capacitance for the FinFET technologies. This will be clarified in Section IV, where more complex gates and functional circuits are analyzed. In spite of the sizing strategy, in Table II , FinFET LP still appears as the best CMOS technology for the lowest frequency targets (see Figs. 2 and 3) , since in these cases, power is not dominated by the charging and discharging of transistor capacitance.
Average energy per operation (E) versus frequency curves in the FO4 inverter is shown in Fig. 4 for LD = 50 and α = 0.1. Energy was calculated as E = P/ f , using the minimum V DD required to operate at such a frequency. No explicit general comments on these figures are made, because the conclusions drawn regarding power performance also apply to energy. Our interest here is in analyzing the minimum energy point. It can be observed that TFET inverters have smaller minimum energy values than CMOS inverters. In particular, ND HETE,1 , the best one, has values around 1.6 orders of magnitude lower than its best CMOS counterpart (MOSFET HP in this case). Unlike energy versus V DD , which is usually used, this representation also makes it possible to evaluate the energy-speed tradeoff achieved by each technology. The minimum energy points of all the TEFTs except PSU HETE were obtained at frequencies lower than in HP CMOS, although slightly higher than in LP CMOS. In fact, the only TFET technology to show an advantage in terms of E/ f (a figure of merit for estimating energy-speed tradeoff) was PSU HETE . The minimum energy for each technological node does not necessarily correspond to the lowest V DD . The critical V DD value for minimum energy (V DD,OPT ) is much lower in TFET than in MOSFET/FinFET, because the ON/OFF current ratio in tunnel technologies is larger (the SS is steeper), as corroborated in previous works [19] , [13] , [26] .
The impact of LD and α on energy performance is shown in Fig. 5 . In Fig. 5(a) , LD was reduced to 25, without significantly modifying the energy curves but with slight differences in the frequency at which minimum energy was achieved. On the other hand, a downward shift in energy was seen when the switching activity factor was decreased by ten, as shown in Fig. 5(b) for α = 0.01. Note that LD was only multiplied by two (the doubling frequency for a given V DD ), which explains the more notable impact of the variation in α with respect to the experiment in which LD was varied. We also found that energy is also impacted by V DD,MIN . The advantages of ND HETE,1 with respect to CMOS in terms of minimum energy, for example, dropped to around one order of magnitude for V DD,MIN = 0.15.
IV. FAN-IN EXPERIMENTAL RESULTS
This section analyzes the performances of more complex gates. More specifically, the inverter is compared with the NAND2 and NAND3 gates as a case study. The test bench for this experiment is shown in Fig. 1(b) . The experiment shown in Table II in Section III was repeated for all three gates. That is to say, for each frequency, LD and α, we chose the technology with the best power consumption and normalized its average measured power with respect to the best MOSFET/FinFET technology result. The best technology ("Best tech.") and CMOS technologies ("Best CMOS tech.") at each target frequency are shown in Table III for one of the LD and α combinations (LD = 25, α = 0.1). The regions in which MOSFET LP or FinFET LP are the "Best CMOS tech." (low frequencies) are distinguished from those in which MOSFET HP /FinFET HP are the most efficient by a bold line. Note that, unlike in the previous Section, FinFET HP now appears as the "Best CMOS tech." for several target frequencies of NAND2 and NAND3. The benefits of the FinFET HP technology could not be shown in the inverter experiment described in Section III, but now, with more complex gates, they could. The border frequencies obtained between regions increased with fan-in. Note the clear correspondence between the regions described above and the frequency ranges at which ND HETE,1 and PSU HETE (highlighted in Table III) are the best "Best tech."
Care should be taken when comparing the normalized power results obtained for the different gates (the third row in each cell), because in many cases, the comparison would be between data corresponding to different technologies and, moreover, normalized with respect to different CMOS technologies. However, it is still interesting to look at the results obtained for NAND2 and NAND3 in those cases in which "Best CMOS tech." and "Best tech." match each other. Two different behaviors can be observed. For low frequencies, between 1 and 20 MHz (FinFET LP as "Best CMOS tech."
and ND HETE1 as "Best tech."), normalized power is smaller in NAND3 than in NAND2. At these frequencies, the contribution of static power due to leakage currents dominates in ND HETE,1 and, so, advantages in terms of the stack factor could be better in this technology than in FinFET LP . This was also reported in [27] . For high frequencies between 1 and 4 GHz (FinFET HP as "Best CMOS tech." and PSU HETE as "Best tech."), the opposite behavior is observed. That is, the normalized power of TFET is higher in NAND3 than in NAND2. In this case, dynamic power dominates and the results obtained can be put down to the sizing strategy, with pull-down transistors that are wider in the TFET NAND3 with respect to NAND2 but identical in the FinFET gates. When PSU HETE power is normalized with respect to MOSFET HP , the results for both gates are much more similar.
Finally, note that there are differences between the results reported for the inverter in Section III and those reported in this experiment. The advantages of PSU HETE can now be observed up to frequencies at which it was not previously competitive. These dissimilarities are caused by the different circuits used to drive the gate under characterization. Input is now produced by a fan-out 1 inverter, whereas in the experiment described in Section III, a fan-out 4 inverter was used. This confirms that benchmarking experiments are also heavily dependent on selected input waveforms, as discussed earlier. It also justifies the circuit level evaluation experiment described in Section V and the aforementioned study of how the different features of TFET devices impact circuit operation.
V. CARRY PROPAGATION ADDER RESULTS
Table IV summarizes the power results for the 8-b RCA adders implemented with the four CMOS transistors and the three heterojunction TFET technologies. The homojunction TFETs are omitted, since they were not shown to be competitive in any of the comparisons carried out at gate level in the previous sections. The best CMOS and best TFET results for each frequency are shown in bold type.
Note that the results obtained agree with those obtained at gate level. For each of the moderate frequency targets explored, the most competitive power solution corresponded to a TFET technology, with large power and energy savings being achieved in all cases. The ratio between the smallest power achieved with TFET technologies and the smallest power obtained with CMOS transistors was evaluated and is shown in the last row in Table IV . Power reductions ranged from 92% (1 GHz) to 97% (1 MHz), although they did not decrease monotonically with frequency. Again, this can be explained by the different delay versus V DD behavior shown by the TFET and CMOS transistors and by experimental discretization.
Normalized power values for NAND2 (NAND3) in Table III (up to 1 GHz) were between 0.02 and 0.12 (0.01 and 0.29), and slightly smaller for the RCA (between 0.03 and 0.08). However, it should be noted that, in this experiment, LD and α were different. That is to say, the power savings obtained for the benchmark at circuit level were larger than those obtained in the gate level experiment with LD = 25 and α = 0.1. It is also interesting to compare the last two columns of Table IV . For all the CMOS transistors, power at 0.5 GHz is less than three times smaller than power at 1 GHz. Assuming dynamic power dominates at these frequencies, and that power halves when the circuit is operated two times more slowly, the additional power savings stem from supply voltage reductions associated with smaller frequency targets. For TFET transistors, the reduction factor is over 4 (4.40 for ND HETE,1 , 6 for ND HETE,2 , and 5.3 for PSU HETE ). Power savings associated with supply voltage reduction are therefore larger for TFETs, and when frequency targets are relaxed, supply voltage reductions supported by TFETs, are larger than in CMOS. This fully concurs with the results described at gate level, and may be specifically related to our analysis of the impact of LD when comparing power and energy advantages. Experiments carried out at circuit level also show that the impact of techniques to improve throughput used for power optimization is larger in TFET technologies. From this perspective, these results show that the power benefits that can be obtained by using parallelism in the implementation of the adder with a throughput target of 1 GHz (each individual copy operating at 0.5 GHz) are larger in TFET than in CMOS. They also seem to indicate that evaluation experiments should take into account architectural aspects.
VI. CONCLUSION
The experiments carried out in this paper show that estimating TFET power benefits or energy savings simply from the reductions obtained in V DD with respect to nominal supply voltages in CMOS technologies can be misleading. The advantages of TFET devices depend heavily on required operating frequency, switching activity, LD, and the minimum supply voltage that is practical for actual circuit applications. As in conventional CMOS technologies, a single device is not competitive in all application domains.
Of the five TFET technologies analyzed, two were identified as competitive devices in two different application fields. ND HETE,1 is suitable for applications in which static power dominates (very low frequency-switching activity products), but has severe speed limitations. It offers significant advantages in power and energy with respect to CMOS, even with regard to LP devices in this domain. PSU HETE was seen to be greatly advantageous in terms of power and energy in a given frequency range, the exact position of which depends on LD and switching activity. In addition, we have found that only the two most promising transistors in terms of speed (PEN HETE and ND HETE,2 ) exhibit I 60 in the 1-10 μA/μm range, identified in [25] as the required range to be competitive with CMOS. For these devices, power-related advantages are associated with lower required supply voltages than those applicable in CMOS, to fulfill with the constraint on path delay imposed by working frequency. This suggests the existence of limitations in benchmarking experiments in which identical CMOS and TFET circuits are evaluated and compared. The adoption of architectural solutions involving path delay constraints capable of fully exploiting TFETs distinctive speed versus supply voltage behavior can make TFET logic circuits competitive up to higher frequencies, increasing their power (energy) efficiency.
Circuit-level evaluation using an 8-b RCA showed that TFET implementations using ND HETE,1 for low frequencies and PSU HETE for larger frequencies have power (energy) consumption values between 3% and 8% of the best values obtained with CMOS. Although CMOS RCAs achieve higher operating frequencies, our results suggest that architectural techniques such as pipelining or parallelization could be applied to TFET designs to enable them to operate at those higher frequencies without losing their power and energy advantages.
