Abstract -Nonvolatile SRAM (nvSRAM) has emerged as a promising approach to reducing the standby energy consumption by storing the state into an in situ nonvolatile memory element and shutting down the power supply. Existing nvSRAM solutions based on a nonvolatile backup in magnetic tunnel junction and ReRAM, however, are costly in backup and restore energy due to static current. This cost results in a long breakeven time (BET) when compared with a lowered voltage standby volatile SRAM. This brief proposes an nvSRAM based on ferroelectric FETs (FeFETs) that are capable of fully avoiding such static current. A simple differential backup and restore circuitry is proposed, achieving sub-fJ /cell total energy per backup and restore operation at the 10-nm node. This leads to hundreds of times BET improvement over existing ReRAM nvSRAM solutions. This nvSRAM also indicates the future FeFET design trends for such memory-logic synergy.
such as using low-standby-power transistors, proper transistor sizing, and supply voltage modulation especially in the idle phase. Using intrinsically nonvolatile memory (NVM) such as ReRAM, spin-transfer torque magnetic random-access memory, and phase-change random-access memory, leads to zerostandby cell leakage, but is constrained in use as an SRAM due to limited speed, endurance, or dynamic energy or power [4] .
In contrast, nonvolatile SRAM (nvSRAM) embeds NVM to store an in situ nonvolatile copy of a CMOS SRAM bit and saves energy by completely shutting OFF the leaky supply [18] [19] [20] [21] [22] [23] . However, the energy consumed in an existing nvSRAM to backup and restore from the NVM, E B&R , is high. For example, the state-of-the-art E B&R in experiments is ∼200 fJ [18] . The fundamental causes include: 1) the high static current during a backup or restore operation; b) long backup time to survive variations; and c) long restore time to cover the supply recovery ramp. As a result, the minimum power-OFF time to save static leakage energy to compensate E B&R , i.e., break-even time (BET), is high for existing nvSRAM designs. This prohibits energy savings with frequent data access.
This brief proposes an nvSRAM, shown in Fig. 1(a) , with sub-fJE B&R and thus significant BET improvement over existing nvSRAMs. It is achieved with a novel backup and restore circuitry using Ferroelectric FETs (FeFETs) that completely get rid of the static current in prior nvSRAMs. FeFET is an emerging beyond-CMOS transistor for Boolean logic and memory [5] [6] [7] [8] [9] [10] [11] , [16] . It fundamentally differs from other NVM devices in its capability of concurrent logic and memory operation with the drain-source conducting route separated from the gate control signal. It also works under a low voltage with very high distinguishability in the drain-source resistance between different FeFET memory states. Such an FeFET nvSRAM also indicates the significance of designing future FeFETs with embedded logic-memory synergy in a single device for more energy-efficient computing.
II. PROPOSED FEFET NVSRAM OPERATIONS
This section first describes FeFET device for nvSRAM and the operation theory of FeFET nvSRAM in different modes. Then device-circuit codesign and optimizations are discussed.
A. FeFET Device Characteristics
An FeFET is implemented with a ferroelectric (FE) material layer electrically and preferably physically sandwiched in the gate-stack [6] . Recent FE development promises good FeFET scaling capability with low-voltage operation [8] . Various FeFETs have been reported. While some of them exhibit hysteresis-free switching with a steep slope, others are highlighted with tunable I -V hysteresis curves [6] [7] [8] [9] . The proposed nvSRAM in this brief makes use of the FeFET I -V hysteresis covering zero gate bias, as shown in Fig. 1(b) . This could be achieved by tuning the FE layer thickness T FE to obtain proper capacitance matching and coercive voltage [6] .
In the FeFET I -V hysteresis, the positive and negative polarization could be set by applying a positive and negative gate-source voltage V GS , respectively. When V GS stays away from the rising and falling edge in the two directions, the polarization state does not change. Even if the gate voltage is removed, the stable remnant positive or negative polarization of the FE material maintains a nonvolatile state of low or high FeFET drain-source resistance, respectively [5] .
In this brief, simulations are based on the 10-nm LSTP PTM model in [14] and the calibrated FeFET model based on L-K equation in [15] that has been adopted by [10] , [11] , and [16] . The FeFET device model parameters for the L-K equation are included in Fig. 1(b) .
B. SRAM-Mode Operation
During normal SRAM-mode operations, the supply voltage V supply and the gate voltage of the two access transistors (M1 and M2) V rstr are kept stably at VDD and GND, respectively. As M1 and M2 are turned OFF, the backup and restore circuitry now acts as a high-resistance OFF-state route from Q and QN to ground, regardless of the polarization states of N1 or N2.
The operation of the rest of the nvSRAM is similar to an existing volatile 6-T SRAM in read and write. For read, the bit lines BL and BLN are both precharged to be high in voltage and then the word line signal WL turns on the access transistors M3 and M4. The internal inverter that outputs zero discharges the corresponding bitline and causes a lower sensed voltage at the bitline. The sense amplifier reads the bitline voltage difference and obtains the Boolean value. To write the nvSRAM, the external bit lines are kept stable at high or low voltage, according to the values to be written. Then M3 and M4 are turned on and the internal inverters are regulated accordingly. It is noted that the additional backup and restore circuitry adds some minor parasitics to the internal nodes Q and QN. This reduces the read access time as these parasitics help discharging the bit lines, and also increases the write access latency. The impact is very minor because the extra parasitics are orders lower in magnitude than bitline capacitance. Read and write noise margins are dc performance metrics, and will be nearly the same as a conventional 6-T SRAM as the extra circuitry is floating (OFF from GND) in the SRAM mode.
As the two FeFETs (N1 and N2) are floating in the SRAM-mode, their gate voltage V bkp does not affect the SRAM-mode functionality. If the SRAM access is not frequent and the FeFET endurance is sufficiently high, V bkp can be set to VDD or GND or somewhere in between. In contrast, when endurance is a concern, V bkp should be kept at an appropriate voltage so that the FeFETs do not wear out soon due to too many polarization switching activities with frequent SRAM state updates in some applications. A possible V bkp for this purpose could be VDD/2, to make sure that the largest FeFET voltage stress |V GS | is minimized to be VDD/2 to avoid unnecessary polarization switching.
For the same reason, given an FeFET nvSRAM design, VDD should be within a certain range. For the backup purpose, VDD should be higher than the coercive voltage to update the polarization. For FeFET endurance considerations, VDD should be less than twice the coercive voltage so that updates of the SRAM data do not flip the FeFET polarization. If a dynamic VDD is available, VDD could also be further lowered in the SRAM-mode to save more power and then raised temporarily to enable backup operations when necessary. Fig. 2 illustrates the 2-step backup operation. In this mode, V supply is stable at VDD, and V rstr is stable at GND. The SRAM external access transistors (M3 and M4) are also turned OFF by a low WL voltage. The state of the SRAM is assumed to be VDD at Q and GND at QN. At step 1, V bkp is raised to VDD, above the FeFET coercive voltage, to ensure that N2 is positively polarized with low drain-source resistance. At step 2, V bkp is lowered to GND to ensure that FeFET N1 is negatively polarized to have high drain-source resistance. Note that N2 polarization stays stable at step 2.
C. Backup-Mode Operation
If the nvSRAM backup operation is followed by a power supply cutoff, V bkp could stay as GND. If the nvSRAM backup is a run-time backup followed by subsequent SRAM-mode operations, V bkp could be set back to its SRAM-mode value. Fig. 3 shows the transient waveforms for backup and restore operations before and after a power failure, respectively. Before a restore, all nodes are assumed to be zero in voltage. The polarization of N1 and N2 is assumed to be negative and positive, respectively. At first, V rstr is raised to VDD (V bkp remains zero). Accordingly, the positive N2 polarization leads to a low-resistance path from QN to ground, and the negative N1 polarization results in a high-resistance path from Q to ground. After that, V supply is raised to VDD. The different pull-down-to-ground strength at nodes Q and QN by the restore circuitry regulates the main SRAM inverters, so that Q follows V supply and QN stays zero. After Q and QN settle down, V rstr is grounded. After a restore, if VDD is higher than twice the coercive voltage, V bkp could be adjusted to its SRAM-mode value for endurance considerations. Note that restore operations do not change the FeFET polarization.
D. Restore-Mode Operation

E. Device-Circuit Co-Design and Optimization
The main nvSRAM performance features include backup and restore energy and delay, retention time, yield, and reliability. The FE layer thickness T FE could be tuned accordingly for optimizations. Fig. 1(b) has shown how T FE affects I -V . Related key FeFET features include the hysteresis window width and ON-and OFF-state currents.
Increasing T FE increases the FeFET retention time with a higher coercive voltage, which also leads to longer time till the polarization is flipped, as shown in Fig. 4(a) for the backup latency. T FE also affects the backup and restore energy E B&R , showing more energy consumed with a smaller T FE , as shown in Fig. 4(b) . For Fig. 4(a) and (b) , the V supply wakeup ramp is set to be 1 μs to mimic typical scenarios. At the array or processor level, restore speed is usually limited by VDD recovery. For fine-grained power-gating scenarios, a fast recovery of a local supply is possible, and the FeFET nvSRAM intrinsic restore speed becomes meaningful. Fig. 4(c) shows the restore latency in such a scenario that is slightly affected by T FE under a 0.1-ns-recovery VDD. This is significantly faster than ReRAM nvSRAM in [18] because of the single-step restore operation with a positive feedback loop in the nvSRAM.
III. PERFORMANCE EVALUATIONS AND DISCUSSIONS
A. Performance Evaluations
When operating at different supply voltages, Fig. 5 For comparison, the state-of-the-art 7T1R ReRAM nvSRAM in [18] is also evaluated with the PTM technology. The low-resistance state of the ReRAM is set to be 5.5 k , to provide sufficient pull-down strength against the preset pull-up PMOS in the SRAM (pull-up resistance with 1 fin is ∼22 k at 0.7-V VDD). The adopted switching time t SW is 0.3 ns at 0.8-V SET/RESET from an ultrafast energy-efficient HfOx ReRAM in [22] . Even if only write energy is considered here, the ReRAM nvSRAM consumes ∼597 × higher energy (more than 35 fJ), assuming no ReRAM device variations. The huge E B&R difference is due to the required static current for at least t SW to flip the ReRAM states [18] , [22] . When ReRAM device variations are considered, E B&R will be much higher as the write pulse duration will be determined by the slowest ReRAM [25] .
The E B&R advantage also leads to BET benefits, as shown in Fig. 5(b) . This indicates that the minimum power-OFF window width during power-gating to win energy savings is only ∼0.2% of that in ReRAM nvSRAM, which benefits frequent-access scenarios. Scaled magnetic tunnel junction (MTJ)-based nvSRAM is far outside the plot due to similar backup energy to ReRAM nvSRAM and large wakeup energy at a typically >μs V supply wakup ramp [22] , [23] .
B. Area Overhead Analysis
The proposed FeFET nvSRAM needs four additional transistors in a cell for nonvolatility, indicating more area overheads than the prior 7T1R design in [18] in which ReRAM could be placed on top of the SRAM cell. Note that this ReRAM nvSRAM has extra area overhead due to the need for various voltage power supplies, which is significant when the SRAM array is small.
Meanwhile, density may not be the key priority for SRAM in some scenarios, such as in 8T and 10T SRAM designs, for which the portion of area overhead reduces. Therefore, the proposed design shows the most advantage when the backup and restore energy overhead is significant, as the proposed FeFET nvSRAM is far lower than the existing nvSRAMs.
More importantly, the area overhead of FeFET nvSRAM can be significantly reduced by moving the accessory circuitry to another layer using new fabrication methods such as the monolithic 3-D process [24] , [26] .
C. Retention, Reliability, and Endurance
If the number of FeFET polarization switching activities exceeds beyond the device endurance, the FeFET may be worn out and will exhibit a different ON-OFF resistance and affect the restore behavior. Thanks to the huge ON-OFF state resistance ratio (up to seven orders in magnitude as reported in [6] ), its degradation of up to even 90% is still acceptable for functionality as long as ON-state resistance is sufficiently low. Future work on retention time and reliability analysis will be meaningful for FeFET device design and comprehensive evaluation when a proper model or experiment becomes available.
IV. CONCLUSION
This brief has proposed FeFET nvSRAM design by exploiting the embedded logic-memory integration in the FeFETs. With ultralow energy consumption in backup and restore operations, it enables a new SRAM paradigm for more energy savings.
