The near-threshold design is widely employed in the energy-efficient circuits, but it suffers from a high sensitivity to process variation, which leads to 2X delay variation due to temperature effects. Hence, it is not negligible. In this paper, we propose an analytical model for gate delay variation considering temperature effects in the near-threshold region. The delay variation model is constructed based on the log-skew-normal distribution by moment matching. Moreover, to deal with complex gates, a multi-variate threshold voltage approximation approach of stacked transistors is proposed. Also, three delay metrics (delay variability, ±3σ percentile points) are quantified and have a comparison with other known works. Experimental results show that the maximum of delay variability is 5% compared with Monte Carlo simulation and improves 5X in stacked gates compared with lognormal distribution. Additionally, it is worth mentioning that, the proposed model exhibits excellent advantages on −3σ and stacked gates, which improves 5X-10X in accuracy compared with other works.
Introduction
By technology downscaling deep into the nanometer era, the near-threshold design has become one of the most efficient ways for high energy efficiency application [1] [2] [3] [4] . However, it suffers from a much higher sensitivity to process, voltage and temperature variations [5] [6] [7] . In view of transistor operation mechanisms, MOSFET changes from strong inversion to weak inversion with decreasing voltage, and its primary current also changes from drift to diffusion current [8] . In the near-threshold, since these two kinds of current cannot be ignored, its current formula is different from that of sub/super-threshold regime [9, 10] . Kumar et al. [11] and Saurabh et al. [12] reveal the relationship of nominal delay with temperature in low voltage, which show the temperature has important effects in the low voltage. Kaul et al. [2] show that delay variability has a 2X improvement due to temperature effects. Consequently, the effects of temperature play an important role in the delay and delay variation, and cannot be ignored.
A lot of works have studied on the effect of temperature on the delay variation. In [13, 14] , they present the model of delay variability and temperature in the subthreshold regime, which is inversely proportional to temperature. Kiamehr et al. [15] introduce the adjustment of temperature-aware voltage scaling considering the process, but from the view of current, subthreshold regime is adopted. For stacked gates, a method is proposed in [16] , which shows the dependence of mean and variance on temperature based on lognormal (LN) distribution for a CMOS inverter and gates with transistor stacks in the subthreshold regime. Also, for stacked transistors, they assume that each transistor is independent. However, due to the connection of the intermediate node, the stacked transistors are dependent, and no longer follow the same Gaussian distribution. Nevertheless, the relationship of delay variation with temperature under near-threshold supply voltage has not been revealed.
For near-threshold region, Balef et al. [17] use log-skew-normal (LSN) distribution to model the delay distribution across all voltage regions with its distribution parameters determined by fitting from the Monte Carlo (MC) samples. But it does not reveal the relationship with the process parameters and the environment parameters and takes numerous MC simulations. A transregional current and delay model was developed in [18] in closed-form for near-threshold. But LN distribution is introduced for the delay variation computing. Nevertheless, due to the difference in current expression, LN is no longer suitable for near-threshold, so the computation of maximum/minimum delay is unsuitable. Therefore, it does not provide a reasonable analytical delay variation model for the near-threshold regime and does not have analysis on stacked gates.
In this paper, a novel analytical delay variation model with temperature in the near-threshold is proposed, and also extended to gates with stacked transistors. The main contributions of the work are as followed.
• A novel analytical delay variation model with temperature is derived by LSN distribution with moment matching for the near-threshold regime.
•
Due to the existence of the intermediate node, the two stacked transistors are no longer independent. So a multi-variate threshold voltage approximation approach of stacked transistors is proposed for the computation of delay variation for staked gates with temperature consideration, which shows remarkable accuracy advantage compared with previous works.
The rest of the paper is organized as follows. Section 2 introduces the related properties of LSN distribution. The delay distribution models for a single transistor and stacked transistors are addressed in Section 3. For stacked transistors, a multi-variate threshold voltage approximation approach is proposed. And the delay variation model is derived by LSN distribution with moment matching. Section 4 shows the experimental results that three metrics (delay variability, maximum and minimum delay) are compared with SPICE MC simulation and other previous works. Section 5 concludes the paper.
The Properties of LSN Distribution
Since this paper mainly analyzes and derives based on LSN distribution, some basic properties used are introduced in this section.
Properties of Skew Normal Distribution
If X is a random variable and follows a skew-normal (SN) distribution, that is X ∼ SN ε, ω 2 , λ , its PDF ( f SN (X)) and CDF (F SN (X)) are represented as
where φ and Φ are the PDF and CDF of the standard normal distribution, ε, ω, and λ are the location, scale, and shape parameters of the SN distribution, T (H, A) is Owen's T function and can be given by
The mean (µ), variance σ 2 , and skewness (γ 1 ) (they are also called first, second and third moment) of the random variable X are written in Equation (4) [19] .
where
Properties of Log-Skew-Normal Distribution
If Y is a random variable and has an exponent relationship with X Y = e X , it follows log-skew-normal distribution, that is Y ∼ LSN ε, ω 2 , λ . And its PDF ( f SN (X)) and CDF (F SN (X)) are represented as
According to [19] , the mean and variance of Y are illustrated in Equation (8) . Besides,
For this work, maximum/minimum delay (±3σ percentile points) is also the important metric. In order to obtain ±3σ percentile points, Equation (9) must be solved.
But it cannot be easily solved for the LSN distribution due to the existence of Owen's T function. To resolve this issue, a reasonable approximation is introduced in this paper to transform the form of the Owen's T function. If the value of shape parameter (λ) in LSN distribution is close to 1, Owen's T function has the following function.
Equation (9) can be derived by
It can be clearly shown from Equation (11) that the maximum/minimum delay at 3σ/ − 3σ percentile point of Y locates at Y max /Y min , which is Table 1 . The property of lognormal, log-skew-normal and Gaussian distributions.
Delay Variation Model Based on LSN Distribution
In this section, delay variation models of a single transistor and stacked transistors based on LSN distribution are introduced, respectively. By moment matching, the distribution parameters of LSN can be computed. The detailed process of the proposed model is introduced in the following.
Delay Variation Model for Single Transistor
Recently, Keller et al. [18] pointed out that near-threshold on-state current is expressed as follows.
where µ is the mobility of carriers, C ox is the oxide capacitance for unit area, n is the subthreshold slope factor, W and L are width and length of a transistor, k is the Boltzmann constant, q is the elementary charge, K 0 , K 1 and K 2 are process-independent constants, which are 0.54, 0.69 and −0.033 [18] . V DD is the operation voltage, V thn0 is the threshold voltage at the temperature T 0 . κ is the temperature coefficient for threshold voltage, and T is the absolute temperature. Simplicity, Equation (13) can be rewritten as
Although µ is also related to temperature, the mobility contributes only 2% compared with thermal and threshold voltage [12] . Therefore, in this work, the dependence of mobility with temperature is also ignored.
And X is noted as
Since the random variable X is a quadratic function of threshold voltage V th which follows Gaussian distribution, the mean (µ), variance (σ 2 ), and skewness (γ 1 ) of X can also be calculated by definition and expressed as
where µ 0 and σ 0 are the mean and standard deviation of V th0 , and
can be computed by integration and the final form is illustrated in
With a simper linear RC-delay model, the delay of a gate can be written as
Delay Variation Model for Stacked Transistors
In this section, we broaden the proposed model to gates with stacked transistors , including stacked transistors, such as NAND, shown in Figure 1 . Due to the existing of intermate node V x , the stacked transistors are not independent. In order to extend the proposed model, a multi-variate threshold threshold voltage approximation approach is proposed.
I U and I D are the current through the up and down series-connected transistor as shown in Figure 1 , which can be computed by
where V x is the intermediate node voltage of the two stacked transistors, V thU and V thD are the threshold voltage for up and down transistors and illustrated in Equation (21), respectively.
where V thU0 and V thD0 are the threshold voltage of up and down transistor at temperature T0. Due to the two transistors are series-connected, so the I U = I D . However, V x cannot be solved analytically, because it causes a transcendental equation. Therefore, a linear approximation method is introduced to determine the relationship between V thU and V thD . Through running MC simulation for a NAND2 cell at 0.55 V and 25 • C, related simulation results are plotted in Figure 2 , which show that the voltage V x changes approximate linearly with V thU and V thD . Moreover, the slope with different temperatures can be considered constant whose specified fitting parameters are listed in Table 2 . The maximum error is 3.12% across the entire temperature. Thereby, the equation form of V x can be determined by a bivariate linear model, which is shown in
where k U , k D , and k C are the fitting parameters. By substituting V x into Equation (20), the discharge current for the NAND2 gate can be written as
where V th_st is the equivalent threshold voltage in the stacked gate and is given by
Because the V thU0 and V thD0 both follow Gaussian distribution, the parameter of V th_st can be obtained easily by Gaussian distribution operation.
Assuming that V thU0 ∼ N (µ U0 , σ U0 ) and V thD0 ∼ N (µ D0 , σ D0 ), the mean and variance of V th_st can be expressed by
If X is noted as
So we can expand the single transistor model to stacked transistors, and the expression of T d_st is formulated by Equation (27), which has the same form as single transistor.
Moment Matching
According to [17] , LSN distribution is suggested for the modeling of the delay distribution and shows remarkable accuracy in the near-threshold region. Therefore, X is assumed skew-normal distribution, the distribution parameters are ε, ω and λ, and the basic propertied of SN and LSN are introduced in Section 2. The specified flow of moment matching is illustrated in Figure 3 . The mean, variance and skewness can be computed from SN distribution Equation (4) and definition Equation (16), respectively. Then, the three distribution parameters can be calculated by equaling the two equations, which are given as:
where Equations (19) and (27) can be transformed into
Then, according to Table 1 , the variability and maximum/minimum of delay can be calculated.
The delay variability and maximum/minimum delay value increase with decreasing temperature and voltage. However, due to the near-threshold regime with complicated current expression, a log-skew-normal distribution is adopted and its distribution parameters are obtained by moment matching, they have no explicit relationship with temperature and voltage. But these expressions can be verified by the following experiments and described in the next section.
Experiments and Result Comparisons
According to the above proposed delay variation model, experiments are carried out on the gates with a single transistor and stacked transistors under all kinds of metrics, such as delay variability, and maximum/minimum delay value, in SMIC40nmLL technology. For a single transistor, we take INV as an example; for stacked transistors, NAND2 is used as an instance. Because the proposed method is analytical, it has the inherent advantage in terms of runtime. Taking NAND as an example, under the same server resources, it takes 40 s to obtain three metrics by SPICE MC method, while it only needs 0.6 s by the proposed model in this paper. Therefore, it has increased by more than 60 times in runtime. For the perspective of accuracy, detailed comparisons with the similar analytical methods in [13, 16, 18] are done. Since they do not have the contrast of ±3σ point, the corresponding calculations are listed in Table 3 35 V than that of in 0.55 V, and have a tendency that they increase with decreasing temperature in all model. In all models, our LSN model and MC simulation match best with maximum error 5% in NAND. For INV, due to the same current formula with [18] , the delay variabilities are the same as each other, which are better than the result of LN distribution in [13, 16] . Besides, in terms of the complex gate at 0.55 V, the stacked threshold approximation is more suitable and the error is about 2% at the 0.55 V which is less than 10% and 20% in the others. It improves 5X at least in accuracy. In addition, the error keeps constant across the entire temperature, which shows the excellent stability. [18] performs best for INV and worst for NAND, but [13, 16] perform worst for INV and have the same accuracy with proposed method for NAND. Therefore, from overall view, LSN performs better in the two kinds of gates compared with the published works. Figure 6 shows the result of minimum delay. The four figures show that the minimum delays have same tendency with maximum, which are increase with decreasing voltage and temperature. From left figures, no matter under what kinds of gates and voltage, the LSN model and MC data show remarkable consistency with each other. More detailed error information can be seen from error figures. The error is almost 2% in all four conditions, which improves 5X-10X and shows a distinct advantage over the other two state-of-the-art works. Therefore, as for −3σ, the proposed model features more obvious advantages, because of the reasonable of LSN equivalent model and the effectiveness of the threshold approximation of stacked transistors.
Above all, LSN model proposed in this work outperforms the model in previous works under different voltages and gates across entire temperature and has a strong agreement with SPICE MC simulation result. Equation (25) The specified Equations (15) and (5) 
Conclusions
This paper proposes a LSN-based methods for current and delay variation model. In order to obtain the distribution parameters, the moment matching is adopted. Besides, a multi-variate threshold voltage approximation approach of stacked transistors is proposed and makes the method easily extend to stacked gate. The model is validated at different voltages and gates across the entire temperature. In addition, it also provides a detailed analysis on the dependence of the three important metrics (delay variability and maximum/minimum delay value)on the temperature, which are consistence to SPICE MC simulation with maximum delay variability 5% and performs very well , especially at minimum delay with a 5X-10X error improvement compared with other works. 
