Along with the fast development of dual-threshold voltage (dual-Vt) and multi-threshold technology, it is possible to use them to reduce static power in low-voltage highperformance circuits. In this paper, we propose a new method to realize CMOS digital circuits that are implemented with dual-Vt technology. We first present a new signalpath-level circuit model which effectively deals with the fact that there can be two threshold voltages assigned to a single gate. In order to assign proper threshold voltage to all the signal-paths in the circuit, our new algorithms introduce the concept of subcircuit extraction and include the hierarchy algorithms which are effective and fast. Experimental results show that our algorithms produce a significant reduction for the ISCAS85 benchmark circuits.
Introduction
With the growing scaling of integration and the increasing usage of battery-operated devices, power dissipation has become a critical issue of VLSI circuits and systems designs. It is especially true, in the design of portable and wireless electronic systems where power issues have already reached a bottleneck. The total power dissipation, consists of switching power, short circuit power and leakage power, can be expressed as:
where f is the operation frequency, V dd is the supply voltage, and N is the number of gates. α, C i , I l,i , and Q short,i are transition probability, load capacitance, leakage current, and short circuit charge of the ith gate, respectively. The behavior of the short circuit power dissipation remains at around 10% of the total power dissipation. 1 As we can see, lowering the supply voltage is the most effective way to reduce the total power dissipation. However, to maintain the performance at the lower supply voltage, the threshold voltage of transistors must be decreased to the same degree as V dd . Unfortunately, lowering the V t will lead to an exponentially increase in leakage current thereby leading to a dramatic increase in the standby power dissipation.
2
An approximate expression for the subthreshold current which is the main contributor to the total leakage current is given by 3 :
where V GS , V DS , and V SB are the gate-source, drain-source, and source-bulk voltages, respectively, V t0 is the zero bias threshold voltage, and A, γ, η, k, T and n are technology-dependent constants.
With the development of the fabrication technology, leakage power dissipation has become comparable to switching power dissipation. 4 At the 90 nm technology node, leakage power may make up 42% of total power.
5
The rest of the paper is organized as follows. In Sec. 2, the overview of leakage control methods is presented. In Sec. 3, we give out preliminaries including our new circuit model, delay models and leakage power models. The problem definition is provided in Sec. 4 . The details of our algorithms are presented in Sec. 5 . The implementation and experimental results are given in Secs. 6 and 7, respectively.
Overview of Leakage Control Methods

Leakage power control techniques
Inevitably, techniques are necessary for reducing the increasing leakage power. These leakage control methods can be broadly categorized into two main categories: process-level and circuit-level techniques.
At the process-level, leakage reduction can be achieved by controlling the dimensions (length, oxide thickness, junction depth, etc.) and doping profile in transistors. Here we mainly talk about circuit design techniques. There are several circuit design techniques, namely, input vector control, 6 power gating 7, 8 and multi-V t design. The input vector control method suffers from inefficiency with large circuits and extra control logic which brings power and area overloads, and finding the minimum leakage vector is still an NP problem. In power gating method, the extra area and delay due to the insertion of sleep transistors have considerable influence on the circuit performance. Furthermore, with the supply voltage scaling down, it is becoming harder to turn the circuit on under a very low supply voltage. The multi-V t design method includes VTCMOS, 9,10 DVTS 11,12 and dual-V t assignment. Both VTCMOS and DVTS suffer from large area and power penalty due to the extra control logic. The circuit using DVTS also suffer from increasing substrate capacitance. The substrate noise becomes another problem.
Among these, the dual-V t process, which allows both low-V t and high-V t transistors on the same chip, is commonly used.
13 A dual-V t assignment method means that a higher threshold voltage can be assigned to some of the transistors in the noncritical paths, in order to reduce the leakage current, while the performance is maintained due to the low-V t transistors in the critical paths. A source-to-well reverse bias can be applied to some transistors to achieve high thresholds. Furthermore, a dual-V t MOSFET process was developed, 18 which makes the implementation of dual-V t logic circuits more feasible. Dual-V t method results in a significant reduction in total power dissipation and energy. Therefore, determining which gate should be the high-V t becomes a major emphasis in the research field.
Dual-V t optimization review
The method described in Ref. 14 is, for the first time, using the idea that some high-V t transistors are assigned in the noncritical paths. All the transistors within the gate are either at V T Hhigh or at V T Hlow . Each gate is checked where it can be changed to V T Hhigh without decreasing the minimum slack over all the gates. This method finds a subset of gates which can be transformed to V T Hhigh . In Ref. 15 , a method is presented to gain a "near optimal approach" which has further reduction of leakage power.
While these two methods demonstrated significant savings in leakage power without degradation in performance, they have shown significant drawbacks too.
16
Their selected gates to be transformed into V T Hhigh are not sufficient. In fact there are more gates that can be assigned. Notice that after the assignment of V T Hhigh to a gate, the critical path may change. This dynamic change in critical path has not been taken into consideration in Refs. 14 and 15.
In Ref. 16 , a different idea is presented which initialized the circuit by assigning a high threshold voltage to all the gates of the circuit, i.e., it essentially configures the circuit to give the minimum power. The algorithm selects a gate which is on the critical path and then assigns V T Hlow to it. Every time a gate changes, an update of the whole circuit is necessary. The algorithm iterates until there exists at least one gate on the critical path, which is yet to be assigned with V T Hlow . This method gets better assignment than the two mentioned before, but it has to reiterate the whole circuit every time we decide whether a single gate can be changed into V T Hlow or not.
Three algorithms are presented in Ref. 17 . Algorithm 1 is very similar to the previous one in Ref. 16 . Every edge in the circuit graph has a weight in order to decide which gate should be changed. Algorithm 2 considers the signal probability for each node, and reduces the delay subject while minimizing the increase in the standby power. The problem of finding an optimal V T Hhigh gate assignment is NP complete, and in Algorithm 2, an iterative improvement procedure called Swep is carried out as an escape from a local optimal solution. However, the drawback in Ref. 16 that one has to reiterate the whole circuit every time one gate changes still remains in these two algorithms. Algorithm 3 brings an improved version of Algorithm 2. After initializing all the gates with V T Hhigh , gates in critical subcircuits are changed into V T Hlow to meet the timing constraints. Algorithm 2 is used to decide which gate in the subcircuits can be changed into V T Hhigh to consume less standby power dissipation. The experimental results are almost the same as Algorithm 2, while the CPU time is up to two times less.
The possibility of different transistors having different threshold voltages within a logic gate is not considered in any of the above algorithms. In Ref. 19 , a methodology for MVT (mixed-V t ) CMOS circuit design is presented. For MVT (mixed-V t ) CMOS circuits, the transistors within a gate can have different threshold voltages with certain process constraints. Therefore, more transistors can be assigned to V T Hhigh and larger leakage current reduction can be achieved. However, the algorithms to assign the V T Hhigh encountered the same drawbacks with the methods described in Refs. 14 and 15.
Our algorithms
In this paper, we assume that all the gates are using the low threshold voltage in order to get the best performance (timing characteristic). The signal-path-level circuit model we used is different from the circuit model which consider a gate as a vertex in a graph. This is to make our algorithms useful for transistor-level leakage control. We use look up table method in our signal-path-level static timing analysis to get the critical paths and noncritical paths of the circuit much faster and with more accuracy. The gates in the critical paths will remain unchanged to maintain the performance; and the gates in the noncritical paths are extracted into several subcircuits. Without reiterating the whole circuit, we focus solely on the subcircuits in which we use new heuristic algorithms to get an optimal result faster.
Preliminaries
Signal-path circuit model
A combinational circuit is represented by a directed acyclic graph (DAG) G = (V, E). Traditionally a vertex v ∈ V represents a CMOS transistor network which realizes a single output logic function (a logic gate), while an edge (i, j) ∈ E, i, j ∈ V represents a connection from vertex i to vertex j. In this way, the transistors within a vertex that are driven by the same logic signal will be assigned to the same threshold. The assignment of threshold voltages to the transistors in the circuit can be represented as assigning a threshold voltage to a vertex v ∈ V , [14] [15] [16] or assigning a threshold voltage to an edge. 17 Thus, this allows treating the dual-V t optimization problem as a kind of graph problem. It greatly simplifies delay analysis and standby power estimation during V t assignment. The effects on delay when a V t change is made can be easily modeled by static timing analysis (STA). In Fig. 1 , a combinational circuit is presented at the left side ( Fig. 1(a) ); the traditional circuit model is at the right side ( Fig. 1(b) ).
In our circuit model, a vertex v ∈ V represents a pin of a CMOS logic gate or a primary input/output; an edge (i, j) ∈ E represents a connection from vertex i to vertex j. In our model, an edge is the abstraction of a wire connecting two gates or a signal-path in a logic gate from one of its input pins to an output pin. Furthermore, we have added a virtual input vertex and a virtual output vertex to our model. The virtual input vertex is connected to all the primary inputs (PIs) and the virtual output vertex is connected to all the primary outputs (POs). The fan-in of a logic gate's input pin refers to the number of pins which connect this input pin. The fan-out of a logic gate's output pin refers to the number of pins which is connected with this output pin. Pins which have a fan-in of zero constitute primary input pins; similarly pins which have a fan-out of zero constitute primary output pins. Figure 1 shows the traditional graph abstraction and our signal-path circuit model ( Fig. 1(c) ) of circuit C17 from ISCAS85 benchmark. If vertex i ∈ V represents one of input pins in gate A and vertex j ∈ V represents gate A's output pin, we define edge (i, j) ∈ E as a "signal-path" and this signalpath belongs to gate A. There are several reasons for using this new circuit model. Firstly, the signal arrival time may be different for every input pin of a gate. More detailed delay information for every gate is presented since the delay information for every pin of the gate is computed by STA. Secondly, through the definition of signal-path, it is possible to have transistors with different V t in a single gate at the same time, which means transistors in every signal-path of one gate may have different V t . Thus, the dual-V t optimization problem is changed into an assignment of high-V t to the possible signal-paths. If we neglect the possibility of assigning different threshold voltage to signal-paths which belong to the same gate, it will get the same solution as previous methods.
14 The edge E in the graph represents two kinds of connections. One is "signal-path", the other is the connection of two pins belonging to different gates respectively which represents a wire between two pins in most cases. Hence, it is possible to consider the interconnect delay during STA in order to get more accurate model of the circuit.
Delay model
In order to get the delay attributes, we levelize the vertexes in the graph, make sure every two vertexes belong to the same level have no edges between them. Each pin's fan-ins are not at the same level as itself, its fan-outs are not either; thus an edge (i, j) ∈ E's two vertexes i, j ∈ V are not at the same level. The delay of an
We define three attributes for every vertex v ∈ V , they are namely, the arrival time t a (v), the required time t req (v), and the slack time t slk (v). The arrival time t a (v) is the worst case of delay from the primary inputs to pin v. t req (v) is the latest time the signal needs to arrive at pin v. We define them as:
By comparison to the traditional circuit model, the arrival time of a gate is the maximum of its input pins' arrival time, and the required time of a gate is its output pin's required time (if the gate is a CMOS transistor network which realizes a single output logic function). The slack time of a gate is also defined as the difference of its arrival time and the required time. The critical path of the circuits is constituted by the set of gates that has the minimum slack time value. We define every edge (i, j) ∈ E, i, j ∈ V in the graph G also has the attribute s i,j which represents the slack time of the edge:
Signal-Path-Level Dual-Vt Assignment for Leakage Power Reduction 203
Finally, the slack time of a vertex v ∈ V is defined as the minimum slack time of its fan-in edges:
In our delay model, we define the critical path of the circuits as the set of edges that has the minimum slack time value. If there is no negative slack in the circuit, then timing constraints are satisfied. 20 The delay of a circuit is computed by STA tools under the signal-path-level.
Gate delay data are obtained by a table of the gate delay for standard cells which is provided by the IC manufacturers. We use a circuit scheme for the implementation of each signal-path. Consider a two-input NAND gate, Fig. 2 shows the four conditions of the threshold voltage changes. The original NAND gate with all the transistor having low threshold voltage is given in Fig. 2(a) ; Figs. 2(b) and 2(c) show how one of the two signal-paths' threshold voltage changes. If both signalpaths in the NAND gate can be changed, then all the transistors in this gate are changed into high threshold voltage and this is illustrated by Fig. 2(d) .
Notice that every signal-path in the same gate can have different delay difference when it changes between high threshold voltage condition and low threshold voltage condition; and when several signal-paths can be simultaneously changed in one gate, the delay difference is even more complicated because of the infections between the changed signal-paths. Here, we select the largest delay difference of all the signalpaths' change schemes as the reference delay difference of the signal-path in this kind of gate. The signal-path delay data are then derived from the look up table of the standard cells and HSPICE simulation.
Leakage power model
Leakage power of a large scale circuit can be estimated by the summation of every gate leakage power. As each gate may have several signal-paths, the leakage power change due to the signal-paths' threshold voltage change should be well estimated. Our circuit model makes it possible to assign different threshold voltages to each signal-path of one logic gate.
Using HSPICE and a typical library for each circuit scheme of the signal-path, we can create a table of leakage power for the signal-path's threshold voltage change. Consider the two inputs NAND gate again. It has three kinds of changes: no signalpath is changed, one of the two signal-paths is changed, and all the signal-paths' threshold voltage is changed. Table 1 shows the standby power for a two-input NAND for the four signal-path change schemes according to Fig. 2 .
When all the signal-paths are changed in a gate with two signal-paths, the leakage power saving is larger than twice the leakage power saving of changing only one signal-path in that gate. We also find out that the leakage power change due to only one signal-path's change is always the same and furthermore, if there are k signal-paths which can change their threshold voltage in a gate with w signal-paths (k < w), no matter how to choose the k signal-paths, the power change due to k signal-paths' threshold voltage change is always the same. The leakage power saving due to k signal-paths' threshold voltage change is nearly the same as k times the leakage power saving due to only one signal-path's change. However, if all the w signal-paths in the gate is changed, the leakage power saving is larger than w times the leakage power saving due to only one signal-path's change. Finally, we use two values to represent each signal-path's leakage power attributes: the larger one is for all the signal-paths in that gate can change into high threshold voltage, and it equals to the leakage power saving due to the gate's threshold voltage change divided by the number of signal-paths in the gate; and the smaller one is for other conditions which equals to the leakage power saving due to only one signal-path's change. We do not consider the signal probability at each pin of the gates, and we may use logic simulation or local probability propagation in our future work to make it possible to combine transistor stacking effects with the circuit analysis to get a more accurate leakage power estimation table.
Problem Definition
We first give some definitions to represent attributes of the above models. Transistors in every signal-path can have different threshold voltage V t , thus different V t is represented by labeling each signal-path by x i,j , where x i,j = 0 means that the transistors in signal-path (i, j) ∈ E, i, j ∈ V have a low threshold voltage, i.e., V t = V T Hlow ; x i,j = 1 means that the transistors in signal-path (i, j) ∈ E, i, j ∈ V have a high threshold voltage, i.e., V t = V T Hhigh . Assuming there are L kinds of gates in the given circuit. We define ∆D i,j (k) as the difference between d i,j of signalpath (i, j) ∈ E with V T Hhigh and V T Hlow . 1 < k ≤ L represents the signal-path's type associated with the gate type. We use ∆P i,j (k) to represent the signal-path's leakage power saving attribute where k also represents the signal-path's type. As we mentioned before, ∆P i,j (k) may have two values under different circuit scheme.
The dual-V t optimization is generally defined as a problem to assign one of two threshold voltages, V T Hhigh and V T Hlow , to each transistor, to satisfy the timing constraints. Thus, the problem can be formally expressed as:
or max
Y. Wang, H. Yang & H. Wang
where λ(i, j, k) of a signal-path (i, j) ∈ E, i, j ∈ V is defined as:
In order to select the signal-path which can lead to larger leakage power reduction, we also define the priority for signal-path (i, j) ∈ E, i, j ∈ V whose slack time is not zero as following expression:
Notice that this priority of a signal-path may have two values, since ∆P i,j (k) may have two values. Changing high priority signal-paths to high threshold voltage will get high return because it achieves leakage power reduction at low delay penalty.
As we described before, if we neglect the possibility of assigning different V t to signal-paths belongs to the same gate, we will get the same solution of the dual-V t gate-level assignment problem.
The Algorithm
Initialization
We have assumed the DAG representation G(V, E) of a signal-path-level combinational circuit. This graph is levelized to indicate the depth of the vertex in the graph. The level of the virtual input (source vertex) is defined to be 0 and it is also labeled as 0. Therefore, the level of any vertex v ∈ V , l(v), is defined as:
and the level of any signal-path (i, j) ∈ E, i, j ∈ V , l(i, j) is also defined as:
The algorithm for levelizing a graph G(V, E) is given below: We initialize the circuit by assigning a low-threshold voltage (V THlow ) to all the signal-paths of the circuit, i.e., it essentially configures the circuit to have the minimum delay. In the initialization procedure, we decide the delay attributes of every vertex and edge in the graph: the arrival time t a (v), the required time t req (v) and the slack time t slk (v), the edge slack time s i,j , the edge propagation delay d i,j . All these attributes can be calculated using static timing analysis and the formula we have denoted before. The fan-ins of a vertex are the former level vertexes which are connected with this vertex; the fan-outs of a vertex are the next level vertexes which are connected with this vertex.
Since every edge has a slack time, we extract all the nonzero slack time edges to construct a set of subgraphs G sub1 , G sub2 , . . . , G subn . The critical paths' delay attributes are not affected when the V t of some signal-paths on noncritical paths are changed. Therefore, the assignment of the V t in the whole circuit is decomposed into several small problems, which have much smaller solution space and thus are more easier to get the optimal assignment of V t . When the V THlow in the circuit is given, we will just focus on the subgraph to decide the optimal high threshold voltage value without reiterating the whole circuit.
The algorithm for the initialization Initialization (G) is given below:
Initialization(G)
1 Assign V THlow to each signal-path; 2 Perform static timing analysis, determine all the delay attributes for the circuits; 3 Extract Subgraph (G sub1 , G sub2 , . . . , G subn ) of noncritical paths.
Assignment of dual-V t
The assignment of dual-V t to the whole circuit is converted into the assignment of dual-V t to several subgraphs. Two methods are given to assign dual-V t to signalpaths in a subgraph. In the third part of this section, we will consider gates other than signal-paths as the optimization object.
Algorithm 1: Forward depth-first low-V t assignment
This first algorithm starts with all the signal-paths being high threshold voltage in the subgraph. Our purpose is to select the signal-paths, which can be assigned to low threshold voltage in order to decrease the delay of the circuit. Thus, we try to assign low threshold voltage to the signal-paths on the critical paths. Using the delay attributes which are gained by the former STA process, we will get the arrival time of the primary inputs and the delay constraints of the subgraph. When we perform the STA to the subgraph; we will get a new set of delay attributes for each vertex and signal-path. Algorithm 1 uses a depth first signal-path selection from the primary outputs to the primary inputs and then assign low threshold voltage to them. The algorithm to a subgraph is similar to the gate-level algorithm presented in Ref. 16 which can assign more high-threshold gates on the noncritical path than the algorithm presented in Ref. 14. We update the delay attributes every time a change occurs in order to get the correct delay attributes of other unvisited signal-paths. 
//Get the vertex i with maximum
// push i into queue Q Update the delay attribution of the subgraph G sub ; } 6 Do 2-5 until all the signal-paths in the critical paths are V THlow .
Algorithm 2: Priority-based high-V t assignment
The dual-V t optimization problem in the subgraph can be regarded as an optimal slack distribution in a subgraph in which every signal-path has a positive slack time value. Levelize the subgraph based on the signal-paths and label every signal-path again. Assume the subgraph G sub (V sub , E sub ) has n levels. Since the subgraph is also a DAG, if we consider any signal flow from one primary input to one primary output, we will have: 
t slk (P O) is the slack time of the primary output pin on the path which passes the signal. Therefore, the slack distribution in subgraph G sub (V sub , E sub ) can be expressed as:
Signal-Path-Level Dual-Vt Assignment for Leakage Power Reduction 209
The second algorithm aims to find an optimal solution to satisfy the constraint above in a fairly fast way using a priority-based method. This method starts with all the signal-paths of the subgraph in V THlow configuration. The objective here is to reduce the standby power as much as possible without increasing the delay. The main idea in the algorithm to achieve the objective is to change the signal-paths with high priority as much as possible without delay influence. So, we select the level with highest priority in the subgraphs and then change the threshold voltage of the signal-paths in that level. This procedure is clearly illustrated in Step 2 of algorithm Deal with subgraph(G sub ). Notice that we only add the priority of all the signal-paths with only one fan-out to gain the priority for each level. The reason is changing the V t of a signal-path with only one fan-out has a much smaller effect on slack attributes on the graph compared to changing the V t of a signal-path with multiple fan-outs. When signal passes the pth level signal-path (i, j) to the primary outputs, the minimum slack time of these primary outputs is defined as t 
Gate-level optimization
If we do not consider the condition that signal-paths in the same gate can have different threshold voltages, we can get the solution for gate-level dual-V t optimization. Therefore, during the subgraph extraction, we will only consider the gates in which all the signal-paths' slack times are positive. It could be easily realized by mapping a whole gate to a single vertex in the graph. The arrival time of the gate is the maximum of the arrival times of the gate's input pins. The required time of the gate is the output pin's required time. The slack time of the gate is the difference between the arrival time and the required time of the gate. Through a little change in Algorithms 1 and 2, we can get the gate-level optimization of the circuits.
Get optimal V THhigh
Due to the exponential relationship between threshold voltage and substrate leakage current, a higher threshold voltage will significantly reduce the leakage power. However, the higher threshold voltage will result in a higher propagation delay. The high threshold voltage is empirically assumed to be 0.2V dd < V THhigh < 0.5V dd .
16
Typical value of V THlow is 0.2V dd due to the noise margin and other parameters constraints. 21 Thus, it is important to decide the value of high threshold voltage. If the value of the V THhigh is close to the value of the V THlow , then there will be much more signal-paths that can be changed into V THhigh , and it also gives small leakage current improvement. On the other hand, if the V THhigh is close to 0.5V dd , there might be less signal-paths that can be assigned with V THhigh despite the fact that each of them will bring a large amount of leakage current reduction. Therefore, there must be an optimal V THhigh corresponding the largest saving of the whole circuits.
The algorithm of obtaining the optimal V THhigh is given blow:
1 Initialization(G); // get subgraphs consisted of signal paths // with nonzero slack time 2 For all the subgraph {G sub } {
Estimate the standby leakage power P leakage ; If (P leakage < P min ) { P min = P leakage ;
The V THhigh opt is the optimal value of V THhigh .
V THhigh start and ∆V t here depends on the technology. The algorithm is only dealing with the subgraphs. Thus, it is much simpler and uses less computation time and space compared to the methods stated in Refs. 14 and 16.
Implementation
The above algorithms have been implemented in C++ under signal-path-level static timing analysis environment. The value of various transistor parameters have been taken from the TSMC library, the effect channel length is 0.13 µm and the gate oxide thickness is 2.4 nm. The circuit temperature is assumed to be 110
• C. The leakage power table and delay look-up table is created by HSPICE simulation. In our analysis, the low threshold voltage and the supply voltage of the original circuits are assumed to be 0.2 V and 1.2 V, and high threshold voltage during the dual-V t optimization is assumed to be 0.3 V. In the optimal high-V t acquirement algorithm, the high-V t changed from 0.25 V to 0.7 V.
Experimental Results
First, we can easily get the optimized circuit of C17 which belongs to ISCAS85 benchmark circuits. In Fig. 3 , the signal-paths labeled in red can change their threshold voltage into high threshold voltage. If we perform gate-level optimization to C17, only NAND A can be changed into high threshold voltage. The leakage power saving of C17 is respectively 16.3% and 28.7% for gate-level and signal-pathlevel optimization. Figure 4 shows the leakage power savings for ISCAS benchmark circuits using Algorithms 1 and 2. Gate-level and signal-path-level optimization's lead to different results for leakage power saving, and obviously more leakage reduction can be achieved through signal-path-level optimization since there are actually more transistors in the implementation of the circuit which can be assigned to high threshold voltage.
The shortage of gate-level algorithms 14−16 was addressed in the introduction part, here we only compare the gate-level algorithm derived from Algorithm 2 (Priority-based(PB) High-V t Assignment) with the signal-path-level Algorithm 2. Obviously the signal-path-level algorithm will take more memory and computational time, for the DAG extracted from the circuit is several times larger than the one in gate-level. Since we perform the same signal-path-level STA process, the gate-level algorithm is taking some extra time to extract gate-level timing attributes which leads to smaller gap comparing to the signal-path-level algorithm. As we can see from Table 2 , by introducing the subcircuit extraction concept, signallevel algorithm takes approximately 4.2X times larger memory and 1.6X more time than the gate-level algorithm. Thus our signal-level algorithm is comparable with gate-level algorithm, meanwhile achieves about 12% more average leakage power reduction.
22 Table 3 reports the leakage power savings and CPU time of different algorithms for signal-path-level dual-V t assignment. The results indicate that Algorithm 1 (Forward Depth-First (FDF) Low-V t Assignment) takes more CPU time. On the contrary Algorithm 2 (PB High-V t Assignment) spends much lesser CPU time with lower leakage reduction. The optimal high threshold voltage for ISCAS85 benchmark circuits is shown in Fig. 5. 
Conclusion
In this paper, we have proposed a new circuit model for combinational circuit. We have given two algorithms for the assignment of high threshold voltage to a maximum number of signal-paths defined in our new circuit model without violating the delay constraints. The algorithms are sped up by the proper extraction of subgraphs. By using a delay look-up table and a leakage power table generated by HSPICE simulation, we find that approximately 12% more leakage power savings can be achieved under the signal-path-level optimization than the gate-level optimization.
