Abstract-3D Integrated Circuits (ICs) have been recently proposed as a solution to the increasing wire delay concerns in scaled technologies. At the same time, technology scaling leads to increasing variability in manufacturing process parameters, making it imperative to quantify the impact of these variations on performance. In this work, we take, to the best of our knowledge, the first step towards formally modeling the impact of process variations on the clock frequency of fully-synchronous (FS) 3D ICs. The proposed analytical models demonstrate theoretically and experimentally that 3D designs behave very differently under the impact of process variations as compared to equivalent 2D designs. In particular, for the same number of critical paths, we show that a 3D design is always less likely to meet a pre-defined frequency target compared to its 2D counterpart. Furthermore, as opposed to models for 2D ICs, the 3D models need to accurately account for not only within-die (WID) critical paths, i.e., paths that lie entirely within one of the die layers, but also D2D critical paths that use throughsilicon vias (TSVs) to span across multiple dies in the 3D stack. Finally, we show, theoretically and experimentally, that the mapping of critical paths to the die layers of a 3D IC can also affect the timing yield of a design, while the mapping issue does not arise in the 2D case since there is only a single die layer in a 2D IC. The accuracy of the proposed models is experimentally verified and found to be in excellent agreement with detailed SPICE and gate-level Monte Carlo (MC) simulations.
I. Introduction
Wire delay has been shown to be an increasing fraction of gate delay with technology scaling, making it difficult to cross the length of a die in a single clock cycle [11] . 3D IC technology is a promising solution to the growing wire delay problem and is being aggressively pursued in industry [9] and academia [6] , [2] , [12] . While there exist numerous flavors of 3D IC technology, in this work, we concentrate primarily on stacked-die 3D integration. In this technology, each device layer is manufactured separately using a conventional 2D fabrication process and the planar (i.e., 2D) dies are subsequently stacked and bonded vertically on top of each other to realize a 3D IC. Wires can cross from one layer in the stack to another using die-to-die interconnects * This research was supported in part by NSF award CCR-0702451.
called through-silicon vias (TSVs). 3D integration is therefore able to significantly reduce the average and worst-case wirelength of a conventional 2D design, thereby alleviating the performance impact of slow global interconnects.
Along with wire delay, the increase in manufacturing process variations (PVs) is another major cause for concern in scaled technologies, since it leads to variations in the performance and power characteristics of fabricated dies. Moreover, process variations manifest themselves as both die-to-die (D2D) variations that affect each die differently but all transistors on a particular die in the same way, and within-die (WID) variations that affect each transistor on every die differently. From a performance perspective, there has been significant amount of prior work to accurately and efficiently model the impact of PVs on the cycle time (or clock frequency) of fully-synchronous 2D ICs. While a number of authors have proposed so called Statistical Static Timing Analysis (SSTA) algorithms to compute the process variation driven distribution of the maximum critical path delay for specific designs, [3] introduced the generic critical path (GCP) model that encapsulates the low-level implementation details of a circuit using two high-level parameters (the number of critical paths in the circuit and the number of stages per critical path), which are then used to compute the maximum critical path delay distribution. The authors use the proposed model to analyze the impact of technology scaling and increased design complexity on the mean and variance of critical path delay. Moreover, the GCP model has since been widely used in a number of studies at the micro-architecture and systemlevel to efficiently characterize process variations without having to resort to time consuming circuit level timing statistical simulation or analysis [14] , [15] , [13] .
Unfortunately, for a number of reasons that will be made clear later in the paper, the GCP model introduced by [3] cannot be simply extended for the case of 3D ICs. Therefore, we introduce for the first time, a generic critical path model for 3D ICs (3D-GCP) that allows us to analytically characterize the impact of process variations on the cycle time distribution for fully-synchronous 3D systems. The proposed models provide the following advantages: (1) they allow for efficient characterization of the impact of increased design complexity, increased integration complexity (i.e., more die layers) and increased magnitude of process variations on the maximum critical path delay distribution of 3D ICs; (2) they serve as efficient high-level process variation models that can be used at the micro-architecture and system level design abstraction; and (3) they serve to theoretically motivate and explain variability-aware, design time optimization approaches that are unique to the 3D IC design flow. The first two advantages are analogous to those provided by the original GCP model to 2D circuits, while the third is specific to the case of 3D ICs and arises from gate/module to die layer mapping flexibility that exists in 3D designs.
II. Related Work
Performance analysis of 3D ICs in terms of improvement in timing characteristics has been studied extensively before. [6] and [2] study the change in the wirelength distribution, and implicitly the increase in clock frequency, obtained by moving from a 2D to a 3D implementation. On a related note, [12] studies the improvement in system performance metrics such as latency and throughput obtained by reducing the average number of hops between modules using a 3D network-on-chip (NoC). However, none of these works have investigated the role of process variations on performance, which is the focus of our work.
Over the last few years, the analysis of impact of process variations on cycle time and frequency for 2D circuits has been the focus of extensive research. We point the interested reader to a comprehensive tutorial paper on this subject [7] . While some of the previously proposed SSTA algorithms could potentially be adapted to handle the case of 3D ICs, the goal of this work is more aligned with the high-level GCP model introduced by [3] . However, compared to our work, [3] focuses exclusively on conventional 2D circuits, which as we will show, cannot be trivially extended for the case of 3D ICs. Finally, the only other work to look at variations for 3D ICs [8] makes use of Monte Carlo simulations and has a very different goal, variabilityaware post fabrication assembly strategies, compared to our work.
III. Preliminaries and Assumptions
We begin by introducing the mathematical notation and assumptions about fully-synchronous 2D and 3D designs that we make in this work. Before introducing the proposed model for 3D systems, we will first briefly review the high-level GCP model proposed by [3] to study the impact of process variations for 2D systems. In particular, [3] assumes that a generic 2D circuit can be fully characterized, from a timing perspective, by N cp critical paths, where each path consists of a chain of n cp two-input NAND gates. Furthermore, the D2D component of process variations is modeled using a single random variable (RV) G and the uncorrelated WID component is modeled using a set of independent and identically distributed (iid ) RVs L ij (1 ≤ i ≤ N cp ,1 ≤ j ≤ n cp ) that represent the impact of WID random variations on gate j of critical path i. The maximum deviation in delay for a fully-synchronous 2D design, ∆T 2D max , can therefore be written as:
where α is the sensitivity of the gate delay to process variations. Now, assuming that variations in process parameters are normally distributed [3] 
, we can write the probability that ∆T 2D max ≤ τ as:
(2) where * represents the convolution operation, and f X () and F X () represent the probability density function (pdf ) and cumulative distribution function (cdf ) of RV X respectively.
To model 3D systems, we assume, without any loss of generality, a system consisting of L die layers 1 stacked on top of each other. Connections between die layers are accomplished using through-silicon vias (TSVs). As shown in Figure 1(b) , the critical paths in such a system can be classified as either:
• Within-die (WID) Critical Paths, i.e., paths that lie fully in one of the L die layers. The total number of WID critical paths in the system is represented as N
W ID cp
, with
). Each WID critical path is assumed to consist of n W ID cp gates. Finally, without loss of generality, the mapping function m() maps each WID critical path to one of the L device layers.
• Die-to-die (D2D) Critical Paths, i.e., critical paths that use TSVs to cross from one die layer to another. The total number of D2D critical paths in the system is represented as N . We note that while all the results that follow will be based on this canonical model for the critical paths in a 3D circuit, more complicated models (for example, one in which the number of stages for a WID critical path vary from one layer to another) can be analyzed using simple extensions of the proposed methodology.
As opposed to 2D systems where there is a single RV representing D2D variations, each layer in a 3D system has a different D2D variation component. We therefore define, for each layer i in the 3D system, a RV G i (1 ≤ i ≤ L) that corresponds to the impact of D2D variations in that layer. Furthermore, for homogeneous stacked-die 3D systems in which each die is fabricated in the same process technology, the RVs G i and G j (i = j) can be assumed to be independent and identically distributed (iid ) RVs. On the other hand, for heterogeneous integration, the D2D RVs of two layers fabricated in different processes may not be identically distributed, although they would still be independent. While the proposed framework can handle both cases, in the rest of the paper, we will focus specifically on the former case of homogeneous integration. Finally, the random WID component of process variation is modeled by RVs L ij and L ij that represent variations in the process parameters of gate j in the i th WID and D2D critical path respectively. For illustration, the example 3D circuit in Figure 1 (b) has been annotated with its corresponding D2D and WID RVs.
Under these assumptions, we can now write the variation in the critical path delay of the i th WID critical path, ∆T
W ID i
, as:
and the variation in the critical path delay of the i th D2D critical path, ∆T D2D i , as:
where, as before, α and β represent the sensitivity of a gate in a WID and a D2D critical path to process variations respectively. Note that in general, the sensitivities of WID and D2D paths to process variations can be different due to the differences in the electrical properties of wires and TSVs. In the experimental results section, we provide a detailed SPICE based characterization of these values. Finally, the maximum deviation in critical path delay for a 3D system ∆T 3D max can simply be written as: ∆T
IV. 3D-GCP Variability Model Having described the canonical structure of a 3D circuit in terms of a few high-level parameters, we would now like to derive analytical expressions for the cdf of the maximum deviation in critical path delay, ∆T 3D max in terms of these parameters. However, compared the conventional GCP model, the increase in model complexity for the 3D case is immediately clear. In particular, instead of only two parameters that were used to describe a 2D circuit, the canonical model for a 3D circuit has 2L + 3 parameters. In addition, as opposed to a single component of D2D variations, each layer of a 3D design has its own RV corresponding to D2D variations. Finally, the introduction of D2D critical paths, which have very different timing characteristics under process variations as compared to WID paths, adds another degree of complexity.
To reduce the complexity, we adopt a constructive approach towards developing the 3D-GCP model -we begin with analyzing 3D designs that consist of only WID critical paths and provide exact analytical expressions for the same. Next we introduce D2D critical paths in the system, which makes the computation of an exact analytical expression for the cdf of ∆T 3D max intractable. We therefore derive analytical stochastic lower and upper bounds on the cdf of ∆T 
By noticing that the maximum critical path deviation for each layer in the 3D system is independent of every other layer, we can write the cdf of ∆T 3D max , F ∆T 3D max (τ ), as:
Note that Equation 7 is the most general form of the analytical cdf for 3D ICs with only WID critical paths, i.e., it makes no assumption about how critical paths are mapped to die layers. To make the relationship between the cdf and the number of device layers L clearer, we examine the specific case in which the WID critical paths are evenly divided between the device layers, i.e.,
. Under this assumption, we can write F ∆T 3D max (τ ) as:
From this equation, it is clear that besides the exponential dependence of the cdf on the number of critical paths in the system, the cdf is also exponentially dependent on the number of layers, L, in the system. Furthermore, while D2D variations only impact the variance of the maximum critical path delay for 2D circuits [3] , in the 3D case, both the mean and the variance are impacted by D2D variations.
A.1 2D Vs. 3D
While the analytical results help to intuitively understand the difference between 2D and 3D circuits from a timing variability perspective, we now try to theoretically explore the difference between the two. Indeed, we prove that for the same number of total critical paths, and assuming the 3D circuit has no D2D critical paths, a 2D circuit is always more likely to meet a given timing specification τ than a 3D circuit, i.e., P r{∆T
The proof is based on the following result on comparing the maximum of multivariate Gaussian random vectors:
Proof: A detailed proof of this theorem can be found in [1] , and is not repeated here for clarity of exposition.
We note that the ≥ st symbol refers to a stochastic inequality: A ≥ st B implies that P r{A ≤ τ } ≤ P r{B ≤ τ }, ∀τ ∈ R. Using this result, we now prove formally the relationship between 2D and 3D circuits. 
Therefore, using the result from Theorem 1, this implies that ∆T
Significance: The number of critical paths in a design is a useful proxy for design complexity [3] . From this perspective, Theorem 2 implies that a 3D system is always worse impacted by process variations than a 2D design with the same design complexity. Therefore, while the theorem establishes process variations to be a potentially even greater cause for concern in 3D circuits, it should not, in general, be used to compare a specific 2D circuit with its equivalent 3D implementation, since a gate-level 3D place-and-route process may alter the number of critical paths in the design [6] .
A.2 Mapping of WID Critical Paths
Mapping of gates or modules in a design to die layers is an important step in the physical design flow for 3D ICs that affects both the final wirelength histogram [6] and the temperature profile [5] of the design. From Equation 7, we can see that the mapping of critical paths to die layers also affects the cdf of maximum critical path delayi.e., for the same number of total critical paths, different mappings of critical paths to die layers can produce different results. Indeed, we prove theoretically that, for a design with only WID critical paths, the worst case impact of process variations on critical path delay occurs when the critical paths are evenly divided between the die layers, i.e.,
Before describing the proof, we first define some useful notation. To explicitly consider the relationship between the cdf and the mapping of critical paths, we let
, where from Equation 7 we can see that:
Theorem 3: For a 3D system with L layers, N
W ID cp
critical paths, and zero D2D critical paths, P r{T
We begin by defining a function
. Now consider the following optimization problem:
subject to:
Since the log() function is strictly monotonic, R τ and Q τ achieve their minimum values at the same point. Furthermore, we observe that the function
) is convex because: (a) r τ (N ) can be shown to be log-convex 2 , and therefore, log(r τ (N )) is convex; and (b) the sum of convex functions is convex [4] . 
, we can restate this inequality as Q τ (S * ) ≤ Q τ (S 1 ), where S 1 can be any point in the feasible region of the optimization problem. In fact, barring the degenerate cases in which either σ L = 0 or σ G = 0, the convexity of the objective function implies that S * is a point of unique minima in the feasible region, in other words, Q τ (S * ) < Q τ (S 1 ) ∀S 1 = S * . Therefore, R τ (S * ) < R τ (S 1 ) ∀S 1 = S * Significance: Theorem 3 demonstrates that a mapping solution that is unaware of the interaction between the mapping of critical paths to die layers and process variations may result in a sub-optimal design -in other words, it introduces critical path mapping as an important element of the variability-aware design space for 3D ICs.
A.3 Summary of Results
The results presented so far are summarized in Figure  2 , which compares the cdf of maximum critical path delay for three cases: (a) a 2D design with four critical paths, (b) a 3D design with the same number of WID critical paths as the 2D design, with an uneven mapping of paths to die layers; in this case, the first layer has three paths and the second has one, and (c) a 3D design with the same number of WID critical paths as the 2D design, with critical paths evenly divided between die layers; i.e., with each layer having two critical paths. Based on the Theorem 2 and Theorem 3, we can conclude that the 3D design with an even mapping of paths, i.e., case (c), will always have the least likelihood of meeting a specified timing constraint, τ , while an equivalent 2D design, i.e., case (a), will have the highest likelihood of meeting timing constraints. This can be observed graphically in Figure 2 .
B. Modeling WID+D2D Paths
We now consider the most general case in which N 3D cp > 0, i.e., a 3D system with both WID and D2D critical paths. Unfortunately, in the general case, it is not possible to obtain a closed-form analytical expression for the cdf of ∆T E(∆T
Instead of providing exact analytical expressions for the cdf, we therefore concentrate on computing provable stochastic lower bounds and upper bounds on ∆T 3D max .
B.1 Stochastic Lower Bound
The lower bound on ∆T 3D max can be obtained by simply ignoring the impact of D2D critical paths on the maximum delay variation. Let ∆T LB max be the RV that stochastically lower bounds ∆T 3D max . We therefore write:
We now prove formally that the RV T LB max is indeed a guaranteed lower bound on the actual critical path delay distribution.
For any two random variables, X and Y , P r{X, Y } = P r{X}P r{Y /X} ≤ P r{X}. Therefore it follows that P r{max(∆T
While ignoring the D2D critical paths may seem to be a loose lower bound, we show experimentally that this is indeed not the case. Intuitively, this is because the path delay of a D2D critical path involves a summation over the D2D RVs G i , which reduces the standard deviation of delay of a D2D path with respect to a WID path, hence reducing the probability that a D2D path will be the speed limiting path of the design. In fact, this intuition can be proved formally:
and α = β, then probability that a WID critical path is the speed limiting path in the design is greater than the probability that a D2D path is the speed limiting path in the design, i.e., P r{max(∆T ), where the inequality is deterministically true, i.e., it is true for any realization of the RVs. Therefore, we can write:
Now, we can write the RV Z = {max(∆T
)} as:
In the above expression, n
cp G j is zero mean normal RV, and is therefore also symmetric about the ori-gin. Furthermore, max(L ij ) and max(L ij ) are iid RVs and therefore max(L ij )−max(L ij ) is a zero mean RV symmetric about the origin. Finally, since the sum of two zero mean symmetric RVs is also zero mean and symmetric, we know that the RV Z is zero mean and symmetric. Therefore, Significance: Theorem 5 shows that, assuming all other things being equal, i.e., for a system with the same number of WID and D2D paths and the same number of stages, a D2D path is always less likely to be the speed constraining path in the system as compared to a WID path. This is, as mentioned before, because of the averaging affect on a D2D path of crossing multiple die layers. From a design perspective, the result suggests allocating as many critical paths as possible as D2D paths as a strategy to mitigate the impact of process variations on performance, unless the sensitivity of a D2D path to PVs is significantly greater than that of a WID path (β >> α). We note that our results suggest that for typical interconnect and TSV dimensions, β ≈ α.
B.2 Stochastic Upper Bound
Let ∆T U B max be the RV that corresponds to the provable upper bound on ∆T 3D max . We will first provide the expression for ∆T U B max before explaining its physical significance. In particular we can write:
where
and
The physical significance of the upper bound can be understood by setting L j=1 βn j cp G j = G L+1 . In particular, the upper bound corresponds to a 3D system in which an additional dummy die layer L + 1 has been introduced and to which all the D2D critical paths in the system have been allocated.
The proof follows directly from Theorem 1 by observing that E(∆T
]) while all other elements of the two co-variance matrices remain unchanged.
From Equation 24 and Equation 25
, we can now write an analytical expression for the cdf of ∆T U B max as:
and s 4 = β n D2D cp .
B.3 Summary of Results
For the generic case of 3D systems with both WID and D2D critical paths, we observed that it is difficult to obtain a precise analytical expression for the cdf of the maximum critical path delay. Therefore, we derive theoretically guaranteed lower and upper bounds on the desired cdf, in other words, for any timing constraint τ , we provide a lower and an upper bound on the probability of meeting that constraint. Finally, to compare the relative impact of D2D and WID paths on the maximum critical path delay distribution, we show in Theorem 5 that all other things being equal, WID paths are more likely to be the speed-limiting path in a system than D2D paths.
Having described in detail the proposed models for 3D ICs with only WID critical paths and then for the more general case, we now present our experimental results that validate and demonstrate the effectiveness of the proposed 3D-GCP framework.
V. Experimental Results
We now present our experimental results that first validate the proposed models against detailed SPICE-based Monte-Carlo simulation and then investigate the application of the models to characterize the impact of variations on 3D designs.
A. SPICE-based Validation
To comprehensively test the accuracy of the proposed models, we compare the cdfs obtained from the analyt- 
TABLE I
Parameters for 3D Designs in Figure 3 Design N ical expressions with Monte Carlo simulations on input SPICE models of the canonical 3D circuits. In keeping with the methodology described by [3] , each critical path in the SPICE netlist is modeled as a chain of two-input NAND gates in a 90 nm PTM technology. Furthermore, wire (via) delays are inserted between gates that lie in the same layer (cross die layers) using a standard π model as shown in Figure 3 (a). The RC parameters associated with the wire models are computed using the average dimensions for Metal 2 wires reported in [16] , while vias are assumed to be 1.2µm × 1.2µm, with a 2.4µm pitch and 20µm length as reported in [9] . Furthermore, to equalize the nominal delay of the D2D and WID critical paths, the gate widths of the D2D critical paths are sized up. We observed that in practice, the additional delay introduced by a TSV with the given parameters is only slightly larger than the delay of a regular interconnect, requiring the gates on a D2D path to be sized up by 10% for a 2 layer design and 30% for a four layer design. Finally, variations in process parameters are modeled by introducing both D2D and random WID variations in gate length, each with a σ of 5% of the nominal value. Under these assumptions, we modeled two 3D systems with parameters as shown in Table I and performed 500 runs of Monte Carlo simulations in SPICE to determine the maximum critical path delay distribution for each design.
We note that we were restricted to relatively small circuits due to the prohibitive run-time and memory requirements of larger designs. Finally, using the models proposed in this paper, we computed the cdfs corresponding to the analytical lower and upper bounds for each design. The obtained cdfs for the two designs are shown in Figure 3 (b) and (c). It is clear that in both cases, the stochastic lower and upper bounds are in excellent agreement with the SPICE results. Furthermore, the upper bound behaves ideally for both cases, i.e., always predicts a lower yield than predicted by the SPICE-based cdf for any cycle time constraint, while the lower bound behaves ideally for all but high yield values (> 80%). The non-ideality arises because of two reasons: (1) The assumption made in the original GCP model about gate delays being linearly dependent on process variations parameters (also used in our model) is not perfectly accurate, although prior results and the results from Figure 3 indicate that this is still a good approximation, and (b) the cdf obtained from SPICE simulations itself has some inherent error since it is obtained from Monte Carlo simulations. Nonetheless, given the complexity of the SPICE transistor models, we believe that the results in Figure 3 demonstrate the effectiveness of the proposed methodology.
B. Design Space Exploration
Having validated the accuracy of the proposed models, we now use the models to explore the impact of various design decisions on the cdf of maximum critical path delay of 3D ICs and also verify experimentally, some of the theoretical results shown before. We begin by analyzing designs that consist of only WID critical paths and then look at the general case of designs with both D2D and WID critical paths.
B.1 WID Critical Paths Only
For all the results that follow, process variations are modeled by introducing both D2D and WID variations in the gate length parameter with a total standard deviation of 10% of the mean [7] , though we note that our framework is general and can handle other sources of variation as well.
represents the contribution of D2D variations to total variations. Finally, to reflect the design complexity in 90nm technology, we choose total number of critical paths in the all the designs that we evaluate to be 1, 000, as suggested by [3] .
We begin by looking at the impact of increasing the number of die layers in a 3D IC for the same number of total WID critical paths. In Figure 4 (a), we plot the percentage increase in the mean of maximum critical path delay with increasing number of die layers and for three values of γ = {0.25, 0.5, 0.75}. From the figure, we can see that for the medium and high values of γ, i.e., when D2D variations have moderate to high contributions to total variability, increasing the number of device layers has a significant impact on the mean critical path delay. For example, for γ = 0.5 the mean delay increases by 9.5% when going from a 2D design to a six layer design. On the other hand, when WID variations dominate, we observe that the impact is less pronounced. This result confirms our prediction from Theorem 2, i.e., in all cases we see that the 3D designs are worse than their equivalent 2D counterparts from a variability perspective.
Figure 4(b) shows the impact of different WID critical path to die-layer mapping on the cdf of maximum critical path delay. As proven in Theorem 3, we can see that the mapping in which the 1, 000 critical paths are equally split between the four layers, i.e., {250, 250, 250, 250}, stochastically upper bounds the other two mappings. In particular, the design with equally split paths provides only 36% yield at the 50% yield point of the best design, i.e., {950, 17, 17, 16} in which 95% of the critical paths lie in a single layer. We note that this is not an arbitrary design point - [10] have shown that up to 95% of the critical paths of a typical microprocessor lie in the cache arrays -from this perspective, the best design would correspond to a case in which the caches are implemented in one layer and the logic in the rest. We point out that while the 2D and 3D GCP models do not explicitly model timing variations for memory structures, [10] have shown that from a variation perspective, memory arrays can be equivalently modeled using chains of logic gates as well.
B.2 WID+D2D Paths
We now analyze the accuracy of the proposed stochastic upper and lower bounds for the more general case of 3D ICs with both D2D and WID critical paths. While previously we used SPICE based Monte Carlo simulations to obtain the actual cdf for small designs, we now use gate-level MC simulations (using sensitivity information extracted from SPICE) so as to analyze larger systems. Figure 5(a) shows the impact on mean critical path delay (relative to the nominal delay) of varying the fraction of D2D paths in the system from 20% to 80% for L = {2, 4, 6}. Also shown are the corresponding mean values obtained from the computed stochastic lower and upper bounds. We observe that except for the upper bound for the two layer design, all the other bounds are extremely tight.
We note that these results confirm the intuition developed from Theorem 5, i.e., as the fraction of D2D critical paths in the design increases, the mean of the critical path delay distribution improves. Finally, in Figure 5 (b), we plot the actual cdfs of maximum critical path delay for three cases, i.e., when D2D paths make up {10%, 50%, 90%} of the total critical paths in the design. While it is clear from the figure that the 90% D2D paths case has significantly higher timing yield than the 10% and 50% cases, we note that the 90% case would come with an associated overhead in terms of the number of TSVs required to implement the design.
VI. Conclusion and Future Work
In this paper, we present 3D-GCP, a high-level analytical model for the impact of process variations on stacked-die 3D ICs, that takes as input a canonical description of a 3D circuit encapsulated within a few parameters. Using this model we are able to provide an exact analytical expression for the cdf of maximum critical path delay for 3D circuits with only WID critical paths and guaranteed stochastic lower and upper bounds for circuits with both D2D and WID paths. The model is validated against SPICE based Monte Carlo simulations and demonstrated to be in excellent agreement with SPICE results. Furthermore, using this model, we prove theoretically that a 3D circuit is always less likely to meet a specified frequency target compared to a 2D design with the same number of critical paths. Finally, we show, theoretically and experimentally, that the mapping of critical paths to die layers and the ratio of WID to D2D critical paths in the design can both have an impact on the timing yield of the design.
As future work, we plan to look in to variability mitigation techniques for 3D ICs, both using variability-aware design time mapping of critical paths and variability-aware post-fabrication assembly techniques.
