In this chapter, we are going to discuss yield loss mechanisms, yield analysis and common physical design methods to improve yield. Yield is defined as the ratio of the number of products that can be sold to the number of products that can be manufactured. Estimated typical cost of modern 300mm or 12inch wafer 0.13 µm process fabrication plant is $2-4 billion. Typical number of processing steps for a modern integrated circuit is more than 150.
Introduction
Total yield for an integrated circuit Y total can be expressed as follows [3] .
Here Y line denotes line yield or wafer yield which is the fraction of wafers which survive through the manufacturing line. Y batch is the fraction of integrated circuits which on each wafer which are fully functional at the end of the line. Steep yield ramp means quicker path to high batch yield and hence volume production. Earlier volume production means higher profitability for the semiconductor manufacturer in today's market with time-tomarket pressures.
Y batch can be further classified based on either type of defect or of failure. Failure-type taxonomy is as follows. • Catastrophic Yield Loss. These are functional failures such as open or short circuits which cause the part to not work at all. Extra or missing material particle defects are the primary causes for such failures. A nice picture of a bridging fault is shown in Figure 1 drawn from [63] . Critical area analysis is used to predict this type of yield loss and is discussed later in this chapter.
• Parametric Yield Loss. Here the chip is functionally correct but it fails to meet some power or performance criteria. Parametric failures are caused by variation in one or set of circuit parameters, such that their specific distribution in a design makes it fall out of specifications. For example, parts may function at certain VDD, but not over whole required range. Another example source of parametric yield loss is leakage in deep sub-micron technologies [28] . Parametric failures may be caused by process variations. Several kinds of integrated circuits are speed-binned (i.e. grouped by performance). A common example of such class of designs is microprocessors wherein lower performance parts are priced lower. The other class is typical ASICs which cannot be sold if the performance is below a certain threshold (for example due to compliance with standards). In the latter case, there can be significant performancelimited yield loss which is why such circuits are designed with a large guardband. In the former case too, there can be significant dollar value loss even if there is little yield loss.
Additionally, there is also testing-related yield loss as no testing process can detect all possible faults (and potential faults). Such yield-loss is related to defect level (e.g., see [39] ) and field returns (e.g. see [57] ). We will not include such yield-losses in our discussion as they are not physical design related. Another aspect of field-returns is long-term reliability of designs (e.g. see [21] ). Reliability is typically treated as a separate topic and we would discuss yield-loss only in its most common definition: number of bad parts at the end of manufacturing line.
Defect types can be classified as follows 1 .
• Random Defects. These are randomly distributed faults such as particle contamination.
• Systematic Defects. These kind of defects are predictable. Example sources include CMP (Chemical Mechanical Polishing) and photoresist pattern collapse.
It is important to understand that both random and systematic defects can cause parametric or catastrophic yield loss. For example, lithographic variation which is typically systematic and pattern dependent can cause catastrophic line-end shortening leading gate (polysilicon over diffusion) not forming and hence a functional failure. A less drastic rendition of lithographic variation is gate-length variation causing gates on critical paths to speed up too much leading to hold-time violations under certain voltage and temperature conditions. Various defect types and failure modes are shown in Figure 2 . Systematic mechanism limited yield loss is projected to be the dominant source of yield loss in current and future technology generations [3] .
Decision at the IC manufacturing site of which parts are not working and should be discarded is an important one. Though this is more a discussion of testing and testability, a very brief introduction is essential to understand final yield measurement at the foundry.
For a more detailed discussion, see [4] . Inline process monitoring is another common way to make sure that the fabrication line is running fine. It is also used for characterization purposes (e.g. to characterize process variation). The most common way to accomplish this is to place and measure simple test structures such as ring oscillators in the scribe-line area of the wafer (i.e. the empty area on the wafer between functional chips). Such measurements are done by wafer-level probing and do not require dicing and packaging of the structures. In addition, scanning electron microscope (SEM) measurements of critical dimension (CD)
2 .
Sources of Yield Loss
As mentioned earlier in the chapter, yield loss can be due to systematic as well as random defects. Contamination related spot defects are discussed later in the chapter. In this section 2 CD is the commonly used term for the smallest (and hence the most critical) linewidth in the design.
we focus our attention to variations. There are several ways to classify variations depending on the axis:
• Process vs. Environmental. Variation occurring during circuit operation (e.g. temperature, power supply, etc) are environmental in nature while those occurring during the manufacturing process (e.g. mask misalignment, stepper focus, etc) are physical. We will focus only on process variations.
• Systematic vs. Random. As discussed earlier systematic variations (e.g. metal dishing, lithographic proximity effects, etc) can be modeled and predicted while random variations (e.g. material variations, dopant fluctuations, etc) are inherently unpredictable.
• Inter-die vs. Intra-die. Depending on the spatial scale of the variation, it can be classified as die-to-die (e.g. material variations) or within-die (e.g. layout pattern dependent lithographic variation). Inter-die variations correspond to variation of a parameter value across nominally identical die. Such variations may be die-to-die, wafer-to-wafer or even lot-to-lot. Inter-die variations are typically accounted for in design, by shift in the mean of a parameter value. Intra-die variations on the other hand correspond to parameter fluctuations across nominally identical circuit elements such as transistors. Intra-die perturbations are usually accounted in design by guardbanding and prevention. Variation compensation in design is further discussed in the next section.
An interesting point to note here is the level of abstraction for sources of variation. For logic designers, variation may be caused by cell delay or transistor delay changes. Such modeling is evident, for example, in most statistical timing analysis tools (e.g. [51, 5, 72] . For circuit designers, the level of abstraction may go down to (say) transistor gate-length variation which leads to cell or transistor delay variation. Going further down, a lithographer may attribute critical dimension (CD) variation to focus variation which may be further blamed on wafer flatness imperfections.
Variation in process conditions can manifest itself as dimensional variations or material variations. Dimensional variations include the following.
• Lateral dimension variation. based methods work on a set of critical paths instead of the full design and as a result are 4 The key operations in such propagation are sum, min and max of random variables.
better equipped to handle arbitrary distributions and correlations using Monte Carlo simulations. Correlations: spatial, logical or otherwise play an important role in such statistical timing analysis. From a foundry perspective, it is very difficult to characterize the process to identify all sources of variation and their magnitude, compute correlations between these sources and also find out the spatial scale to which they extend. To add to the complexity, most of these sources of variation have very systematic interactions with layout and cannot be easily split into inter-and intra-die components. Nevertheless, with the magnitude and sources of variability increasing, statistical power and performance analysis coupled with accurate modeling of systematic variations will lead to parametric yield analysis to be part of standard design sign-off.
Random defect yield modeling and critical area computation
A number of models for the prediction of yield of a semiconductor device due to random manufacturing defects have been proposed over the years. The common focus of all models is a measure called critical area that represents the sensitivity of a VLSI design to random defects during the manufacturing process.
A majority of random defects is introduced into the IC layer by the lithography process.
These are spot defects caused by various contamination particles. Spot defects may result in circuit failure depending on their size and location. They are classified into extra-material defects (also referred to as bridges or protrusion defects) and missing material defects (also region between patterns at different photolithographic levels (see e.g. [66] ). Shorts, opens (breaks), and via-blocks are the main types of random manufacturing defects resulting in circuit failure.
The yield of a chip considering random manufacturing defects is computed as
where Y i is the random defect yield associated with the ith step of the manufacturing process (see e.g. [73, 47, 26] ). For convenience the subscript is omitted and Y is referred as the yield of a single processing step. There is a number of models to compute random defect yield such as Seed's, the Poisson, the negative binomial, and Murphy's model (see e.g. [25] ). The main difference between the various yield models is in the choice of statistics that are assumed to govern the spatial distribution of defects. For example, choosing negative binomial statistics results in the wide spread negative binomial yield model which is given by the following equation for a single processing step:
where d denotes the average number of defects per unit of area, α is a clustering parameter, and A c denotes the critical area. All yield models, independent of the statistics used, result in yield equations that are functional forms of the same quantity termed critical area.
Critical area is a measure reflecting the sensitivity of the design to random manufacturing defects and is defined as follows
A(r)D(r)dr
where A(r) denotes the area in which the center of a defect of radius r must fall in order to cause circuit failure and D(r) is the density function of the defect size. A(r) is referred to as the critical area for defect size r. The total critical area integral A c for all defect sizes is also referred to as the weighted critical area. The defect density function has been estimated as follows [73, 27, 66] :
where p, q are real numbers (typically p = 3, q = 1), c = (q + 1)(p − 1)/(q + p), and r 0 is some minimum optically resolvable size. further the computation of the total critical area integral for all defect sizes given the defect
The extraction of critical area for various types of faults poses the major computational bottleneck in VLSI random yield prediction. In the following we review the main computational paradigms proposed in the literature for the extraction of critical area and focus on their algorithmic aspects. Pinhole defects are ignored because extracting their critical area is straightforward. Pinhole defects are modeled as points of no area and their critical area is simply the area of overlap between patterns at different photolithographic levels (see e.g.
[66]). We first give some mathematical insight on various defect models.
Defect models
Random manufacturing defects are typically modeled as circular disks with a size distribution. In particular, a defect of size r is modeled as a circle of radius (or diagonal) r. The radius r is a random variable with a known probability density function D(r) as given above.
When calculating critical area it has been a normal practice to approximate the circular defect by shapes that are computationally easier to deal with such as squares or regular k-gons to measuring distance in the ordinary Euclidean way, where the Euclidean distance between
Computing distances in the L 1 (Manhattan) metric, where circular defects is given in [53] . Often in practice the purpose of critical area computation is the prediction of relative yield. That is, predict how a new design will yield in comparison to existing designs by comparing their critical areas. In this respect, the consistency of the defect model is far more important than the actual numerical values. In the following we review the main computational paradigms that have been proposed for the extraction of critical area.
Statistical methods
There have been two types of statistical methods proposed for critical area estimation at the chip level: Monte Carlo simulation [75, 67] and layout sampling [6] . Monte Carlo simulation has been the oldest and the most widely implemented technique. The method is as follows:
Draw a large number of defects with their radii distributed according to D(r), check for each defect if it causes a fault, and divide the number of defects causing faults with the total number of defects. The main advantage of the Monte Carlo method is simplicity and the ease to conceptualize the procedure necessary to detect almost any type of failure mechanism. A tool can potentially be constructed upon most Design Rule Checking (DRC)
Platforms. The method is computationally intensive. A naive implementation could result in prohibitively long runtimes. However, adaptive sampling and adaptive integration techniques can greatly benefit the basic method.
Distributed processing can significantly improve performance further as reported in [74] . Despite potential inefficiencies the Monte Carlo simulation method is widely accepted and it provides a good standard for comparison.
Layout sampling, in combination with a deterministic method to compute critical area over a layout window, can give an alternative statistical technique to estimate the critical area of an entire chip. Layout random sampling in combination with standard shape shifting techniques to compute critical area were introduced in [6] . The method is as follows:
Randomly sample the layout to obtain random sample windows, compute critical area in every sample using a deterministic method, combine results on sample windows to obtain a critical area estimate for the entire chip. Stratified sampling can increase the accuracy of the prediction by dividing the layout area into a number of regions (strata) for which critical area is estimated using sampling techniques. The performance as well as the accuracy of the sampling methodology depends heavily on the method to generate samples as well as on the method to compute critical area in a selected sample. Details of the method as implemented in the EYES system combining stratified random sampling and shape shifting techniques are given in [6] .
Deterministic iterative methods
Critical area estimation methods in this category iteratively compute A(r), the critical area for a specific defect size r, for several different values or r. Those values are then interpolated with the defect density function D(r) to approximate the total critical area integral. The majority of these methods are based on shape shifting techniques (see e.g. [9, 16, 38] ). For shorts the typical shape shifting method to compute A(r) can be described as follows:
1. Expand each geometry shape by r/2.
2. Compute the region of overlap among two or more shapes in different nets.
Let the area of the resulting region be A(r).
The main advantage of the shape-shifting methods is the ease of implementation using widely available shape-manipulation functions through most DRC platforms. The disadvantage is that step 2 can be expensive demonstrating quadratic behavior even for smart scanline type of implementations. The reason is that the number of intersections between the expanded polygons, denoted as I, can be Ω(n Most of the investigation on critical area extraction in this category has been done for shorts. Opens have been studied less and are often approximated as a dual problem, substituting shape-expansion by shape-shrinking enhanced with other shape manipulation operations to derive critical regions. The reader is referred to [58] for details on such a treatment of opens based on DRC operations. Via-blocks are also treated in [58] for a simplified definition where a defect is considered a fault if it simply overlaps any portion of a contact or via. For a discussion on the sensitivity of via chains to metal opens and via-opens see [24] .
The Voronoi (non-iterative) deterministic method
This method, originally proposed in [55, 53] , computes the entire critical area integral in O(n log n) time, where n is the number of layout edges, in a single pass of the layout. It is based on the following concept: Divide the area of the layout into small regions such that critical area within each region is easy to compute. The total critical area integral can be derived as the summation of all partial critical areas obtained within each region. Assuming that within each layout region the critical area integral can be computed accurately, the total critical area of the layout can be easily extracted once the appropriate layout subdivision The critical radius of a layout point t, denoted r c (t), is the radius of the smallest defect centered at t causing a circuit fault. Given a layer of interest C, and a fault type, the Voronoi method subdivides C into regions such that for any point t the critical radius is easy to compute. In particular, r c (t) is given by the distance of t from the layout element owning the region where t belongs. In the L ∞ metric (similarly for L 1 and the octagon metric) r c (t) becomes a simple linear function allowing for simple critical area integration.
In the following we indicate the Voronoi diagram for shorts and refer the reader to [53, 54] for the case of opens and via-blocks. The L ∞ metric is assumed throughout the section. The concepts are easily extendible to the octagon metric with some penalty in the complexity of the Voronoi diagram construction (see e.g. [19] for k-gons). For circular defects no analytical formulation for critical area integration is known.
Voronoi diagram for shorts A short at a layout point t is a defect centered at t overlapping with at least two shapes in two different nets. Let P to be the polygon nearest to
t. The critical radius of t is determined by the 2nd nearest polygon to t, say Q, such that Q is in a different net than P , and r c (t) = d(t, Q). Thus, 2nd nearest neighbor information is
needed which can easily be obtained by the 2nd order Voronoi diagram on the layer of interest defined as follows: For every polygon P partition the interior of reg(P ) by the Voronoi diagram of all polygons other than P . In Figure 6 , the thick lines illustrate the 2nd order subdivision of reg(P ), where P is shown in dotted lines. Note that only Voronoi neighbors of reg(P ) can contribute to the 2nd order subdivision of reg(P ). More formally, given a layer C, the 2nd order Voronoi region of an element s ∈ C − P within the Voronoi cell of
s).
To avoid counting shorts between disjoint polygons of the same net, any neighboring Voronoi regions of the same net can be united prior to the 2nd order Voronoi computation.
Critical Area Integration Let's assume that the appropriate Voronoi subdivision of a layer for a fault type is available. Each Voronoi region can be partitioned into simple subregions such as rectangles and triangles (assuming the L ∞ , L 1 or octagon metric), where the critical area integral can be computed analytically given the defect size distribution D(r).
Once analytic formulas are established for each type of simple region, the total critical area integral can be derived as a simple summation of those formulas. As formulas are analytic there is no integration error. In [55, 56] analytic formulas were derived for the widely used defect size distribution D(r) = 1/r 3 assuming the L ∞ metric and were shown to simplify into terms derived directly from Voronoi edges. As a result critical area extraction becomes trivial once the appropriate Voronoi diagram is computed. In case A(r), the critical area for a given specific defect size r, is also needed it can be easily derived in linear time from the same Voronoi subdivision of the layout.
Scanline construction of the Voronoi diagram
The Voronoi diagram of a layout can be constructed by a scanline approach as described in [55, 56] Results on the wide use of the Voronoi method to compute critical area and predict yield by IBM Microelectronics are given in [48] .
Other non-iterative approaches
The grid method of [73] assumes a fine grid over the layout and uses it to perform critical area integration. The grid resolution can provide a trade off between accuracy and speed.
The method computes the critical radius for every grid point and uses this information to compute the critical area integral. The approach is appropriate for an interactive tool and can be sped up as shown in [55] .
FedEx [65] is a fault extractor for shorts. That is, instead of computing critical area, it extracts a list of all two node intra-layer bridges (shorts). It also computes approximate weighted critical area for each bridge, and provides approximate fault locations. As pointed out in [65] FedEx trades accuracy for speed and memory. It assumes Manhattan layouts.
FedEx starts with a hierarchical design description, incrementally flattens the layout and writes bridging faults out in a flat manner. For circuit and fault extraction uses a scan line algorithm that first converts polygons into rectangles. Memory consumption is relatively small as only a moving window of geometry is kept, i.e., approximately O( √ n), where n is the size of the layout (number of rectangles). Bridge fault sites are written flat to the output file. There are several performance similarities between FedEx and the Voronoi method.
Both methods start with a hierarchical design using a scanline that only locally sees the layout geometry flat. Memory consumption is relative small as only a neighborhood of the design near the scanline is kept in memory. The 1st order Voronoi diagram of the layout geometry also provides information on same layer two node bridges as obtained by FedEx.
FedEx outputs fast an approximate critical area for each bridge and the Voronoi method uses the 2nd order Voronoi diagram to obtain an accurate same layer critical area number maintaining an O(n log n) worst case performance.
Methods for Yield Optimization
Aggressive technology scaling had made process variation control from purely manufacturing perspective very tough. Design-related yield losses have been projected to increase [77] which implies greater cooperation between physical design and process communities is necessary.
Yield optimization methods work with the "measure, model and mitigate" flow. Measurements are usually done by targeted test structures which are measured on silicon for physical parameters like linewidth and thickness as well as electrical parameters like sheet resistance and transistor saturation current. A good publication to keep track of for those interested in test-structure design and measurement is ICMTS [1] . Models of process extracted from such test-structure measurements are usually abstracted to simpler models or a set of rules for physical design and verification tools to use. In this section, we will briefly discuss the evolution of yield optimization physical design techniques.
Critical Area and Catastrophic Yield Optimization Methods
BEOL yield and manufacturability optimization is a complicated task. Methods for yield improvement vary ranging from critical-area based wire spreading, metal fill, and the development of new rules and optimization for routers. We start with a review of available methods for wire spreading and critical area reduction.
Methods for critical area reduction fall into two broad categories: methods that alter the topology of the layout by attempting critical area optimization at the routing phase and methods used as a postprocessing step that keep the layout topology fixed while attempting to alleviate congestion and increase wire spacing. The two categories can be regarded complementary and both can be incorporated into the design cycle.
In the first category the most representative method is [36] , where a general routing cost function is described that takes into account critical area in conjunction with traditional routing objectives. The cost function combines most types of major defects i.e., shorts, opens, number of vias, and pinhole defects. Results verify that taking critical area into account at the routing phase can result in effective critical area reduction and therefore effective optimization for yield. In [44] , channel routing is modified to reduce critical area between wire segments. [44] also minimizes the number of vias as their presence increases manufacturability complexity and degrades the yield.
The methods in the second category attempt to redistribute spacing between adjacent wires without changing the layout topology. They are usually based on compaction techniques using the following observation: In a VLSI layout distances between shapes can vary as long as the minimum value imposed by the design rules is met. Slack between two shapes is defined by the difference of the current distance between the two shapes and the minimum distance required by the design rules. Carefully redistributing the slacks can result in a layout with a better yield. Several slack redistribution techniques have been proposed, see [20, 11, 35, 14] . In their majority they are based on principles of layout compaction and are formulated as a one-dimensional layout optimization problem. They start with a constraint graph representation of the layout and perform layout modification for yield in one direction at the time, using in majority a one-dimensional yield objective function. The main drawback of a one-dimensional yield objective function is that, although it optimizes for critical area in one direction, it fails to take into consideration a potential critical area increase in the orthogonal direction. Figure 7 , reproduced from [35] , illustrates one such situation where movement of a layout element in one direction decreases critical area in one direction but increases critical area in the orthogonal direction. To address this problem [35] combines the one-dimensional movement for slack redistribution with a two-dimensional yield objective.
The first compaction based algorithm to improve yield was given in [20] . A heuristic algorithm increases the spacing of layout objects through a series of spacing iterations in one direction. Only objects off the critical path are allowed to move maintaining the original layout area. The defect sensitivity of open-circuit type faults is reduced by increasing the width of certain non-critical elements in the layout. In [11] the slack redistribution problem in one direction was transformed into a network flow problem which can be solved via the fast wire length minimization algorithm of [70] . The layout is represented by a constraint graph where a node corresponds to a layout object and an edge links the nodes of two adjacent layout objects. The cost of each graph edge is an estimate of the fault probability between the two corresponding objects, expressed as a function of the length of the graph edge, that can be approximated by a convex piece-wise linear cost function. Another one-dimensional compaction based formulation is given in [14] where first the critical area rectangles for one chosen defect size are computed. The standard compaction area optimization objective is en-hanced with the additional terms of minimizing the critical area rectangles which are reduced into functions of original layout variables. In this manner the original compaction algorithm is upgraded with yield enhancement without introducing extra variables or constraints.
A non-compaction based approach in this category is based on post-route optimization using a rubber-band wiring model [69] . The layout is given in a rubber-band sketch (RBS) form which represents every layer of interconnect as a set of flexible rubber-bands with elastic properties. Wire spreading is achieved by estimating the critical area of the RBS and exploiting the flexibility of the rubber band behavior while maintaining wire connectivity.
For more information see [69] . Heuristic layout changes to improve yield are described in [8] through the use of a set of local rules for contacts, metal and polysilicon layers. A system that allows the user to first evaluate layout modifications by applying them to samples only of the chip layout, rather than the entire layout, is described in [7] . The results from those samples can be used to define the modifications to be applied to the whole chip.
An effective way to reduce open faults is the introduction of redundant interconnects. Given a rectilinear feasible routing region FRR, a rectilinear Steiner routing tree T within FRR, and a wirelength budget W , find a set of augmenting paths A within the FRR such that the total length of augmenting paths is at most W , and the total length of edges of T that are non-bridges in G = T ∪ A is maximum. An exact algorithm based on an integer programming formulation, and a greedy heuristic algorithm that iteratively adds an augmenting path between vertices were given in [40] . Experimental results show that the greedy augmentation method achieves significant increase in reliability, as measured by the percentage of biconnected tree edges, with only small increase in wirelength.
In addition to reducing the potential for opens, redundant interconnects have also been proposed in clock networks to overcome the clock skew variation problem. In [60] cross links are inserted to a regular clock tree converting it to a non-tree with lower skew variability and only a small increase in wire length.
Redundant via insertion provides another effective way of increasing design reliability and yield. Vias have an inherently low reliability (e.g due to stress related via voids) and thus redundant via insertion is a good solution to reduce the yield loss by via failure. Typically redundant via insertion is done post-routing on a "wherever space is available" basis but considering redundant vias in detailed routing also has been proposed [78] . Note that an increased number of vias could have a negative impact in terms of routing area and may reduce critical area for via blocks at the cost of increasing the critical area for shorts. Overall however making appropriate tradeoffs design reliability can increase considerably.
Antenna fixes is another topic for improving design reliability. Since VLSI layers are formed one at a time during fabrication, "dangling" metal1 routes (e.g.
nets not yet fully fabricated) connected to the poly gate can cause stray charge deposition on gate damaging it. Methods to correct such situations include inserting jumpers in routes such that the maximum dangling route length is limited (see for example [68] ). Diffusions diodes can also be inserted to provide a discharge path if space is available.
Design Rules
Abstraction of manufacturing constraints into a set geometric of constraints or design rules for the layout designers to follow have traditionally been foundry's main method to en- Manufacturability implications of technology scaling have led to three major trends in design rules:
• More complicated rule sets. The sheer number of design rules has been growing at a rapid pace with every technology generation. More process constraints have required new kinds of rules [41, 59] . This has made physical verification, routing as well as custom layout very difficult and time consuming tasks.
• Restrictive design rules. To cope with sub-100nm manufacturability concerns where manufacturing equipment is not keeping pace with feature scaling, radically restraining layout options has been proposed as a viable option [46, 45] . One common restriction is to enforce regularity in layout which aids printability. An example of such a rule is allowing only one or two pitched on the polysilicon layer.
• DFM rules. Most 90nm and 65nm design rule manuals include a separate set of nonminimum design rules. These design rules if obeyed by the layout, enhance its manufacturability. For example, the minimum metal-via enclosure can be 20nm while the corresponding DFM rule can be 30nm. The increased enclosure can reduce chances of loss of contact between metal route and via at the cost of increased routing area.
Though design rules have served the industry well in the past as the abstraction layer, inadequacy and sub-optimality of such yes/no rules has led to a slow but steady adoption of model-based checking methods [59] . have led to the following trends in corner-based design analysis and optimization.
Corner-Based Design Analysis
• More corners. As more complicated process effects emerge and as a result of nonmonotone dependence of delay on many of the process parameters, the number of PVT corners at which a design needs to be signed off is increasing.
• On Chip Variation (OCV) analysis. To model within-die variation in static timing tools implicitly analyze clock paths and data paths at separate corners [76] . For example, for setup time analysis, the launching clock path may be analyzed at a slow corner while the capturing clock is analyzed at a fast corner and the data path is analyzed at the slow corner. This in essence tries to model the worst-case impact of on chip variation.
Additional techniques such as common path pessimism removal (CPPR) which figures out the shared logic between launching and capturing paths to avoid pushing them to different corners, are used to reduce the inherent pessimism in OCV analysis.
Though the runtime overhead of ever-increasing number of corners, the excess pessimism in corner-based analysis and fear of missing some corners in a high process-variability regime has led to an increasing interest in statistical analysis tools, corner-based design deterministic design optimization still remains mainstay of commercial parametric yield optimization.
Futures of Parametric Yield Optimization
As mentioned earlier, explicit parametric yield analysis and optimization is a relatively new field of research. Several interesting published works in the past few years have attempted to deal with the problem of manufacturing variability.
Methods for Systematic Variability
There are several pattern-dependent process effects which are systematic in nature. These can be compensated for during physical design to aid manufacturability and hence improve yield. The biggest contributors in this bucket are CMP and photolithography. Metal filling and slotting techniques for CMP are discussed elsewhere in the book. Traditionally, design rules have been the method to optimize for systematic variation. Recently more explicit mitigation of impact of systematic variation on circuit power and performance has been studied. For instance some methods have tried to reduce CD variability by avoiding lithography-induced forbidden pitches during detailed placement [30] or detailed routing [62, 49] . Making circuit more robust to focus variations has been studied in [31, 42] .
Statistical Optimization
Just as statistical analyses, statistical physical design is an active area of research with very little in terms of well-accepted methods of optimization. Deterministic physical design tends to generate a wall of slack. As the number of uncorrelated critical paths increase in a design, any of them can "pop up" to being critical and hence be the determinant of circuit delay. As a result, a higher wall of slack can mean a slower circuit delay distribution.
Intentional "under-optimization" by assigning a penalty to paths that are close to critical has been suggested as a simple technique to overcome this issue [10] . Another approach in same vein assigns a delay penalty to every gate proportional to its delay variability [15] and uses standard static timing analysis in optimization. Other approaches explicitly rely on a statistical timing engine in a statistical sensitivity [33, 34] or nonlinear programming based optimization [37] . The biggest challenge in statistical physical design besides computational complexity is accurate modeling of physical reality. For example, ignoring parametric or spatial correlations (i.e. assuming independence or perfect correlation between performance or process random variables) can undo any benefit from statistical optimization.
Conclusions
In this chapter we have touched upon various sources of manufacturing yield loss in modern sub-micron processes. We have briefly described methods of yield calculation and optimization with emphasis on well-known methods related to random-defect driven yield loss.
We have also discussed the emerging area of parametric yield analysis and optimization in physical design.
