Abstract-With the goal of building an hierarchical design methodology for quantum-dot cellular automata (QCA) circuits, we put forward a novel, theoretically sound, method for abstracting the behavior of circuit components in QCA circuit, such as majority logic, lines, wire-taps, cross-overs, inverters, and corners, using macromodels. Recognizing that the basic operation of QCA is probabilistic in nature, we propose probabilistic macromodels for standard QCA circuit elements based on conditional probability characterization, defined over the output states given the input states. Any circuit model is constructed by chaining together the individual logic element macromodels, forming a Bayesian network, defining a joint probability distribution over the whole circuit. We demonstrate three uses for these macromodel-based circuits. First, the probabilistic macromodels allow us to model the logical function of QCA circuits at an abstract level-the "circuit" level-above the current practice of layout level in a time and space efficient manner. We show that the circuit level model is orders of magnitude faster and requires less space than layout level models, making the design and testing of large QCA circuits efficient and relegating the costly full quantum-mechanical simulation of the temporal dynamics to a later stage in the design process. Second, the probabilistic macromodels abstract crucial device level characteristics such as polarization and lowenergy error state configurations at the circuit level. We demonstrate how this macromodel-based circuit level representation can be used to infer the ground state probabilities, i.e., cell polarizations, a crucial QCA parameter. This allows us to study the thermal behavior of QCA circuits at a higher level of abstraction. Third, we demonstrate the use of these macromodels for error analysis. We show that low-energy state configurations of the macromodel circuit match those of the layout level, thus allowing us to isolate weak points in circuits design at the circuit level itself.
Ç

INTRODUCTION
Q UANTUM-DOT Cellular Automata (QCA) is an emerging technology that offers a revolutionary approach to computing at nano-level [1] . What sets it apart is that it exploits, rather than treating as nuisance properties, the inevitable nano-level issue of device to device interaction at nano-scales to perform computing. Each cell consists of two electrons that can occupy four dots, resulting in two ground states configurations, which can be taken to represent the logic states of zero or one. Two or more cells interact by Coulombic interaction, with an arrangement of cells settling to the lowest energy state. Since there is no flow of electrons involved, there is no need for traditional interconnects and it has potential for extremely low-power computing, even below the traditional kT [2] . Both individual QCA cell (semiconductor and metallic) and multiple QCA arrangement have been fabricated and tested [3] , [4] . Significant progress is also being made in using molecules to implement QCAs [5] , [6] , which will make it possible to operate in room temperature, possibly alleviating the initial criticisms of this technology. It will also connect the areas of molecular computing and QCAs.
The time is ripe to look beyond just device-level research in emerging devices and explore circuit level issues so as to scope out the types of circuits that can be built [7] , [8] , [9] , [10] , [11] , [12] . However, the QCA modeling tools available for such designs have been at the layout level. There are several approximate simulators available at the layout level, such as the bistable simulation engine and the nonlinear approximation methods [13] , [14] , [15] . These methods are iterative and do not produce steady state polarization estimates. In other words, they estimate just state assignments and not the probabilities of being in these states. The coherence vectorbased method [16] , [15] does explicitly estimate the polarizations, but it is appropriate when one needs full temporal dynamics simulation (Bloch equation) and, hence, is extremely slow; for a full adder design with about 150 cells, it takes about 500 seconds for eight input vectors. Perhaps the only approach that can estimate polarization for QCA cells, without full quantum-mechanical simulation, is the thermodynamic model proposed in [17] , but it is based on semiclassical Ising approximation. In [18] , [19] , it was shown that layout-level QCA cell probabilities can be modeled using Bayesian probabilistic networks.
To advance design with QCA, it is necessary to look beyond the layout level. Hierarchical design at multiple levels of abstraction, such as the architectural, circuit, layout, and device levels, has been a successful paradigm for the design of complex CMOS circuits. It is only natural to seek to build a similar design structure for emerging technology. Henderson et al. [20] proposed a hierarchical CMOS-like top-down approach for QCA blocks that are analyzed with respect to the output logic states; this is somewhat similar to the functional logic verification performed in CMOS. We also advocate building a hierarchical design methodology for QCA circuits. However, such a hierarchy should be built based not just on the functionality of the circuit, but it should also allow the abstraction of important nano-device parameters. It is not sufficient just to abstract a QCA circuit in terms of 0-1 Boolean logic-based majority gates and other logic components; we also have to represent the probabilistic nature of the operations. Thus, for each logic variable X, we have to assign the probabilities associated with the logic values, i.e., P ðX ¼ 1Þ or P ðX ¼ 0Þ. In the parlance of QCA, the specific design variable is the "polarization" of cell, which is P ðX ¼ 1Þ À P ðX ¼ 0Þ. These probabilities (or polarizations), which are governed by quantum mechanics, are dependent on temperature, which is an important design variable for QCAs that needs to be represented at upper design levels. Another need for probabilistic representations arises due to the nature of the QCA operations. QCA circuits are designed so that the intended logic is mapped to the lowest-energy (ground state) of the cell arrangement. So, it is important that the circuit be kept near ground state during operations, using mechanisms such as four-phased adiabatic clocking. Logical errors in QCA circuits can arise due to failure to settle to the ground state. It is important to compute the difference between the probability of the lowest-energy state configuration that results in correct output and the lowest-energy state configuration that results in erroneous output. It would indeed be useful to be able to compute these erroneous configurations at higher levels of design. Building a device-level characterization sensitive macromodel will facilitate answering the following kinds of questions at higher design levels of abstraction: What is the expected polarization of the outputs? How does it change with temperature? How sensitive is the design with respect to operational errors?
In this work, we formulate a probabilistic framework for a higher level of abstraction of QCA circuits that would enable one to characterize designs with respect to thermal profiles and errors, the two most important design issues in nano-circuit design. Standard QCA circuit elements such as majority logic, lines, wire-taps, cross-overs, inverters, and corners are represented using conditional probability distributions defined over the output states given the input states. The probabilistic macromodels allow us to model QCA circuits at an abstract level above the current practice of the layout level; we term this higher level the "circuit" level. The full circuit level model is constructed by chaining together the individual logic element macromodels. This circuit is represented using the graphical probabilistic models known as Bayesian networks, where the nodes of the graphs are the individual macromodels and the links represent the connection between them. The nodes are quantified by the macromodel conditional probabilities. The complete network represents a joint probability distribution over the whole circuit. Since conditional distribution over the inputs and outputs are obtained based on quantum mechanical probabilistic characterization, the circuit level model is also faithful to the underlying quantum-mechanical phenomena.
Computations using the macromodel translate to different kinds of probabilistic inference problems. For instance, the computation of ground state polarization is done using the average likelihood propagation on the built Bayesian network macromodel. Similarly, the most likely configuration of the internal nodes corresponding to the first-excited, also called near-ground state or the most likely error state at the outputs, can be isolated at the macromodel circuit level itself using maximum likelihood propagation on the same Bayesian network macromodel. We demonstrate and validate our model using commonly studied QCA circuits and elements whose behaviors are fairly well understood by others. First, we show that the ground state polarization probabilities of the output nodes as well as the intermediate nodes in the macromodel of the QCA logic circuit closely match with those obtained from a full layout level implementation [18] at different temperatures. We show examples of the characterization of thermal behavior of a QCA logic circuit that can be carried out. Second, we demonstrate that both the ground and the next excited (error) state configuration of the macromodel exactly match the corresponding configurations of the detailed layout cells. The mismatch between the ground and the next excited error state configuration can be used to identify weak spots in circuit design. Using the macromodel, this can now be done at a higher level of abstraction. Isolation of error-prone components would be useful in applying redundancy selectively to the necessary components rather than to the whole circuit. Third, we use the circuit level implementation to vet between alternate design choices. We show examples of this design space exploration process with the example of two adders. We find that one adder design, Adder-1, in spite of its larger area, is better in terms of polarization, which is an extremely important measure for the QCA circuits. Also, we see that, for Adder-1, the number of error-prone components is less than a second adder design, Adder-2, and, hence, the needed redundancy measures would be less for Adder-1.
The organization of this paper is as follows: In Section 2, we begin by explaining the hierarchical modeling scheme used in this work. Then, we proceed in Section 2.1 to summarize the quantum-mechanical nature of the probabilities associated with the QCA cells. In Section 2.2, we show how an arrangement of QCA cells can be modeled by a joint probability function, represented as a Bayesian network. Further down, in Section 2.3, we present the theory behind the macromodels. We demonstrate how, using these macromodels, we can 1) model full circuits (Section 2.4), 2) explore design space exploration in QCA circuit layouts (Section 4.3), and 3) conduct error studies (Section 3). We comment on the computational advantage of the circuit level representation over the layout level one in Section 4 and we conclude with Section 5.
MODELING THEORY
In this section, we explain the hierarchical modeling scheme. We focus on two levels: the layout level and the circuit level, where groups of QCA cells, corresponding to a basic logic element, are represented as one macroblock. For both these levels, we will use the graphical probabilistic model called Bayesian Networks to represent the underlying joint probability of the entire set of nodes. Note that probabilistic representation is essential to capturing the inherently uncertain nature of computing with QCAs.
Bayesian Networks [21] are efficient representations of the joint probability distribution over a set of random variables using a Directed Acyclic Graph (DAG). Each random variable of interest is represented as a node and links between the nodes denote direct dependencies (cause-effect interactions) between the random variables. For our problem, the random variables are the states of the QCA cells at the layout level or the I/O states of the macromodels. The links are guided by the interaction neighborhood of the cells and the logical flow of information from the inputs to the outputs. For QCA circuits, these cause-effect directions would be determined by the direction of propagation of quantum-mechanical information propagation with change in input. Clocks determine the causal order between cells. Within each clock zone, ordering is determined by the direction of propagation of the wave function [22] . Since the Coulombic interaction between cells falls off faster than the fifth power of the distance between them, we need to consider links between cells that are within a small neighborhood of each other, typically two cell distance.
In Fig. 1a , we show the QCA layout of a NAND gate. In this work, we will use X to represent the random variable denoting the states of a QCA cell at the layout level (Fig. 1b) . The input cell states will be denoted by X 1 ; Á Á Á ; X r , the noninput QCA cells will be X rþ1 ; Á Á Á X N , and X s will denote one of the output cells, where r þ 1 ! s ! N. 
The nodes of the Bayesian network are quantified by the conditional probabilities. At the layout level, we need to specify the conditional probability of the state of a cell given the states of parent neighbors, i.e., P ðxjpaðXÞÞ, where P aðXÞ are the direct causes of the random variable X or the parents of the node X in the directed graph representation. 1 We estimate this using the quantum mechanical modeling of QCA cells. At the circuit level, we need to specify the conditional probability of the output states of a macromodel given the states of the inputs, P ðyjP aðY ÞÞ. These conditional probabilities are estimated from the conditional probabilities in the layout level model of the QCA cells comprising the macromodel, at different temperatures.
In general, a Bayesian network encodes the joint probability function as a set of factored conditional probabilities, of minimal representational complexity. Proof of minimality can be found in standard Bayesian network texts such as [21] .
In the conditional probability term P ðxjpaðXÞÞ, paðXÞ represents the values taken on by the parent set, P aðXÞ. Inference or computation with Bayesian networks exploits the sparsely connected graph structure. The most common schemes involve passing messages among the nodes. As we shall see, we will need to conduct both average case and maximum likelihood inferences. For both the average and maximum likelihood propagation, we adopt the cluster-based exact inference scheme. We refer the reader to [21] , [23] , [19] for details on the inference scheme. However, it suffices to note that the propagation schemes are based on message passing and are similar, differing only in the kinds of messages that are passed. The original Bayesian network, which is a DAG structure, is first transformed into a junction tree of cliques and then marginal probabilities are computed by local message passing between the neighboring cliques. These methods result in exact inference of probabilities.
In the rest of this section, we provide details of the process. We start with a discussion of the macromodel construction process by the Bayesian network model at the layout level, which was proposed in [19] . Then, we present the construction of the macromodels and circuit-level Bayesian representation.
Quantum Mechanical Probabilities
We sketch how the state probabilities of a QCA cell are dependent on the state probabilities of its layout neighbors, distance to the neighbors, and temperature. Each cell has two electrons that can occupy four possible dots. Among all the possible occupancy configurations, there are two lowest 1 . We use lowercase to indicate the value of a random variable, i.e., P ðxÞ denotes the probability of the event X ¼ x or P ðX ¼ xÞ. energy configurations corresponding to the diagonal occupancy of the cells. These represent the two logical states, 0 or 1. So, following Tougaw and Lent [22] and other subsequent works on QCA, we use the two-state approximate model of a single QCA cell. We denote the two possible, orthogonal, eigenstates of a cell by j1i and j0i. The state at time t, which is referred to as the wave-function and denoted by jÉðtÞi, is a linear combination of these two states, i.e., jÉðtÞi ¼ c 1 ðtÞj1i þ c 2 ðtÞj0i. Note that the coefficients are a function of time. The expected value of any observable hÂðtÞi can be expressed in terms of the wave function as hÂi ¼ hÉðtÞjÂðtÞjÉðtÞi or, equivalently, as Tr½ÂðtÞjÉiðtÞhÉðtÞj, where Tr denotes the trace operation, Tr½Á Á Á ¼ h1j Á Á Á j1i þ h0j Á Á Á j0i. The term jÉðtÞihÉðtÞj is known as the density operator,ðtÞ. The expected value of any observable of a quantum system can be computed if ðtÞ is known.
A 2 by 2 matrix representation of the density operator in which entries are denoted by ij ðtÞ can be arrived at by considering the projections on the two eigenstates of the cell, i.e., ij ðtÞ ¼ hijðtÞjji. This can be simplified further.
The density operator is a function of time and, using Loiuville equations, we can capture the temporal evaluation of ðtÞ in (3) .
where H is a 2 by 2 matrix representing the Hamiltonian of the cell and using Hartree approximation. The expression of the Hamiltonian is shown in (4) [22] .
where the sums are over the cells in the local neighborhood. E k is the "kink energy" or the energy cost of two neighboring cells having opposite polarizations. f i is the geometric factor capturing electrostatic fall off with distance between cells. P i is the polarization of the ith cell. And, is the tunneling energy between two cell states, which is controlled by the clocking mechanism. The notation can be further simplified by using " P to denote the weighted sum of the neighborhood polarizations, P i P i f i . Using this Hamiltonian, the steady state polarization is given by
ð5Þ Equation (5) can be written as
where E ¼ 0:5 P i E k P i f i is the total kink energy,
is the Rabi frequency, and Á ¼ kT is the thermal ratio. We use the above equation to arrive at the probabilities of observing (upon making a measurement) the system in each of the two states. Specifically, P ðX ¼ 1Þ ¼
ss Þ, where we made use of the fact that ss 00 þ ss 11 ¼ 1.
Layout Level Model of Cell Arrangements
To enable us to form macromodels of various cell arrangements, we need to represent the joint state probabilities of a collection of cells at the layout level. In this section, we summarize how this joint probability can be efficiently represented using Bayesian networks, as shown in [19] , [18] . We will use the majority logic arrangement of QCA cells in Fig. 2a to illustrate the process.
Each cell is represented by a random variable, taking on two possible values, shown in the Bayesian network in Fig. 2b . Each node in the network has a conditional probability table (CPT), capturing the probabilities of that node, given the states of the parent (cause) nodes. For example, the center node, X4, will be associated with the conditional probability P ðx4jx1; x2; x3Þ. The product of these CPTs determines the joint probability distribution over all the variables in the network. Thus, the joint probability is P ðx1; x2; x3; x4; x5Þ ¼ P ðx4jx1; x2; x3ÞP ðx5jx4; x3; x2; x3Þ:
The polarization of the output cell X5 is a function of the remaining four cells in the layout. The center node, X4, is actually the one which gets polarized based on the majority of inputs. The output cell depicted here receives the polarization of the central cell, X4, and also the three inputs, X1, X2, and X3. The interaction between the output cell and the central cell will be much more than the inputs. This is because the kink energy (which determines the amount of interaction between two neighboring cells) decays as the fifth power of distance.
For a given set of possible parent node assignments, the conditional probability values are computed using the Hartree-Fock approximation, applied locally. The parent states are constrained to be as specified in the required conditional probability. We fix the children states (or polarization) so as to maximize ¼ ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ffi 
The steady state density matrix diagonal entries (6) with these children state assignments are used to decide upon the conditional probabilities in the Bayesian network (BN).
P ðX ¼ 0jpaðXÞÞ ¼ Note that, once the conditional probabilities between the nodes and its parents are obtained, the Bayesian Network is quantified completely. Some of the important parameters used in this model that effect the polarization of a cell apart from temperature are:
relative permitivity ¼ 12:9; radius of effect ¼ 4; cell dimension ¼ 20nm; cell to cell pitch ¼ 10nm;
CLOCK HIGH ¼ 6:1 Ã 10 À2 eV ;
and CLOCK LOW ¼ 1:9 Ã 10 À15 eV .
Macromodel
The basic circuit elements of a QCA circuit consist of typical logic elements, such as Majority, NAND, AND, OR, and NOT, and QCA specific elements, such as wires and crossbars. The macromodels of different circuit elements are the conditional probability of output cells given the values of the input cells. We compute this by marginalizing over the internal cells. The underlying premise of the macromodeling is that, if the joint probability distribution function P ðx 1 ; Á Á Á ; x n Þ over all the n cells in the layout is available, using the process outlined in Section 2.2, then we can always obtain the exact distribution over a subset of cells by marginalizing the probabilities over the rest of the variables. For instance, the joint probability over just three cells, x i , x j ; and x k , can be obtained by
Hence, at the circuit level, we do not represent all the m internal cells. Note that, at circuit level, we only represent P ðx i ; x j ; x k Þ and represent them with different variable Y , which essentially captures the input-output dependence but is faithful to the layout level quantum interaction since the macromodel is built by marginalizing the layout level cells. This marginalizing is achieved by conducting average likelihood inference [21] , [23] on the Bayesian network representation over all the cells in the macromodel unit. Note that (9) will yield different results at different temperatures and we store the conditional probabilities at various temperature points. Fig. 2d shows the thermal models for the majority gate in Fig. 2a . The macromodel probability distribution is defined over the output and the three input nodes. At a temperature of 1K, if the inputs are 0, 0, and 0, then the probability that the output node is at state 0 is "0.999963." As the temperature is increased, this probability decreases. We also notice that the thermal behavior is dependent on the input values. Note that, for correct operation, the probability of correct output should be greater than 0.5. In the rest of this section, we present results for other basic building blocks: clocked majority gate (Table 1a) , inverter (Table 1b) , line (Table 1c) , corner (Table 1d) , inverter chain (Table 1e ), even tap (Table 2a) , odd tap (Table 2b) , crossbar (Table 2c) , AND gate (Table 2d) , and OR gate (Table 2e) . For each macrocell, we show the QCA layout, layout-level Bayesian model, circuit-level inputoutput relation, and magnitude of polarization drop with temperature. All of the conditional probabilities are stored at various point of temperatures.
We make three important observations. First, a clocked majority gate, which is necessary to synchronize all the input signals reaching the majority gate, has weaker polarization at higher temperatures compared to the simple majority shown in Fig. 2d as the number of cells is higher in the clocked majority gate. Hence, if inputs to a majority gate arrive at the same time, then simple majority yields better polarizations at higher temperatures. Second, inverters have a larger drop of polarization over the odd-tap structure at higher temperatures. Third, the crossbar structure, which allows two signals to cross each other in a coplanar way, has a different drop for the two signals. Table 3 lists all the symbols used for macromodel design blocks that we have used in our designs. A macromodel library stores the input-output characteristics (output node probabilities for each input vector set) of each macromodel block based on temperature. That means that, for each temperature, we have a library of macromodel blocks listed in Table 3 . Once we know the logic components required to build a circuit, we simply extract the macromodel logic blocks and the required connectivity blocks (e.g., Line, Corner, Inverter Chain, etc.) from the library at a given temperature and use them to build the logic circuit. We form a Bayesian macromodel using the input-output probabilities of each block. The output from one macromodel block is fed to the input(s) of the next macromodel block. We illustrate the process using the full adder circuit, Adder-1, shown in Fig. 3a . It consists of five majority gates with no inverters. Fig. 3b shows the corresponding layoutlevel Bayesian network. We model the circuit-level QCA macromodel shown in Fig. 3c , which is the circuit level abstraction of Fig. 3a . The Bayesian macromodel is shown in Fig. 3d . Each signal (node) can either be a primary input, or an output cell of a macroblock like line, inverter, etc. The links are directed from the input to the output of each macroblock and are quantified by the device macromodels. Thus, we arrive at a directed acyclic graph easily from the circuit model in Fig. 3c .
Circuit Level Modeling
ERROR COMPUTATION
Apart from the computation of the polarization of each QCA cell or macromodel line, which we can arrive at by using average case propagation, another analysis of interest when comparing designs is the comparison of the least energy state configuration that results in correct output versus those that result in erroneous outputs. What is the probability of the minimum energy configuration that results in error at the output, x s , for a given input assignment, x 1 ; Á Á Á ; x r ? This can be arrived at by conditional maximum likelihood propagation. In essence, we compute arg max x 1 ;x 2 ;ÁÁÁ;x r P ðx rþ1 ; Á Á Á ; x N jx 1 ; Á Á Á ; x r ; x s Þ and the minimum energy configuration of all the cells t h a t g e n e r a t e s t h e e r r o n e o u s o u t p u t x s i s fx i , the ith cell is considered sensitive to error at output x s (also termed weak spots).
The above computational problem of the maximization of a product of probability functions can be factored as a product of the maximization over each probability functions; these maximizations can also be computed by local message passing [21] . The exact maximum likelihood inference scheme is based on local message passing on a tree structure, whose nodes are subsets (cliques) of random variables in the original DAG [23] . This tree of cliques is obtained from the initial DAG structure via a series of transformations that preserve the represented dependencies. The details of the inference scheme can be found in [19] . At this transformed point, we have a tree of cliques, where each clique is a subset of random variables. Two adjacent cliques that share a few common variables play a key role in inference. The joint probability of all the variables can be proven to be the product of individual clique probabilities. Since the problem of the maximization of a product of probability functions can be factored as the product of the maximization over each probability function, this maximization can also be computed by local message passing [23] . The overall message passing scheme involves the neighboring cliques using the maximum operator, where the clique probabilities are updated until the marginal probabilities of the shared variables are the same. This kind of maximum likelihood analysis can be conducted both at the layout and the circuit levels. In the next section, we will present results that show that the error modes of the circuit and layout levels match. That is, whenever Y j is sensitive to the first-excited error state for output Y s , the corresponding layout level model shows the set of fX i g that constituted the macroblock Y j is also sensitive. This is an extremely important finding that indicates that a weak spot in the design can be identified at the circuit level itself without obtaining the cell layout. Also, this is an important design metric and can be used to vet one design over and above the thermal profile of the output polarization.
RESULTS
We present results using the full adder design, which has been widely studied by others. We also use a multiplier design, which is a somewhat larger design. First, we will show that the ground state polarization probabilities of the output nodes as well as the intermediate nodes in the macromodel of the QCA logic circuit closely match with those obtained from a full layout-level implementation [18] at various temperatures. Second, we demonstrate that both the ground and the next excited (error) state configuration of the macromodel exactly match the corresponding configurations of the detailed layout cells for two full adders designs. Third, we use the circuit level implementation to vet between alternate design choices. We show examples of this design space exploration process with the example of two adders. Fig. 4 plots the polarization estimates at the layout and the circuit levels for various temperatures and for different inputs for the Adder-1 architecture shown in Fig. 3a (layout level) and Fig. 3c (circuit level) . Fig. 5a shows second adder architecture (Adder-2), consisting of three majority gates and two inverters [24] . Fig. 6 plots the polarization estimates at the layout and the circuit levels for various temperatures and for different inputs for the Adder-2. We see that the difference in the probability of a correct output node between the circuit and layout-level model designs is low for both the adders. We also see that, in both the layout and circuit-level designs, the probability of the output node is dependent on the input vector set.
Polarization
Similar trends are also seen for the 2 Â 2 multiplier circuit shown in Fig. 7 . The multiplier circuit is somewhat larger than the full adder circuit and consists of two AND gates and two half adders. We made use of a half adder similar to Adder-2 full adder design for the simple reason that it occupies less area. Figs. 8 and 9 represent the macromodel representation of the multiplier circuit and the Bayesian network for the macromodel design, respectively. The polarization of the output nodes in the multiplier layout is almost similar to that obtained at the outputs of multiplier circuit designed using the macromodel blocks. In Figs. 10 and 11 , we show the variation of output nodes C0, C1, C2, and C3 of the multiplier with respect to temperature for both layout and macromodel design.
Error Modes
We compute the near-ground state configurations that result in errors in the output carry bit C out of the QCA full adders (Adder-1 and Adder-2) using both the layout and circuit level models. These are shown in Figs. 12 and 13 and Figs. 14 and 15. We show four cases, for input vectors (0, 0, 0), (1, 0, 0), (0, 1, 0), and (1, 1, 1) . The other four input vector sets will have similar results due to symmetry in design. We have marked the nodes in both the layout and circuit-level Bayesian networks to point to the components that are weak (high error probabilities) in both the layout and circuit levels. We can easily see that the nodes with high error probabilities in the QCA layout are the ones that are clustered to form an erroneous node in the macromodel circuit design. In other words, if a node (a macromodel block) in macromodel circuit layout is highly error prone for a given input set, then some or all of the QCA cells forming that macromodel block are highly prone to error. This indicates that a weak spot in the design can be identified early in the design process, at the circuit level itself.
Design Space Exploration
We show that, even at the macromodel circuit level, we have the ability to explore the design space with respect to different criteria. In addition to obvious criteria such as gate count, we can use polarization as a design metric. The probabilistic macromodel allows us very fast estimates of polarization that correlate very well with layout level estimates. As an example, we use the two adders in Fig. 3a and Fig. 4a . The two adders shown here have been designed using different macromodel blocks, occupying different design areas. The outputs of the Adder-1 circuit are given by
A; B; C in Þ; mðA; B; " C in Þ; mðA; " B; C in ÞÞ; C out ¼ mðA; B; C in Þ; ð10Þ where mðA; B; C in Þ is the majority gate containing A, B, and C in as inputs. Similarly, for the Adder-2 circuit, the outputs are given by [24] :
C out ; C in ; mðA; B; " C in ÞÞ;
We see that the Adder-1 circuit uses five majority gates and three inverters for implementation, while the Adder-2 circuit uses three majority gates and two inverters. Hence, the design circuit design of Adder-2 is certainly superior to Adder-1 in terms of area. However, as can be seen from the thermal study, inverter has one of the worst polarization drops with respect to temperature and inverters in the series path will reduce the overall polarization to a great extent. Hence, for larger circuits, a design criteria might look at Adder-1 in a different light.
Note that, in the context of the error modes presented earlier, we saw that Adder-1 again shows less error-prone nodes than Adder-2 ( Fig. 12 shows error-prone nodes for the first-excited state at carry output) for most likely errors in the outputs. Note that, ideally, this conclusion requires the detailed layout; however, maximum-likelihood propagation of the circuit-level Bayesian Network yields the same error modes as the detailed layout. This measure indicates that the cost of addition error correction required for Adder-2 would be more than that of Adder-1.
Last but not least, we observe that an odd tap shown in Section 2 is a good target for one inverter as the polarization loss is less than an inverter and an even tap works better than an even number of inverter chains. The multiplier design that we show utilizes these facts to arrive at a better design with respect to output polarization and this, in turn, improves the multiplier's thermal characteristics.
Computational Advantage
To quantify the computational advantage of a circuit-level macromodel with a layout-level model, we consider the complexity of the inference based on the Bayesian net models for each of them. As we mentioned earlier, in the cluster-based inference scheme, the Bayesian Network is converted into a junction tree of cliques and the probabilistic inference is performed on the junction tree by local computation between the neighboring cliques of the junction tree by local message passing [21] , [25] . The space complexity of Bayesian inference is Oðn:2 jC max j Þ, where n is the number of variables and jC max j is the number of variables in the largest clique. Time complexity is Oðp:2 jC max j Þ, where p is the number of cliques in the junction tree. We tabulate the complexity terms for the two adder designs in Table 4 , along with the corresponding values for n, p, and jC max j. We can see that the macromodel is an order of magnitude faster, especially due to the reduction in jC max j, which would be important in synthesizing larger networks of QCA cells. Another observation is that Adder 2 is less expensive in terms of computation even though polarization drops are greater due to the presence of inverters.
As we can see from the Table 5 , the simulation time required to evaluate a circuit is orders of magnitude lower than that in the QCADesigner tool. Moreover, we see that the simulation timing for Bayesian macromodels of the adder circuit is much lower than for the Bayesian full layout model. The graphs depicted in Fig. 5, Fig. 6, Fig. 10 , and Fig. 11 present the crux of this work. The drooping characteristic of output node polarization with a rise in temperature is a universally known fact. What we have shown in this work (as depicted in these graphs) is that the polarization of the output node in our macromodel design shows the same drooping characteristics and is almost the same as that of the full layout. We can see that the macromodel is orders of magnitude faster, especially due to the reduction in jC max j, which would be important in synthesizing larger networks of QCA cells. Another observation is that Adder 2 is less expensive in terms of computation even though polarization drops are greater due to the presence of inverters.
CONCLUSION
We proposed an efficient Bayesian Network-based probabilistic macromodeling strategy for the QCA circuit that can estimate cell polarizations, ground state probability, and lowest-energy error state probability, without the need for computationally expensive quantum-mechanical computations. We showed that the polarization estimates at the layout and circuit levels are in good agreement. In our previous work [19] , we had validated the layout level Bayesian network model with quantum-mechanical simulation Hartree-Fock, Self-Consistent Analysis (HF-SCA)-based estimates (see Fig. 16 ). In this work, we illustrate our macromodeling idea using two full adder macro model design implementations and a somewhat larger QCA design of a 2 Â 2 Multiplier implementation. We found that both the polarization and the error mode estimates at the circuit level match those at the layout level.
The Bayesian macromodel should be useful for vetting QCA circuit designs at higher levels of abstraction in terms of not only the ground state, but also polarization, thermal dependence, and error modes. The contributions of this work can be broadly classified to be in the area of "Design for low Error" that considers error-tolerant circuit synthesis, taking into account circuit overhead considerations. The developed models in this work can be used to selectively identify weak components in a design early in the design process. It would then be possible to reinforce those weak spots in the design using reliability enhancing strategies. The error modes can also be used to compare multiple designs early on in the process. One possible future direction of this work involves the extension of the BN model to handle sequential logic. This is possible using an extension called the dynamic Bayesian networks, which have been used to model switching in CMOS sequential logic [26] .
Sanjukta Bhanja received the BE, MSc (eng.), and PhD degrees from Jadavpur University, the Indian Institute of Science, Bangalore, and the University of South Florida, respectively. She has been an assistant professor in the Department of Electrical Engineering at the University of South Florida since 2002. Her research interests include design automation, low power VLSI design, error and reliability-centric statistical modeling and CAD, graphical probabilistic belief models, nano-CAD tools modeling quantum-mechanical behaviors for emerging technologies, and nano-computing. She is a member of the IEEE.
. For more information on this or any other computing topic, please visit our Digital Library at www.computer.org/publications/dlib.
