Abstract-Embedded implementations of cryptographic algorithms require countermeasures against side-channel attacks (SCAs), that exploit physical variables measured during the computation. These countermeasures increase cost, power consumption and latency of the device. One class of countermeasures, hiding, consists of a balanced circuit style, including balancing of the capacitances and delays; it requires full connection to avoid memory effect that is an effect caused by repeatedly recharged energy after being only partially discharged at the internal parasitic capacitance. This paper proposes binary decision diagrams (BDDs) to derive complex pull-down networks that fulfill all these requirements while being compact at the same time; it uses sense amplifier-based logic (SABL) to obtain well-balanced pre-charge circuits. An attack based on mutual information analysis (MIA) is applied to the AES S-boxes implemented in our novel secure logic style. After the evaluation at pre-layout SPICE level, the balanced circuit with BDD leaks less information than comparable logic styles, even though the implementation area is reduced by 40.6%, the power consumption up to 46.1% and the delay by 35.2% compared to the classic SABL approach.
I. INTRODUCTION With current advances in technology, we face diverse and unprecedented security threats. New vulnerabilities in electronic devices have come to light. Such devices are often exposed to security threats even when mathematically strong algorithms are implemented. In particular, side-channel attacks (SCAs), which is a type of physical attack, use physical leakage information (e.g., power, time or electromagnetic signals) while a target device is running cryptographic algorithm. The protection of devices against SCAs can only achieved using carefully designed countermeasures.
Ever since power analysis as proposed by Kocher et al.
[1], a broad range of SCA techniques have been introduced along with a set of countermeasures, that include masking and hiding techniques. While masking randomizes data during vulnerable points of the computation, hiding eliminates the dependency of side-channel leaks on sensitive intermediate data.
One of the most powerful and efficient hiding methods for hardware implementations is the use of a secure logic style: this is a fundamental solution, as it suppresses sidechannel information at the lowest level. The SABL style [2] is one of the more well-known secure logic styles based on the dual-rail pre-charge (DRP) logic style. Dynamic current mode logic (DyCML) [3] is another typical secure logic style from among the current mode logic (CML) styles. These secure logic styles transform basic gates such as the XOR gate to a specific instantiation. Compared to standard cell designs, there is a substantial increase in implementation area and power consumption, and above all a large increase in latency. This paper proposes a new simple balanced (SB) method, which is secure and fully connected. The proposed SB method offers a protection to SCAs comparable to other secure logic styles, but at a reduce cost in area, power, and latency: compared to SABL, the area is reduced by 40.6%, the power by 46.1%, and the latency by 35.2%.
The main principle of the proposed SB method is to achieve a balanced capacitance load for every input transition. The only source of unbalancedness in our solution are process variations, which we consider outside the scope of this paper. We achieve a balanced capacitance load by having the same logic depth for each path of the binary decision diagram (BDD) [4] after circuit optimization. Our algorithm examines every logic path of the BDD; if a path is identified that is not balanced (i.e. has an unbalanced logic depth), we insert dummy path transistors with the same input and a complementary input. After this transformation, every logic path has the same number of NMOS transistors stacked against each input transition. The resulting BDD has the property that all internal nodes are automatically connected to the output nodes, which removes the memory effect: indeed, the charged energy is not consumed in the internal parasitic capacitances during the evaluation phase and those capacitances are recharged during the succeeding pre-charge phase. Although Tiri et al. [5] proposed in 2005 a fully connected method for a differential pull-down network (DPDN), in their solution it was difficult to manipulate complex Boolean functions.
In order to test our hypothesis, we design AES S-boxes using the proposed SB method and other secure design methods. We compared these AES S-boxes using Mutual Information Analysis (MIA) [6] . The conclusion is that our novel logic style provides an effective countermeasure against SCAs that is more efficient than earlier work.
II. PREVIOUS SECURE LOGIC DESIGN METHODS

A. Secure Logic Styles with custom basic gates
Secure logic styles with custom basic gates provide a solution to eliminate the core information leakage of sensitive data at the transistor level. They transform basic gates (e.g., XOR) in a way that the power consumption during gate switches from different input transitions is constant. SABL, one of the best-known secure logic styles, has fully connected internal nodes and a discharge phase in a clock cycle. When the SABL circuit is pre-charged, the output nodes store the same amount of energy due to these properties. In a second phase, the charged energy at the capacitance load is consumed through each rail of the circuit with SABL. Therefore, the power consumption per clock cycle is the same regardless of the input values. Another secure logic style, DyCML, uses a current mode logic style, such that it has a small swing during a clock cycle. This small change makes it very difficult to distinguish the intermediate values for different input values. Therefore, a power analysis attack against SABL or DyCML requires far more power traces than against an implementation based on DRP style logic. However, secure logic styles convert basic gates into custom gates, which increases the area and latency. Such means restrict the use of the custom basic gates with the secure logic styles.
B. Previous fully connected methods
Memory effect is an effect that still remaining energy at the parasitic capacitances after being discharged is recharged during the pre-charge phase. This effect induces unbalanced power consumption, even if the circuit has the balanced configuration. Consequently, the information leakage depending on input values is increased. In order to remove the memory effect, Tiri et al.
[5] suggested a fully connected DPDN method, which connects each internal node to the output nodes. When the fully connected node is in an evaluation phase, the nodes eliminate all the charged energy both at the internal and output nodes. This characteristic renders the power consumption constant, independent of input transitions. Despite plenty of advantages as mentioned above, this method requires the transformation of a Boolean function before circuit configuration. Such a transformation is very cumbersome if the targeted Boolean function is complex and has many input variables. In addition, it only considers AND and OR operations; another, more complex, transformation is required for XOR gates, which are very common, further increasing the complexity. In addition, a final step is required to check that full connection is achieved. As an advanced method, Tiri et al. also propose a fully balanced circuit using pass transistors. It adds pass transistors with a different input value of the NMOS transistor in the unbalanced logic path. If the number of input variables to the Boolean function is increased, the number of pass transistors has to be increased in order to create a balanced logic path. This further increases the complexity, which is in particular a problem for cryptographic algorithms that are typically based on rather complex Boolean functions.
C. Previous BDD-based secure design methods
Earlier work on BDD-based secure logic to achieve balanced circuits can be divided into two types: solutions that perform manipulations at the transistor level and solutions based on standard cell logic. The methods described in [7] and [8] are of the second type. They use additional basic gates, hence the increase in area is comparable to the secure logic styles that transform basic gates as discussed in Section. II.A. Another secure logic style [9] uses complementary pass transistors for dynamic operation. However, it also converts basic gates using a special logic style. Similarly, it increases implementation area compared to designs with a compacted NMOS tree. De et al. [10] proposed another BDD-based secure logic style with a compacted NMOS tree. It uses additional pre-charge configuration at the leaf nodes of the BDD and a one-rail pullup network makes unbalanced charged energy at the output nodes results in unbalanced power consumption. The precharge part connected with leaf node of BDD increases with the complexity of the target Boolean function.
III. SECURITY EVALUATION CRITERIA
Many security evaluation methods have been proposed, including normalized energy deviation (NED), normalized standard deviation (NSD), and maximum current variation (MCV).
A. NED, NSD, and MCV
These three methods compare the variation of power consumption, maximum current, and standard deviation of energy per cycle depending on input transitions. While these criteria provide a good starting point, they are insufficient to evaluate the quality of countermeasures against SCAs as they fail to consider the worst case scenario (for the defender): these criteria assume that the target device operates in a specific environment under fixed measuring conditions. However, in practice operating environments can vary broadly from RFID chips to smart cards and USNs; these devices have different operating frequencies, operating power, and sensitivity from noise. All these elements strongly influence the evaluation criteria. Moreover, if unsophisticated attackers can succeed with low quality measuring devices under noisy conditions, there is no doubt that powerful attackers with highly sensitive measurement equipment will be even more successful. Attacks becomes also more effective when more measurement data is accumulated, even if the variation of the data is small. In order to consider attacks that consider such worst case scenarios for operating environments and measuring conditions, Mutual Information Analysis (MIA) has been proposed.
B. MIA
MIA performs information theoretic analysis of the leaked information, which is independent of operating environments and measuring conditions, because it only uses entropy of the implementation. In the following we assume that both power trace and noise n q , with q the number of samples (power traces), follow a Gaussian distribution. Denote with H[S g ] the entropy of the key class S g before applying a SCA, where S g and s g are the correct target signals of the key value and particular key candidates respectively. In addition, E is the expected value of the key candidates:
Denote with d q sg the deterministic value of intermediate leakage. The leaked information L q sg , which is a random vector that contains a correct key class s g , can be expressed as follows:
By combining equations (1) and (2) one can compute the conditional entropy for quantifying information leakage:
Finally, the mutual information between the trace and the leakage is defined as follows:
According to the information theoretic method, we can compare the amount of information leakage between several logic styles under a Gaussian leakage model. In addition, MIA uses the maximum likelihood function:
It chooses a time sample that maximizes the perceived amount of information. The subsequent analysis evaluates and exploits the information leakage using an information theoretic metric at a different noise level. The mutual information (4) indicates the strength of the hardware countermeasures, dependent of operating environments and measuring conditions; in other words, this information theoretic metric is uncorrelated with a particular implementation. This makes MIA a suitable method to compare secure logic styles.
IV. OUR PROPOSED DESIGN METHOD
A secure logic style contains two important components: the differential pull-up network (DPUN) and the differential pull-down network (DPDN); both have a strong influence on the security of a logic style against SCAs. During the operation of security logic styles, the output node capacitance is charged by energy dissipation through the pull-up network that mostly consists of PMOS transistors that perform the pre-charge from the voltage source. Therefore, if different amounts of energy are stored in two output node capacitances, the amount of discharge energy also differs. Hence, the DPUN should store in advance the same amount of energy at the differential output nodes. In addition, during the evaluation phase, the energy which is charged in the output node capacitance dissipates through a logic path of DPDN depending on input transitions. To consume constant power through DPDN, it should have balanced internal capacitances and a balanced NMOS transistor configuration. This paper proposes a novel efficient design method for both components to improve a secure logic style. Our method uses SABL for the DPUN to charge the balanced energy at the differential output nodes. A balanced DPDN is achieved using a BDD: this results in a compact DPDN reducing the overall area and it alleviates the delay problem of earlier SABL circuits. Furthermore, the BDD method produces a fully connected configuration and removes the memory effect of the internal nodes.
A. BDD
Binary Decision Diagrams (BDDs) have made a substantial impact on digital hardware design, as they allow for efficient manipulation of large Boolean formulae. BDDs have two remarkable properties. First, an ordered and reduced BDD has a canonical form. Therefore, two correct implementations of the same Boolean function have identical BDDs. Second, BDDs are very effective in representing complex Boolean functions that have a large number of inputs and complicated combinational logic circuits. These properties are extremely useful in efficient circuit design optimization, testing, and A BDD represents the Boolean function as a rooted and directed acyclic graph. For example, Figure 1 represents the function f = (x1 + x2) · x3, defined by the truth table on the left. On the right side, a graphical representation with a tree structure is shown. According to the bottom-up approach of the graph, each terminal vertex (leaf nodes at the bottom) is labeled 0 or 1. Above them to the top node (root node), each non-terminal vertex v is labeled by a variable var(v) and has arcs directed towards two children vertices that are expressed with a dashed line lo(v) and solid line hi(v). Dashed lines are assigned to 0 and the other lines to 1 for the corresponding input value of the line. The graph has a different configuration depending on the ordering of the variables. It suggests that a BDD is not unique for a given Boolean function. However, the ordered BDD (OBDD) for a given variable order is unique. In Figure 1 , due to the way branches are ordered, the terminal vertex has a value matching the truth table. The right-most column of the truth table matches the value of the terminal vertex of the graph from left to right. The variable ordering follows from top to bottom in the BDD, as shown in Figure 1 . However, in the truth table it follows from left to right. To be specific, there are two important rules for BDDs, as described below.
1) Variable ordering:
An OBDD basically needs a parent vertex u and a non-terminal child vertex v with input variables. The vertices u and v are connected to each other in a way that var (u) and var(v) are in the order being named. In the decision tree on the right side of Figure 1 , for instance, the variables are ordered as x2, x1, and x3. The variable ordering is selected in view of the purpose of the BDD. However, choosing a right variable ordering is essential for the efficient manipulation of complex Boolean functions. Furthermore, a compact implementation can be achieved by selecting a good variable ordering.
2) Reduction rule: Currently, a BDD is often referred to as a reduced ordered BDD (ROBDD). This is because most compacted implementations consist of configurations with an ROBDD. The transformations follow two steps of merging and elimination.
• Step 1. Merging: First, duplicate terminals are removed, merging all equivalence leaves. Afterwards, redirects all arcs are redirected to the removed nonterminal vertices connected to the remaining ones. 
