Compact realizations of reversible logic functions are of interest in the design of quantum computers. In this paper we present an exact synthesis algorithm, based on Boolean Satisfiability (SAT), that finds the minimal elementary quantum gate realization for a given reversible function. Since these gates work in terms of qubits, a multi-valued encoding is proposed. Don't care conditions appear naturally in many reversible functions. Constant inputs are often required when a function is embedded into a reversible one. The proposed algorithm takes full advantage of don't care conditions and automatically sets the constant inputs to their optimal values. The effectiveness of the algorithm is shown on a set of benchmark functions.
Introduction
Research in quantum circuit synthesis is motivated by the growing interest in quantum computation [17] . Synthesis for quantum circuits can be approached from two different angles. Reversible logic circuits can be designed with generalized Toffoli gates [20] , that are later mapped into quantum gates. The alternative is to target elementary quantum gates directly during the synthesis process.
The minimization of generalized Toffoli networks has been studied extensively. It is possible to construct optimal circuits for all 2 3 ! = 40320 three-input functions [18] and store the resulting circuits in a table. Once such a table has been constructed, the optimal circuit for any three-input functions can be found in constant time. For the synthesis of larger reversible functions methods based on Boolean Satisfiability (SAT) [7, 22] and Quantified Boolean Formula (QBF) Satisfiability [24] have been proposed. These methods guarantee to find an optimal solution. However, too complex functions have to be synthesized with heuristic methods (see e.g. [12] ). Once the circuit is obtained, the generalized Toffoli gates are then translated into elementary quantum gates [2] . During the linear translation the interaction between the gates is ignored, and the resulting circuit may be suboptimal. Techniques such as template matching [14] can be used to reduce the gate count in the resulting quantum circuit.
In order to avoid the mapping of Toffoli gates into quantum gates, elementary quantum gates may be targeted directly. For this case, heuristic as well as exact methods have been proposed. While the former ones (e.g. [1] ) can be employed with no guarantee of minimality the exact approaches are limited to small functions. The results of a breath-first search that determines all optimal quantum circuits with three input variables are described in [13] . Another exact approach uses symbolic reachability analysis to find optimal quantum circuits [9] .
In this paper we present an exact synthesis algorithm based on Boolean Satisfiability (SAT). The synthesis problem is expressed as a Boolean function such that it is satisfiable iff there exists a network of quantum gates with a given depth. Thereby a multi-valued encoding is used for representing the respective qubits used by the quantum gates. Our method can be applied to synthesize completely specified function, as well as those with don't care conditions. All quantum circuits are by necessity reversible. To realize a non-reversible function it must be embedded into a reversible one. This results in garbage outputs and constant inputs. We refer to [11] for a detailed treatment on garbage in reversible networks. Garbage outputs are by definition don't cares. Additionally, when there are constant inputs, these can be set arbitrarily. The proposed synthesis procedure assigns the optimal values to the constant inputs during this process. Our implementations shows significant speedups over the algorithm described in [9] , i.e. in the best case quantum circuits are synthesized up to a factor of 45 faster.
The remaining paper is structured as follows. Section 2 describes the elementary quantum gates used in our target networks and briefly reviews SAT. The exact synthesis algorithm is presented in Section 3. Section 4 introduces the improvements for incompletely specified functions. Experimental results are discussed in Section 5. Finally, the paper is summarized and conclusions are given.
Background

Elementary Quantum Gates
The basic building block for a quantum computer is the qubit. A qubit is a two level quantum system, described by The state of a quantum system with n > 1 qubits is given by an element of the tensor product of the single state spaces and can be represented as a normalized vector of length 2 n , called the state vector. The state vector is changed through multiplication of appropriate 2 n × 2 n unitary matrices [2] . Elementary gates are the building blocks for quantum computation. The following elementary gates are used in this paper (the notation for the gates with the corresponding matrices are shown in Figure 1 ):
• Inverter (NOT): A single qubit is inverted.
• Controlled inverter (CNOT): The target qubit is inverted if the control qubit is 1.
• Controlled V gate: The V operation is also know as the square root of NOT, since two consecutive V operation are equivalent to an inversion.
• Controlled V+ gate: The V+ gate performs the inverse operation of the V gate, i.e. V + ≡ V −1 .
All elementary gates are assumed to have unit cost [2] . Additionally, when a CNOT and a V (or V+) gate are applied to the same two qubits, the cost of the pair can be considered as a unit as well [19] . The possible pairs (also known as double gates) are shown in Figure 2 . Using this cost metric allows us a direct comparison to [9] . Note that our approach is not limited to these metrics -other metrics can be applied as well. In this paper, the input to the circuit and each control line of a gate is restricted to 0 and 1. This has the effect that the value of each qubit is restricted to one of {0, 1, V 0 , V 1 }. That is we are dealing with 4-valued logic [9] , where V 0 = 
Boolean Satisfiability
The Boolean Satisfiability problem (SAT problem) is defined as follows:
Definition 1 Let h be a Boolean function in Conjunctive Normal Form (CNF), i.e. a product-of-sum representation. Then the SAT problem is to determine an assignment to the variables of h such that h evaluates to true or to prove that no such assignment exists.
The CNF is a conjunction of clauses. A clause is a disjunction of literals and each literal is a propositional variable or its negation.
In the past several algorithms (so called SAT solvers) were proposed [3, 10, 16, 6] . Most of them are based on three essential procedures: (1) The decision heuristic assigns values to free variables, (2) the propagation procedure determines implications due to the last assignment(s) and (3) the conflict analysis tries to resolve conflicts by backtracking. Advanced techniques like e.g. efficient Boolean constraint propagation [16] or conflict analysis [10] are common in state-of-the-art SAT solvers. Today, SAT is not only used in formal verification, but in many application domains like automatic test pattern generation or logic synthesis.
Exact Synthesis Approach
In this section the exact synthesis algorithm is presented. The basic idea is to check if there exists a quantum gate network representation for the reversible function f with d gates, where d is increased in the next iteration if no realization is found. The iterative checks are performed by (1) encoding the synthesis problem as a SAT instance and (2) testing the SAT instance for satisfiability using an SAT solver. In the following the Boolean encoding of the synthesis problem is presented.
Boolean Encoding
For a reversible function the synthesis problem is expressed as a Boolean function S d that is satisfiable iff there exists a quantum gate network representation of size d. Therefore, the following definitions are given: 
That is, if z k ij is 0 the Boolean domain is considered, otherwise the non-Boolean quantum states V 0 and V 1 are selected.
) defines a Boolean vector representing the chosen quantum gate at depth k. Here g denotes the number of different quantum gates in n variables.
Lemma 1 For a reversible function with n variables there exists 3n(n − 1) + n different elementary quantum gates.
Proof 1 A CNOT, V and V+ gate has exactly one target line and one control line. With n variables there are n(n − 1) possible choices for the target-control pair. Additionally there are n possible NOT gates (one for each line). In total we get 3n(n − 1) + n different elementary quantum gates.
Lemma 2 For a reversible function with n variables there exists 4n(n − 1) double gates.
The proof is similar to Lemma 1. In the following elementary quantum gates and double gates are called quantum gates. Thus, there are 7n(n − 1) + n quantum gates that have be considered at each depth.
The application of the definitions and lemmas are illustrated with the following example. Figure 3 shows the variables needed to formulate the constraints for a function with n = 3 inputs/outputs and depth d = 2. The possible positions for the quantum gates are marked with dashed rectangles. For each of the 2 3 = 8 lines in the truth table n = 3 lines in the network with the respective vectors for input, temporary, and output variables are used (i.e. overall 3 · 8 = 24 lines are considered). As mentioned above the multi-valued variables of the vector x 2. The functional constraints for the possible quantum gates that are chosen by an assignment to w k are:
Example 1
These constraints state that if in line i at depth k a concrete quantum gate is selected (i.e. w k is completely assigned), the variables in the next depth k + 1 are computed from the variables at depth k with the quantum gate function q( x k i , w k ). The function q( x k i , w k ) covers the functionality of a quantum gate represented by w k .
3. Illegal assignments to w k are excluded in this constraint, since not all values of w k are legal to enumerate all possible quantum gates:
We now have a Boolean formulation for the synthesis problem which uses multi-valued logic for encoding the quantum signals. It is well known that with the introduction of new variables the CNF form for any Boolean formula can be produced in time and space linear in the size of the original Boolean formula [21] . Therefore, we first define methods for the simple logic functions like AND, OR, etc. that generate the corresponding clauses. Then, we extended this scheme for more complex logic. This allows a mapping of our formulation including the quantum gate function q( x k i , w k ) to CNF. Furthermore, the assignments of the input/output constraints are applied by using unit clauses. Illegal assignments to w k are expressed by explicitly enumerating all values that are not allowed in form of a blocking clause [15] .
This results in a CNF representing the synthesis problem of a reversible function f with d quantum gates that can be given to a SAT solver. The instance is satisfiable iff there exists a network realization for f with exactly d gates. In this case, the resulting quantum gate circuit can be extracted from the assignments to w k for each depth.
Incompletely Specified Functions
It is well known that many practical logic functions contain don't care conditions. That is, the output for certain input combinations is irrelevant. Furthermore, in order to make a function reversible, it is often necessary to add constant inputs and garbage outputs [11] . Garbage outputs are by definition don't cares. In our SAT based procedure the don't care outputs can be left unspecified (the reversibility is still guaranteed since only reversible gates are part of the solution). The constants have to be assigned with a fixed value.
However, to synthesize a quantum gate circuit with minimal costs for a function with constant inputs, both values 0 and 1 for each constant input have to be checked. For example, while a minimal quantum gate circuit is found for a half adder at depth 5 when the constant input of this function is set to 1, a cheaper network representation with costs 4 can be found if the constant is set to 0 (as shown later in the experiments). For synthesizing the function Decod24 [8] we can chose the values for the constant inputs from {00, 01, 10, 11}, i.e. more than one constant input is needed and thus four independent runs of the synthesis algorithm.
In this section we present two improvements for the initial exact synthesis formulation of functions containing garbage outputs and constant inputs. First we show, how a SAT instance representing the synthesis problem can be reduced by removing constraints that only define garbage outputs. Then, a modification of the encoding is introduced, that includes all possible combinations of constant values and thus allows to find a minimal quantum gate circuit by solving only one instance.
Removal of Constrains
As explained in Section 3 constraints for each line i encode the input/output specification and the quantum gate functionality. The latter constraints ensure, that based on the selected gates (by assignments to w k for each depth k) the inputs of each truth table line are mapped to the corresponding output. However, if the right side of a truth table line only consists of garbage outputs (i.e. don't cares), then all constraints created for this line can be removed.
Table 1. Embedding of function
Example 2 Consider the function f = a · b that was transformed into a reversible one by adding a constant input and two garbage outputs as shown in By removing the useless constraints the number of clauses is significantly reduced. Thus, a speed-up of the overall synthesis time can be expected.
Constraining Constant Inputs
If constant inputs are used, different options exist on how to assign these constants. In Example 2 the constant input was assigned to 0. But the resulting quantum network is not necessarily minimal. A smaller network may be possible if the constant input is set to 1. Thus, both values for each constant have to be checked. For example as already mentioned for function Decod24 we can chose the values for the constant inputs from {00, 01, 10, 11}. All these combinations are encoded as single instances and separately solved by the SAT solver. Our new approach uses an encoding that only needs one instance for each depth. Here, all variables in x 0 i , that represent a constant input will not be initially set to 0 or 1 by the input/output constraints. Instead constraints are added, which ensure that all variables representing the same constant are equal. This way, the SAT solver determines the values for the constants. Since the considered constant inputs are now modeled symbolically (the value of each constant input is not fixed to 0 or 1) only 2 n−c truth table lines have to be considered (where c is the number of constants). In total the number of SAT instances can be reduced significantly and thus speed-ups in the overall synthesis time are achieved as shown in the experiments. 
Experimental Results
We implemented the presented synthesis algorithm in C++. As a SAT solver we use MiniSat v2 [6] including simplification [5] . Unless mentioned otherwise, experiments have been carried out on an AMD Athlon 3500+ with 1 GB of main memory. All benchmarks have been taken from [23] .
Incompletely Specified Functions
We tested our approach for a set of benchmarks that are embedded in reversible functions. The results are shown in Table 2 . For each function there are 2 c entries, where c is the number of constant inputs. Each line below the function name corresponds to one assignment of the constant inputs. Columns labeled D show the number of gates in the optimal circuit and columns labeled TIME list the corresponding run-time in CPU seconds. Removal of constrains as introduced in Section 4.1 lead to better run-times. In the case of Decod24, the run-time is improved by more than a factor of two. Only for the full adder more run-time is required. This can be explained by a different decision order of the SAT solver due to the structure of the respective instance.
However, the combination of both improvements, i.e. removing useless constraints as well as constraining the con- stant inputs, offers a significant speed-up for all examples.
The reduction of run-times is between 70% and 95%. Reductions are more substantial if at least one of the assignments has a solution with more gates than the optimal constant input assignment. This can be observed for all functions in Table 2 except Decod24. It should be noted that the constraining of constant input variables requires some computation time (i.e. run-times may be higher than those for solving the function with a fixed constant input assignment). However, this overhead is easily compensated by the fact that only one instance needs to be solved.
Effect of Double Gates
Double gates (as shown in Figure 2 ) may not be available at unit cost in the target implementation [2] . In fact, most minimization methods (for example, [1, 14] ) consider the cost of a double gate to be two, since they are composed of two elementary gates. To the best of our knowledge [9] describes the only minimization procedure that assigns a single cost unit to double gates. Hence, there are compelling reasons to consider minimizations that rely only on elementary gates.
In one experiment we ignored double gates. This reduces the number of possible gates at each level from 7n(n−1)+n to 3n(n − 1) + n (according to Lemma 1 and 2). The results are summarized in Table 3 . In the first column the name of the function is given. The next columns provide the depth and the run-time in CPU seconds for the elementary gates and the quantum gates (i.e. elementary gates and double gates), respectively.
In general, it is expected that more choices of possible gates at each level will increase the time to find a correct solution. This is clearly seen for the benchmark functions where the inclusion of double gates offers no advantage (i.e. both values for D are the same). For example, the run-time for graycode6 increases by one order of magni- 
Comparison with Previous Work
To compare our approach with the results obtained in [9] we used a 733 MHz Pentium III with 512 MB of main memory that is significantly slower than a 850 MHz Pentium III processor, the system used there. The outcome is shown in Table 4 . RA denotes the run-times of the reachability analysis obtained from [9] and SAT the run-times of our approach in CPU seconds (using both elementary and double gates). IMPR gives the run-time improvement, i.e. the run-time of RA divided by the run-time of SAT.
We were able to synthesize all functions from [9] that were considered there for exact synthesis in significant shorter run-time -even on a slower processor. Improvements of at least a factor of 2 have been achieved. In the best case an improvement of a factor of 45 is observed.
Furthermore, we compared the synthesized results for the functions q4-example, Peres-double and Toffoli-double. Here, the authors of [9] constrained the search space for these functions. In fact they restrict the target-line of the V and V+ gates to be the forth line. Therefore they cannot guarantee optimal solutions.
The comparison in Table 5 shows, that our approach is able to find the optimal results for these functions with a low run-time increase (the results of the heuristic approach of [9] are denoted by RA heu ). Thereby, we prove that for Peres-double and Toffoli-double the minimal quantum gate networks have been found in [9] . Additionally, in case of q4-example our approach synthesizes an optimal quantum gate representation with costs 5 instead of the sub-optimal circuit of size 6 obtained in [9] . Note, again all these benchmarks are carried out on a slower system than the one used in [9] . For absolute run-times on a fast computer see the rightmost column of Table 3 .
Conclusions
In this paper we presented an exact synthesis algorithm that finds the minimal realization for a given reversible function. We have demonstrated our synthesis algorithm on a set of benchmarks. The resulting circuits consist of elementary quantum gates. We presented significant improve- ments of our approach for incompletely specified functions. Furthermore, the algorithm can be modified to accommodate different gate libraries. We have shown this by including/excluding double gates. As in case of optimal logic synthesis of Boolean functions [4] , large libraries have a negative effect on the run-time of the synthesis procedure, but may lead to smaller circuits. In comparison to the previous exact synthesis method, in the best case we synthesized the functions up to a factor of 45 faster.
