We consider the relationship between size and depth for layered Boolean circuits and synchronous circuits. We show that every layered Boolean circuit of size s can be simulated by a layered Boolean circuit of depth O( √ s log s). For synchronous circuits of size s, we obtain simulations of depth O( √ s). The best known result so far was by Paterson and Valiant [17], and Dymond and Tompa [6] , which holds for general Boolean circuits and states that D(f ) = O(C(f )/ log C(f )), where C(f ) and D(f ) are the minimum size and depth, respectively, of Boolean circuits computing f . The proof of our main result uses an adaptive strategy based on the two-person pebble game introduced by Dymond and Tompa [6] . Improving any of our results by polylog factors would immediately improve the bounds for general circuits.
Introduction
In this paper, we study the relationship between the size and depth of fan-in 2 Boolean circuits over the basis {∨, ∧, ¬}. Given a Boolean circuit C, the size of C is the number of gates in C, and the depth of C is the length of the longest path from any input to the output. We will use the following notation for complexity classes. DT IM E(t(n)) and SP ACE(s(n)) are the classes of languages decidable by deterministic multi-tape Turing machines in time O(t(n)) and space O(s(n)), respectively. Given a Boolean function f : {0, 1} n → {0, 1}, define C(f ) to be the smallest size of any circuit over {∨, ∧, ¬} computing f , and define D(f ) to be the smallest depth of any circuit over {∨, ∧, ¬} computing f . Note that C(f ) and D(f ) are not necessarily achieved by the same circuit.
Pippenger and Fischer [18] showed that for t(n) ≥ n, DT IM E(t(n)) can be simulated by logspace uniform families of circuits of size O(t(n) log t(n)).
Email addresses: panni@cs.utexas.edu (Anna Gál), keith@cs.utexas.edu (Jing-Tang Jang)
1 Supported in part by NSF Grants CCF-0830756 and CCF-1018060 2 Supported in part by MCD fellowship from Dept. of Computer Science, UT Austin
Borodin [4] showed that for s(n) ≥ log n, languages computed by logspace uniform families of circuits of depth s(n) are contained in SP ACE(s(n)), and SP ACE(s(n)) can be simulated by logspace uniform families of circuits of depth O(s 2 (n)). Furthermore, circuit depth is related to parallel computation time [22] . These results show that the study of circuit size versus depth helps to investigate the relationship between sequential and parallel computation time, as well as time versus space in sequential computation. However, very little is known about the size versus depth question for general Boolean circuits. The best known result so far is the following theorem, which was first proved by Paterson and Valiant [17] , and later proved by Dymond and Tompa [6] using another method.
On the other hand, it can be easily shown that D(f ) = Ω(log C(f )). Theorem A leaves a huge gap (log C(f ) versus C(f )/ log C(f )) for circuits of any size. McColl and Paterson [14] showed that every Boolean function depending on n variables has circuit depth at most n + 1. There is an even stronger result by Gaskov [8] showing that circuit depth is at most n − log log n + 2 + o(1). This gives a much stronger bound on depth than Theorem A for functions that require circuits of large size. In particular, for f : {0, 1} n → {0, 1} such that C(f ) is exponential in n, [14] and [8] give essentially tight bounds on depth. However, for functions that can be computed by subexponential-size circuits, there is still a large gap. Note that Theorem A gives a stronger result than [14] and [8] only when C(f ) = o(n log n). Improving Theorem A would yield improvements over [14] and [8] for larger C(f ) as well.
Because of the connections mentioned above, there are other important consequences if Theorem A can be improved. Hopcroft, Paul, and Valiant [11] proved the following analogous theorem about sequential time and space, and Adleman and Loui [1] later gave an alternative proof.
By the results of [18] and [4] mentioned above, improving Theorem A by at least a polylog factor in a uniform setting immediately improves Theorem B.
For general Boolean circuits, the simulating depth O(t(n)/ log t(n)) in Theorem A is very close to the circuit size. On the other extreme, consider tree-like circuits, where every gate has fan-out at most 1. Spira [21] showed that given any tree-like Boolean circuit C of size t(n), we can always simulate C by another tree-like Boolean circuit of depth O(log t(n)). Note that tree-like circuits are commonly referred to as formulas in circuit complexity. We will use the term tree-like circuits to avoid any ambiguity. It is unlikely that Spira's result holds for general Boolean circuits, since that would imply P = N C 1 . Still, it is possible that Theorem A can be improved. We indeed achieve improved simulations for special classes of Boolean circuits.
Our results
We consider the size versus depth problem for special classes of Boolean circuits. As far as we know, previously no better bounds were known for these classes than what follows from the bounds for general circuits [17, 6] . We obtain significant improvements over these general bounds for layered circuits, synchronous circuits, and planar circuits as well as classes of circuits with small separators. Informally, a circuit is layered if its set of gates can be partitioned into subsets called layers, such that every wire in the circuit is between adjacent layers. A circuit is synchronous if for any gate g, every path from the inputs to g has the same length. Synchronous and planar circuits have been extensively studied before. Synchronous circuits were introduced by Harper [10] . Planar circuits were introduced by Lipton and Tarjan [13] . Layered circuits are a natural generalization of synchronous circuits, but as far as we know they have not been explicitly studied. Layered graphs have been studied by Paul, Tarjan, and Celoni [15] (they call these "level graphs" in their paper). Belaga [3] defined locally synchronous circuits, which is a subclass of layered circuits, with the extra condition that each input variable can appear at most once. The synchronous circuits form a proper subset of layered circuits. (See next section for more details.) Furthermore, Turán [23] showed that there exists a function f n such that any synchronous circuit for f n has size Ω(n log n), but there exists a layered circuit for f n with size O(n). See Belaga [3] for the same gap for functions with multiple outputs. This distinguishes synchronous circuits and layered circuits with respect to their computational powers. Notice that every Boolean function can be computed by circuits from each of the classes we consider.
Our main result is for layered circuits. We can obtain slightly better bounds for synchronous circuits. A circuit is planar if its underlying graph can be embedded in the plane without crossings of the wires [13] . In [7] we showed that planar circuits of size s can be simulated by planar circuits of depth O( √ s). For planar circuits, we used the fact that every planar circuit of size s has a separator of size O( √ s) [12] . Informally, the separator of a graph is a subset of the nodes whose removal yields two subgraphs of comparable sizes. This allowed us to use a simple divide-andconquer strategy. Graphs with small separators include trees, planar graphs [12] , graphs with bounded genus [9] , and graphs with excluded minors [2] . In fact, we can get similar results for arbitrary classes of circuits with small separators.
On the other hand, not all synchronous circuits and layered circuits have small separators. See [20] for many examples. So we need strategies other than the divide-and-conquer approach. Our idea is to consider cuts, which separate the graph into two subgraphs that are not necessarily comparable in size. For synchronous circuits, our technique is to find a relatively small cut such that the function can be computed by the composition of two circuits of small depths. This gives a simple proof for synchronous circuits, but the same method cannot be applied to the more general layered circuits. For layered circuits, we develop an adaptive strategy in the two-person pebble game, such that the sizes of the cuts are taken into account during the game. Note that both [17] and [6] use the notion of separators in their proofs. Our results for synchronous circuits and layered circuits show that the minimum circuit depth does not necessarily grow with the separator size of the minimum-size circuit.
Finally we note that an arbitrary circuit of size s can be converted to either a planar or a synchronous circuit of size O(s 2 ) [25] . Thus improving our results by polylog factors for any of the classes we considered would also yield improvements over the best known bounds for general circuits.
Definitions and Backgrounds

The Circuit Model
A Boolean circuit is a labeled directed acyclic graph (DAG), where every node is labeled by either a variable from {x 1 , . . . , x n }, or an operation from {∧, ∨, ¬}. The inputs of a Boolean circuit are the nodes with in-degree (fanin) zero, and the outputs of a Boolean circuit are the nodes with out-degree (fan-out) zero. The size of a Boolean circuit is the number of its gates. We will consider Boolean circuits with gates of fan-in at most 2 from the basis {∧, ∨, ¬}. We refer interested readers to [25] for more background on Boolean circuits.
Definition 1. [10]
Synchronous circuits A circuit is synchronous if for any gate g, all paths from the inputs to g have the same length. Definition 2. Layered circuits A circuit is layered, if its set of gates can be partitioned into subsets called layers, such that every wire in the circuit is between adjacent layers. For circuits with one output, the following is an equivalent definition: A circuit with one output is layered if for any gate g all paths from g to the output have the same length.
Definition 3. Depth and height Let C be a circuit, and let g be any gate in C. The depth of g is the length of the longest path from any input to g. The depth of C is the maximum depth of an output gate, over all outputs of C.
For circuits with one output, the height of g is the length of the longest path from g to the output.
Definition 4. Levels and layers
The ith level of a circuit consists of all gates with depth equal to i. For circuits with one output, the ith layer of the circuit consists of all gates with height equal to i.
Note that the 0th layer in a circuit with one output consists of the output gate, and the 0th level in any circuit consists of the inputs. Also, "levels" and "layers" are usually used interchangeably in the literature, and distinguishing them this way is just our terminology. The following lemma is straightforward from the definitions, and it shows that every synchronous circuit is layered. A simple example shows that the converse is not true: consider the circuit with inputs x 1 , x 2 , x 3 , gates g 1 = x 1 ∧ x 2 , g 2 = g 1 ∧ x 3 , and g 2 as the output.
Lemma 3. A circuit C is synchronous if and only if every wire in C is between adjacent levels. Thus, every synchronous circuit is also a layered circuit.
The Two-Person Pebble Game
Several variants of pebble games have been invented to study questions related to the space requirements of computation, e.g. [16, 5] . See [19] for a survey. Here we focus on the two-person pebble game, which was defined by Dymond and Tompa [6] . The game is played on a DAG G. There are two players, the challenger and the pebbler. The challenger starts the game by choosing to challenge any single node of G, then the pebbler puts some pebbles on a subset of the nodes. From this point on, the challenger can only choose either the currently challenged node or a node that was pebbled in the pebbler's most recent move. The game continues until at the beginning of the pebbler's move, all immediate predecessors of the currently challenged node w are already pebbled. Then we say the challenger loses G at w. If, under the best defense of the challenger, the pebbler can win with t number of pebble placements, then we say that G can be two-person pebbled in time t. Note that pebbles are never removed: once a node is pebbled it remains pebbled during the rest of the game.
The next two theorems give an alternative proof of Theorem A.
Theorem C.
[6] Let G be a DAG with node set V . Then the pebbler can win the game in time O(|V |/ log |V |).
Theorem D.
(Theorem 3 in [6] ) Let C be a Boolean circuit computing a function f . If the underlying graph of C can be two-person pebbled with t pebbles, then there exists a tree-like circuit of depth 2t + 1 that also computes f .
Paul, Tarjan, and Celoni [15] gave a pebbling strategy for layered graphs but used the rules of a different pebble game, which does not imply bounds on the depth.
Size versus Depth for Layered Circuits
The following lemma gives an adaptive strategy in the two-person pebble game for layered circuits. Proof. In our proof, the pebbler will always place pebbles only on nodes with larger height than the currently challenged node. Since the challenger can choose only either the currently challenged node or a node that was pebbled in the pebbler's most recent move, in the proof we can assume without loss of generality that the circuit C has only one output, and that the first move of the challenger is to challenge the output gate. Furthermore, our strategy forces the challenger to choose vertices with increasing height within C, except when rechallenging the currently challenged node. Let L(0), . . . , L(d) be all the layers, where d is the depth of C, and L(i) is the set of gates with height i. (See previous section for definitions.) Note that L(0) consists of the output gate. We say that a layer L(i) is large if |L(i)| > y and small otherwise. We shall determine the value of y later.
The strategy of the pebbler has two phases. During the first phase, the pebbler forces the challenger to go into a subcircuit between two small layers such that every layer between the two small layers is large, or into a subcircuit such that all nodes of the subcircuit belong to large layers. In the second phase, the pebbler will win the game within that subcircuit.
During the first phase, in each round the pebbler places pebbles on every node of a small layer S α with α > β, where S β is the layer where the challenged node resides in that round. The pebbler continues the first phase until the small layer S α with α > β closest to the challenged node is pebbled, or until there are no more such small layers. The pebbler chooses the next small layer in a divide-and-conquer way depending on the location of the challenged node in each round. Since there are at most s small layers, the number of pebbles used in the first phase is at most y log s . Phase I.. Let S 0 , . . . , S m be the small layers numbered starting from the output. Note that S 0 = L(0) since L(0) contains only one gate. Define h(j) to be the height of the gates in the jth small layer S j . We define the strategy inductively.
At the beginning of the game (round 1), the challenger chooses the output node, which belongs to S 0 = L(0).
Suppose that for r ≥ 1 at the beginning of round r the challenger chooses a node w ∈ S j . Since during phase I pebbles are only placed on nodes in small layers, the challenged node w belongs to a small layer in every round within phase I. According to our strategy, a small layer either has a pebble on all of its nodes (in this case we say that the layer is pebbled), or has no pebbles on any of its nodes. We have three cases. Phase II.. The pebbler's strategy in the second phase is as follows: Suppose that the challenger chooses node w in the beginning of the kth round. Then the pebbler puts pebbles on the two inputs of w, say u and v. In the (k + 1)st round, if the challenger stays on w, then the pebbler wins the game. On the other hand, if the challenger chooses one of the inputs of w, WLOG u, then the pebbler puts pebbles on the two inputs of u in the (k + 1)st round. The game continues inductively this way until at the beginning of pebbler's move either the currently challenged node w is an input of C, or the two immediate predecessors of w are already pebbled. Thus the pebbler wins in the second phase.
Note that in this phase, the pebbler only spends at most two pebbles in each round, and the two pebbles are put on nodes in large layers of C. Moreover, during k rounds of the second phase, the pebbler places pebbles on nodes from k different large layers. Since the number of large layers in C is at most s y , the second phase must terminate in at most s y rounds. Thus, the number of pebbles used in this phase is at most 
Size versus Depth for Synchronous Circuits
The following simple lemma was given in [25] . The results by McColl and Paterson [14] , and Gaskov [8] mentioned in the introduction give stronger results. But for our purposes, this slightly weaker bound is sufficient, and we include a simple proof for completeness.
Lemma 5.
[25] For every function f : {0, 1} n → {0, 1}, there exists a synchronous circuit of depth at most n + log n + 1 computing f .
Proof. The proof is based on considering any DNF of f . The terms can be computed in parallel with depth at most log n + 1, and the number of terms is at most 2 n . This gives the desired depth. Note that any circuit can be made synchronous without increasing its depth.
Next we prove Theorem 2. Note that in the proof, we use the property of synchronous circuits that given any level LV (i), f is a function of exactly those functions computed at the gates in LV (i). This property allows us to do function composition in terms of two circuits. However, for layered circuits, inputs could be in the jth layer for j > i. Thus the property no longer holds for layered circuits that are not synchronous.
Note that the notation LV stands for "levels", while in the previous section we used L for "layers".
Proof of Theorem 2. Let f be the function computed by C. (If C has more than one output, the proof can be applied by considering each output function separately, and combine the resulting small depth circuits.) Since C is synchronous, every level in C forms a cut. Furthermore, given any level LV (i), f is a function of exactly those functions computed at the gates in LV (i). We shall use this special property of synchronous circuits to compute f by the composition of two circuits.
Let LV (0), . . . , LV (d) be the levels in C, where d is the depth of C, and LV (0) contains the inputs. Let y be an integer to be determined later. We say that a level LV (i) is small if |LV (i)| ≤ y and large otherwise.
If C has many outputs, then it is possible that all the levels are large, but then the depth of C is at most s y . Assume that C has at least one small level. Let LV (k 0 ) be the small level farthest from the output of C. Now let g 1 , . . . , g m be the gates in LV (k 0 ), and let γ i be the function computed at g i . As noted above, f is a function of γ 1 , . . . , γ m . Let f = f (γ 1 , . . . , γ m ). Then by Lemma 5, given γ 1 , . . . , γ m as inputs, f can be computed by a synchronous circuit F of depth O(|LV (k 0 )|) = O(y).
Let C be the multiple-output subcircuit of C with outputs g 1 , . . . , g m . That is, C consists of the levels LV (0), . 
