Recently, Impagliazzo et al. constructed a nontrivial algorithm for the satisfiability problem for sparse threshold circuits of depth two which is a class of circuits with cn wires. We construct a nontrivial algorithm for a larger class of circuits. Two gates in the bottom level of depth two threshold circuits are dependent, if the output of the one is always greater than or equal to the output of the other one. We give a nontrivial circuit satisfiability algorithm for a class of circuits which may not be sparse in gates with dependency. One of our motivations is to consider the relationship between the various circuit classes and the complexity of the corresponding circuit satisfiability problem of these classes. Another background is proving strong lower bounds for TC 0 circuits, exploiting the connection which is initiated by Ryan Williams between circuit satisfiability algorithms and lower bounds.
Introduction
Satisfiability problem gives both an integral view on theory of NP-complete problems which is firstly defined in [6] and [14] , and one of the most useful methods for constraint satisfaction problems in engineering and other practical fields. In particular, heuristic ways on CNF SAT are applied in the practical area of various combinatorial search problems such as boolean circuit design verification.
There are several well known computational problems related to satisfiability problems. The first one is satisfiability for CNF formulas and its generalization, because CNF is one of fundamental concepts about boolean functions. For example, Santhanam [17] gives an algorithm with a nontrivial exponent for linear size formulas of AND and OR gates with fan-in two. The second one is MAX-k-SAT, the optimization version of k-CNF SAT. Even for MAX-3-SAT, no algorithms with constant savings over brute force search are known while such an algorithm is constructed for MAX-2-SAT in [19] . The third one is Integer Linear Programming (ILP) that is very useful in expressing combinatorial optimization problems both in theory and practice.
Satisfiability for depth two threshold circuits contains these problems as special cases. Satisfiability for CNF formula can be solved by algorithms solving satisfiability for depth two threshold circuits. We should note that we do not obtain a nontrivial algorithm for depth two threshold circuit satisfiability algorithm as a corollary of the result in [17] . The reason is that known transformation from a linear size threshold circuit to formula over AND and OR gates yields a quadratic blow-up of size. MAX-k-SAT, the optimization version of k-CNF SAT, can be computed by algorithms solving satisfiability for depth two threshold circuits, since we can regard the top threshold gate as a counting device of the number of satisfied CNFs and an objective function in an optimization problem. Finally, testing the feasibility for a 0-1 ILP is equivalent to testing the satisfiability of a circuit with two levels: the bottom consisting of threshold gates and the top level being an AND gate. So understanding satisfiability of depth two threshold circuits could give us various view points on both theoretical and practical areas including the above three problems. In the paper [13] , Impagliazzo et al. constructed the first nontrivial algorithm with constant savings in the exponent over brute force search for the satisfiability of sparse depth two threshold circuits which has cn-wires for every constant c. As a consequence, they also got a similar result for linear-size ILP. Here we say an algorithm is nontrivial, if its running time is bounded above by 2 n /w(n) where n is the number of input variables and w(n) is a super-polynomial function in n. Note that 2 n is just the number of assignments to n input variables. Their main subroutine is an algorithm for the Vector Domination Problem: given n vectors in R d , decide whether there is a pair of vectors such that the first vector is larger than the second vector in each coordinate. Relationship between this problem and satisfiability problem is studied in [19] .
The Strong Exponential Time Hypothesis (SETH) is a well known conjecture about limitations of efficiency of satisfiability algorithms. The statement of SETH is that for every δ < 1 there is a k such that k-SAT cannot be solved in time O(2 δn ). In particular, an algorithm with constant savings for depth two threshold circuits of super linear size would violate SETH [9] , since k-CNF for all k can be reduced through Sparsification Lemma [10] to superlinear size depth two threshold circuits [3] . Some algorithms solving CNF-SAT and MAX-SAT with constant savings when the formula is linear size are given in [17] and [7] . Assuming the SETH, we can not to solve satisfiability problem for super linear size depth two threshold circuits with constant savings as a direct extension of the result in [13] .
Thus one of natural directions relating with this result is extending classes of input circuits and constructing an algorithm with constant savings under the SETH for such classes. Considering algorithms for a class of circuits of Copyright c 2015 The Institute of Electronics, Information and Communication Engineers polynomial size is also crucial to circuit complexity theory. Ryan Williams proved that nontrivial improvement over the brute force search for the general boolean circuit satisfiability problem implies circuit lower bounds [20] . He also applied this technique to prove super polynomial size bounds for ACC 0 circuits using a novel nontrivial satisfiability algorithm for ACC 0 circuits, solving a long standing open problem [21] . Because of the connection to circuit lower bounds the power of threshold circuits is extensively studied in [2] and [5] . The class TC 0 , which is a class of constant depth polynomial size threshold circuits, is a well known natural circuit class larger than ACC 0 . Current understanding of the limitations of bounded depth threshold circuits is, however, inadequate. Exponential lower bounds for such circuits are only showed for the limited class of depth two and bounded weight [8] . For larger depth circuits, it is barely super linear lower bounds that we obtained on the number of wires [12] . Very recently, Ryan Williams obtained a remarkable result toward the bounded depth threshold circuit lower bounds: circuit lower bounds for the class ACC 0 • THR, using a sophisticated modification of the previous results [22] , but nontrivial satisfiability algorithm for polynomial size depth two threshold circuits is not given. For all above reasons, it is significant to give algorithms for an explicit class which is a subclass of depth two threshold circuits of super linear size.
In this paper, we construct a nontrivial algorithm for a larger subclass of depth two threshold circuits with polynomial number of gates. Two gates in the bottom level of depth two threshold circuits are dependent, if the output of the one is always greater than or equal to the output of the other one. We give a nontrivial circuit satisfiability algorithm for a class of circuits which may not be sparse in gates with dependency and which have polynomial size. Lets consider the following parameterized problem: for given depth two threshold circuit C of size n c which is sparse in independent gates such that there exists an unique maximal independent gate set I of size greater than k , compute YES iff C is satisfiable, where c is a constant and parameter k in this problem k is the maximum size of independent gate set of C except I . This improves the previous result in [13] in the sense that we can construct a nontrivial algorithm when we relax some condition on sparsity in input circuits. We can consider a family of circuits with high dependency which can compute all boolean functions.More details about this fact are mentioned in the later.
The rest of the paper is as follows. Firstly, we define several notions being necessary in other sections in Sect. 2 and formally state our results in Sect. 3. We give an overview of the entire algorithm in Sect. 4. In Sect. 4, we also define a problem: for given a circuit and a graph containing information about dependency of the circuit output YES if and only if the circuit is satisfiable. We give a constructive proof of a reduction from our original circuit satisfiability problem to this problem in Sect. 5. In Sect. 6, we solve the problem defined in Sect. 4. In Sect. 7, we give an algorithm whose subroutine is to solve the problem in Sect. 5 to solve the original satisfiability problem. Finally, we discuss future work in the last section.
Preliminaries
A threshold gate which outputs a boolean value has the label w 1 x 1 + · · · w n x n ≥ t, where each of w 1 , . . . , w m and t is a real number, and x 1 , . . . , x n are input boolean variables. For all boolean inputs (x 1 , . . . , x m ), it outputs 1 if and only if the statement of the label holds. We give a more precise definition as follows. Definition 2.1: Let x 1 , . . . , x n be boolean variables. Let w 1 , . . . , w n , t be real numbers. We define a threshold gate as a gate computing a boolean function T HR w 1 ,...,w n ,t (x 1 , . . . , x n ) such that T HR w 1 ,...,w n ,t (x 1 , . . . ,
A depth two threshold circuit is a circuit which has two layers of threshold gates: the top gate and bottom gates. We assume that there may be some wire from an input variable to the top gate, and we call such wire a direct wire.
Definition 2.2:
. In other words, one of two preimages G We define an extension of this notion which involves dependency in circuits.
Definition 2.4:
A depth two threshold circuit C is sparse in independent gates, if there exists some constant d for an arbitrary independent gate sets I in C (at the bottom level)
Problems We Consider
Let's consider the following parameterized circuit SAT problem.
sparse in independent gates such that there exists an unique maximal independent gate set I of size greater than k, where c is a constant. Parameter: k: Maximum size of independent gate sets of C except I . Compute: YES iff C is satisfiable.
Note that we obtain an instance of k-THR-SAT by eliminating all gates in I and wires connecting to them from C.
Motivation of Our Setting
In this subsection, we describe several facts on circuits with high dependency. We think that these explain why the investigation of threshold circuits parameterized by its dependency is interesting.
Let k-THR be a layer of threshold gates whose maximum independent gate set size is k, and let THR • k-THR denote the class of depth two circuits where the top gate is an arbitrary threshold gate and the bottom level is k-THR.
It is clear that the class THR • k-THR with k = 2 n can compute all boolean functions by emulating DNF formulas. A bit surprisingly, THR • k-THR is universal even for k = 1. We describe below the construction of such a circuit for an arbitrary given function.
Let f (x n−1 , . . . , x 1 , x 0 ) be a boolean function on n variables. For simplicity, we assume f (0, 0, . . . , 0) = 0. For 0 ≤ j ≤ 2 n − 1, let y j denote the binary representation of j of length n, i.e., y j := (x n−1 , . . . , x 1 , x 0 ) with n−1 i=0 x i 2 i . Let G j be the threshold gate whose output is 1 iff
Obviously, there is no pair of independent gates in G. The top gate outputs 1 iff G j ∈G w j G j ≥ 1 where the weight w j is f (y j )− f (y j−1 ) which is 1 or −1. In fact, the value of f is equal to G j ∈G w j G j . We note that the top gate can be replaced by a symmetric gate (i.e., a gate whose output depends only on the sum of its inputs) that outputs 1 iff G j ∈G G j is odd. This says that SYM • k-THR is also universal.
We see by these examples that limiting dependency affects not the universality but the complexity, i.e., the number of gates or wires in a circuit. As was described in Introduction, for every δ < 1, the existence of 2 δn time algorithm for k-THR-SAT of superlinear size for k = ω(n) would refute SETH. It is clear from the definition that the class of functions that can be computed by THR • k-THR circuits of size s(n) contains the one computed by THR • k -THR circuits of the same size for every k ≤ k. Hence it is interesting to see the largest value of k such that k-THR-SAT admits an algorithm with constant savings as well as to study how the time complexity of circuit satisfiability problem varies as the dependency of input circuits does.
Our Results
We show the following main theorem, which is about a construction of nontrivial satisfiability algorithm for depth two threshold circuits with bounded size of independent gate sets.
Theorem 3.1:
There is a satisfiability algorithm for k-THR-SAT with unique exception that runs in time
, and k ≤ n γ for an arbitrary real constant 0 < γ < 1 and d is a sparse constant of a given circuit.
In the rest of the sections, our main goal is to prove the following Lemma 3.1 and we obtain Theorem 3.1 from Lemma 3.1 Lemma 3.1: There is a randomized satisfiability algorithm for k-THR-SAT with unique exception in which all random bits are created by independently tossing a coin, and the algorithm runs in time O(2
In what follows, we give a way to obtain Theorem 3.1 from Lemma 3.1. A way to get a deterministic algorithm from a two sided error algorithm with error probability at most 1/3 is given as follows. This method is generally called the conditional expectation method.
We consider a randomized algorithm that uses m random bits. We can regard all its sequences of coin tosses as corresponding to a binary tree of depth m. We know that most paths from the root to the leaf are good, that is, give a correct answer. It is natural and simple thought to try and find such a path by walking down from the root and making good choices at each step. Equivalently, we try to find a good sequence of coin flips with considering each single bit.
We consider formally this intuition. Fix a randomized algorithm A and an input x, and let m be the number of random bits used by A on input x. For 1 ≤ i ≤ m and r 1 , r 2 , . . . , r m ∈ {0, 1}, we define P(r 1 , . . . , r i ) as the fraction of continuations of a randomized computation that are good sequences of coin tosses. A precise definition is as follows: if R 1 , . . . , R m are uniform and independent random bits, then P(r 1 , . . . , r i ) is defined as Pr
By averaging argument, there exists an r i+1 ∈ {0, 1} such that P(r 1 , . . . , r i , r i+1 ) ≥ P(r 1 , r 2 , . . . , r i ). Thus, at node (r 1 , . . . , r i ), we pick r i+1 which maximizes P(r 1 , . . . , r i+1 ). Finally, we obtain r 1 , . . . , r m such that P(r 1 , r 2 , . . . , r m ) ≥ P(r 1 , r 2 , . . . , r m−1 ) ≥ · · · ≥ P(r 1 ) ≥ P ≥ 2/3, where P is the fraction of good path from the root. Therefore, we have P(r 1 , . . . , r m ) = 1, because P(r 1 , . . . , r m ) is either 0 or 1.
For an implementation of this argument, we just construct a deterministic algorithm to compute P(r 1 , r 2 , . . . , r i ) for each i. Note that if we show an algorithm in which all random bits are created by independently tossing a biased coin then an implementation is given. By the Chernoff bound, we can construct a randomized algorithm which repeats the algorithm in Lemma 3.1 constant times and runs with error probability at most 1/3. Thus, using the condi-tional expectation method, we obtain Theorem 3.1 by repeating the algorithm in Lemma 3.1 constant times.
Note that the statement of Theorem 3.1 improves the following previous result by Impagliazzo et al. [13] in the sense that we can construct a nontrivial algorithm when we relax some condition on sparsity in input circuits. We first give a rough and qualitative sketch of the outline of the algorithm in [13] . For given depth two sparse threshold circuits, they give three reductions: the first one transforms an arbitrary ILP instance with small number of inequalities to an instance of vector domination problem, and the second one transforms any depth two circuit with small number of gates to an union of ILP instances with small number of inequalities, and the third one transforms a depth two threshold circuit with linear number of wires to an union of depth two circuits with small number of gates. Constructing an algorithm with constant savings in the exponent, we can test satisfiability of depth two sparse threshold circuits with our setting. Our meaning of small number is the one that the number of gates or inequalities is less than the number of variables.
The random restriction technique is used to construct these reduction procedures. The second reduction uses restrictions to output wires of bottom gates in depth two threshold circuits with small number of gates. The third reduction uses restrictions to input variables, and we can decrease the number of bottom gates because of this restriction. For each restriction to output wires of bottom gates we obtain an ILP instance with small number of inequalities by the second reduction, and for each restriction to input variables we obtain a depth two circuit with small number of gates by the third one.
Restricting to the output wire of a bottom gate means obtaining a linear inequality whose variables and coefficients and the threshold value agree with the label of the bottom gate. Let's consider when we obtain a depth two threshold circuit with small number of gates. The number of brute force restrictions to bottom gates of which the number is less than the number of variables is still less than the number of brute force restrictions to input variables. Because of this saving, we can constantly save the complexity of the exponent of the running time.
Restricting variables for the third reduction involves a little technical argument. We take a random subset of variables and assign a boolean string to these variables and let the other unchosen variables remaining. When for a random subset of input variables these variables are fixed, we consider the following two cases for an arbitrary bottom gate. In the first case the gate has at most one unfixed fan-in. In the second case the gate has at least two unfixed input fan-ins. In the former case, such kind of gates do not cause any trouble for the reduction, because we can eliminate these gates and decrease the number of bottom gates. In the latter case, however, we cannot take such straight forward argument. It is the sparsity of circuits that gives the nice property that there are not so many such bad gates.
We mention how we obtain an extension of [13] from our setting. In our setting, the number of restrictions to bottom level gates is bounded above because of dependency of bottom gates. We first define some partial order on the set of bottom gates. Hasse diagrams of this relation are useful to formalize the notion of dependency of bottom gates in a circuit. Next, we define a mid-point problem: for given a pair of a circuit and a Hasse diagram relating with the circuit, output YES if and only if the circuit is satisfiable. Our main subroutine is a randomized algorithm solving this problem. Because of an upper bound on the expected number of restrictions to bottom level gates, the running time of the randomized algorithm is faster than the complexity of the trivial exhaustive search. In other words, the expected exponent of the running time is faster than the one of the trivial exhaustive search. Our main subgoal is to obtain an upper bound on the expected exponent of the running time of a randomized algorithm to check satisfiability. We show several lemmas about the bounds on the number of restrictions. We finally design a randomized algorithm contains several subroutines: the reduction procedure from satisfiability problem for given depth two threshold circuits to the mid-point problem and the algorithm solving an intermediate problem. We obtain a deterministic algorithm by repeating this randomized algorithm constant times, using the method called the conditional expectation method.
4. An Overview of the Entire Algorithm in Lemma 3.1
Partial Order on Bottom Gates
We express structures on dependency of bottom level gates using directed graphs. We first introduce a partial order representing the dependency of threshold gates. Definition 4.1: Let C be a depth two threshold circuit. The binary relation on the set of bottom level gates of C is defined as follows:
where G is the set of bottom gates.
We define a problem using the partial order stated in the above.
Definition 4.2:
Name of Problem: L Given: C, H satisfying the following conditions.
• C is an instance of k-THR-SAT with unique exception • There is no pair of gates
• H is a Hasse diagram of a partial ordered set of bottom level gates in C according to the order .
Output:YES iff C is satisfiable
We will give a procedure for the problem L and this procedure is a critical subroutine of the algorithm constructed in Roughly speaking, the reduction described in this lemma generates a Hasse diagram by checking the dependency of every pair of bottom gates by solving an ILP with two constraints. The proof of the lemma is postponed to the next section.
Restriction to the Bottom Gates and Reduction to ILP
Intuitively, when the dependency of a circuit C is limited, the output of C is determined by fixing the output of a small number of bottom gates. By using this property, we can build a set S of small number of ILP instances such that C is satisfiable iff at least one instance in S is feasible and each instance has a small number of constraints. In fact, our algorithm solves k-THR-SAT by solving such set of ILP instances. The following lemma, which will be proved in Sect. 6, states this more formally.
Definition 4.3:
For a depth two threshold circuit C, the set X(C) is defined as {(y 1 , . . . , y n c ) ∈ {0, 1} n c : y i is the output of the i-th bottom gate in C, when C runs for an arbitrary input x ∈ {0, 1} n }.
Lemma 4.2:
Let C be an instance of k-THR-SAT with unique exception. There is a set S of ILP instances with n variables satisfying the following three conditions: (1) It holds that C −1 (1) = S ∈S F(S ), where F(S ) is the set of feasible solutions of S ∈ S, (2) the set S contains at most |X(C)| ILP instances and (3) each instance in S has at most 2k + |I | constraints, where I is unique exceptional independent gate set in C.
We will use this reduction in the main algorithm which will be described in Sect. 6.
The Entire Overview
The construction of an algorithm in Lemma 3.1 is as follows: 1. Call the reduction procedure in Lemma 4.1 to transform the given instance of k-THR-SAT with unique exception to an instance of the problem L . 2. Find the exceptional unique independent set I . 3. Run the main algorithm on the input C, H and I .
We note that we can find I in step 2. as follows. First, let a positive integer l be 1, and repeat increasing l by one until there uniquely exist an independent set of size l. Next, search the unique maximal independent set of size greater than l. Note that the repeating process stops in at most k steps. In step 3, restriction methods to both input variables and outputs of bottom gates and the reduction to ILP in Lemma 4.2 are used.
Partial Order in Circuits and Reduction Lemma
At first we give the following lemma on a binary relation on the set of bottom gates of a depth two threshold circuit mentioned in the previous section.
Lemma 5.1: Assume that C is an instance of k-THR-SAT with unique exception, and that there is no two gates
2 (1). Then, there exists some partial ordered set (G, ), where G is a set of bottom gates of C such that the maximum size of an independent set of Hasse diagram H of (G, ) is k.
Proof First, we prove the existence of a partial ordered set. For any instance of k-THR-SAT with unique exception the following holds. If there is no two equivalent gates at the bottom level in the instance, then there is a partial ordered set (G, ) such that for all
, where G is a set of bottom gates. To prove this statement we show that the relation is reflective, asymmetric and transitive.
It is clear that G
Thus the relation is reflective and transitive.
Finally, G
there is no pair of two gates which is equivalent. Thus the relation is asymmetric.
Next we argue the maximum size of independent sets in a Hasse diagram. Let H = (V, E) be a Hasse diagram of C stated above. It holds that G
Thus, for any bottom gates G, G , the following three conditions are equivalent.
Hence, an arbitrary maximum independent set in H corresponds to some maximum independent gate set in C by the definition of .
In the rest of this section, we give the proof of the lemma describing the reduction from k-THR-SAT to the problem L .
Lemma 4.1(restated)
There is a reduction which reduces k-THR-SAT with unique exception to the problem L defined above and the reduction runs in deterministic time O(poly(n)T ZOLP [2, n] ), where T ZOLP [m, n] is the time complexity of the 0-1 Linear Programming with m constraints and n variables. Proof Let P(x), Q(x) be constraints depending on x = (x 1 , . . . , x n ) ∈ {0, 1} n .
Note that ∀x[P(x) ⇒ Q(x)] is equivalent to ∀x[¬P(x) ∨ Q(x)] and is equivalent to ¬[∃x[P(x) ∧ ¬Q(x)]] and that ∃x[P(x) ∧ ¬Q(x)]
is the YES-condition of the Integer Linear Programming. Thus we have the following procedure for the reduction. In this procedure, the 0-1 Linear Programming with two constraints and n variables is solved in step 4. and the other steps are computed in polynomial time. Thus, the running time of this procedure is O(poly(n)T ZOLP [2, n] ).
The reduction procedure 
in the TOP gate is replaced with (
8. G and all input and output wires of G are removed from C. Replace V with the set of bottom gates in C. For any bottom gate U,
Output the result C, H = (V, D)

Main Algorithm
At first we recap a sketch of the outline of the algorithm in [13] . Remind three reductions for given depth two sparse threshold circuits: the first one transforms an arbitrary ILP instance with small number of inequalities to an instance of vector domination problem, and the second one transforms any depth two circuit with small number of gates to an union of ILP instances with small number of inequalities, and the third one transforms given instance to an union of depth two circuits with small number of gates.
We show several lemmas about the bounds of the number of restrictions. We first define several terms being necessary for formal statements of these lemmas. S is a min-set of We count the number of validly ordered restrictions, and a lemma in this section is about an upper bound on the number of these restrictions. Bottom gates in min-set or max-set are critical to design our algorithm for satisfiability. Satisfiability of a circuit depends on information about bottom gates which are in min-set or max-set of the circuit, when output of bottom gates are fixed. In other words, we can decide satisfiability, even if we consider only some local information about bottom gates of a circuit and ignore the other gates. The covering condition is a condition stating the concept of min-set and max-set from another viewpoint, and is used in our algorithm in this section.
Definition 6.1: A directed graph H = (V, E) is an Induced
H for V.O.R χ, if S = {u min ∈ V ∩ χ −1 (1) : ∀v ∈ V \ {u min }[v u min ⇒ χ(v) = 0]} S is a max-set of H for V.O.R χ, if S = {u max ∈ V ∩ χ −1 (0) : ∀v ∈ V \ {u max }[u max v ⇒ χ(u) = 1]}
Definition 6.3:
Let X H be a set of validly ordered restrictions of H. We define I H as a set of pairs of independent sets in H which satisfies the covering condition, that is, I H := {(I 1 , I 0 ) ⊆ V × V : I 1 , I 0 are independent sets satisfying the covering condition in H}.
The following lemma is a main lemma of this section, and rough meaning of this lemma is that we can construct a satisfiability algorithm using exhaustive search for all independent gate sets.
Definition 4.3(restated)
Lemma 6.1:
For an arbitrary instance C, H ∈ L , let I be the unique maximal independent set of size greater than k.
We first show the following lemma to prove Lemma 6.1, which reduces counting the number of restrictions for a circuit to counting the number of structures in a graph.
Lemma 6.2:
There is a bijection μ H : X H χ → (I 1 , I 0 ) ∈ I H such that if μ H (χ) = (I 1 , I 0 ) then I 1 is the min-set of H for χ and I 0 is the max-set of H for χ.
First, we show two claims. The lemma easily follows from these claims.
Claim 6.1: Let H be an I.H.D and χ be a validly ordered restriction of H. Let I 1 , I 0 be min-set and max-set of H for χ respectively. Then (I 1 , I 0 ) is a pair of independent sets such that the covering condition holds for H.
Proof We give a proof by contradiction. For any fixed χ which assign 0 or 1 to vertices of H and for min-set I 0 and max-set I 1 of H for χ, adding all edges which is in I 0 × I 1 preserves validity of χ. In other words, after adding all edges which is in I 0 × I 1 , χ is still a valid ordered restriction. Let H = (V, E ) be this I.H.D which is obtained by adding edges to H. Thus I 0 , I 1 are maximal independent sets in H .
Assume that I 0 , I 1 do not satisfy the covering condition. Then either case1 or case2 holds for H .
In this case, u 1 u 0 contradicts to the assumption that
In this case, since I 1 , I 0 are maximal independent sets, for all v ∈ V \ (I 1 ∪ I 0 ) it holds that ∀u 1 
Thus we obtain that there exists a vertex v ∈ V \ (I 0 ∪ I 1 ) such that u 0 v u 1 , contradicting to u 1 ∈ I 1 and u 0 ∈ I 0 . In other words, for any c ∈ {0, 1} we obtain that χ(v) = c contradicts to u c ∈ I c .
Claim 6.2:
For any (I 1 , I 0 ) which is a pair of independent sets in H satisfying the covering condition, there uniquely exists validly ordered restriction χ : V → {0, 1} such that I 1 , I 0 are min-set and max-set of H for χ, respectively.
Proof For any validly ordered restriction χ it holds that I 1 is max-set implies χ(I 1 ) = {1}, and I 0 is min-set implies χ(I 0 ) = {0}. First, we consider vertices in I 0 ∪ I 1 . Since I 1 , I 0 are min-set and max-set of H for χ respectively, assign zero-one value to vertices in I 1 ∪ I 0 such that χ(I 1 ) = {1} and χ(I 0 ) = {0}. Finally we consider the other vertices. For any v ∈ V \ (I 1 ∪ I 0 ), the value χ(v) is uniquely determined by the definition of covering condition.
Proof of Lemma 6.2 For any χ ∈ X H , there uniquely exists (I 1 , I 0 ) ⊆ V × V such that I 1 is the min-set of H for χ and I 0 is the max-set of H for χ. By Claim 6.1, there is a map
then I 1 is the min-set of H for χ and I 0 is the max-set of H for χ. By Claim 6.2, this map μ H is a bijective map.
Proof of Lemma 6.1 Fix an assignment χ|
We assign 1 to any vertex u ∈ V such that there is some vertex v ∈ χ| −1 I (1) ⊆ I such that v u. We assign 0 to any vertex u ∈ V such that there is some vertex v ∈ χ|
Remove all vertices whose assignment is fixed and all edges connecting to them. Thus we obtain a subgraph H of H such that size of any independent vertex set in H is at most k because we assume the existence of unique exception. The output pattern set X(C) is a subset of validly ordered restrictions of H.
By Lemma 6.2 there is a bijective map μ H : X H → I H . Thus for any H the cardinality |X H | is bounded above by
, we obtain the de-
The following lemma is similar to a part of work in [13] , which gives a reduction from an instance of circuit satisfiability to a union of ILPs.
Lemma 4.2(restated)
Let C be an instance of k-THR-SAT with unique exception. There is a set S of ILP instances with n variables satisfying the following three conditions: (1) It holds that C −1 (1) = S ∈S F(S ), where F(S ) is the set of feasible solutions of S ∈ S, (2) the set S contains at most |X(C)| ILP instances and (3) each instance in S has at most 2k + |I | constraints, where I is unique exceptional independent gate set in C. Proof For any circuit C and each element in X(C), we obtain the following transformation from a circuit to an ILP instance, according to the following three kinds of gates. (i) For gates whose output is fixed to 1 with weights w 1 , . . . , w n and threshold t, we have
(ii) For gates whose output gate is fixed to 0 we require
(iii) For the top gate, let v 1 , . . . , v n be the weights of the direct wires, and s be the threshold of the top level gate, and w FIX be the sum of the weights of the gates whose output is fixed to 1. Then we require
Thus, the set of these instances satisfies the conditions (1) and (2) . Moreover, the following Fact 6.1 and the definition of min-set and max-set implies that the dependency of gates in C gives at most 2k + |I | constraints, and yields the existence of a set S of ILP instances with n variables satisfying the condition (3), because it holds that for each restriction to gates in I , the circuit C which is obtained by removing all gates in I and all wires connecting to them from C has min-set and max-set of size at most k. Fact 6.1: For boolean functions P 1 (x), . . . , P m (x) : {0, 1} n → {0, 1}, the following statements hold.
When for a random subset of input variables these variables are fixed, we consider the following two cases for an arbitrary bottom gate. The gate has at most one unfixed input wire in one case, and the gate has at least two unfixed input wires in the other case. In the former case, such gates are not harmful for our argument about the reduction, because we can eliminate these gates and decrease the number of bottom gates. In the latter case, however, we cannot use such straightforward argument. We define more precisely such gates to which we cannot directly apply gate elimination argument.
Definition 6.4: BAD gate
Let U be a subset of variables. BAD gate on U is a bottom level gate that depends on at least two variables in U.
In other words, if a gate is not BAD then we can eliminate it or replace it with a direct wire. The following corollary is easily obtained from this lemma.
Corollary 6.1 ([13] ): Consider a depth two threshold circuit C with n variables, which is a part of C, H ∈ L . Let VAR[I ] be a set of variables connecting to the gates corresponding to I , which is unique exceptional independent gate set in C. Let δ > 0 be an arbitrary positive real number and letŨ be a random set of variables such that each variable is not inŨ with some probability p independently. Let I Ũ be a subset of I which are also BAD gates onŨ. There exists Finally we consider an algorithm to solve L under given the unique exception of each instance. Let VAR[I ] be a set of variables connecting to the gates corresponding to I .
We give a description of the main algorithm as follows, using for all above ingredients. Note that all random bits are created by tossing a coin independently in the following algorithm.
Description of the main algorithm Given: An instance C, H , and the exceptional unique independent set I in H. Output:YES if and only if C, H is a YES instance of L .
Choose a random subsetŨ ⊂ VAR[I ] such that each
variable is not inŨ with probability p independently. 2. For each boolean assignment toŨ, fix the value of input variables inŨ, and do the following steps 3. and 4.
3. Eliminate any gate in I whose output is totally fixed. Replace any gate whose output value is the value of some input variable x with direct wire connecting to x. Let V D be a set of variable indices directly connecting to the top gate. Let D i (i ∈ V D ) be the sum of weights at the top gate such that these weights are coefficients of outputs of bottom gates replaced with direct wires connecting to the i-th variable. 4. For each restriction μ to outputs of remaining bottom gates in I do the following steps 5., 6., and 7.
5. Assign 0 to the output of an arbitrary bottom gate G, if there is some G ∈ I such that the output of G is fixed to 0 and G G . Assign 1 to the output of an arbitrary bottom gate G, if there is some G ∈ I such that the output of G is fixed to 1 and G G. 6. Remove all gates whose outputs are totally fixed from the given circuit which is a part of given instance of L . (i) For bottom gates whose output is fixed to 1 with weights w 1 , . . . , w n and threshold t, the corresponding inequality is
(ii) For bottom gates whose output gate is fixed to 0, the corresponding inequality
(iii) For the top gate, let v 1 , . . . , v n be the weights of the direct wires, and s be the threshold of the top level gate, and w FIX be the sum of the weights of the gates whose output is fixed to 1. Then for the top gate, the corresponding inequality is
HALT and return "NO"
Note that three kinds of inequalities in this step. appear in the proof of Lemma 4.2 and that the iterating increment of l in this step stops in at most k steps. Note also that Lemma 6.1 which gives an upper bound on the number of restrictions in steps 2. and 4. is a key to obtain a constant saving in the exponent in the running time of our algorithm.
Analysis of the Expected Savings
In this section our goal is the following lemma about the expected savings.
Lemma 3.1(restated)
There is a randomized satisfiability algorithm for k-THR-SAT with unique exception in which all random bits are created by independently tossing a coin, and the algorithm runs in time O(2
We remind the construction of an algorithm in Lemma 3.1 as follows. 1. Call the reduction procedure in Lemma 4.1 to transform the given instance of k-THR-SAT with unique exception to an instance of the problem L . 2. Find the exceptional unique independent set I . 3. Run the main algorithm on the input C, H and I .
Let TŨ(n) be the running time of the Main Algorithm. To consider savings of this algorithm, we only analyze the exponent log TŨ(n) of the running time of the Main Algorithm, because the time complexity of entire procedure is at most 3 max{2 εn , 2 o(n) , TŨ(n)} for some positive constant ε < 1. Note that three quantities 2 εn , 2 o(n) , and TŨ(n) are respectively corresponding to three steps 1., 2., and 3. in the above algorithm and that by summing up these three terms we obtain the bound. We also note that we can find I in step 2. by the exhaustive search for all i (1 ≤ i ≤ k + 1) and for all i-sets of the vertex set of H and searching maximal independent set of size greater than k. We use the following result. Then we can find a solution in time.
(1/2+λ(log(e)+ log(1+1/2λ)))n · poly(n).
Note that this algorithm is faster than 2 n for λ < 0.136 and has some positive constant saving C such that running time is 2
(1−C )n .
Firstly we prove Lemma 3.1 assuming the following Claim.
Claim 7.1: There exists a constant N 0 for any δ > 0,
where R is a set of remaining variables in the main algorithm.
Proof of Lemma 3.1 Let R be a set of remaining variables in the main algorithm. The expectation of the exponent of the time complexity is bounded above as follows.
, where |{0, 1} |Ũ| | is the number of assignments for brute force restriction to a random subset of variablesŨ. We mention how each operation in the main algorithm contributes to the above expectation. We remind that there are two loops in the description of the main algorithm: the inner loop and the outer loop. Note that the term |X(C ρ[Ũ] )| corresponds to restricting procedure to bottom gates in the steps 4., 5., and 6. and that T ZOLP [2k + |I Ũ |, |R|] corresponds to solving ILP problem in the step 7., where |I Ũ | is the size of the unique exception when a random subset of input variables is chosen and input variables in the subset are fixed. Thus by all these observations we obtain the above bound.
By the linearity of expectation, E[log |{0,
We show upper bounds on the above three terms. Let's consider the case (ii) λ < 1/2. In this case it holds that |I Ũ | ≤ In this case, we consider the following two subcases; (ii-a) for all real constant β > 0, |I Ũ | + 2k ≤ β|R|, and (ii-b) there exists some real constant β 0 (0 < β 0 < 1 2 ), |I Ũ | + 2k > β 0 |R|.
(ii-a) for all real constant β (0 < β < (ii-b) there exists some real constant β 0 (0 < β 0 < 1 2 ), |I Ũ | + 2k > β 0 |R|.
In this case, β 0 |R| < |I Ũ | + 2k < 1 2 |R|. Thus, there exists some real constant α (β 0 < α < 1 2 ) such that |I Ũ | + 2k = α|R|. Hence 0.5|R|+
|R| (log(e)−log(|I Ũ |+2k)+log |R|)|R| = 0.5|R| + (|I Ũ | + 2k)(log(e) − log(α|R|) + log |R|) = 0.5|R| + (|I Ũ | + 2k)(log(e) + log(α −1 )). Remind that |I Ũ | ≤ ≤ 0.5pn+(log(e)+ log(α −1 ))(3δp)n+2k(log(e)+ log(α −1 ))
In the case (ii-a), for any δ > 0, let β be a constant such that β(log(e) + log(1 + 1/2β)) ≤ 3δ. Thus, in the case (ii-a), T ZOLP [|I Ũ | + 2k, |R|]] ≤ 0.5pn + 3δpn + o(n).
Let N 0 be max{log(e) + 1, log(e) + log(α −1 )}. Note that N 0 does not depend on δ because α does not depend on δ. Therefore, there exists some constant N 0 for any δ > 0 it holds that E[log T ZOLP [|I Ũ | + 2k, |R|]] ≤ 0.5pn + 3N 0 δpn + o(n).
Concluding Remark
In this section, we mention future work: finding nonuniform circuit classes having super polynomial size lower bounds against NEXP. We denote O(k)-THR as a layer of threshold circuits such that the size of maximum independent gate sets is at most O(k), where k may depend on the number of input variables.
Let Proving size lower bounds for these circuit classes against NEXP by extending the methods developed in this paper is an interesting future work.
