Abstract -A practical N-user arbiter and its implementation are presented in this correspondence. Because of the asynchronous character of its input variables (request signals), the design proposed is asynchronous and keeps in mind the possibility of metastable operations. The structure of the arbiter is very simple and modular.
Index Terms -Asynchronous arbiter, asynchronous logic design, conflict resolution, metastable operation, modular control logic.
I. INTRODUCTION When in a digital system there are multiple processors operating under the control of independent clocks, and they share the use of a common resource, a circuit is necessary to arbitrate its use and to resolve the conflicts that may occur. A valid solution to this problem would be to design a control mechanism (N-user arbiter) that satisfies the following general conditions. 1) Only one processor may use the resource shared at any one time.
2) Any request for using the common resource must be serviced in a finite time.
An arbiter is not a sequential circuit operating in fundamental mode. Because of the asynchronous character of its input variables (request signals), they may change at arbitrary times, independent of one another, and then it is necessary to define the proper operation of the circuit under the unrestricted input change assumption [5] . Therefore, with reference to arbiters, the traditional design approaches are inadequate [1] . However, we think that these methods can be used for designing some parts of the arbiter; normally, they will make the design easier. On the other hand, it is necessary to bear in mind the possibility of metastable operations in the arbiter's memory elements. Although it is an inevitable problem [8] , [9] , the designs must tend to diminish the probability of any anomalous behavior, and if possible, such situations should not affect the output signals of the arbiter.
The purpose of this correspondence is to present a design of a simple and modular N-user asynchronous arbiter that takes into account the metastable operation. The design is based on the experience of other earlier work [1] [2] [3] [4] . However, we dedicate special attention to design simplicity taking advantage of the power of traditional design methods whenever possible. In Sections 1I-B and II-C the modularity equations of the arbiter are given and a detailed study of the control circuit timing is carried out, relating the complexity of the control circuit and the number of processors that must share the resource. Finally, in Section III we discuss the failure probability of the arbiter due to metastable operations and estimate this probability in a concrete realization of the arbiter.
II. THE ARBITER
Communication between each processor Pi, i = 1**, N, and the arbiter takes place by means of two wires: the request wire ri and the acknowledge wire ai. The processor Pi communicates that it wants to use the common resource and sends a request signal by switching its wire ri to' 1. The arbiter, according to its priority scheme, will give permission to use the common resource to processor Pi with an acknowledge signal by switching its wire ai to 1. When the Pi has finished using the common resource, it resets its request line (ri = 0), which causes the arbiter to reset the corresponding acknowledge line (ai = 0). That is, we adopt the signaling convention shown in Fig. 1 [1] , and in agreement with condition 1), aia1 = 0, Vi,j, and i #j. We propose a design scheme which is based on the following rules.
a) The arbiter must have one module Mi for each processor Pi that has access to the shared resource. Each module Mi will be made up of a memory element to record the request and additional gates for determining the processor Pi priority and for generating the acknowledge signal.
b) The N-user arbiter will be realized by a linear array of N such modules. The priority is determined by the order of the modules in the array. c) A signal CO controls the intervals of time where the request signals must be recorded in the arbiter.
d) The signal CO will order a new recording (CO = 1) of request signals only when all requests recorded in the last iteration are serviced and the common resource can be used again.
e) The design must be asynchronous, simple, and must keep in mind the possibility of metastable operations.
A. Memory Element
In agreement with the above rules, the state table, the transition table, and the implementation of the memory element are shown in Fig. 2 . Notice that the memory element can record request ri = 1 to use the common resource only if Co = 1. When Co = 0, if the request was recorded (Qi = 1), the memory element will change state when the request signal ri switches to 0. After this, where Co is 0, the memory element will remain inhibited for any change of ri.
B. Priority Scheme
After each process of request storage, the arbiter initiates a new iteration of acknowledgments to use the common resource from among the processors Pi that had made the requests. The arbiter will grant the use of the resource to the processor Pi iff the following hold.
1) Its request was recorded (Qi = 1).
2) All processor Pj, with j < i, requests were serviced.
That is, Vi with 1 ' i ' N:
where bi = Qi bi-1 .
If we include the necessary gates for realizing (2) and (3), the module design Mi is shown in Fig. 3 . Of course, for MI, b0 must always be 1. We must complete the arbiter design with the control circuit for generating the signals C0 and I,. The function of the latter signal will be shown later.
C. Control Circuit
In the N-user arbiter that we propose, the control signals C0 and Ia are generated from the output bN of module MN. In agreement with (3), the switch of signal bN to 1 indicates that all requests have been serviced. From this moment the arbiter must initiate a new iteration and record the new requests to use the common resource. For the sake of discussion, the design of the control circuit is represented in Fig. 4 , and its significant signals are shown in Fig. 5. Supposing that all gates used in the design have the same delay Atg, the pulse bN duration Atb has a minimum value equivalent to the delay of five gates (5Atg). This minimum value would corre-0018-9340/86/0100-0067$01.00 C 1986 IEEE The purpose of the delay At, is to avoid runt pulses (see f3 + bN in Fig. 5 
where the first term on the right side of (6) To determine At2, we must select the worst case, which occurs when i = N and k = 1 in (6) and, because of the request of P1, the module M1 is in the situation indicated in point 1) above. Then, At2 must be At2 > 3Atg + (N -l)Atg = (N + 2)Atg.
That is, At2 = (N + 2) Atg + At (7) (8) where the term At must cover the duration of the metastable operation. Logically, as we will discuss in Section III, the larger At is, the smaller the probability that some output ai will show a possible metastable operation.
On the other hand, in agreement with the function of I,,, it is obvious (see Fig. 5 ) that the delay At2 must be smaller than the minimum pulse duration in the signal C, (AteC) A t2 < AtCo min . (9) cl- The last expression suggests to us some simplifications (which reduce the circuit cost) of the control circuit that we have proposed. The utility of these simplifications will depend on the number N of processors and the particular characteristics of the resource shared. We discuss three solutions. Fig. 4 is not necessary. Then, Co -bN, AtCoI min = Atb min = 4Atg
From (8) and (9), 4Atg > At2 = (N + 2)Atg + At. (10) Obviously, N must be smaller than 2, and then this solution is not (4) valid. 
From (8) and (9), 5Atg + Atm + Atdm > At2 = (N + 2) Atg + At.
It is evident from the last two expressions that the monostable output pulse is necessary whenever N ' 7. Taking into account the fixed values of N and At, the value of Atm must be chosen in agreement with (13) . Then, this solution is valid in all cases.
III. DISCUSSION OF THE METASTABLE OPERATION In each module M, the core of the memory element is a simple R-S flip-flop constructed by cross-tying two NAND gates. With relation to metastable operation, numerous studies and experimentation have been carried out on this device [10] [11] [12] [13] [14] . For our purpose, we wish emphasize the works of Chaney and collaborators [15] , [17] ; they have provided abundant experimental data for a good number of bistable devices, relating the temporal characteristics of input signals and some parameters of the devices with probabilistic measures of metastable operation.
For the sake of discussion, we will designate event A as being the negative-going edge of the pulse in signal C0 and event B as being the switching to 1 of the signal ri. In the arbiter that we propose, the metastable operation in some memory element can only be caused by the simultaneity between events A and B, that is, by the transition riCG: 01 --10 in the state table of Fig. 2 . This situation is shown schematically in Fig. 6 .
We define the resolution time of the memory element as the time it takes to produce logically defined and stable outputs, after the time of ocurrence of its excitation. In good behavior conditions this resolution time will be the normal propagation delay time (D), which we have estimated as 3A tg. When the device is in a metastable state, its resolution time is nondeterministic.
In agreement with [15] , [16] , a characteristic of the memory element is a time interval [tl, t2] , termed glitch window, whose location and width ( = t2-tl) are defined with respect to event A. If the relative delay td between events A and B is within this interval (td E [tl, t2j), the memory element will be set in a metastable state where t1 is small enough so that the memory always records the request to use the common resource within its specified propagation delay time, and t2 is sufficiently large so that event B is never recognized.
Assuming that event A occurs at time t = 0 and that the probability density functionf(td) is uniform over the interval 8, that is,   r0 if tl > tdor td > t2 f(td) = I i f t1 _ td _-t2 (14) we can define F(At) as the probability that the resolution time exceeds At. Hurtado [7] has shown that for At > ; this probability can be approximated by We wish to point out that the development carried out in this Section is based on one memory element only. Therefore, the estimated values for F(At) do not correspond to the failure probability, due to metastable situations in an arbiter with N modules. However, they represent a very good approximation. Take into account that during intervals of requests recording, the modules are statistically independent. The worst case occurs when the first module in the array goes into metastable operation. Otherwise, in agreement with (8), the memory element will have a time interval greater than the fixed value of At, in order to resolve its metastable situation. Moreover, if the module in the anomalous situation does not have the highest priority in the iteration, the above time interval will be increased by the time the processors, with higher priority, take to use the resource.
IV. CONCLUSIONS The arbiter described in this correspondence uses a requestacknowledge signaling convention presented by Plummer. The arbiter operates by iterations, each of which begins with a process of request storage. After that, in accordance with a priority scheme for a linear selection, the arbiter initiates a process of acknowledgments to use the common resource from among the processors Pi that have made the requests. Only when all requests have been serviced will a new iteration begin. In this asynchronous and modular design, two characteristics have been pursued: simplicity of design and the intent to reduce the problems related to possible metastable situations. several papers [2] , [3] . It has been shown that the deductive method is faster than the parallel method for most circuits.
The deductive method, introduced by Armstrong [4] , considers each gate in the circuit, and by analyzing the faults that cause incorrect signals at gate inputs, deduces a list of faults that would cause an incorrect signal at the gate output. By processing all the gates in this manner a list of faults causing incorrect signal values at the circuit outputs can be calculated in one simulation pass.
Deductive simulation involves set operations (union, intersection) on fault lists. It is shown in this paper that list operations on fault lists can be greatly simplified by taking into consideration the structural properties of the modules. A method for the deductive simulation of inverter-free circuits is presented in the following section. It is shown that fault list calculations are greatly simplified compared to the conventional deductive simulation. Results of Section II are applied to the deductive simulation of programmable logic array (PLA) faults in Section III. A two-valued simulation is considered. The single-fault assumption is made throughout the discussions. Proof: The proof follows from the fact that an s-a-0 (s-a-1) fault can only cause erroneous 0's (1 's) at gate inputs and outputs on paths from the fault site to the circuit outputs. The fault list of a line with value 1 (0) consists of faults that would change it to a 0(1), and in this case these could only be s-a-0 (s-a-1) faults. Therefore, the same fault could not appear on the fault lists of lines with complementary logic values.
II. GATE LEVEL DEDUCTIVE SIMULATION OF INVERTER-FREE CIRCUITS
Q.E.D. This means that in fault simulation of an inverter-free circuit by the deductive method, the same fault could not appear on the fault lists of inputs to the same gate with different logic values, and this leads to a simplification of operations performed for the calculations of the gate output fault lists.
In the conventional deductive simulation algorithm, the output fault list Lout of an AND gate with output value 0 in the fault-free circuit is calculated using (2) (2) 
