Introduction
B-ISDN is regarded as the future telecommunications system 1]. It will be capable of supporting all kinds of information such as voice, video and data. To realise this, Asynchronous Transfer Mode (ATM) has been selected as a means of switching and multiplexing 1]. One of the most important issues in ATM-based systems is how to construct a switch capable of o ering high performance and reliability 2]. Many ATM switches have been proposed in the literature, most of them based on a class of networks called Multistage Interconnection Networks (MINs) 3, 6] . These are favoured because of several desirable features including distributed control, low complexity, modularity and their suitability for VLSI implementation. However, two main drawbacks of MINs have been noted: their low throughput and lack of fault tolerance. The former is due to the fact that a MIN is internally blocking, whereas the latter is due to the single-path property possessed by most MINs.
To improve both the performance and the reliability of MINs, one approach is to use a number of MINs in parallel to provide multiple paths for every source-destination pair 3, 4, 6] . This approach, also called K -replicated MIN (K is the number of MINs), o ers a number of advantages. In terms of performance, the o ered tra c is shared among MINs, therefore reducing internal blocking in each MIN. Moreover, as a result of multiple-paths, queuing at the output can be achieved|alleviating the output contention problem. From the reliability point of view, both redundancy and fault-tolerance are inherently provided. However, with this con guration, a great deal of complexity is placed in the input port controllers (IPC), used to dispatch the incoming packets among MINs, as they have to decide to which MIN packets should be sent. The easiest approach is to make the selection randomly, but as is shown in 4], this results in poor performance. A more sophisticated mechanism, proposed in 6], achieves a signi cant improvement in performance, but at the expense of increased complexity in the IPC dispatching algorithm. Another point is that when a fault occurs in one of the routing networks, the IPC also has to keep track of the faulty network in order to send packets only to the good ones|a further increase in the complexity of the IPC. This paper presents a fault-tolerant ATM switch which adapts the idea of using MINs in parallel. It has a distribution network to distribute incoming packets to several routing networks arranged in parallel. Consequently, the IPC is simply required to submit packets to the distribution network. As the latter is designed to perform some of the routing itself, the routing networks are not required to carry out the whole routing process; consequently incomplete (i.e. truncated) routing networks are used. As a result, the overall complexity of the switch is not further increased by the introduction of the distribution network. The distribution network is also in charge of re-routing packets in the presence of a fault. Unlike other fault-tolerant networks where the routing algorithm is usually modi ed in order to avoid faults, in the proposed switch there is no modi cation to the routing algorithm in fault conditions: hence the re-routing process is simple and the complexity of switching elements is kept low. Further, it will be shown that the proposed switch gives both improved performance and lower complexity than replicated MINs.
The paper is organised as follows. In section 2, the switch architecture and operation are presented. Section 3 gives the evaluation of the switch performance. A complexity comparison between the proposed switch and replicated networks is made in section 4. The results obtained from analytical models and simulation are presented and discussed in section 5. Finally, conclusions and further work are given in section 6.
2 Switch Architecture and Operation The DN and RNs are both unbu ered. The switch operation is based on a request/acknowledgement protocol, consisting of two phases: path-establish and packet-transfer 5]. At the beginning of the rst phase, an IPC that has a packet generates a request, a form of short packet, and then submits it to the DN and waits for an acknowledgement (ACK). If the returned ACK is a positive one, the IPC can transfer the packet to the appropriate output module during the second phase. If not, it waits for the next path-establish phase and starts the same procedure again. After being admitted to the DN, the request goes along stage by stage attempting to set up a path across the DN and RN until reaching its required output module where a positive ACK is now generated and sent backward following the path that the request has set up. However, in some cases when setting up the path, the request might be unable to get through because of losing contention to another request that also wants the same output of an SE at that stage. In this case, a negative ACK is generated from there.
The DN is in charge of distributing, routing and, if required, re-routing. Basically, it distributes an incoming request to one of the RNs according to some rule, and at the same time routes the request. After receiving a request from the DN, the RN then directs it to an appropriate output module. The routing algorithm used is the same as that of a Banyan network. The 2 4 SEs employed in the DN have two sets of outputs, referred to here as rst and second choices. In each set, there are two outputs| upper and lower outputs like those of a 2 2 SE. As an example, when a request arrives at the upper input of a 2 4 SE in stage i, if bit i of its routing tag is 0, then the request will be routed to the upper output. But which upper output? There are at least two ways of selecting the choice: random and partially-deterministic selections. By using the random selection, a request is randomly granted one of the choices; while with the partially-deterministic policy, a request is forced to take the rst choice, but if two requests want the same output, say the upper one, one gets the rst choice and the other the second on a random basis. Another option that also concerns the function of a 2 4 SE is how it behaves after receiving a negative ACK. In no re-routing mode, the 2 4 SE simply returns a negative ACK to a 2 4 SE in the previous stage. In re-routing mode, the SE attempts to route the request through the other choice. If still unsuccessful (because the request has been blocked or there is a faulty SE in a further stage), a negative ACK is generated back to the previous stage. In order to distribute requests evenly among the RNs when partially-deterministic selection is used, the arrangement of the rst and second choices of 2 4 SEs in the DN is done in such a way that the rst choice of half of the SEs in stage i are connected to their left child node and the other half to their right.
Performance Evaluation
The proposed switch is analysed under the following assumptions.
Packets arriving at di erent IPCs in the same cycle are mutually independent. Packets arriving at an IPC or at di erent IPCs in di erent cycles are mutually independent. The destination of a packet is uniformly addressed to one of the OMs with equal probability. Packets arrive at an IPC in a given time slot with probability p. This is the input rate of the switch.
A blocked packet, after returning to its corresponding IPC, is discarded without being re-submitted. The bu er size of an OM is in nite 3]. It was observed in 7, 8, 5] that the above assumptions taken together imply that the states of the IPCs are statistically independent and SEs in the same stage or di erent stages also behave independently. It follows that it is su cient to study only one SE per stage, as it represents the behaviour of all SEs in the same stage. The rst two assumptions are unrealistic in practice but, as it is shown in 8] by means of simulation, the results obtained are only slightly di erent with those two relaxed. The third assumption is a uniform tra c pattern, which is widely used when comparing di erent networks. The fth assumption is in fact an implication of the second one. The last assumption is there to simplify the analysis. The performance measure used is the probability of acceptance (P A ) which is de ned as the probability that an arbitrary request will be accepted 8].
Probability of Acceptance of the Proposed Switch: No Re-routing
The analysis of the no re-routing version of the proposed switch is based on an approach used in 8].
As the proposed switch consists of a number of stages of SEs, the output of an SE in stage i being the input of a SE in stage i+1, here P A can be thought of as the ratio of the output rate and the input rate of the switch. Consequently, derivation of P A for the switch is done by recursively evaluating the output rate of a SE in each stage. In this approach, the expression of interest is the output rate of a SE in each stage given its input rate. As the proposed switch employs three forms of switching devices (i.e. 2 4 SEs, 2 2 SEs and K 1 OMs), the task here is to derive an expression for the output rate of each device.
Derivation of Output Rate Expressions
I. Output Rate Expression for a 2 4 SE In Fig. 4 (a), denoting P out;u;1 (P out;u;2 ) as the output rate at the rst (second) choice of the upper outputs of a 2 4 SE and P in;u (P in;l ) as the input rate at the upper (lower) input of a 2 4 SE, then: 1) Partially-deterministic Selection P out;u;1 = P out;l;1 = (P in;u =2)(P in;l =2) + (1 ? P in;u =2)(P in;l =2) + (P in;u =2)(1 ? P in;l =2)
(1) P out;u;2 = P out;l;2 = (P in;u =2)(P in;l =2) (2) 2) Random Selection P out;u = P out;l = (0:5)((2)(P in;u =2)(P in;l =2) + (1 ? P in;u =2)(P in;l =2) + (P in;u =2)(1 ? P in;l =2))
Eq. 1 is the probability that a request is present at the rst choice output of a 2 4 SE; this is when there is at least one of the two requests which is addressed to the upper output (second and third terms) or both of them ( rst term). While in Eq. 2, there will be a request at the second choice only if the two packets are destined to the same output.
II. Output Rate Expression for a 2 2 SE In Fig. 4 (b), denoting P out;u (P out;l ) as the output rate at the upper (lower) output of a 2 2 SE and P in;u (P in;l ) as the input rate at the upper (lower) input of a 2 2 SE, then: P out;u = P out;l = 1 ? (1 ? P in;u =2)(1 ? P in;l =2)
III. Output Rate Expression for a K 1 OM In Fig. 4 (c), denoting P out as the output rate of a K 1 OM and P in;i as the input rate at the i th input of a K 1 OM, then: De ne P in as the input load of the switch (or p), P out1;j (P out2;j ) as the output load of the rst (second) choice output of a 2 4 SE in stage j, P out;j as the output load of a 2 2 SE in stage j and P out as the output load of an OM. Then, the Probability of Acceptance (P A ) of the switch is expressed as below:
I. Partially-Deterministic Selection 
where the rest of the equations are the same as those in the case of partially-deterministic selection, i.e. Equations 8, 9, 10 and 11.
Probability of Acceptance of the Proposed Switch: Re-Routing
Simulation was used to obtain the P A of the switch in case of re-routing. The switch was simulated under the same assumptions as those adopted in the analytical model. Two independent runs were carried out, each consisting of ve batches|altogether giving ten values to be averaged for each simulation point. Each batch consisted of 10,000 cycles. In each run, an additional 1,000 cycles were completed before data was collected in order to avoid transient a ects. The 90% con dence intervals are very tight and therefore will not be shown. However, due to time limitations, only 16 16 and 32 32 switches were modelled.
Complexity Comparison
Denoting the number of SE crosspoints (excluding demultiplexers and multiplexers) by N c , then for the proposed switch this is 8 
Results and Discussions
Fig . 6 shows the P A of the switch in case of no re-routing with both selection modes. It is clear that the partially deterministic mode gives better performance. Nevertheless, there is no signi cant di erence between P A in di erent selection modes in the case of re-routing mode as depicted in Fig. 7 . Although, as Fig. 8 shows, P A in the case of re-routing is signi cantly higher than in case of no re-routing for partially deterministic selection. This is due to the fact that in the re-routing version, requests have more choice to get to their requested outputs as a result of re-routing. However, this comes at the expense of a longer set up time. Curves of P A for a no re-routing version of the proposed switch with partially deterministic selection and a replicated MIN, as a function of the number of stages and the number of routing networks, are plotted in Fig. 9 , with the results for the K -replicated MIN (multiple acceptance) taken from 3] . Clearly, Fig. 9 indicates that the proposed switch o ers better performance than this replicated MIN.
Conclusions and Further Work
In this paper, a fault tolerant ATM switch design based on the use of a number of routing networks in parallel has been proposed. The design has a distribution network which is used to distribute the incoming packets among the routing networks. As a result, the input port controller simply submits the packets to the distribution network|in contrast to typical replicated networks where the input port controller has to decide which routing network the packets are to be sent to. The distribution network is also in charge of re-routing when there is failure in the switch. Since the re-routing process requires no additional modi cation, the routing algorithm is simple and the complexity of switching elements is low. Analytical models of the switch have been developed. Finally, it was shown that the proposed design gives better performance but requires less complexity when compared to a replicated network if the number of routing networks is greater than 2.
There are several further aspects of the proposed design to be investigated. These include an analytical model for re-routing mode, performance evaluation in the presence of faults and reliability analysis of the switch. 
