This article presents a discrete-event simulator, called QUARTS-II, that can be used for the design and analysis of routing protocols for ATM networks. This simulator shares many high-level features of the ATM Forum's P-NNI routing protocol; topology-state advertisement, on-demand source routing, call admission control and generic call admission control, hierarchical routing, and crank-back and rerouting. Simulations can be carried out at both the call and cell levels. Although this simulation tool has been designed primarily to simulate routing protocol, it can easily be extended to simulate other elements of an ATM traffic control architecture.
he driving force behind the operation of an ATM network is a set of complex protocols operating over various timescales. ATM networks are connection-oriented, in which a route for a call between two end users is derived through a routing protocol. An important feature of such networks is that they can support a wide range of services with different traffic characteristics and quality of service (QoS) requirements on a single platform. Thus, an ATM routing protocol should take user-specified QoS requirements into consideration while choosing routes. Furthermore, a high-performance ATM routing protocol should be dynamic. Such a routing protocol is capable of adjusting the routes in response to network congestion and failure, and hence leads to an efficient utilization of network resources. After choosing a route, the network invokes its signaling protocol to establish the call on the route. Once the call is established, the ATM cells corresponding to the call are switched from the input port to the required output port at the switches along the route.
Accurate analytical approaches to studying ATM routing protocols often result in models which are very complex with a large number of state variables. Also, the underlying relationships among the variables are nonlinear and therefore further complicate the analytical models. In order to obtain tractable analytical models, simplifying assumptions have to be made. However, such assumptions may be unrealistic in practice [1] . On the other hand, simulation is a flexible tool for evaluating the performance of complex routing protocols. It is possible to design simulation models that capture the important features of an ATM routing protocol with reasonable accuracy. Queen's University ATM Routing Testbed/Simulator (QUARTS) [2] is such a simulation package, designed primarily to simulate various aspects of ATM routing protocols. It can be used for both call-and cell-level simulations. It incorporates many high-level features of the private network-to-network interface (P-NNI) routing protocol recommended by the ATM Forum [3] , such as topology state advertisement, on-demand source routing, call admission control (CAC), and generic CAC (G-CAC). QUARTS-II is a successor to QUARTS and supports two more sophisticated routing features of P-NNI: hierarchical routing, and crank-back and rerouting. The source code of the simulator has been written in C programming language on a UNIX platform.
The goal of this article is to give an overview of QUARTS-II and demonstrate its application through an example. The rest of the article is organized as follows. The second section describes the building blocks of the simulator. The routing protocol and signaling protocol implemented in the simulator are described in the third and fourth sections, respectively. The fifth section presents the implementation details and operation of the hierarchical routing protocol in the simulator. In the sixth section, we use the simulator to examine the performance of a sample network. Finally, the seventh section concludes this article.
BUILDING BLOCKS
The simulator has been designed in a modular fashion using a number of building blocks: a graphical user interface (GUI), an event controller, and a number of network modules and control modules. These blocks interact with each other as shown in Fig. 1 . We now describe each of these building blocks in more detail.
GRAPHICAL USER INTERFACE
A user-friendly GUI simplifies the process of creating the network topology and setting the parameters for simulation runs. A network component (a switch, link, etc.) is created by choosing an item from a menu and setting the parameters in the dialog box associated with that item. An existing network file can also be modified by adding and/or deleting components. A snapshot of the GUI is shown in Fig. 2 .
The GUI provides control buttons to start and stop the simulator. It also permits the user to step through a simulation with respect to time or event. This feature is very helpful during the development or modification phase of the simulator. Moreover, the simulator enables the user to instantly monitor the output statistics during the course of a simulation. However, the use of the GUI has been made optional to speed up simulation runs. Once a network model is created and the appropriate parameters are set through the GUI, the network can be simulated without the GUI for a user-specified time. This approach significantly reduces simulation time.
EVENT CONTROLLER
The simulator is based on discrete-event technique. A network is treated as a discrete-event system where the events of interest occur at discrete points in time, and nothing of interest takes place between these time intervals [4] . At any given time, the state of a network is determined by the states of its components. For instance, the state of a link can be represented by the available bandwidth on that link. An event is an activity that might change the network's state and is the basic executable unit in discrete-event simulation. Some typical events are call arrivals and departures, and cell arrivals and departures.
The function of the event controller is to maintain and update an event list consisting of events that are arranged in chronological order. It also maintains a global clock to keep track of the simulation time. A cycle of discrete-event simulation is depicted in Fig. 3 . The event controller fires the event with the earliest occurrence time and advances the clock to the firing time of this event. When an event is fired it is removed from the event list, and the event routines associated with that event are executed. Furthermore, firing an event may create one or more future events which are then inserted at the appropriate locations in the event list. The heart of the simulation program is a loop executing the above cycle until the simulation is terminated.
NETWORK MODULES
The simulator decomposes an ATM network into three physical components: switches, links, and end systems (ESs). Each of these components is implemented by one or two network modules, as shown in Fig. 4 . A link module is an abstraction of a physical link connecting two switches. A link is characterized by propagation delay, link bandwidth, operational status (whether or not the link is up), and administrative weight.
A physical switch is represented by two modules: a switch module and a route module. The switch module implements the cell-switching function of an ATM switch. A cell at an input port is switched to the corresponding output port decid- ed at the call establishment time. The mapping between the input and output ports at each switch is implemented using a data structure called a translation The route module is the most important element that implements a topology-state (TS) routing protocol. A detailed description of this module is given in the next section. When creating a simulation network topology, a route module must be attached to a switch module. The separation of the route and switch modules facilitates the task of building an entirely different routing protocol without modifying the switch architecture.
The ESs are implemented using two modules: a source ES (SES) and destination ES (DES). An SES module generates call requests according to the specified arrival rate. It also generates the ATM cells for the active calls originated from it. The ATM cells are consumed by a DES module. An SES module should have at least one peer DES module, and only ES modules with a peer relationship are allowed to communicate. The GUI has the facility to visually provide peer relationship between an SES and one or more DES modules.
The simulator supports constant bit rate (CBR) and variable bit rate (VBR) traffic. A CBR source generates cells at the specified bit rate. The cell generation process of a VBR source is governed by a two-state on-off model. When the source is active it generates cells according to either a Poisson or deterministic process chosen before starting the simulation. The holding times of the calls from any SES is exponentially distributed with a specified mean. A switch can be attached to more than one SES module of any traffic type to create a heterogeneous environment.
CONTROL MODULES
In addition to the network modules, there are three other modules implemented to control simulations: console, output, and stopper. The console module is used to set some common simulation parameters (e.g., the seed for a random number generator). It is also used to set some particular parameters of interest globally rather than individually. For example, the bandwidths of all the links in a network can be simultaneously changed to a particular value by making a single change in the console module, rather than making a change in the dialog box of every single link. This feature is very useful, especially with a very large network, because it reduces the time required to configure and modify the simulation topology.
The output module is used to collect and display the values of performance measures of interest. This module is operated in two modes: periodical polling and event-driven update. In the former, the output module periodically visits the network modules to gather information. On the other hand, with event-driven updates whenever there is an event (e.g., a call is blocked), the associated network module invokes the output module to update the performance measures.
Finally, the stopper module is used to specify the running time of a simulation.
COMPONENTS OF THE ROUTING PROTOCOL
The TS routing protocol implemented in the route modules has two components: a routing information base (RIB) and a route computation algorithm. The RIB contains information regarding the connectivity and states of the links and switches in a network. The state of a link (switch) can be used to determine the desirability of using the link (switch) in a route. The route computation algorithm computes routes based on the contents of the RIB.
The route module supports both flat and hierarchical routing protocol. With flat routing protocol, a switch has detailed information (connectivity and state) about all the links and switches in the network. But, with hierarchical routing protocol a switch has detailed information about the links and switches that are close to it (in terms of some nearness measure) and progressively summarized information about the links and switches located further from it.
RIB
In the simulator, the state of a link or switch is specified by metrics and attributes. A metric is quantitative and can be used to determine the cost of using a link or switch during a route computation. On the other hand, an attribute is qualitative and can be used to determine whether or not a switch or link can be considered for route computation. The total and available bandwidth, and the administrative weight of a link are used as metrics, whereas the operational status, available s Figure 3 . A discrete-event simulation cycle. End system Source end system module Destination end system module bandwidth, and QoS parameters of a link are used as attributes. Note that the available bandwidth is used as both an attribute and a metric. This is because if a link does not have the bandwidth required by a call it should not be considered during route computation. The QoS parameters of a link are specified in terms of cell transfer delay, cell delay jitter, and cell loss ratio experienced by the link. The metric used for a switch is its buffer occupancy. This metric represents the degree to which the switch is busy. The attribute of a switch is its operational status. Given the state and connectivity information, the routing protocol can choose routes avoiding heavily loaded links and switches. A switch is assumed to be instantly aware of the activities associated with it and its outgoing links. Thus, it immediately updates its RIB in response to these activities. Furthermore, a switch advertises its local information (states of itself and its outgoing links) to all other switches in the network through topology-state advertisements (TSAs). TSA packets are disseminated using a flooding protocol [5] . Unnecessary duplication of TSA packets are prevented using sequence numbers. Note that with flat routing protocol, a TSA packet visits every switch in the entire network at least once. However, as we explain in the fifth section, with hierarchical routing protocol, the extent to which a TSA packet is distributed within a network is limited.
A switch triggers TSAs on both a periodic and an eventdriven basis. In the former, a switch triggers TSAs periodically regardless of its own activities or those of its outgoing links. In the event-driven approach, a switch triggers a TSA if it detects a significant change in its local information. How to define a significant change is an important design issue of TS routing protocol. Currently, a change in available bandwidth on a link by a specified threshold is defined as significant. We note, however, that it is easy to define more sophisticated events with a minor modification in the route module. Using the local information as well as the information obtained through the TSAs from the other switches, a switch builds a complete RIB that has enough information to compute a route from itself to any other switch in the network. Note that the portion of the RIB containing the local information of a switch is always accurate. The accuracy of the remaining portion depends on the frequency of TSAs.
ROUTE COMPUTATION ALGORITHM
As with the P-NNI routing protocol, on-demand source routing is used in the simulator. Upon receiving a call request from the SES, the source switch (SS) computes a route to the destination switch (DS). When computing a route, the SS does not use the links which do not have enough bandwidth to support the call. The bandwidth requirement of a call is its equivalent bandwidth [6] . The equivalent bandwidth is a function of the traffic descriptors of the call, and, depending on the implementation, it might as well be a function of the QoS requirements of the call. Currently, three traffic descriptors are defined in the simulator: peak and mean cell rates, and mean burst length. The QoS requirements are maximum cell transfer delay, cell delay jitter, and cell loss ratio.
The SS uses its CAC algorithm to compute the equivalent bandwidth to be reserved on its outgoing links. The purpose of the CAC algorithm is to check whether the new call can be accommodated without violating the QoS guarantee provided to the existing calls. As with the P-NNI routing protocol, the simulator allows each switch to use its own CAC algorithm.
The simulator provides a set of CAC algorithms based on the notion of equivalent bandwidth, and it is also very easy to add new CAC algorithms. However, in order to compute the equivalent bandwidth to be allocated on links other than the outgoing links, the SS uses the Generic CAC (G-CAC) algorithm [7] . G-CAC is a standard algorithm that any switch can use to estimate the bandwidth reserved by a possibly different CAC algorithm in another switch. The simulator provides a set of G-CAC algorithms with varying complexity, and new G-CAC algorithms can easily be incorporated. Note that simulations can be run without G-CAC, in which case the equivalent bandwidth is computed purely based on the CAC algorithm of the SS.
The SS computes a route using Dijkstra's shortest path algorithm [5] . A sample link cost function used for the algorithm is given by the following equation:
where k a is the administrative weight assigned to the link, k l is a constant, and U link is the bandwidth utilization (normalized to the full bandwidth) of the link. The switch then examines the computed route to see if it satisfies the QoS requirements of the call. If the QoS requirements are satisfied, the SS is successful in finding a route.
THE SIGNALING PROTOCOL
The signaling protocol used to establish a call on the chosen route is implemented in the route module. We now describe the signaling protocol used in conjunction with the flat routing protocol. The signaling procedure used with the hierarchical routing protocol is slightly different, and we will elaborate on this aspect in the fifth section.
When an SES makes a call request to a DES, it submits a signaling packet to the SS. The packet is a data structure containing a number of fields, as shown in Fig. 5 . The packet type is initially set to SETUP. Using the packet, the SES provides the source switch with the identity of the DES, and the traffic descriptors and QoS requirements of the call. Upon receiving the packet, the source switch attempts to find a route; if unsuccessful, it changes the packet type to REJECT and sends the packet back to the SES. In practice, if a call is blocked the caller may choose to try the call again. Currently, QUARTS-II does not support manual retrial. However, it is easy to incorporate such manual retrials via a probability of retrials and mean time between successive retrials.
If the SS successfully finds a route, a TT entry is created at the switch. The SS also appends an ordered list of all the switches in the chosen route to the packet. The packet is then forwarded to the next switch en route. The simulator supports automatic network rerouting when the chosen route cannot accept the call. The additional fields in the packet, namely the number of rerouting attempts (NRA), rerouting limit (RL), and blocking links, are provided to facilitate rerouting. The network will repeatedly try the call on alternate routes until either the call succeeds or the NRA exceeds the RL. Initially, the SS sets the value of the NRA and RL fields to zero and the specified rerouting limit (a network parameter), respectively.
As the packet progresses, each switch along the route performs an admissibility check via its CAC algorithm. If a call request passes the CAC test at a switch, a TT entry is created at the switch and the packet will be forwarded to the next switch. In the simulator, it is assumed that a DES does not
s Figure 5 . Signaling packet format. deny a call request. Thus, upon receiving the signaling packet, the DS always changes the packet type to ACCEPT and sends it back to the SES. The call setup delay is the time elapsed between the arrival of the call request and the reception of the packet (of type ACCEPT) at the SES. Once the call is accepted, the SES starts to generate ATM cells according to the call's traffic descriptors.
On the other hand, if CAC rejects the call request at a switch, the switch appends the identity of the blocking link to the packet and sets the packet type to REJECT. The packet then cranks back to the SS (Fig. 6 ). When the SS receives the packet, it increments the value of NRA (Fig. 5) . If NRA does not exceed RL, the SS attempts to compute an alternate route after excluding all the blocking links recorded in the packet from its RIB. If an alternate route is found, the SS will begin to establish the call on the alternate route; but if NRA exceeds RL, the SS will reject the call.
A HIERARCHICAL ROUTING PROTOCOL
As mentioned before, with a flat routing protocol a switch has the information about the connectivity as well as the states of the switches and links in the whole network. Even though this approach lets a switch make an accurate routing decision, it does not scale well with the network size. As networks get larger, the number of TSA packets disseminated can be overwhelmingly large and lead to a significant increase in processing, communication, and storage overheads. Hierarchical routing protocols are used to alleviate this problem [3, 8] .
In the hierarchical routing approach, a network is logically divided into subnetworks called peer groups (PGs). At the lowest level of the hierarchy, each node represents a real physical switch and each link represents a physical link. A PG is represented by a single logical switch at the next level of hierarchy, and two logical switches are connected by a logical link. A switch has complete information (connectivity and state) about the switches and links in its own PG. However, it has only a partial information about the links and switches that belong to different PGs. The internal details of a PG are not revealed to the any other PG. Figure 7 shows an example of a two-level routing hierarchy with three PGs (A, B and C) at the first-level. For any switch in the PG A, for instance, the entire PG B appears as a single logical switch B. Similarly the entire PG C appears as a single logical switch C. Because of the hierarchical structure, the size of RIB at each switch is reduced. Also, as becomes evident shortly, the number of TSA packets disseminated in the network is reduced as well. The end result is the reduction in routing related overhead. However, hierarchical routing protocols are more complicated than their flat counterparts. Additionally, routes computed based on inaccurate RIB may be poor.
Currently, the simulator allows only up to two levels of hierarchy, and a future version will extend the number of levels. It is expected that even in a very large ATM network with several hundreds of thousands of switches the number of hierarchical levels will unlikely to be more than 6. So, we feel that a two-level routing hierarchy is adequate to examine the salient features of a hierarchical routing protocol in a network consisting of about 100 switches.
TSA
In the simulator, there are two levels of TSAs, one for each level. The first-level TSA packets are exchanged (via flooding) only among the switches in the same PG. These packets are not sent out to the neighboring PGs. From these packets, a switch gains complete information about the links and switches in its own PG. The state information about the logical links are advertised through the second-level TSAs. These TSA packets are flooded throughout the network as in the case of flat routing. That is, when a switch receives a second-level TSA packet it passes the packet to its neighbors in not only its own PG but also in different PGs. Upon receiving a secondlevel TSA packet, a switch examines the packet to find the origin of the packet and updates its RIB only if the packet has not originated within its own PG. In a given PG, the secondlevel TSA packets are generated by a real switch called PG leader (Fig. 7) . Currently, PG leaders are fixed and are manually selected before the simulations. However, an election protocol can be easily implemented in the route module to dynamically select the PG leaders according to some policy.
We now demonstrate how switch A.1 builds its RIBs using the information gathered from TSAs. How to perform aggregation is an important design issue in hierarchical routing. In the simulator, we use a simple averaging method proposed in [9] . It is very easy to incorporate more sophisticated aggregation techniques in the route module.
A SIGNALING EXAMPLE
With a flat routing protocol, only one switch (i.e., the SS) is involved in route computation. However, with a hierarchical routing protocol, the SS creates a route which is only hierarchically complete. Such a route contains an ordered list of both the physical and logical switches to be visited by the signaling packet. Because of the incomplete RIB, the SS cannot compute a detailed route segment through a PG that is different from its own. As a result, the final route to the DES is decided by not only the SES but also the entry border switches. We now demonstrate the signaling procedure for establishing a call from the SES to the DES, shown in Fig. 9 . We assume that the network keeps trying the call until either it succeeds or a route cannot be found (i.e., there is no limit on the number of rerouting attempts).
The following steps are taken by the signaling protocol when the SES submits a signaling packet of type SETUP to the SS (A. 
A SIMULATION EXAMPLE
In this section, we use the simulator to compare the performance of hierarchical and flat routing protocols in a network with 16 switches, shown in Fig. 10 rates at all SESs are identical. During the simulation, the network load was varied by simultaneously changing the call arrival rates at all SESs. Each switch periodically triggers TSAs every 90 s. It also triggers an event-driven TSA whenever it detects a significant change in available bandwidth in any of its outgoing links. A change in available bandwidth is considered significant if the new value differs from the previously advertised value by at least 10 percent. Also, route computations are carried out using the link cost function mentioned before with k a = 1 and k l = 0.005 for all links. G-CAC is used during route computation. A call is tried only once, and a rejected call is assumed to be lost. Figure 11 compares the performance of the network with flat and hierarchical routing protocols. The performance measures used are the average values of call blocking ratio, call setup delay, bandwidth utilization, and TSA packets processed per second at each switch. Each point in the curves was obtained as an average of seven independent runs.
As mentioned before, with the hierarchical routing protocol the RIBs are incomplete, so we cannot expect a switch to make better routing decisions. This is evident from the blocking performance. With moderate to high load, the hierarchical routing protocol leads to a 3 percent increase in the call blocking ratio. Another observation is that bandwidth utilization is higher with the hierarchical routing protocol. This also can be attributed to the incomplete RIBs used for route computations. We have observed that the average route length (in number of hops) per call is longer with the hierarchical routing protocol than that with the flat routing protocol. The longer the route length the larger the bandwidth required per call, and hence the higher the bandwidth requirement. Another consequence of using longer routes is the increased call setup delay with the hierarchical routing protocol, as shown in the figure.
According to the above observations, the flat routing protocol outperforms the hierarchical routing protocol with respect to almost all the performance measures considered here. However, from the viewpoint of routing overhead the hierarchical routing protocol has an edge. In the simulations we have considered only the processing component of the routing overhead, which was measured as the average number of TSA packets processed per second at each switch. It can be seen from Fig. 11 that the hierarchical routing protocol yields approximately a sixfold reduction in the overhead under moderate to heavy loads.
CONCLUSIONS
Simulation plays an important role in computer-aided analysis and design of communications networks. QUARTS-II is one such simulation package, written in C programming language on the UNIX platform. This simulator can guide the design and performance analysis of ATM routing protocols. It allows one to simulate various high-level features of the ATM Forum's P-NNI routing protocol. A user-friendly GUI facilitates the task of creating and modifying the simulation network models. Also, the discrete-event simulation paradigm makes the simulator execution-efficient.
Even though this tool has been designed specifically for ATM routing protocols, its modular structure permits one to include additional ATM traffic control schemes (flow control, usage parameter control, etc). It is also possible to design an entirely different routing protocol by creating a new kind of route module without significantly modifying the other modules.
