This paper presents a straightforward approach for synthesizing a standard VHDL description of an asynchronous circuit from a behavioural VHDL description. The asynchronous circuit style is based on`micropipelines', a style currently used to develop asynchronous microprocessors at Manchester University. The rules of partition and conversion which are used to implement the synthesizer are also described. The synthesizer greatly reduces the design time of a complex micropipeline circuit.
Introduction
The design of asynchronous circuits generally follows a modular approach, where a system is designed as an interconnection of modules. In the 1988 Turing Award Lecture, Sutherland expounded a modular approach t o building hardware systems based on datadriven asynchronous self-timed logic elements called micropipelines 1 . This micropipeline design methodology has attracted the attention of many researchers, including those in the Department of Computer Science at Manchester University. The modular approach reduces design time and cost. It is being used to develop high performance and low p o w er microprocessing systems by the AMULET group at Manchester University 2 , 3 .
Asynchronous circuits are attracting renewed interest because they avoid the problems caused by the clock in synchronous circuits. The design of asynchronous circuits is in general more di cult than that of synchronous circuits. The control circuits of micropipelines are composed from event-driven logic modules and their interconnections. To implement the control circuits the designers only need to construct the interconnections between the modules. The eventdriven logic modules may be designed by specialists in advance, and replaced by new modules whenever better designs become available. It is not necessary to change the interconnections when such replacements happen. Thus, the design of asynchronous circuits is simpli ed by the application of micropipeline design techniques. Most micropipeline designs are presently done by hand. An automatic synthesis tool can reduce the design time required to obtain high performance circuits. VHDL is an IEEE standard introduced in 1987 4 which supports an environment within which designs may be described using VHDL statements and then simulated. It is easy to check the correctness of the speci cations and requirements of the designs at an early stage. In this paper we i n troduce a straightforward approach in converting top level behavioural VHDL models into structural VHDL models and Verilog models based on micropipelined asynchronous circuits.
Section 2 introduces Event-driven Logic Modules and micropipelines. Section 3 describes the synthesis method which is used by the synthesizer. A brief introduction to VHDL models will be presented in Section 4. Section 5 will present some rules and steps for implementing the synthesizer. A test example used to show h o w the synthesizer works is also included in section 5. Finally, Section 6 will give a short conclusion and suggestions for further research.
Micropipelines
Micropipelines are members of a class of asynchronous pipelines whose operation is based on a twophase bundled data convention and are self-timed, event-driven systems 1 . A " t w o-phase bundled data convention" is a communication system where a twophase handshaking protocol is used and an arbitrary number of data wires must be treated as a bundle together with the request signal wire. In the twophase handshaking protocol, rising transitions and falling transitions of either control wire have the same meaning; they represent request events or acknowledge events. In this signalling scheme, the operating cycle is 1 data available 2 request event, and 3 acknowledge event.
Various circuits have been devised for controlling transition signals. These are called event-driven logic modules 1 . Figure 1 shows the circuit symbols of these event modules. TOGGLE circuit: A TOGGLE circuit sends events from an incoming stream alternately to one of two outputs.
SELECT module: A SELECT module can be used to deliver events to one of two outputs under the control of a boolean signal.
CALL module: A CALL module can be used to give shared access to a sub-process from two independent but mutually exclusive higher-level processes.
ARBITER module: When two e v ents occur simultaneously or nearly simultaneously, an arbitration element must be used to impose an ordering on the events. Only one event is granted at a time. An ARBITER element can be connected directly to the CALL element to enforce mutual exclusively. Arbiter trees can be used to implement arbiters which h a v e more than two inputs 6 . Figure 2 is an example of a micropipeline stage. A micropipeline stage consists of control circuits, combinational logic circuits and storage elements. When data is available on the inputs, there is an event called request' on`Rin' to enable the storage elements to capture this valid data. When the storage elements have latched the input data, there is an event called acknowledge' on`Ain' to inform the preceding stage that the data has been accepted and an event called request' on`Rout' to enable the successor stage to capture the data. When the successor stage holds data, it will place an event on`Aout' to clear the storage elements of the current stage and enable the current stage to capture subsequent data.
The control circuits are composed from eventdriven logic modules, such as the Muller C, TOGGLE, SELECT, CALL ... etc. In this example, the combinational logic circuit consists of two OR gates and one AND gate. The storage element consists of an XOR, an active-low transparent latch and a TOGGLE. The control circuit here is only a Muller C. It is used to implement the rule if predecessor and successor di er in state then copy predecessor's state else hold present state" 1 . This rule allows data to ow through the micropipeline stages. Such e v ent-driven logic modules can also be used to connect micropipeline stages in different con gurations. For example, Figure 3 shows a micropipeline stage which selectively connects to one of two micropipeline stages which call the same output micropipeline stage.
The communication between stages or storage elements in micropipelines is not fully delay-insensitive if control and data signals are considered separately. The delays in data transmission must be less than the delays in transmitting the request signal. The combinational logic circuits for computations within the micropipeline must be carefully designed. High speed electronic techniques, such as dynamic CMOS tech- niques, may be applied when designing this part. The data signals must propagate through a micropipeline faster than the control event through its control circuits. Therefore, special delays are sometimes required in the control path when signi cant processing logic is put between storage elements in the data path. Previous work has been carried out in translation from programs to circuits 7, 8, 9 , 1 0 , 1 1 . A synthesis method of transforming concurrent programs based on a v ariant of CSP 13 i n to self-timed circuits has been developed by S. Burns in 1988 7 . A subset of OC-CAM 12 based on CSP is used to describe computations and a compiler has been constructed that automatically performs the translation and transformation 8, 9 . The translation of Tangram programs based on CSP and guarded-command language 14 i n to handshake circuits and then VLSI circuits has also been developed by Philips Research Laboratories 10 . A silicon compiler which converts a high-level programming language, OCCAMasync, to asynchronous CMOS circuits has been described 11 . However, previous work has not used VHDL which is an IEEE standard introduced in 1987 4 ; this supports an environment within which designs may be described using VHDL statements and then simulated easily. The authors are interested in developing a novel method for translating VHDL behavioural descriptions into micropipeline circuits. The research is ongoing and progress so far is presented in this paper. 3 The method of synthesis 3. Convert all statements into internal circuit models which are denoted by objects of Device and Terminal classes, as shown in Figure 6 , for each stage. These classes have been used to implement a micropipeline simulator 15, 1 6 based on Petri net modelling techniques 17 . The conversion rules are mentioned in Section 5. The internal circuit models of the data path for the above two stages are shown in Figure 7 . 4. Using the synthesis information from each stage produce the relevant control circuits. Because the above two stages only contain assignments, the control circuits for these two stages are the same as the control circuit shown in Figure 2 . The rules for producing control circuits are described in Section 5. 5. Using information from the predecessor and successor of each stage produce the interconnections between stages. For the above two stages, the predecessor of the rst stage is the top level, i.e. the external data inputs and RIN and AIN. The successor of the rst stage is the second stage, i.e. the data inputs and RIN and AIN of the second stage. Similarly, the predeccessor of the second stage is the rst stage, i.e. the data outputs and ROUT and AOUT. The successor of the second stage is the top level, i.e. the external data outputs and ROUT and AOUT. 6. Produce the structural VHDL model for each stage and for the whole network. The steps for producing structural VHDL output les are presented in Section 5. The corresponding structural VHDL model is shown in Figure 8 . 
VHDL
The reasons for choosing VHDL as the input and output models are that :
VHDL is IEEE standard 1076-1987 4 . It is an important industrial design tool for digital systems. The event-driven logic modules shown in Figure  1 h a v e been modelled individually using VHDL. These VHDL models have also been compiled and entered into the library. Therefore, they can be used as components in the synthesized circuits. Input or output models written using VHDL are easily simulated and their functions veri ed. The bene ts of VHDL are described elsewhere 18 and VHDL synthesis examples are also given.
The rst kind of output of the synthesizer is a set of structural VHDL models of the components.
An entity and a structural architecture pair for each rin_stg1 => wire11, aout_stg1 => wire111, ain_stg1 => wire19, rout_stg1 => wire110, U11a0 => wire13, U11b0 => wire14, U12a0 => wire15, U12b0 => wire16, QA0 => bus21( 0 ), QA1 => bus21( 1 ), QB0 => bus22( 0 ), QB1 => bus22( 1 ) ); rin_stg2 => wire110, aout_stg2 => wire12, ain_stg2 => wire111, rout_stg2 => wire112, U11a0 => bus21( 0 ), U11a1 => bus21( 1 ), U11b0 => bus22( 0 ), U11b1 => bus22( 1 ), QC0 => bus23( 0 ), QC1 => bus23( 1 ), wire13 <= A; wire14 <= B; wire15 <= C; wire16 <= D; SUM( 1 ) <= bus23( 1 ); SUM( 0 ) <= bus23( 0 ); Carry <= wire18; signal bus21, bus22, bus23 : MVL_VECTOR( 0 to 1); Figure 8 : The structural VHDL model for the behavioural VHDL model shown in Figure 5 .
stage is written into separate output les. A structural architecture for the whole system is also written to an output le. Only some of the VHDL statements are used for describing the behaviour of the synthesized system. They are the assignments of variables and signals, the IF statement, the WHILE statement and the FOR statement. The operators which can be used to perform comparisons, form boolean equations and perform arithmetic at the right hand side of assignments and the conditions of IF and WHILE statements are as follows: Boolean not, and, or, nand, nor, xor Comparison =; = = ; ; = ; ; = Arithmetic +; ,; ; = The Leapfrog VHDL simulator was used to simulate the input and output les as well as VHDL models of each e v ent-driven logic module shown in Figure 1 .
The library contains standard gates and userde ned gates, such as the event-driven logic modules shown in Figure 1 . The VHDL models for each o f t h e event-driven logic modules are implemented individually. They have been simulated and veri ed independently. The library also contains the components which are generated during synthesizing.
The implementation
The synthesizer is implemented using C++. Classes and their member functions are de ned to handle the input descriptions, such as String and StringList. To represent the internal circuit models, classes and their members are also de ned, such as Terminal, Device, DeviceList, TerminalList, Stage, StageList. Before implementing the synthesizer, rules for partition, conversion, and the production of control circuits must be de ned and followed. They are introduced in the following subsections.
Partition rules
When the synthesizer reads the input le, which contains the behavioural descriptions written in VHDL, the corresponding registers for the variables are created. For example, as shown in Figure 7 , the registers for the variables QA, QB, and QC declared in Figure 5 are created. They are denoted by the rectangles labelled`latch'. Then the partition rules are applied to divide statements into di erent blocks. Such blocks are called stages. If a statement contains shift operations it will be put into a stage on it own. The reason is that a special control circuit is required for shift operations. This control circuit is not able to handle other functional statements at the same time.
If a statement is a LOOP statement, such a s F OR or WHILE, it will be put into a stage on its own. The reason is the same as for the shift operations.
If an IF statement contains unbalanced assignments on its THEN and ELSE branches it will be put into its own stage. An IF statement with balanced assignments on its THEN and ELSE branches can easily be implemented by m ultiplexers. However, an IF statement with unbalanced assignments must be constructed with feedback and the control circuits are more complex. Therefore, an IF statement with unbalanced assignments will be put into its own stage.
Conversion rules
After partitioning the statements into several blocks the synthesizer starts one of main missions presented in this paper, the conversion of statements to internal circuit models. There are also some conversion rules which help to deal with the conversions. They are illustrated as follows:
The expression at the right hand side of an assignment m a y b e a v alue, a mathematical expression, or a logical expression. To convert it just create objects of Device class to denote each of the values, mathematical or logical computations and connections.
An IF statement with balanced THEN and ELSE branches can be converted into objects of Device class to denote multiplexers and connections. To convert multi-level IF statements into their internal circuit models a FILO method is used to calculate the position of the multiplexer.
In micropipeline circuits the input to a latch may be from several di erent sources at di erent times. This means one signal or variable may appear at the left hand side of several di erent assignments in the same description. A selector is applied to connect di erent sources to the input of the corresponding latch if the number of assignments with the same left hand side is greater than 1.
Rules for producing control circuits
Control circuits are required within each stage to deliver the events to related latches for the capture of data as well as to the predecessor and successor stages for handshaking. They are as follows:
If a stage does not contain any SHIFT operation, LOOP statement or unbalanced`IF' statements its control circuits are the same as that shown in Figure 2 . Otherwise, a control circuit similar to that used in a previous 4-bit asynchronous multiplier circuit 19 will be used. Figures 9 and  10 show the control circuits which are used for WHILE statements and SHIFT operations. Following the con guration of a stage, it is necessary to connect its control signals to its predecessor and successor stages. Some event-driven logic modules are required for these interconnections between stages. After the internal circuit models are constructed the synthesizer produces the structural VHDL output les for each stage and for the whole system. 
Experimental results
All the rules and steps mentioned in previous subsections were implemented. A test example of the behavioural description shown in Figure 11 was used to test the synthesizer. This example is a small oating point processor with a 4-bit exponent and a 5-bit One of the goals of designing the synthesizer is to synthesize a micropipelined coprocessor. Therefore, this example was chosen to test whether the basic operations of the synthesizer are correct. When the synthesis was complete three structural VHDL les and their entity les were automatically generated for the three stages which are partitioned by the synthesizer itself. A structural VHDL le for the whole system is also produced automatically. All the output les were executed on a leapfrog VHDL simulator without error. The expected output values and waveforms were produced.
The structural architecture VHDL les of the three stages contain 286, 288, and 405 lines respectively. A structural architecture VHDL le for the whole system contains 404 lines. As the les are long it is not possible to list all the output les here. We only show the number of the components and the internal signals which w ere used inside each stage and the whole system. These components are event-driven logic modules, active-low transparent latches, multiplexers, subtractors, comparators, and some basic logic gates. The numbers of components and internal signals which were used inside each synthesized structural VHDL le are shown in Figure 12 . The three stages, stg1, stg2, and stg3, were treated as components inside the structural VHDL le for the whole system.
The structural Verilog les of the three stages and the whole system were successfully imported into the Design Framework II database.
Conclusions
A synthesizer has been developed which converts behavioural VHDL into asynchronous structural VHDL and Verilog following the micropipeline design style. The synthesizer can support addition, subtraction, multiplication, division, and some logical operations. However, components which execute these mathematical and logical operations must exist in the library. The internal circuit model is similar to a DFG 20 . To optimize synthesized circuits the method used in DFG is proposed to eliminate the redundant circuits and balance the computations within the datapaths.
The research presented in this paper demonstrates that VHDL can be used to describe the behaviour of micropipelined systems. It is also suitable for representing the circuits output from the synthesizer. At the high level the computations of a whole system can be treated as one computation block with micropipelined latches. This simple structure helps the designer to estimate the expected behaviour of an abstract system as early as possible using a VHDL simulator. The simulated results may be stored for comparison with the simulated results from the nal synthesized circuits to verify the correctness of the design. Figure 12 : The numbers of components and internal signals inside each synthesized VHDL le.
Synthesis enables the design time to be greatly reduced. With careful partitioning of the system logic functions into several stages, it is possible to improve the system to achieve optimum performance.
