Abstract. Composition of a concurrent system out of components involves coordination of their mutual interactions. In component-based construction, this coordination becomes the responsibility of the glue-code language and its underlying run-time middle-ware. Reo offers an expressive glue-language for construction of coordinating component connectors out of primitive channels. In this paper we consider the problem of synthesizing Reo coordination code from a specification of a behavior as a relation on scheduled-data streams. The specification is given as a constraint automaton that describes the desired input/output behavior at the ports of the components. The main contribution in this paper is an algorithm that generates Reo code from a given constraint automaton.
Introduction
Composing components into a concurrent system involves coordination of their mutual interactions. The internals of black-box components cannot be modified to implement such coordinated interactions. Coordination, therefore, becomes the responsibility of the "glue-code" that inter-connects the constituent components of a composite system, and of its underlying run-time middle-ware. Reo [3] offers a powerful glue language for implementation of coordinating component connectors that resemble electronic circuits and are based on a calculus of mobile channels. Reo is being used, for instance, in the context of the Cybernetic Incident Management project [13] for composition of web services, which constitute the black-box components of dynamically configured distributed applications [15] ; to model business processes, such as electronic auctions [30] ; and for modeling coordination in biological systems [6] . This paper addresses the synthesis problem of component connectors with Reo as our target implementation language. The input for this problem is a specification of a coordination protocol and its output is a Reo connector circuit that implements this protocol. Synthesis problems address the issue of the (algorithmic) generation of an implementation from a given specification and have a long tradition in computer science. In the context of switching circuits, the synthesis problem was first raised by Church [12] and is nowadays well-understood; see, e.g., [11, 25, 21, 19] . For temporal logical specifications, several synthesis algorithms have been suggested that rely on the close relationship between the synthesis and satisfiability problem [17, 22, 9, 24, 8] or on a game-theoretic view where a system must be designed to meet a specification, no matter how an opponent (the environment) behaves [24, 1, 16, 26, 29, 28, 18] . The output of these synthesis algorithms are some kind of automata or state-transition graphs. Our goal is a step further toward an implementation by generating Reo code from a given automaton specification. Thus, our contribution is more in the spirit of gate-level hardware synthesis from given automata specifications. More precisely, our starting point is a specification of a component connector as a relation over timed data streams [10, 7] , represented by a constraint automaton [5] . Constraint automata are variants of labeled transition systems that operationally describe the maximally parallel data-flow activity through the nodes in a Reo circuit. In [5] , constraint automata are used to provide an operational semantics for coordination mechanisms formalized by composition of Reo connector graphs. In a constraint automaton, the states of the automaton represent the possible configurations (e.g., the contents of the FIFO-channels of the Reo-connector); transitions going out of a state represent dataflow at that state and its effect on the configuration. In this paper we are not primarily concerned with the derivation of (constraint) automata representations from higher-level behavior specifications, such as in temporal logic or relations on timed data streams. Similar derivations, for instance, in the field of digital circuit design, are well-known. The main contribution of this paper is an algorithm that takes as input a constraint automaton A and produces a Reo connector graph that implements the relation on timed data streams specified by A. This is tantamount to compiling an automaton down to actual concurrent executable code for a distributed implementation of the coordination behavior specified by that automaton. The rough idea of our synthesis algorithm is as follows. We first transform the automaton A into an equivalent scheduled-data expression which is a slight variant of an ordinary ω-regular expression. We then construct circuits for the atomic expressions and composition operators on Reo circuits that capture the semantics of concatenation, union, and infinity-closures. The major difficulty is the treatment of the atomic expressions that describe a complex "one-step" coordination scenario with possibly datadependent synchronous and asynchronous behavior. Given an automaton all of whose data constraints are in disjunctive normal form, our algorithm produces a circuit whose size is linear in the length of the given expression. Superficially, compiling constraint automata specifications to Reo circuits seems simple. By analogy, derivation of digital circuits from Mealy automata specifications are well understood. However, constraint automata (and Reo circuits) can exhibit far more complex behavior than digital circuits, including combinations of synchrony and asynchrony, and relational, as well as simple (input/output) functional, interdependencies. In the light of this fact, it is far from obvious if synthesis of Reo circuits from constraint automata is possible at all, and if so, whether it can be done efficiently. The rest of this paper is organized as follows. Section 2 contains a summary of the main features of Reo. Section 3 recalls the definition of constraint automata and their accepted TDS-languages. In Section 4, we show the equivalence of scheduled-data expressions and constraint automata. The construction of a Reo circuit from a given expression is explained in Section 5. Section 6 concludes the paper. The submitted version of this paper contains an appendix which provides an example 
Fig. 1. Basic channel-types in Reo
and a correctness proof for our synthesis algorithm. To meet the length restriction, the appendix can be removed in the final version.
A Reo primer
Reo [3] is a channel-based exogenous coordination model wherein complex coordinators, called connectors, are compositionally built out of simpler ones. The simplest connectors in Reo are a set of channels with well-defined behavior supplied by users.
Components can instantiate, compose, connect to, and perform I/O operations through connectors. Here, as in [7, 5] , we do not consider the dynamic creation, composition, and reconfiguration of connectors by components. We restrict our attention to connectors that have a static graphical representation as a Reo circuit which coordinates the data-flow through the channels connecting the input/output ports of components. Reo's notion of channel is far more general than its common interpretation and allows for any primitive communication medium with exactly two ends. The channel ends are classified as source ends through which data enters and sink ends through which data leaves a channel. Although Reo allows for an open-ended set of channel-types with user-defined semantics, for our purposes in this paper, we restrict ourselves to the channel-types shown in Fig. 1 . The simplest form of an asynchronous channel is a FIFO channel with one buffer cell (called a 1-bounded FIFO channel or simply a FIFO1 channel). We graphically represent a FIFO1 channel by a small box in the middle of an arrow. In the example in Fig. 1 , the left channel-end is a source, and the right end is a sink. The buffer is assumed to be initially empty if no data item is shown in the box (this is the case in Fig. 1 ). The graphical representation of a FIFO1-channel whose buffer initially contains a data element d shows d inside the box. FIFO channels with two or more buffer cells can be produced by composing several FIFO1 channels, as for instance, explained in [7, 5] .
A synchronous channel (depicted as a simple solid arrow) has a source and a sink end, and no buffer. It accepts a data item through its source end iff it can simultaneously dispense it through its sink. A lossy synchronous channel (depicted as a dashed arrow) is similar to a synchronous channel, except that it always accepts all data items through its source end. If it is possible for it to simultaneously dispense the data item through its sink (e.g., there is a take operation pending on its sink) the channel transfers the data item; otherwise the data item is lost. For a synchronous filter channel, its "pattern" P (for our purposes here, formalized as a set P ⊆ Data) specifies the type of data items that can be transmitted through the channel. Any value d ∈ P is accepted through its source end iff its sink end can simultaneously dispense d; all data items d / ∈ P are always accepted through the source end but are immediately lost. The P-producer is a variant of a synchronous channel whose source end accepts any data item d ∈ Data, but the value dispensed through its sink end is always a data element d ∈ P. More exotic channels permitted in Reo are (a)synchronous drains that have two source ends. Because a drain has no sink end, no data value can ever be obtained from these channels. Thus, a synchronous drain accepts a data item through one of its ends iff a data item is also available for it to simultaneously accept through its other end as well. All data accepted by this channel are lost. An asynchronous drain accepts and loses data items through its two source ends, but never simultaneously. Synchronous and asynchronous spouts are duals of their corresponding drain channel types, as they have two sink ends. A complex connector has a graphical representation, called a Reo circuit, which can be produced by applying certain composition operators to channels. In our setting, where we do not consider dynamic aspects of the Reo language, a Reo-circuit is a finite graph where the nodes are labeled with pair-wise disjoint, non-empty sets of channel ends, and where the edges represent their connecting channels. The set of channel ends coincident on a node A is disjointly partitioned into the sets Src(A) and Snk(A), denoting the sets of source and sink channel ends that coincide on A, respectively. A node is called a source node if
In this paper, it suffices to assume that all mixed nodes are hidden. In other words, we abstract away from their names and formalize the behavior of a Reo circuit by means of the data-flow at its sink and source nodes. Intuitively, source nodes of a circuit are analogous to the input ports, and sink nodes to the output ports of a component, while mixed nodes are its hidden internal details. Components cannot connect to, read from, or write to mixed nodes. Instead, data-flow through mixed nodes is totally specified by the circuits they belong to. A component can write data items to a source node of a Reo circuit that it is connected to. A write operation succeeds only if all (source) channel ends coincident on the node accept the data item, in which case the data item is transparently written to every source end coincident on the node. A source node, thus, acts as a replicator. A component can obtain data items from a sink node of a Reo circuit that it is connected to through input operations. 1 A take operation succeeds only if at least one of the (sink) channel ends coincident on the node offers a suitable data item; if more than one coincident channel end offers suitable data items, one is selected nondeterministically. A sink node, thus, acts as a nondeterministic merger. A mixed node is a self-contained "pumping station" that combines the behavior of a sink node (merger) and a source node (replicator) in an atomic iteration of an endless loop: in every iteration a mixed node nondeterministically selects and takes a suitable data item offered by one of its coincident sink channel ends and replicates it into all of its coincident source channel ends. A data item is suitable for selection in an iteration only if it can be accepted by all source channel ends that coincide on the mixed node. shows an implementation of an exclusive router built by composing five synchronous channels, two lossy synchronous channels and a synchronous drain. The intuitive behavior of this circuit is that through its source node A, it obtains a data item d from its environment and delivers d to one of its sink nodes B or C. If both B and C are willing to accept d then (the merger in the mixed node in the middle of) the exclusive router nondeterministically decides to deliver d to either B or C. No data that passes through A can be lost because of the synchronous drain and the two synchronous channels in the middle of the circuit. The synchronous drain ensures that data flow at A is synchronized with data flow through the node at its opposite end. The merger inherent in this mixed node guarantees that at most one of its two coincident synchronous channels transfer data, synchronized with the data flow at either B or C. The circuit in Fig. 2 .b shows an implementation of a shift-lossy FIFO1 channel with source node A and sink node B. This implementation uses four synchronous channels, a synchronous drain, a FIFO1 channel whose buffer initially contains a token data item, o, an empty FIFO2 channel, and an instance of the exclusive router of Fig. 2 .a shown as the box labeled EXR. A shift-lossy FIFO1 channel behaves the same as a FIFO1 channel, except that writing to its source end is never blocked. If at the time of a write operation its buffer is full, the stored data item in the buffer is lost and the new data item replaces it in the buffer. The observable behavior of each of these Reo circuits is represented by a constraint automaton in Fig. 3 . Derivation of these constraint automata as compositions of the constraint automata representing the behavior of the individual primitives used in their respective Reo circuits appears in [5] .
In spite of its simplicity, the semantics of Reo is indeed very rich, yielding a surprisingly expressive language [3] . For instance, the relational (as opposed to functional) dependencies that result in "propagation of synchrony" as well as the way in which the local behavior of, e.g., lossy synchronous channels imposes non-local constraints on a circuit, are already evident in the exclusive router of Fig. 2 .a. (We use this exclusive router later in this paper in our synthesis of Reo circuits.) Examples of Reo circuits with more interesting behavior can be found elsewhere [2] , and the reader is encouraged to see [27] and [7] for the simple, rich, and expressive formal semantics of Reo.
In the remainder of the paper, we discuss the synthesis problem of Reo circuits where the input specification of the desired coordination is given as a constraint automaton, as defined in the next section.
Constraint automata
Constraint automata can serve as an operational model for Reo circuits [5] . The states of an automaton represent the configurations of its corresponding circuit (e.g., the contents of the FIFO channels), while the transitions encode its maximally-parallel stepwise behavior. The transitions are labeled with the maximal sets of nodes on which data-flow occurs simultaneously, and a data constraint (i.e., boolean condition for the observed data values).
We start with a simple example for a constraint automaton that models a component with input port A and two output ports B and C which is modeled by a Reo circuit as shown in the left of the picture below.
The picture on the right shows the corresponding constraint automaton where we assume that only bits 0 and 1 can be transmitted through the channels. The initial state stands for the configuration where the buffer is empty, while the two other states represent the configurations where the buffer is filled with one of the data items. The outgoing transitions from the initial state are labeled with the singleton set {A} which reflects the fact that in the initial configuration only data-flow at A is possible. If the buffer is filled then data-flow at A is impossible and only B and C can take the value from the buffer.
In the sequel, we specify constraint automata using a nonempty and finite set Data consisting of data items that can be sent (and received) via channels and a nonempty and finite set N = {A 1 , . . . , A n } of names. Intuitively, we may think of the A i 's to be the source or sink nodes of a Reo circuit. We refer to the subsets of N as node-sets.
Data assignments, data constraints. A data assignment for
denotes the set of all data assignments for N, and DA the set of all data assignments (on any N). Data constraints, which can be viewed as a symbolic representation of sets of data assignments, are formally defined as propositional formulas built from the atoms "d A ∈ P" and "
denotes the set of data constraints using only names from N, and DC is a shorthand for DC(N ). We simply write "
The symbol |= stands for the obvious satisfaction relation which results from interpreting data constraints over data assignments. Satisfiability and logical equivalence ≡ of data constraints are defined as usual.
synchronous channel synchronous drain asynchronous drain lossy synchronous
with filter synchronous channel Example 3 (Constraint automata). Constraint automata for the various basic channels types, the exclusive router and shift-lossy FIFO1 channel are shown in Figure 3 (where valid guards have been omitted). The automaton for a FIFO1 channel with source A and sink B is the same as the one for the example in the beginning of the section, except that C has to be removed. These automata do not have terminal states as in any configuration data flow at some nodes is possible. The left part of Fig. 4 shows the Reo circuit for an initializer, i.e., a component without input ports (source nodes) and a single output port A init where data-flow at A init happens exactly once. on the right of the picture where the source node B stop can put a value into the buffer exactly once, because afterward the buffer is filled forever as no data-flow is possible for an asynchronous drain with both source ends coincident on the same node. Thus, if
an output port B of a component C is connected via a synchronous channel with B stop then output at B is possible exactly once. In this sense, component "Stop" can serve to terminate data-flow in other components.
In [5] , we formalized the semantics of a constraint automaton as a relation on timed data streams. For the purposes of this paper, an equivalent, but simpler concept suffices which abstracts away from time and describes the "traces" of a constraint automaton by scheduled-data streams: finite or infinite sequences of pairs N, δ , consisting of a set N of all the nodes that are scheduled to be synchronously (i.e., atomically) active in the next step, together with a data assignment δ ∈ DA(N) describing the data values that are input and output.
Definition 4 (Scheduled-data streams, generated language). A scheduled-data stream Θ = Θ(0);Θ (1); . . . is a finite or infinite sequence of pairs Θ(i) ∈ 2 N × DC, denoted by
is a non-empty node-set and Θ.δ(i) a data assignment for Θ.N(i). We write |Θ| to denote the length of Θ (which can be ω). The empty scheduled-data stream is denoted by ε. SDS N or briefly SDS denotes the set of all scheduled-data streams.
such that (1) q 0 = q and (2) either q and Θ are infinite or q consists of |Θ| transitions and ends in a terminal state and (3)
The generated language L(A) of A is the set of all scheduled-data streams Θ ∈ SDS which have a q 0 -run in A for some initial state q 0 ∈ Q 0 .
For instance, the SDS-language generated by the automaton for a synchronous channel consists of all infinite scheduled-data streams Θ with Θ.N(i) = {A, B} and where data assignment Θ.δ(i) assigns the same data item to A and B.
Although the formal definition of scheduled-data streams does not impose a relation between the data assignments Θ.δ(i), for a given constraint automaton, there can be a link between the data constraints Θ.δ(i) and Θ.δ(i + 1). For instance, the automaton for a FIFO1 channel with source node A and sink node B generates the SDS-language consisting of all infinite scheduled-data streams Θ with Θ.N(2i) = {A}, Θ.N(2i + 1) = {B}, and with Θ.
In [5] , we explain how an automaton for a Reo circuit can be constructed in a compositional way. (For the purpose of this paper, the details of that construction do not matter. The only thing that we use later, in Section 5, is that by applying the above definition to the automaton for a Reo circuit R, we obtain an SDS-language L(R) for R.)
In what follows we show, conversely, how to construct a Reo circuit from a constraint automaton.
Scheduled-data expressions
The first step of our construction of a Reo circuit from a given automaton is to transform the automaton into an equivalent ω-regular expression, a so-called scheduled- pressions is defined as for ordinary ω-regular expressions and is omitted here. Similar to the construction of a finite automaton from ordinary regular expressions (see e.g. [20] ), we can assign a constraint automaton to any scheduled-data expression that generates the same SDS-language and which is linear in the size of the expression. Since this construction does not play a role in the present paper, its description is omitted. Instead, we use the reverse construction, i.e., of a scheduled-data expression for a constraint automaton. Although to do so, we may apply the standard algorithms for generating (ω-)regular expressions from automata (see e.g. [20] ), we suggest here an alternative algorithm. Rather than describing the construction in general, we treat a typical example. Consider the constraint automaton as shown on the left of the following picture where a, b, c are pairs of node-sets with corresponding data constraints. 
From scheduled-data expressions to Reo
We now address the issue of constructing a Reo circuit for a scheduled-data expression α 0 . Because the source and the sink nodes of a Reo circuit play different roles with respect to its environment, and this distinction is abstracted away in scheduled-data expressions (and constraint automata), we first need to identify the "input" and "output" of a circuit by partitioning its node set N . That is, our starting point is a description of a component connector by its input ports C 1 , . . . ,C n and its output ports D 1 , . . . , D m and by (the scheduled-data expression α 0 of) a given constraint automaton that specifies the observable data flow at the C i 's and D j 's. 
(R).
For the construction of R, we use a compositional approach that builds a Reo circuit R α for each subexpression α of α 0 . Fig. 5 shows the general structure of R α : if the source node A α is fed from outside with some data element, then it is put into the buffer between A α andÃ α . As soon asÃ α takes the data element from the buffer, the subcircuit in the middle is "activated". Similarly, data-flow inside this sub-circuit stops as soon as a data element arrives atB α , which puts it into the buffer betweenB α and B α . Thus, data-flow at the sink nodeB α can be viewed as a signal that R α has "terminated". The nodes C, D in Fig. 5 are there to indicate that there will be some channels connecting the sub-circuit in the middle of R α with (some of) the source nodes C and (some of) the sink nodes D in N . The construction of a circuit R for an expression α 0 will be completed by a last step, in which "Init" and "Stop" components, defined in Example 3, are added to begin and end the data-flow of in the circuit R α 0 , as shown in Fig. 6 . The construction of the circuit will be such that at any moment, exactly one of the leftmost and rightmost buffers or buffers inside R α 0 will be filled. Thus, we may consider data-flow through R as a token game, where the token is passed on from left to right. The reason why we put R α 0 in the context of an initializer and a terminator is that the circuit R α 0 allows a "restart" of data flow at node A α 0 wheneverÃ α 0 has consumed the data item in the buffer between A α 0 andÃ α 0 . In fact, the initializer ensures that data flow at A α 0 occurs exactly once. The reason for using the stop-component is similar. Concatenation, union and closure. We first explain how to construct a circuit R α , assuming we have already constructed the circuits for α's subexpressions. (If a subexpression α occurs more than once in α 0 , e.g. if α 0 = α;α, then we need a copy of the Fig. 6 . The final Reo-circuit R circuits R α for every syntactic occurrence of α as a subexpression in α 0 .) For α = γ;β the Reo circuit R α results from combining R γ and R β as follows:
Note that the internal FIFO-channels "at the end" of R γ and "at the beginning" of R β (not drawn in the picture) ensure that in the concatenation γ;β data-flow inside R β cannot start before data-flow in R γ has finished. For α = γ ∪ β, the Reo circuit R α is obtained by combining R γ and R β with an exclusive router that nondeterministically chooses to "activate" the data-flow in either R γ or R β :
EXR
The Reo circuit R α where α = β ∞ is obtained from R β as follows: 3
For α = β ω , the Reo circuit has the following structure.
Init empty
Here, "empty Init" is a variant of the initializer in Ex. 3, where the buffer is initially empty. Thus, data-flow never occurs in "empty Init" or at node B α . Being non-reachable, it may be omitted; we keep it here so that the circuit retains the general shape of Fig. 5 .
The empty expression. For α = ε, we simply use a FIFO1 channel with its source end on node A ε and its sink end on node B ε . (Using just a single channel departs from the general schema sketched in Fig. 5 , but the nodesÃ α andB α are not needed in our compositional approach.) Atomic expressions. So far the construction of Reo circuits for composite expressions has followed patterns that are familiar from automata theory. Next we come to the most complicated and most interesting step in our construction, namely the construction of a Reo circuit for atomic expressions N, g . The difficulty lies in the fact that such expressions model a computation step of a corresponding Reo circuit, in which certain channel ends are active and others are not. Moreover, we must ensure that at every active channel end, the right data value is input or output. Let Atoms denote the set of all atomic expressions N, g of α 0 . Recall that N is a nonempty subset of N = {C 1 , . . . ,C n } ∪ {D 1 , . . . , D m } and g is a satisfiable data constraint for the nodes in N. We first describe a general technique to design a Reo circuit for the atoms N, g ∈ Atoms.
(Later we explain how this technique can be made more efficient in various ways.) We first transform g into its canonical disjunctive normal form, which replaces it with an equivalent data constraint h 1 ∨ . . . ∨ h r where each of the h's is a formula of the form
Next, we replace N, g with the equivalent expression N, h 1 ∪ . . . ∪ N, h r , construct the circuits R N,h k (see below) and combine them with the union-operator described above.
Fig. 7. Reo-circuit R N,h
With the formula h as above, a circuit R N,h for N, h is presented in Fig. 7 , which we now explain. For the Reo circuit R N,h of a given N, h , we need a pair of nodes C N,h andC N,h for every source node C ∈ N src , and similarly, one node D N,h for every sink node D ∈ N snk , plus one other node E N,h . The same nodeC must be used for all circuits R M, f where C ∈ M and M, f ∈ Atoms, while the nodes C N,h andC N,h are unique for every atomic data expression N, h ∈ Atoms where C ∈ N. We can think of the node E N,h as a switch that synchronizes the data-flow in the upper sub-circuit with the nodes D N,h , D, and C N,h ,C N For the source nodes C, the situation is a bit more complicated because we must ensure that C accepts an input value iff C synchronizes with exactly one of the nodes E M, f where M, f is a subexpression of α 0 with C ∈ M. The use of perfect synchronous channels is not appropriate because of the replicator semantics of the source nodes. If C were connected with E N,h via perfect synchronous channels only, then data-flow would block when C appears in two or more atomic subexpressions of α 0 . (Note that simultaneous data-flow at different nodes E N,h , E M, f is not possible.) For this reason, we connect C with E N,h via a filter channel, a lossy synchronous channel and a synchronous drain through the nodes C N,h andC N,h . These three channels (1) allow C to pass values even when E N,h is not available to synchronize with C, and (2) force C to pass a value d ∈ P C when it synchronizes with E N,h . To prevent C from passing a value without synchronizing with one of the nodes E M, f where C ∈ M, we use a synchronous channel connecting E N,h withC and a synchronous drain betweenC and C. These channels ensure that for C ∈ M, C is active exactly when data-flow occurs at C and exactly one of the nodes E M, f .
Size of the constructed circuit. In the worst case, the treatment of the atoms N, g leads to an exponential blow-up (because every disjunctive normal form for g may be exponentially longer than g). However, when we assume that all data constraints in α 0 are given in canonical disjunctive normal form and when we measure the length of α 0 as the total length of all data constraints occurring in (one of the atoms in) α 0 then the total number of channels in the constructed circuit is linear in the length of α 0 .
Preprocessing. We now explain how a preprocessing phase of the set Atoms can simplify the construction of the circuits for the atomic subexpressions of α 0 . We first look for pairs C, D with C ∈ {C 1 , . . . ,
plication.) Then, we establish a synchronous channel with its source end on node C, its sink end on node D and remove D in the sense that any N, g ∈ Atoms is replaced with 
However, here we need a Reo connector that checks the equality of two (synchronously) arriving input values.
Optimization. As in other algorithmic constructions, our resulting Reo circuits contain certain redundancies which can be optimized away. We can detect and remove them by applying circuit transformation rules that look for recognizable patterns of (sub-)circuits and replace them with their simpler equivalents. E.g., every occurrence of a synchronous channel preceding or following any other channel X can be simplified to only X.
Conclusion
The main contribution of the present paper is a general construction of a Reo circuit from a constraint automaton. Although similar constructions exist in the classical area of automata and digital circuits, the situation here is far more complicated because of two major differences:
1. The behavior specified by constrained automata is generally not functional (from input to output) but relational. 2. In a digital circuit and the Mealy automaton describing it, behavior is always synchronous. In contrast, in Reo, behavior can be synchronous, asynchronous, or (at different steps) a combination of the two.
Because of in particular point 2, the classical construction of a circuit from an automaton breaks down, and at forehand, it was by no means obvious how to tackle the problem for Reo. We see the algorithm described in the present paper therefore as a major step forward in the automatic synthesis of Reo component connector circuits.
From the theoretical point of view, the results established here and in [5] yield that Reo connector circuits, constraint automata, and scheduled-data streams have the same expressiveness and can be transformed into each other via algorithmic transformations. This result can also be useful in practice as it allows to switch between these three formalisms. For instance, it enables one to use automata-models within the Reo framework to describe (and finally to synthesize) the interfaces of black-box components. On the other hand, our algorithm also illustrates the expressive power of the channel types presented in Fig. 1 . In fact, our construction uses all of them. 4 To some extent, our construction can also be modified to treat real-time constraints, e.g., those formalized by timed scheduled-data expressions of the form α = β ≤t stating that data flow described by β must be completed within t time units. (See [4] for a formal treatment of real-time within the Reo framework.). For this, we just connect nodes A α withB α via a synchronous drain and a timer channel with off-option, i.e., a timer channel that allows the timer to be stopped at any point in time before the expiration of its delay. In the picture below, this timer channel is depicted by an arrow with a circle labeled with the delay t in its middle.
A compositional approach similar to the one we suggest here can also be used to provide "Reo-implementations" for processes specified in terms of CCS-or CSP-like process algebras. In fact, some of the typical operators are already included in regular expressions (CCS-like nondeterminism corresponds to union, sequential composition to concatenation, and ε to, e.g., the CCS-process nil). Parallel composition with CCS-or CSP-like synchronization can be realized by establishing appropriate synchronous channels and Reo's join operator. A LOTOS-like disrupt operator P[>Q can be obtained using a Reo component that realizes a switch; this switch is initially "on" and synchronizes with P as long as it is "on" but is turned "off" by Q's first activity (the inhibitor circuit in [3] can be used to construct this switch from our set of primitive channels). Although the construction presented here is not overly complicated, it can and should still be simplified further and made more efficient. Parts of such considerations have already been sketched in the present paper. In our future work, we will investigate further optimizations and the design of an alternative synthesis algorithm that goes directly from automata to Reo circuits without having the regular expressions as an intermediate step. Furthermore, dynamic reconfiguration of connector circuits is an inherent aspect of Reo that we plan to cover in our future work.
A Example
A concrete example for the Reo-circuit which is constructed from the scheduled-data expression α = N, h ∪ M, f is shown in Fig. 8 . Here, we assume that N = {C, D},
In Fig. 8 , there are multiple candidates that qualify for the application of the circuit transformation rule mentioned at the end of Section 5. There are more specialized patterns that are applicable only in the context of our resulting circuits. For instance, we know that in Fig. 8 , data-flow can occur through only one of the top or bottom branches of the circuit (because there is only one token at a time that passes through the entire circuit; the exclusive router; and because the two branches are isolated from one another by drains). This makes the right-hand-side FIFO1 channels on both top and bottom branches redundant.
B Correctness of the construction
We now provide a proof sketch for the soundness of our algorithm. That is, we show that the final circuit R for a scheduled-data expression α 0 generates the SDS-language L(α 0 ). Recall that R is derived from R α 0 by the addition of an initializer and a terminator as shown in Fig. 6 . In the sequel, we often identify scheduled data expressions and their induced SDSlanguage. For instance, we simply write ε for the singleton set {ε} or N, h for the singleton set { N, h }. Extended circuits. The crucial point in our construction is that the circuits R α as in Fig. 5 generate exactly the scheduled-data streams in L(α), provided that there is exactly one input at A α and exactly one output at B α . However, the generated SDSlanguage of the circuit R α may not agree with the SDS-language associated with α, because in R α data flow at A α is possible in every configuration where the buffer between A α andÃ α is empty. In fact, this is the reason why we switched from R α 0 to the circuit R that puts R α 0 in the context of an initialization-and a termination-component. Moreover, R α returns to its initial configuration after the output at B α . The latter condition is essential for the treatment of the closure-operators α = β ω and α = β ∞ .
To use inductive arguments in our correctness proof, we work with an extension of the circuits R α (for the subexpressions α of α 0 ) by a FIFO1 channel with its source end on B α and its sink end on A α , as shown in Fig. 9 . This circuit is called ext(R α ). To be able to reason about data flow at nodes A β , B β for α = β and the proper subexpressions of α, we assume that all (sink, source, and mixed) nodes are observable. Thus, the underlying node-set of ext(R α ) and its constraint automaton is
where N, h ranges over all atoms occurring in α. Recall that
Constraint automata for the extended Reo circuits. The constraint automaton for ext(R α ) has the form shown in Fig. 10 .Ã α is a sub-automaton with initial stateq α and node-setÑ α = N α \ {A α , B α }, where:
(1) q α is the initial configuration of ext(R α ) where the new buffer connecting B α and A α contains data item 0, while all other buffers are empty. (4) The generated SDS-language L(ext(R α )) over the node-set N α can be described by the scheduled-data expression
where L(Ã α ) stands for a scheduled-data expression that characterizes the accepted SDS-language of the sub-automatonÃ α with node-setÑ α .
Extended scheduled-data expressions. We now do a similar extension with the scheduleddata expressions and switch from α to the expression
β is a subexpression of α . The subexpressions α ext of ext(α) is defined by structural induction:
Relation between extended circuits and scheduled-data expressions. To formalize the relation between the SDS-languages of α and ext(α), we need the following notation.
Let L be an SDS-language with node-set M and let K be another node-set such that
0. Then, we write L| K to denote the restriction of L to the scheduled-data streams for the nodes in K ∩ M . Formally,
where for a scheduled-data stream Θ ∈ SDS M , the scheduled-data stream Θ| 
We then have: 
Correctness of the final circuit. From (1)- (6), we can derive the desired result stating that the final circuit R (shown in Fig. 6 ) generates the same SDS-language as the given expression α 0 . With α = α 0 the argument is as follows. All scheduled-data streams in
One of the Θ i 's may be infinite, in which case the above stream "ends" with Θ i .) Thus, we conclude from (4)-(6) that L(α) agrees with the SDS-language L(R) for the final circuit that is obtained from R α by adding the components "Init" and "Stop" as explained before. Note that "Init" provides an input at A α exactly once while "Stop" can consume a data item at B α exactly once. Thus, the scheduled-data streams of the final circuit R are streams of the form Recall that in R all nodes except for the sources C i ∈ N and sinks D j ∈ N are hidden. That is why we must take the projection on the N -nodes. Proof of (1)-(6). We show (1)-(6) for all subexpressions α of α 0 by an inductive argument.
Basis of induction.
For α = ε, the extended Reo circuit ext(R ε ) and its automaton have the following structure:
That is, here, the sub-automatonÃ ε consists of a single stateq ε =p ε . Conditions (1)- (4) are obvious. Moreover, L(Ã ε ) consists of the empty scheduled-data stream, and hence, agrees with the language of ε ext = ε (which yields (5)). In addition, L(ext(R ε ))
consists of the infinite scheduled-data stream
(which yields (6)). The case α = N, h where h is as in Section 5 can be verified using the join-operator that has been introduced in [5] . Essentially, is a product construction that allows for the compositional construction of automata for given Reo circuits. Applying the join operator to all channels that are involved in ext(R α ) for α = N, h , we obtain the automaton shown in Fig. 11 , which is an instance of Fig. 10 where the sub-automatoñ A α consists of the two statesq α andp α . (In this figure, we skip the data constraints concerning the additional nodes E α ,Ã α ,B α , etc.) We will not consider all the details of the construction of the above automaton by joining the automata for all of its constituent channels. To give an impression of how the construction works, Fig. 12 illustrates the most difficult case of joining the channels that are involved in the sub-circuit for a source node C ∈ N. It is now easy to see from Induction step. For α ∈ {γ;β, β ∞ , β ω } we can again use the join-operator to verify the structure of the resulting automaton and the properties (1)- (6) . Conditions (1) and (3) allow reasoning about the automaton for the original circuit, without the additional FIFO1 channel between B β and A β for the subexpressions β of α. Another crucial point in the argument for union or concatenation is that when combining the automata for the two sub-circuits R γ and R β (via union or concatenation), there is no undesired interference between the common sink nodes D or source nodes C. We skip the details here and just illustrate in Fig. 13 the situation where a source node C occurs in atoms N, h of γ and M, f of β. Moreover, here we assume that h contains the sub-constraint "d C ∈ P C " and f the sub-constraint "d C ∈ T C ". Fig. 13 shows the constraint automaton, on the right, that is obtained for the sub-circuit shown on the left. Intuitively, the constraint automaton in this figure makes it clear that there are two transitions representing the cases where C performs a write operation: one for the atom N, h in γ, and one for the atom M, f in β. 
