In this paper, we present a framework to formally describe and study the interconnection of distributed shared memory systems. Using it allows us to classify the consistency models in two groups, depending on whether they are fast or not. In the case of non-fast consistency models, we show that they cannot be interconnected in any way. In contrast, in the case of fast consistency models we provide protocols to interconnect some of them.
Introduction
Distributed shared memory (DSM) is a well-known mechanism for interprocess communication in distributed environments [21] . Roughly speaking, it consists in using read and write operations for interprocess communication, thus hiding the particular communication technique employed by the programmers to avoid the need to be involved in the management of messages. However, this can cause problems in systems where several processes independently and simultaneously submit reads and writes, since they can see each other's operations out of order. This problem led to the concept of consistency models. A consistency model is a specification of the allowable behavior of memory, and it can be seen as a contract between memory implementation and the program utilizing memory: the memory implementation guarantees that for any input it will produce some output from the set of allowable outputs specified by the consistency model, and the program must be written to work correctly for any output allowed by the consistency model. Depending on the semantics of the memory operations, a number of consistency models has been proposed in the literature (see for instance [21, 7, 13, 11, 16] ).
In this paper, we study the interconnection of distributed shared memory systems. By this we mean the addition of an interconnection system to several existing distributed shared memory systems that implement a given consistency model in order to obtain a single distributed shared memory system that implements the same consistency model. There are two main reasons for interconnecting DSM systems with new protocols instead of using a single protocol for the whole system:
• First, in this way we can interconnect systems that are already running without changing them. They can go on using their protocols at their local level.
• Second, depending on the network topology, it could be more efficient to implement several systems and interconnect them than to have one single large system. An example of this would be a DSM system that has to be implemented on two local area networks connected with a low-speed point-to-point link.
If the protocol that is used broadcasts updates, in a single system with many popular protocols there would be a large number of messages crossing the point-to-point link for the same variable update. In this case, it would seem appropriate to implement one system in each of the local area networks, and use an interconnecting protocol via the link to connect the whole system. With the appropriate interconnecting protocol, many fewer messages cross the link for each variable update. It is interesting to compare our approach with the concept of locality, defined by Herlihy and Wing [12] . Both approaches have to do with the ability to compose DSM systems. However, locality addresses composability of DSM systems with the same set of processes but disjoint sets of memory objects, while our approach studies the composability of DSM systems with the same set of memory objects but disjoint sets of processes.
A first contribution of this work is the introduction of a framework for the interconnection of memory systems and the formalization of the interactions between the existing memory systems and the interconnection system. Furthermore, we identify the fastness of a memory model (a concept that will be defined later in the paper) as the key property that will qualify it to be an interconnectable memory model or not.
In the case of non-fast consistency models, we show that they cannot be interconnected in any way, thus deriving that a number of popular memory models can not be interconnected (e.g., the atomic, safe, regular and sequential models [18] , the PCG and PCD consistency models [10, 1] , the eager release model [9] , the lazy release model [17] , the entry model [6] , the scope model [14] , etc.).
In contrast, we show that several fast consistency models can, indeed, be interconnected (namely, the pRAM [20] , causal [2] , and cache models [10] ). However, whereas the cache model can be interconnected without any restriction, we found that the other two memory models can only be interconnected when the subsystems fulfill certain restrictions. In this last situation, we give sufficient conditions and the corresponding interconnecting protocols to do so.
Regarding previous work that has been carried out on the interconnection of distributed shared systems, as far as we know, it has only been studied in [8] . 1 Here, we extend the results of that paper in a number of ways. First, we consider consistency models other than the causal one (which was the only one considered in [8] ). Second, we provide some impossibility results related with interconnection of consistency models. Third, we use much weaker assumptions on the systems to be interconnected.
The rest of the paper is organized as follows. In Section 2, we introduce the framework for the interconnection of systems. In Section 3, we show the impossibility of interconnection for nonfast consistency models. In Section 4, we study the interconnection of pRAM systems, in Section 5 the interconnection of causal systems, and in Section 6 we show how to interconnect cache systems. In Section 7, we briefly study the performance of the proposed interconnecting protocols. Finally, in Section 8, we present some concluding remarks.
System model
We consider distributed shared memory systems (or systems for short) formed by a collection of application processes that interact via a shared memory consisting of a set of variables. All the interactions between the application processes and the memory are performed through read and write operations (memory operations) on variables of the memory.
Each memory operation is applied on a named variable and has an associated value. A write operation of the value v in the variable x, denoted w(x)v, stores v in the variable x. A read operation of the value v from the variable x, denoted r(x)v, reports to the issuing application process that the variable x holds the value v. To simplify the analysis, we assume that a given value is written at most once in any given variable and that the initial values of the variables are set by using fictitious write operations.
1 In addition, of course, to the preliminary version of this paper, appeared in OPODIS'03 [15] .
Furthermore, we also consider explicit synchronization operations. Synchronizations can be used just to import information, as with the acquiring of a lock, or just to export information, as with the release of a lock.
In order to characterize the system model, we specify the components that form it, the consistency model, the system architecture and the interconnecting system.
The consistency model
Roughly speaking, a consistency model (also called memory model) is a specification of the allowable behavior of the system's operations. To formally define a consistency model, first we introduce what a system's execution is. An execution α of a system S consists of a set of read and write operations, as well as synchronization operations (if any), issued by the application processes that form system S. Such operations must preserve the so called execution order. To define this, first we introduce the process order. Obviously, for this definition to make sense, it is necessary to define what an execution of type M is in each case. The specification of particular types of executions will be dealt with later in the paper. For such a task, we need to define several related concepts. → op to denote that op precedes op in a view β. We will omit the view when it is clear from the context. We will also use α → α , where α and α are sets of operations, to denote that all the operations in α precede all the operations in α . 
Definition 1 (Process Order
)
Definition 4 (View

The system architecture
From a physical point of view, we consider distributed systems as consisting of a set of nodes and a network that provides communication among them. The essence of this model has been taken from [4] . The application processes of the system are actually executed in the nodes of the distributed system. We assume that the shared memory abstraction is implemented by a memory consistency system (MCS). The MCS is composed of MCS-processes that use local memory at the various nodes and cooperate following a distributed algorithm, or MCS-protocol, to provide the application processes with the impression of having a shared memory. The MCS-processes are executed at the nodes of the distributed system and exchange information as specified by the MCS-protocol. They use the communication network to interact if they are in different nodes. Each MCS-process can serve several application processes, but an application process is assigned to only one local MCS-process. For each application process p we use mcs(p) to denote its MCS-process. An application process and its MCS-process have to be in the same node, as stated by the following assumption. An application process sequentially issues read/write/ synchronization operations on the shared variables by sending (read/write/synchronization) calls to its MCS-process. After sending a call, the application process blocks until it receives the corresponding response from its MCS-process, which ends the operation. We assume an asynchronous model. This means that there is no bound on the amount of time instructions and message transmissions take. We do not assume synchronized clocks among processes. We also assume that no system component (processes, nodes, and networks) fails. Fig. 1 shows an example of the system architecture described above.
Regarding the consistency model implemented by a system (i.e., by its MCS), we follow the same approach taken when defining a consistency model:
Definition 6 (System). A system is of type M if all its executions are of type M.
Furthermore, we consider systems in which at least the last write operation on every variable must be eventually visible in every process of the system. This is a very natural property which is preserved by every system that we have found in the literature. In our terminology, it means that their MCSs must satisfy the following property:
Liveness property. Consider any execution α of system S. If there is only one process writing on variable x and its last operation on x was w(x)u, then eventually the response to any read call on x issued by any application process will contain the value u.
The Interconnection system
Interconnecting several systems involves making them to behave as though they were one single system. Using the terminology defined above, this actually means interconnecting several MCSs.
In our model, the load of such an interconnection will fall on an interconnection system (IS). An IS is a set of processes (IS-processes) that execute some distributed algorithm or protocol (IS-protocol). For simplicity in the IS design, we consider the existence of one IS-process for each MCS to be interconnected.
The IS-process of each system is an application process and, hence, it has an MCS-process that by Assumption 1 is in its same node. The IS-process uses the MCS-process to read and write on the shared memory of the local system. In particular, the only way a value written by an application process in some system can be read by an application process in another system is if the IS-process of the latter system writes it. IS-processes exchange information with each other (as specified by the IS-protocol) by using a reliable FIFO communication network. Note that, after the interconnection, the overall system has a global MCS formed by the MCSs of the original systems plus the IS that interconnects them. Fig. 2 presents an example of an IS interconnecting two systems.
Definition 7.
We will say that a consistency model can be interconnected if for any collection of systems implementing this consistency model there is an IS-protocol that interconnects them.
In the rest of the paper we will use N to denote the number of systems to be interconnected. The systems to be interconnected will be denoted by S Regarding how the ISs operate, we note that it is necessary to guarantee that any given IS-process be eventually aware of the writes taking place at the MCS-processes that it manages, so that it could exchange such information with other IS-processes. This functionality can be implemented in a number of ways.
(1) Within the IS-process: for instance, it can be implemented by making the IS-process check for any updated variable, by periodically reading the whole memory. In this case, the ISprocess will behave as a regular application process and no additional assumption is made on the MCS-processes. (2) Within the MCS-processes: in this case the MCS-processes have to communicate explicitly any update to the IS-process.
Whereas such an approach could be more efficient than the previous one, it requires that the MCS-processes be able to perform such a task. (3) By using a combination of both.
In order to maintain it as general as possible, in this paper we only assume that there is an interface (between the MCS and the IS) that provides the above mentioned functionality, without considering how it is implemented. 2 However, in order to ''decouple'' as much as possible the original systems and the interconnecting protocol, this interface does not allow the IS to contact the MCS (except to read or write variables). In particular, the IS cannot block the MCS (as was done in [8] ). More formally, the interface guarantees the following assumption: Assumption 2. When any MCS-process updates its local memory (as a result of a write operation issued by an application process), the IS-process will be asynchronously notified about these events (i.e., about the updated variable, the written value and the application process). Other than this, there is no MCS-initiated interaction between MCS and IS processes.
Fast vs non-fast consistency models
In this section, we show that only systems implementing fast consistency models can be interconnected. Formally, we define a fast consistency model as follows: Definition 8. We say that a consistency model is fast if there is an MCS-protocol that implements it, such that memory operations only require local computations before returning control, even in systems with several nodes.
Since there are several examples of popular fast and non-fast models, this implies that the property of being fast classifies the set of memory models in a non-trivial way. The following observation will be useful to prove some subsequent results. For the second part, we use induction on i to show that i systems can be interconnected for any i ≥ 2. For i = 2 the claim is trivially true. Now, assume that we can obtain a system S by interconnecting the systems S 0 , S 1 , . . . , S i−2 . The result of the interconnection is a single system. Then, the IS-protocol can be used to properly interconnect S and S i−1 .
In what follows, we consider the interconnection of only two systems, and use this observation to generalize our results to several systems. Now, we prove that non-fast memory models cannot, in general, be interconnected.
Theorem 1. There is no IS that guarantees the interconnection of systems implementing non-fast memory models.
Proof. We show the result by contradiction. Assume that there is a non-fast memory model M that can be interconnected. From Observation 1, we can consider the interconnection of two systems. Therefore, let us assume there is an IS I that interconnects two systems implementing M. Let us first take a distributed system with two nodes. In each node we implement a system with one MCS-process, at least one application process, and the corresponding IS-process. By Assumption 1, the MCS-process and the application processes (the IS-process included) are in the same node. Then, in each of these two single-node systems each memory operation only requires local computations. Now, we use I to interconnect these two systems into a unique system implementing M. By Assumption 2, I cannot block the MCSprocesses. Then, every memory operation in the resulting system still requires only local computations, which contradicts the fact that M is not fast.
As a consequence of this theorem, we derive that a number of popular memory models cannot be interconnected. In [4] it is shown that the sequential consistency model is not fast. Hence, it cannot be interconnected, and the same happens with the atomic consistency model and its derivations, i.e., the safe and regular memory models [19] . Similarly, Attiya and Friedman [5] have shown that the processor consistency models PCG and PCD [10, 1] are not fast, and consequently cannot be interconnected. Finally et al. [5] also proved that any algorithm for the mutual exclusion problem using fast operations must be cooperative. This implies that any synchronization operation that guarantees mutual exclusion must be non-fast. Therefore, any synchronized memory model that provides exclusive access cannot be interconnected. As a result, we have that memory models such as the eager release [9] , the lazy release [17] , the entry [6] or the scope [14] cannot be interconnected.
On the other hand, there is a number of consistency models that are fast and for which Theorem 1 does not apply. In the following sections we show that some of the most popular fast memory models (namely, the pRAM [20] , the causal [2] and the cache [10] ) can indeed be interconnected, although, in some cases, in a constrained fashion.
We will assume that these fast systems control the replicas by propagating the new values to update the replicas. This assumption does not significantly restrict the domain of application of our results, since all current implementations of fast models we are aware of have been obtained by using propagation.
Interconnection of pRAM Systems
In this section, we study the interconnection of pRAM systems [20] . In this model, every process performs all its operations locally and transmits updated values to the other processes along FIFO channels. These updates are later performed asynchronously at the remote processes. Formally, we define a pRAM system as follows:
Definition 9 (pRAM System
Following, we show that, in general, the interconnection of pRAM systems is not possible. That is, there is no IS that interconnects every pair of pRAM systems. The proof is based on the fact that some pRAM systems may not be FIFO ordered (the formal definition of FIFO ordered system is provided below). Indeed, as we pointed at the end of Section 2, it is necessary to have a functionality that guarantees that any given IS-process be eventually aware of the writes taking place at the MCS-processes that it manages. If such a functionality is implemented within the IS-process (i.e., by making the IS-process periodically read the whole memory), it is easy to prove that pRAM systems, in general, cannot be interconnected. Basically, the idea is that when the IS-process finds two variables that have been modified since the last time it read them, it cannot know in which order they were written, and may propagate them to the other systems in an incorrect order. Hence, such a functionality must be implemented within the MCS-processes (i.e., by making them to communicate explicitly any update to the IS-process, as described in Assumption 2). Depending on how this update is done, it could happen that local replicas are updated in a different order than the write operations were issued. 3 Then, the communication of the updates between the MCS-processes and the IS-process (which is performed without using read and write operations but using explicit messages) could be out of order.
In what follows in the rest of the paper, we extend our notation and use both subscripts and superscripts to respectively denote the process that performs the operations and the system where such a process is located.
Theorem 2. There is no IS that guarantees pRAM interconnection for every pair of pRAM systems.
Proof. The proof is based on the fact that when some process p in S k issues several write operations, it may update the corresponding variables in its local memory in a different order from p's process order.
Let us assume, by way of contradiction, that there is a system S T which is the result of interconnecting two pRAM systems S 0 and S 1 through some interconnection system I. From Definition 9, we know that for every execution α T there is a legal view β Despite the previous result, we have found that for certain types of pRAM systems, which we call FIFO ordered, it is still possible to do this.
FIFO ordered systems. We say a system is FIFO ordered if for each process p in S k , if p issues two write operations w(x)v ≺ p w(y)u, then the mcs(isp k ) process updates its local replica of x with the value v before updating its local replica of y with the value u.
In Fig. 3 Proof. See Appendix A.1.
Interconnection of causal systems
In this section, we study the interconnection of causal systems [2] . In the causal model, in addition to the conditions of the pRAM executions, read operations are forced to return the value written by the latest causally ordered operation (i.e., read operations preserve the execution order in Definition 2). Formally, we define a causal system as follows: 
Definition 10 (Causal System).
A system S is causal if for every execution α and every process p there is a legal view β p of α p preserving ≺.
As in the case of pRAM systems, here we consider systems that are implemented by using propagation. First of all, we have that since the pRAM model is strictly weaker than the causal model [7, 2] , the result of impossibility in Section 4 is also applicable to causal systems.
Corollary 1 (From Theorem 2). There is no IS that guarantees causal interconnection for every pair of causal systems.
In spite of this result, in the previous section we presented an ISprotocol for interconnecting pRAM systems that are FIFO ordered. Thus, a question that naturally arises is whether it is possible to interconnect causal systems that are also FIFO ordered. However, here we show that this result does not apply to causal systems. That is, there is no IS that interconnects every pair of causal systems, even if they are FIFO ordered. Nevertheless and although in general the interconnection of causal systems is not possible even if they are FIFO ordered, we found that it is still possible to interconnect causal systems that are globally ordered.
Globally Ordered Systems We say that a system is globally ordered if for each two write operations w(x)v ≺ w(y)u issued by (maybe different) processes in S k , each mcs(p) with p in S k updates its local replica of x with the value v before updating its local replica of y with the value u.
In Fig. 4 , we present an IS-protocol that can be used to connect causal systems that are globally ordered. It consists of two concurrent tasks, Propagate k out and Propagate k in , such as in the ISprotocol in Fig. 3 . In fact, the Propagate k in task is the same. The key difference is found in task Propagate out , where a pair x, v is not sent to the other systems until all the MCS replicas of x have been updated.
Clearly, globally ordered systems provide stronger guarantees than FIFO ordered systems. However and similar to this latter type, they do not undergo substantial constraints with respect to systems that are not globally ordered. Theorem 5 states formally the guarantees provided by the above mentioned protocol. Proof. See Appendix A.2.
Interconnection of cache systems
In this section, we study the interconnection of cache systems [10] . Roughly speaking, this memory model forces independent variables considered in isolation to be sequential. That is, data operations on any individual variable must ''appear'' to have been executed atomically in an order that is consistent with the order seen in individual processes. Formally, a cache system is defined as follows:
Definition 11 (Cache System). A system S is cache if for every execution α and every variable x there is a legal view β x of α x preserving ≺ (where α x denotes the subset of operations obtained by removing from execution α all the operations on variables other than x).
We show that, unlike the previous models, the interconnection of cache systems is always possible, independently of how they are implemented. In Fig. 5 , we present an IS-protocol that can be used to connect cache systems of any type. It consists of only one task Propagate k . Note that each IS-process maintains a copy of the latest value propagated from the other system in last(x) for each variable x. That copy must be initialized with a special value (namely, NoData). Note also that initially one of the IS-processes (for instance isp 0 ) must send a message with x, NoData to the other for each variable x to start the interconnection. Theorem 6 states formally the guarantees provided by the above mentioned protocol. Observe that the proposed interconnection algorithm is usually highly inefficient in terms of the network traffic it causes, given that, for each memory object, there is a continuous exchange of messages between the IS-processes. Since the main target of this work is to identify whether consistency models can be interconnected, we are not very concerned about efficiency. However, we note that the algorithm can be easily optimized to decrease the network traffic. For instance, instead of handling each variable independently, the algorithm could work at a larger scale, dealing with several (or even all the) variables simultaneously. This optimization can be done by modifying task Propagate k () so that it is executed only upon reception of a given set of x, v pairs (for different variables), and executing the same code for each one of them. Additionally, instead of continuously transferring messages between the IS-processes, such a transfer could be performed only when a given time interval has passed or when the variable x is updated at the sending system, whichever happens first. This optimizations can be done by adding some simple pieces of code that implement this wait condition. Note, however, that in some cases, these changes could increase the latency. The correctness proofs of these alternative interconnecting protocols are in essence the same as the one for the algorithm shown in Fig. 5 .
Performance
In this paper, we have approached the interconnection problem from a theoretical point of view, trying to decide whether it is possible to interconnect given consistency models. We have not been concerned above with the efficiency of the interconnection algorithms we have proposed and the performance of the resulting interconnected system. However, we do here a brief and simple performance evaluation of algorithms and systems. We compare the performance of a system obtained using our IS-protocols with the performance of a system that uses a MCS-protocol connecting all the processes directly. We assume that the same MCS-protocol is used in the global DSM system of reference and in each of the systems interconnected with our IS-protocols.
First, observe that our IS-protocols should not affect the response time a process observes when issuing a memory operation, since its MCS-process is not affected (in particular, cannot be blocked) by the interconnection. Since the three models that we study are fast, the response time of the algorithms that implement them (e.g., [20] for pRAM, [2, 3] for causal, or [16] for cache) only depends on local computation at a node. This does not change with the interconnection.
Second, let us look at the latency of a DSM system, which is the largest time until a value that is written becomes visible in any other process. In a single DSM system this time depends on the MCS-protocol used. For instance, if we discard the time for local computations, for the pRAM and causal algorithms [20, 2, 3] the latency depends on the time to complete a broadcast in the system. Let us denote this time by T B (n) in a system with n MCSprocesses, which we safely assume is at most linear on n. The only algorithm for cache consistency of which we are aware [16] has instead latency L cache (n) = Θ(nT B (n)). 4 The three proposed IS-protocols propagate as soon as they can any new value they are aware of by sending a message. In the case of pRAM and causal consistencies this is done immediately, while in the case of cache consistency the IS-process may need to wait for a message from the other IS-process. Then, using the algorithms referred above, the interconnection of two systems S 0 and S 1 with n 0 and n 1 MCS-processes respectively (n = n 0 + n 1 ), has latency T B (n 0 ) + T B (n 1 ) + d (IS-processes are assumed to use existing MCSprocesses) in the case of pRAM and causal, where d is the delay of a point-to-point communication.
In the case of cache this latency
In the three cases, since T B (n) is at most linear, the latency of the interconnected system is larger than the latency in the original system. However, since the broadcast delay cannot be smaller than d, the latency increases at most by a constant factor of 3. If this is generalized to N systems the increase factor depends on the topology. In the worst case, systems are connected as a line, and the worst factor becomes 2N − 1. In the best case systems are connected as a star, and the increase factor is again bounded by a small constant 5.
Regarding the network traffic, the pRAM and causal protocols [20, 2, 3] broadcast a message for each write operation. Let 
Conclusions
In this paper, the interconnection of distributed shared memory systems has been studied. We have classified the consistency models in two groups, depending on whether they are fast or not. In the case of non-fast consistency models, we have shown that they cannot be interconnected in any way. In contrast, in the case of fast consistency models we have provided protocols with which to interconnect some of them. Whereas in some cases it is possible to interconnect fast consistency models without any restriction, other fast consistency models need some additional constraints. In this last situation, we gave sufficient conditions and the corresponding protocols to do so. Fig. 6 summarizes these results. At this point, we note that whereas we have shown that cache systems can be interconnected in a more general fashion than pRAM and causal systems, the protocol that we have used for such a task is, in general, less efficient than the protocols used for interconnecting pRAM and causal systems.
A.1. Correctness proof of Theorem 3
Let p be some process in S k , k ∈ {0, 1} 
