Abstract-The numerous intellectual property blocks of a system-on-a-chip must be integrated so that they can meet system-specific requirements. However, such integration is not guaranteed due to mismatches between IP protocols. Protocol conversion algorithms can generate converters that can guarantee correct system behaviour, but the implementation of converters on-chip remains an open question. IPs can be modelled at several layers of the Open Systems Interconnection (OSI) model. Current protocol conversion algorithms either focus on a single layer or worse, blur the boundaries between these layers. We propose a formal framework that allows generating implementable converters for IP protocols modelled at different OSI layers using any existing converter generation algorithm. We apply the framework to an existing conversion algorithm and discuss how it can be as readily used with other algorithms.
I. INTRODUCTION
System-on-a-chip (SoC) systems contain multiple interacting intellectual property blocks or IPs [1] . IPs interact at the seven layers of the OSI standard [2] . Each layer offers a more abstract set of communication constructs, which are sequences of constructs of the layer below it. Sometimes, IPs may suffer from protocol mismatches and fail to communicate and/or meet system-level requirements. We can manually modify IPs, or use layer-specific IPs like bridges at the data-link layer or transducers at the network layer [3] . Formal protocol conversion algorithms like [4] - [7] can automatically generate IPs called converters that guarantee that the system will satisfy implicit or user-specified requirements. Unfortunately, existing algorithms either target single OSI layers [5] , or do not provide a ready path to implementing converters [4] , [6] , [7] .
We focus on the problem of implementing converters at desired OSI layers when IP protocols and requirements are specified at different OSI layers. We assume that physicallayer connections are sound. We also constrain ourselves to synchronous on-chip interconnects like AMBA [8] which are the current de facto standards for SoCs.
We propose an elegant protocol conversion framework for IP protocols and requirements modelled at any OSI layer (Sec. III). Cross-layer models are normalized to a common target layer, enabled by a simple layer-based expansion of constructs. Normalized protocols and requirements are then processed by a chosen converter generation algorithm to generate a converter, if one exists. The generated converter can be implemented at the identified target layer or at a lower target layer. As a case study, we use the algorithm proposed in [7] as it can handle both control and data mismatches. We consider a video SoC (presented in Sec. II) case study and report experimental results in Sec. IV. Concluding remarks appear in Sec. V. Fig. 1 shows an abstracted video SoC based on the AMBA AXI-4 interconnect. The SoC reads a composite A/V stream, splits it into video and audio streams via a splitter, and then processes the streams to produce progressive video-and audioout streams. An ARM processor executes the main control program. We focus only on the processor and deinterlacer IPs in this paper, without losing the generality of the framework. IP protocols are modelled using labelled transition systems, similar to existing protocol conversion approaches [3] - [5] , [7] .
II. VIDEO SOC CASE STUDY

Definition 1 (LTS):
A LTS is a tuple Q, q 0 , A, R where Q is a set of states, q 0 ∈ Q is the initial state, A is a set of actions, and R ⊆ Q × 2 A × Q is a total transition relation (all states have outgoing transitions). The language L is the set of all infinite words over A generated from state q 0 . The protocol proc in Fig. 2 (a) models the control flow of the program executing on the processor at the application layer. From initial state s 0 a transition to s 1 involves the action init() signifying an initialization function. We write transition using the short-hand s 0
States s 1 and s 2 have transitions to each other to signal the deinterlacer to run (d run()) and then to log data (log()). The data-link level deinterlacer protocol deint in Fig. 2(b Existing protocol conversion algorithms expect userprovided formalized requirements to specify how IP protocols must interact. We use the requirements model of [7] that extends LTS to have marked states. Fig. 2(c) shows a singlestate requirement req for the video SoC. The initial state r 0 is also a marked state. The language of req is the set of all infinite words that visit this marked state infinitely often. A system satisfies req if its projected language (with all non-relevant actions and transitions w.r.t. the requirement removed), is a subset of the language of the requirement. There are several issues with enforcing req on a system containing proc and deint. The protocols are modelled at different layers, and a higher-layer action like d run() would mean numerous data-link level transfers. Hence, we cannot compose the protocols to model their combined behaviour or reason about timing. Also, req constrains only the application layer behaviour and does not relate to the data-link layer. Fig. 3 shows the stages in the proposed conversion process. Firstly, protocols and requirements modelled at different layers are normalized to a single target layer. Next, the normalized models are read by a conversion algorithm to generate a converter. The converter is then implemented using an implementation algorithm suitable for the target layer. 
III. LAYER-BASED PROTOCOL CONVERSION
A. Modelling layer-level expansions
The OSI stack contains seven layers. The lower-most physical layer involves driving and sampling pin and wire level voltages. The data-link layer uses the pin and wire constructs provided by the physical layer and implements key issues such as timing, arbitration, and synchronization. It provides logical links to be used by the network layer above it. The network layer implements routing of packets. The transport layer then enables streams of untyped messages by implementing packeting, flow control and error correction over the end-toend packet streams of the network layer. The session layer provides the transmission of end-to-end untyped but named messages and carries out synchronization. The presentation layer provides end-to-end typed and named messages by using the constructs of the session layer. Finally, the application layer implements algorithms that use the presentation layer to activate and transfer data between IPs. A construct of a higher layer sequences constructs of a lower layer. We can capture this relationship as follows.
Definition 2 (Layer-based expansion): Given actions set A, the function Deconstruct reads an action a ∈ A and a targetlayer l ∈ [2, 7] and returns null (no expansion available) or The Deconstruct function decomposes a single action at a higher layer to a sequence of constructs (actions) of the next lower layer. Fig. 4 shows the fragment returned by Deconstruct(d run(), 6). A fragment may contain cycles; If we expand the presentation-layer construct send msg(deint, int) to the transport layer, error-control strategies might introduce intermediate states that have transitions back to previously visited states in the fragment when messages require re-sending.
A key question in building the Deconstruct function, essentially a look-up table, is, "Where does the expansion information come from?". There is no simple answer. We may find such information coded within software or network design layers (layers 3 to 7), or within timing diagrams and datasheets (for layers 2-4). While it is desirable to extract this information automatically, this problem is beyond the scope of this paper. In many cases though, manual specification of such expansion information should be simpler than a complete manual normalization of a protocol. The testing of this hypothesis is an important future work. The restricted number of constructs at each layer helps ensure that the Deconstruct function does not require too much space, however the amount of space will depend on the level of detail captured at each level, which in turn depends on the choice of conversion technique used. Layer-based expansion allows transforming higher-layer protocols and requirements to a lower layer. For the Video SoC, we must normalize the protocol proc and requirement req to the data-link layer at which the deinterlacer protocol is modelled. We introduce the algorithm NormalizeToLayer for this purpose. Its inputs are a LTS, and integers cl and tl indicating the current and target layers respectively. Values 1 . . . 7 for these variables refer to the physical . . . application OSI layers. The LTS is iteratively normalized layer-by-layer (lines 2-14). For each layer, the algorithm replaces a transition in the LTS with an equivalent LTS fragment at the next lower layer obtained using the call Deconstruct(a, cl) (lines 5-11). Fig. 2(a) is replaced by the LTS fragment shown in Fig. 4 . Fig. 5(b) presents the processor model normalized to the data-link layer. At this level, each transition in the model synchronizes with the bus clock, like the deinterlacer model in Fig. 2(b) . We have renumbered the states of the normalized protocol as s 0 . . . s 4 , and have removed any unreachable states after normalizing. Note that Alg. 1 does not explicitly remove any state from a LTS. However, most protocol conversion techniques construct converters by traversing only reachable states, and hence the removal of unreachable states can be seen as a secondary activity. During the normalization of the processor protocol, we assume that local applicationlayer actions of the processor, such as init() and log() from Fig. 2(a) , take only one bus clock because processors tend to be many times faster than buses. This assumption, and the fact that the processor does not use the bus to carry local operations, lead to the outgoing transitions from s 0 and s 4 to be unlabelled. Operations Fig. 2(c) is normalized. The application-layer construct d run() is expanded to a sequence of data-link layer constructs. This sequence is identical to the boxed sequence shown in Fig. 5(b) . While processing requirements, Alg. 1 needs to be slightly extended to correctly handle marked or acceptance states. In Alg. 1, the transition from state q ... S, cl, tl) ) is strongly bi-similar to LT S.
B. Normalizing heterogeneous protocols
− → q is removed and replaced by an LTS fragment LT S F (lines 4-11
The translation of upper-layer constructs into sequences of lower-layer constructs results in a progressively larger model. However, we can control this growth in state space by abstraction. E.g., data values can be replaced by control signals. Similarly, special address bus values such as for decoding slave-select signals can be modelled as control signals too.
C. Conversion and Implementation
The conversion stage applies the chosen protocol conversion algorithm to the normalized protocols and requirements. A conversion algorithm might require additional meta-data. E.g., the protocol conversion technique presented in [7] requires a classification of control signals, signal mappings, and data linkages. The proposed generic approach allows user-provided meta-data to be included and passed to the conversion algorithm along with protocol and requirement models. If required, the generated converter can be further normalized to a lower target-layer using Alg. 1. Finally, the converter can then be integrated into the system by implementing it at the target layer using an appropriate implementation algorithm, such as the one presented in [2] .
IV. RESULTS
We have created a Java-based tool that allows users to specify layer-based expansions by relating a layer-specific construct to a sequence of constructs at the next lower layer. Our tool implements the NormalizeToLayer and layerbased conversion algorithms to normalize multi-layer IP and requirements and then carry out converter generation. Unlike existing approaches, our tool always generates converters that are specific to the target layer. The generated converter can be further normalized to a lower layer if required. We assume that the use of soft-IP cores allows us to exclude modelling the physical layer constructs. Hence, our tool can generate converters until the data-link layer.
Once a layer-specific converter has been obtained, we can use a layer-specific implementation technique to synthesize a converter implementation. Tab. I shows the implementation options for the various layers [2] . As noted earlier, we do not consider the physical layer due to the use of soft IPs and automatic interconnect generation supported by SoC/NoC tools. Note that for layers 3-7, protocols are asynchronously composed, and we may extend existing asynchronous conversion or transducer generation techniques like [3] for these layers. Our chosen technique [7] can work directly with synchronous compositions at layers 2 and 3. We mitigate this restriction by using wait-states or states with self-loops when expanding models at layers 3-7. The wait states may progress to other states only when specific actions happen, allowing us to de-synchronize protocol execution and yet use synchronous composition supported by the chosen approach [7] . Fig. 6 shows how the size of video SoC increases as the processor protocol's size grows from three states at the application layer to 40 states at the data-link layer. We report two system sizes. Ex1 refers to a system comprising the processor and the deinterlacer. Ex2 refers to an extended system that also contains a stream splitter and an abstracted audio converter, modelled at the data-link layer based on Xilinx data-sheets. System size (the number of states in the composition of all constituent protocols) grows proportionally to the size of the processor protocol as other protocol sizes are kept constant, and directly affects protocol conversion times. We present a generic framework that allows converter generation techniques to work with IP implementation methods. We normalize multi-layer models of IP protocols and system-level requirements, allowing the generation of layer-specific converters that can be implemented using layer-specific methods. The normalization depends on user-provided expansions of layer-specific constructs into sequences of constructs at lower layers. We show how our technique works with a recent protocol conversion technique and discuss important issues such as choosing the right target layer, extending to other approaches, and automating layer-based expansions. Experimental results show that our approach can help translate converter logic into implementable converters.
Cross-layer modelling of IP protocols is common as IPs are customised at different levels. Converter generation techniques can generate "correct" converter logic, but overlook timing and relationships between cross-layer signals. Normalized protocols and converters as proposed in this paper can be implemented in hardware (layers 2, 3) or software (layers 3-7). We can utilize the implementation techniques presented in [2] , [9] , [10] to implement IPs. Our approach can work with other protocol conversion approaches, as well as other related approaches for generating bridging logic [3] , [11] , [12] . All existing approaches use finite state machines to model IP protocols and can therefore readily use layer-level expansions and normalization. Additional information needed by a specific technique can be modelled as additional meta-data. Capturing requirements is more tricky; Some approaches have implicit requirements [5] , [12] while others support models in temporal logic [6] or different finite-state machine variants [4] . Hence, normalizing requirements might require additional formalization. We can avoid this problem by normalizing protocols to a specific target layer and then writing requirements on these.
Future directions include looking at distributed layer-based converter generation and creating a robust tool-chain to provide end-to-end on-chip protocol conversion.
