Abstract-The power dissipation at the off-chip bus is a significant part of the overall power dissipation in digital systems. This paper presents irredundant address bus encoding methods which reduce signal transitions on the instruction address buses by using adaptive codebook methods. These methods are based on the temporal locality and spatial locality of instruction address. Since applications tend to JUMP / BRANCH to limited sets of addresses, proposed encoding methods assign the least signal transition codes to the addresses of JUMP / BRANCH operations in the past. Our encoding methods reduce the signal transitions on the instruction addres buses by an average of 88%.
I. INTRODUCTION
Recently interest in VLSI design has shifted to minimizing the power dissipation from the conventional design criteria such as chip area and operation speed. In general microprocessor based systems, a large amount of power dissipation comes from the system buses such as highly capacitive memory buses. In these situations, power reduction of buses has become serious concern, for low power VLSI design, as the power dissipation related to the bus is becoming the major part of the total chip power dissipation. Generally, capacitance of such buses are several orders higher than the capacitance of on-chip internal logic because the buses between microprocessors and memories are off-chip wide long interconnects. It means that some additional encoding and decoding circuits can reduce signal transitions of buses (it corresponds to the power dissipation of buses) even though additional power dissipation of such codec circuits are necessary. Many power reduction methods for bus including data encoding schemes have been proposed [l, 2, 3, 8, 9 , 101. Figure 1 shows basic concept of low power bus encoding / decoding scheme. In this paper we introduce new address bus encoding techniques which are based on adaptive code-book methods. Since the address buses have their own characteristics (temporal and spatial locality), the signal transitions can be easily reduced by bus encoding schemes. In addition our methods use adaptive codebooks to enhance the performance of signal transition reduction, it gives more power reduction compared to the other previous works. Since our methods use codebooks more effectively than other work [7] , the codebook hit rates are still high and higher reduction of signal transitions can be achieved. The rest of this paper is structured as follows: first we describe the previous works related to our work. In section 3, we explain the idea of our encoding methods. In section 4, we present the experimental results by using an instruction set simulator and SPEC2000 benchmarks. Finally we conclude this work in section 5.
PREVIOUS WORKS
In this section we review previous related works. There are a lot of low power bus encoding methods and some of them are reviewed below. In many encoding methods for low power system buses proposed so far, they use one or more redundant bits to reduce signal transitions effectively on buses. In many systems, such redundant bus lines (interconnects) are not acceptable because of implementation problems on the PCBs or MCM. Since our methods are irredundant low power address bus encoding methods, most of the following are irredundant address bus encoding methods.
The following notation will be used in this section: [3] . TO-C coder51 is an expansion of TO encoding to omit the redundant bit. Figure 2 is the pseudo code of TO-C encoding method. In this encoding method, the concept is almost the same as TO encoding. To omit the redundant bit, the exceptional operation is carried out to keep 2N-to-2N mapping between the original and encoded addresses. This expansion also increases the performance of signal transition reduction and it reduces the number of signal transition on address bus by 70%.
In [6] , TO-XOR code and Offset-XOR code are proposed. The former works as follows:
It works the same as TO encoding when an address increments uniformly. In other cases, the signal transitions are reduced compared to TO encoding because the offset address between before and after JUMP operation is usually small. Its concept can be applied to the Offset-XOR encoding and it works as follows:
Usually the offsets are small numbers and the XOR of the previous bus value and the offset has a small number of transitions.
In ALBORZ encoding [7] , the authors introduced anoth.er encoding of the offset for Offset-XOR encoding. For example, the small minus offset (such as "-1") has many Is in binary representation. ALBORZ method changes it to a value with fewer number of 1s. This method also has codebooks and it contains the recently used offsets. In case the offset is hit to the contents of the codebook, the bus signal transitions are reduced by using LWC(Low Weight Code). This encoding method can reduce the number of signal transition on address buses by up to 89%.
ENCODING METHOD
This section explains encoding techniques that we propose in this paper. First we describe TO-C based encoding method with an adaptive codebook. Then we describe Offset-XOR based encoding method with an adaptive codebook. Both encoding methods utilize adaptive codebooks to enhance the performance of signal transition reduction considering temporal and spatial locality of instruction address buses. Please note that both encoding methods are irredundant (no extra bit).
A. TOCAC Code
In this section, we introduce a new address bus encoding method based on TO-C code [5] . As described in the previous section, TO-C code is an expansion of TO code by omitting the redundant bit. To improve the performance, we combined adaptive codebook with TO-C code, we named it TOCAC (TO-C with adaptive codebook) code.
The basic concept is to assign the least signal transition codes to the most frequent patterns. For example, the following case occurs on the instruction address bus without JUMF' or BRANCH operations.
Data(t) = Data(t-1) + S
Our method assigns zero transition code to such cases just like TO code (or TO-C code). As a result, Data(t) is encoded to Bus(t-1). In addition, we use an adaptive codebook which contains the recently accessed destination address of JUMP / BRANCH operation (JUMP/BRANCH destination address history). In case that the address to be transmitted is hit to one of the addresses in the codebook, our method assigns MSB one-hot code to such cases. For example, in case of a loop operation, the same destination addresses of JUMP/BRANCH operations will frequently appear. By using TOCAC code, the number of signal transitions is effec- tively reduced in such cases. Then the encoder and the decoder update their own codebooks to keep consistency between them. Since this encoding method is irredundant, the 2N-to-2N mapping should be maintained in the N-bit buses. In other words, the encoded bus value must be decoded to the original value uniquely. To satisfy these assumptions, these assignments should be swapped in the 2N-to-2N mapping. The detailed operations are described in the following sections. 
Encoding Method of TOCAC code

Data@--
; n i o n n n t i~ + n i o n n n i i
(c) an example of exceptional operation The following are the detailed encoding operation of each
Operate TO-C encoding
First TOCAC encoder perform TO-C encoding to the raw ad-
Check whether the address is contained in {His[i]}
Then the raw address is checked whether it is contained in the destination address history list {His[i]} (it is the codebook). In case that the address is contained in the codebook, Bus(t) = Bus (t-1) @ OH[i] is transmitted to the bus, where
is one of the MSB One-Hot codes. Therefore in such a "hit" case, the number of signal transition of bus is 1, which means that TOCAC code is more effective than TO-C code.
This operation corresponds to the lines 5 to 12 of Fig. 3 . Figure   4 
Update the codebook
Finally the encoder and the decoder update their codebook on the same manner. In the current implementation, the codebook is updated only when JUMPBRANCH operation is performed and the address does not hit to the codebook. For simplifying the codebook circuits, we use the FIFO buffer for the codebook.
This encode method is similar to ALBORZ code [7] . In AL-BORZ code, contents of codebooks are the offsets of successive instruction addresses to simplify the encoding algorithm. On the other hand, contents of codebooks are the original destination address values in our method. From these differences, we assume our method can increase the codebook hit rate because the hit rate depends not on the offset but the destination address essentially. As a result, our method can reduce signal transitions more effectively.
B. OXAC Code
In this section, we introduce another address bus encoding method based on Offset-XOR code [6] . TOCAC code can reduce signal transitions when the address value increments or the address value hits to the codebook. However in other situations, it cannot reduce signal transitions effectively because the raw address value is transmitted. To improve the performance, we introduce a new encoding method, OXAC (Offset-XOR with adaptive codebook) code.
Basic concept is very similar to TOCAC code in the sense that both encoding methods use an adaptive codebooks which contain history of JUMPBRANCH destination addresses. In the cases that the address does not hit to the codebook, OXAC encoder transmits an offset between Data(t) and Data(t-1) whereas TOCAC encoder transmits a raw address value(Data(t)). Figure 5 shows the block diagram of OXAC encoder. The following are encoding method of OXAC code:
Calculate an offset between Data(t) and Data(t-1) and encode it
First the Offset between the previous address and the expected address of incrementation is calculated as follows:
In case that an address increments, Offset is zero. To reduce the number of 1's in small negative offsets, LSB-Inv [7] 80000000h FFFFFFFFh function [7] is given to the offset. LSB-Inv function inverts all bits of negative numbers except their MSBs, and positive numbers unchanged. Table I shows examples of LSB-Inv function and as a result the modified offsets of small negative value have less 1's in their binary representation.
Swapping using an adaptive codebook
Then the modified offsets are swapped as the same manner of TOCAC code. In case the address is hit to one of the contents of the codebook, the modified offset is replaced by the corresponding one-hot code (OH[i] ). To keep 2N-to-2N mapping, the modified offset which is equal to OH[i] is also replaced by the corresponding modified offset (Offset[i] ). These operations just swap mappings.
As a result of this step, the modified offsets have less 1's in their binary representation when the address increments or the address hits to the codebook.
3.
XOR operation between previous bus value and modified offset To reduce signal transition, the encoder calculate XOR operation between Bus(t-1) and modified offset and transmit it to the bus. It results in the signal transition reduction of the bus since the modified offsets are assumed that they have less 1s in terms of binary representation.
Update the codebook
Finally the encoder and the decoder update their codebook on the same manner. It is done just same as TOCAC code which is described in the previous section.
C. Decoding Methods of TOCAC Code and OXAC Code
As described above, both TOCAC code and OXAC code use an input address value (Data(t), Data(t-1)) and the previous bus value(Bus(t-1)) and the codebook ({His[i]}). The decoder can also use them and decode to the original value if the codebook of the encoder and the codebook of the decoder are always identical. 
IV. EXPERIMENTAL RESULTS
To evaluate the effectiveness of the proposed encoding methods, we implemented these algorithm in C-language. We used the Simplescalar[l2] architecture simulator to trace the instruction addresses. We generated instruction address traces for 8 of the SPEC 2000 benchmark programs and each trace consists of 100,000,000 instructions and all the simulations have been done using the SPEC 2000 test input parameters. The used benchmark programs are g z i p , g c c , mcf, amp, a r t , b z i p 2 , equake a n d p a r s e r . Table II shows the signal transition reduction ratios for each encoding method. In this table, the signal transition reduction ratio means the ratio between the number of signal transitions of the raw instruction address and the encoded address. In other words, smaller signal transition reduction ratio indicates higher performance of signal transition reduction. We evaluated TO code, TO-C code, Offset-XOR code for comparison in addition to TOCAC code and OXAC code. The last row of this table shows the average signal transition reduction ratio over the all benchmarks for each encoding method. As shown in the table 11, the proposed encoding methods with 8 or 16 codebooks are superior to other encoding methods in terms of signal transition reduction. And the signal transitions are reduced to 15.8% and 11.8% by using TOCAC code and OXAC code, respectively. These results indicate that adaptive codebook based encoding methods are very effective and applicable for low power encoding of instruction address buses. Figure 6 shows signal transition reduction ratio and the hit ratio to the codebook for each codebook size for proposed encoding methods. We used g z i p benchmark for this experiment. As described in the previous section, TOCAC code and OXAC code use the same algorithm for updating codebook. Therefore hitratio of them are identical. From these results, the signal transition reduction ratio highly depends on the codebook hit ratio. And we can assume 16-entry codebook is sufficient for proposed encoding methods since the codebook hit ratio is saturated around there. As compared with ALBORZ code [7] , the hit rate is higher because our methods use codebooks more effectively as described in the previous section. To obtain the same hit rate, our methods require codebooks of half sizes compared to ALBORZ. It indicates that the additional hardware costs of our methods are smaller since the codebook size is one of the parameters which affect the codec circuits. Figure 7 shows the distribution of signal transitions in the encoded stream for each encoding method. In this experiment, the g z i p benchmark is also used for evaluation. TOCAC code and OXAC code used the codebooks which contain 16 words for retaining JUMP/BRANCH destination address. Though there is almost no zero-transition pattern in raw address, around 80% of patterns are zero transition in other encoding methods. It is because over 80% of the instruction addresses increment in g z i p benchmark. Moreover almost 98% of encoded patterns are zero bit or one bit transition in our adaptive codebook encoding methods. It also certifies the performance of our methods.
Finally, we analyzed the power dissipation overhead of the codec circuit of the proposed encoding method. To estimate the power dissipation of OXAC encoder and decoder, we designed them by using Verilog-HDL and we got the physical transistor net-list by using a commercial logic synthesis tool and a commercial P&R tool. The target technology was 1.8-volt 0.18-pm CMOS process. We also used a switch level circuit simulator to estimate the power dissipation with traced instruction address of g z i p benchmark. Figure 8 shows the total power reduction including power dissipations of encoder / decoder circuits for each value of the external address bus capacitance. In this figure, the total power reduction ratio indicates the ratio between the non-encoded raw address stream and the encoded address stream including the power dissipation of codec circuits, where the signal swings of U 0 interfaces are 3.3-volt. Since this experimental result is a first trial of implementation of the proposed methods, the circuit is not well optimized. Still it is clear that our methods are effective for large value of address bus capacitances. In this paper, we proposed new low power encoding methods which were applicable for instruction address buses. Since the proposed methods use codebooks which contain JUMP / BRANCH destination addresses, they can reduce signal transitions on the address buses more effectively than other methods.
These methods can be easily applicable for conventional systems, because they are irredundant encoding methods. Experimental results showed that these methods can reduce signal transitions by 80-95% (average 88%). Moreover circuit simulation results indicate these methods are very attractive when the address bus capacitance is large.
As future works, we are refining the encoding algorithm to decrease hardware overheads of codec circuits. We believe that small modifications of encoding algorithms can reduce the overheads considerably. Moreover we are investigating low power address bus encoding methods based on other profiles of address traces though we only used only JUMP / BRANCH destination address in this paper. We assume that these low power bus encoding scheme will contribute to future low power systems.
