periodically monitor temperature at runtime and dynamically reduce heat dissipation as soon as the temperature exceeds a predefined DTM trigger threshold. However, these techniques inevitably deteriorate the performance of the microprocessor.
The register file is known as one of the hottest functional units in the microprocessor. To tackle the high temperature of the register file, DTM operations have been used, resulting in performance degradation. To reduce heat dissipation while minimizing the performance degradation, we adopt the banked integer register file scheme [2] , which was originally proposed to reduce dynamic power consumption. The banked integer register file scheme is reported to reduce the dynamic energy by 38.6% [2] and the peak temperature of the integer register file by 19°C [3] . In this letter, we compare the impact of the lower heat dissipation of the banked integer register file on performance to that of the non-banked integer register file. We also examine the benefits of the banked integer register file scheme in relation to leakage power consumption. As far as we know, this is the first study that evaluates the impact of the banked register file on performance and leakage considering heat dissipation.
II. Banked Register File Structure
Generally, in a high-performance superscalar microprocessor, access to the register file is one of the timing-critical paths in determining the clock frequency. As the issue width and instruction window size are increased for better performance, the number of ports and the size of the register file also should be increased. However, increasing the numbers of ports and the size of the register file could be a dominant delay factor. To overcome these problems, the banked integer register file scheme can be adopted to reduce the access time by 20% while reducing instructions per cycle (IPC) by less than 5%, though the register file area is increased by 11% [2] . As shown in Fig. 1 , the register file provides a total of eight global read ports and four global write ports using four interleaved register banks. In addition to these global ports, each bank has two local read ports and two local write ports connected to each other by the bit-line. This structure simplifies the local-global port crossbar by connecting one local port on each bank to the global left operand buses and the other to the global right operand buses. This allows any instruction to get both operands from the same bank in one cycle. The register file partitioned with several subword banks handles the inevitable bank conflicts without compromising cycle time or adding excessive complexity by containing wakeup-select loop.
III. Simulation Environments
We first compare the temperature of the banked integer register file [2] to that of the conventional non-banked register file. Then, we evaluate the performance of microprocessors that have a different register file structure when the following DTM techniques are used: local toggling (LTOG), which is also called fetch toggling, and dynamic voltage scaling (DVS). We set the DTM trigger threshold temperature to 90°C. Finally, we evaluate the total processor leakage power consumption, when the banked register file is adopted.
For processor simulations, we used the SimpleScalar [4] simulation toolset. The processor configuration is the same as that used in [1] , and the Alpha 21264 floorplan is used. Due to the additional circuitry overhead, the area of the banked register file is larger than that of the conventional register file [2] . However, for conservative evaluation, we did not consider this area increase. Wattch [5] , SimpleScalar, and HotLeakage [6] were also incorporated into our processor simulator to evaluate the dynamic and leakage power consumption of each functional unit in the microprocessor. We used HotSpot [1] , which was validated against a commercial microprocessor, to convert the power trace into temperature at the architectural level. For simulations, we selected eight applications (crafty, gcc, gzip, mcf, parser, perlbmk, twolf, and vpr) from the SPEC2000 benchmarks suite [7] .
IV. Simulation Results Figure 2 shows the peak temperature of six units located near the integer register file (including the integer register file), assuming that no DTM technique is used. As shown in Fig. 2 , the banked integer register file is cooler than the non-banked integer register file by as much as 12.8°C on average. When DTM techniques are applied, the banked register file invokes far fewer DTM operations, which may result in performance improvement. Note that the other functional units do not cause any DTM operation since their peak temperatures are below 90°C. In addition, the temperature of the units adjacent to the integer register file is also reduced by 1°C to 2°C since heat dissipation from the integer register file to the adjacent units is reduced. Table 1 describes the performance improvement of the banked integer register file. The performance improvement with the banked integer register file is 13.37% on average. The lowered temperature of the banked integer register file reduces the number of cases exceeding a pre-defined DTM trigger threshold, which reduces the number of DTM operations. Consequently, the reduced number of DTM operations leads to performance improvement. Table 2 shows the leakage reduction rate of the functional units near the integer register file including the integer register file and the total processor leakage reduction rate. The banked integer register file reduces the leakage power of the integer register file by as much as 56.36% on average. Since leakage power is dependent on temperature, the decrease in temperature caused by the reduced dynamic power in the banked integer register file leads to the leakage reduction in the banked integer register file. Additionally, the reduction in dynamic power consumption also leads to reduced heat dissipation to the adjacent functional unit, which results in 1% to 5% leakage reduction of the adjacent functional units. Thus, the total processor leakage power consumption is reduced by 10.48% on average by adoption of the banked register file scheme. 
V. Conclusion
In this letter, we adopted the banked integer register file to improve performance and to reduce leakage power consumption by decreasing the temperature of the integer register file. The simulation results show that the banked register file significantly reduces heat dissipation, resulting in an average performance improvement of 13.37%. The lowered temperature also reduces total processor leakage power by an average of 10.49%.
