Abstract-In the domain of High Performance Energy Efficient Computing (HPEEC), the focus of research is shifting toward energy efficient computing or low power VLSI design or green computing. We are applying three different clock gating technique in target design of Gurmukhi Unicode Reader (GUR). These power saving techniques reduce the power dissipation of GUR in significant amount. It has been observed that if we switch to flip flop based clock gating instead of simple clock then we can save 89.6% of clock power and we can save 90.48% of clock power if we switch to latch free clock gating (LFCG) or latch based clock gating (LBCG) in case of all SSTL based logic families. It has been observed that if we switch to flip flop based clock gating instead of simple clock then we can save 23.88% of IO power and we can save 26.59% of IO power if we switch to either LFCG or LBCG in case of SSTL2_II_DCI. It has been observed that if we switch to FFBCG instead of simple clock then we can save 52.38% of total power and we can save 56.67% of total power if we switch to either LFCG or LBCG in case of SSTL2_II_DCI.
Introduction
Clock gating is an energy efficient technique. It is used for power savings by gating off the functional units not required by the currently executing instruction, as determined by the Instruction Decoder unit [1] . 
CLKG
When the idle condition is true, the clock signal is calculated by function f (CG) as shown in Figure 1 . CLK is the system clock and CLKG is the gated clock of the functional unit. According to reference [1] , Power optimization traditionally relegated to the synthesis and circuits level, now shifted to the System Level and RegisterTransfer-Level. This is possible due to clock gating which switch off the inactive units of the design and reduce overall power consumption. There are many clock gating styles in [1] available to optimize power in VLSI circuits [2] . As shown in Figure 2 , there are three clock gating styles. These are Latch Free Clock Gating, Latch Based Clock Gating, Flip-Flop Based Clock Gating. Stub Series Terminated Logic (SSTL) is a group of electrical standards for driving transmission lines commonly used with DRAM based DDR memory IC's and memory modules. The Stub-Series Terminated Logic (SSTL) for 2.5V (SSTL2) and 1.8V (SSTL18) standards are for general purpose memory buses.Virtex-6 FPGA I/O supports both standards for single-ended signaling and differential signaling. This standard requires a differential amplifier input buffer and push-pull output buffer. For energy efficient design, we generally use IO standards. HSTL [8] and LVCMOS I/O standard [6] [7] are used for energy efficient design. We are extending our work to SSTL IO standards with combination of clock gating [1, 2, 5] . SSTL18_II, SSTL15, SSTL2_II_DCI, SSTL18_II_DCI, SSTL2_II, SSTL15_DCI are six different IO Standards available on FPGA as shown in Figure 3 . We are using these 6 different IO for energy efficient Gurmukhi Unicode reader design along with clock gating. The word Gurumukhi means "from the Guru's mouth" [3] and Shahmukhi translating into "from the King's mouth" [4] . This energy efficient Unicode reader design [5] [6] deals with hardware aspects of natural language processing. Table 1 shows the representation of vowels used in Gurmukhi in the hexadecimal form. Similarly, we have hexadecimal codes for consonants, digits and other signs in Gurmukhi.
Table1: Unicodes for various vowels used in Gurmukhi In LFCG, Enable (EN) and Clock (CLK) are two inputs and GATED_CLK is output. We have to hold enable signal high for at least duration of clock period in order to avoid glitches as shown in Figure 4 .
2. Latch-Based Clock Gating Design LBCG has a level sensitive latch in order to hold the enable signal during entire clock period. There are two types of LBCG design. OR gate is used with latch when clock is low. Combination of latch with AND gate when clock is high as shown in Figure 5 . 
Results
There is significant reduction in clock power but only a minor reduction in IOs power possible with clock gating. For further reduction in IOs power, we use IO standard technique. IO standard is a matching of impedance of transmission line and input/output port. In this paper we have observed the clock power, IO power and total power at SSTL IO standard. All the three powers are being analyzed at different standards of SSTL based logic families. In the following table clock power has been at different SSTL based IO standards. It has been observed that if we switch to flip flop based clock gating instead of simple clock then we can save 89.6% of clock power and we can save 90.48% of clock power if we switch to latch free clock gating and latch based clock gating in case of all SSTL based logic families as shown in the Figure 8 and Table 2 . It has been observed that maximum power reduction occurs is in the case of SSTL2_II_DCI logic family in which if we operate the circuit with clock only then total power consumption is of 14.944Watts. But if switch to latch free or latch based clock gating instead of simple clock than there is a consumption of only 10.970 Watts which means there is 26.59% reduction in power and furthermore if we switch on to flip flop based clock gating then we can save 23.88% of power as shown in Figure 9 and Table  3 . In the following table the total power has been analyzed at different SSTL based IO standards. It has been observed that maximum power reduction occurs is in the case of SSTL2_II_DCI logic family in which if we operate the circuit with clock only then total power consumption is of 32.833 Watts. But if switch to latch free or latch based clock gating instead of simple clock than there is a consumption of only 14.221 Watts which means there is 56.67% reduction in power and furthermore if we switch to flip-flop based clock gating then we can save 52.38% of power as shown in Figure 10 and Table  4 . 
Conclusion
Clock gating is the most efficient technique to reduce clock power as well as dynamic power dissipation in compare to power dissipation without clock gating. Latch Free Clock gating technique is the most efficient clock gating technique in compare to FlipFlop based clock gating. It has been concluded that as far as a clock power is being concerned then we can save 90.48% of power in all the SSTL based logic families. Where as in case of IO and total power maximum power reduction occurs in the case of SSTL2_II_DCI which is 26.59% and 56.67% with the help of clock gating.
Future Scope
We have implemented Gurmukhi Unicoder on 40nm technology based Virtex-6 FPGA. There is a scope to implement this Gurmukhi Unicoder on 28nm Artix-7, 28nm Kintex-7 FPGA to make the most energy efficient Gurmukhi Unicoder. We have worked up only on the SSTL based Logic Families in this paper. We can further enhance it by working on other Logic Families such as HSTL, LVDCI, LVCMOS and many more to make our circuit more energy and power efficient.
