INTRODUCTION 1
Low power VLSI devices and circuits have been a subject of keen research interest in today's era of deep submicron technologies. Suppression of the sub-threshold leakage current and control of transistor count per chip, is highly desirable to extend the battery lifetime of highperformance portable applications with long idle periods. For reducing leakage power, different power gating (PG) structures with high performance in the active mode and short wake-up time during standby mode have been proposed in literature [1] [2] [3] . During active mode, runtime leakage is also minimized using PG techniques [4, 5] . But many negative circuit level aspects of PG, such as power supply noise, large instantaneous current through sleep transistor (ST) etc. necessitate a careful understanding of its impact on the behavior of circuits. These aspects can be controlled by controlling the current through the sleep transistor during sleep to active mode transition. Previously different strategies has been developed in order to minimize this kind of noise. Some of the literatures related to this noise in the PG circuit are described next, in this section.
Step-wise Turn-on Approach
Kim et al. [6] have reported the ground bounce noise is reduced by turning the sleep transistors ON in a non-uniform stepwise manner. Based on literatures [7, 8] , a three-step turn-ON strategy is presented to suppress this bounce noise. Kumar et al. [9] stated that delay and bounce noise are controlled in four steps by turning ON the parallelly connected ST gradually.
2. Two-step Turn-on Approach
In, tri-mode PG structure ground bounce noise is suppressed with a two-step transition in the park mode during the sleep to active mode [10] . A dual-switch circuit technique is described in literature [11] with a ground bounce noise suppression methodology. A tri-transistor controlled circuit technique is presented in literature [12] to suppress the ground bounce noise in gated-VDD and ground MTCMOS circuits. A tri-mode PG structure [10] is also proposed in literature [13] for the reduction of noise in sequential MTCMOS circuit.
3. Stacking PG Approach
According to Saxena et al. [14] , high performance stacking PG structure is implemented to reduce ground bounce noise and leakage current. Sreenivasulu and Rao [15] reported that transistor sizing techniques are adopted to reduce ground bounce noise. Based on literature [16] , low leakage multiplier circuit is designed to control ground bounce noise using a stacked sleep transistor with a delayed select signal. Kumar et al. [17] stated that multi-VDD level converter is used in PG technique to reduce the leakage current and ground bounce.
4. Low Dropout Voltage Regulator (LDO)
The fluctuations in the power supply and the ground line also can be minimized by LDO. Over the past years, number of LDOs have been designed having certain advantages as well as shortcomings, one over the other. Wu et al. [18] proposed LDO shows a reduction in the load regulation and dropout voltage with a low PSRR (44dB). A CMOS LDO of 0.4V dropout voltage is designed by Hicham and Qjidaa [19] using wideband and low-current circuit technique. Kugelstadt [20] has introduced a relatively less efficient LDO which is designed with a 1nF internal capacitor. Based on Giustolisi et al. [21] , a robust compensation technique is applied in LDO regulators to reduce the compensation capacitor. A capacitor less LDO regulator architecture was presented by Torres et al. [22] .
5. PG Circuit and Bounce Nois
A new design of PG approach (active power gated (APG) circuit) in active mode has been developed. In this work, effect of ground bounce noise of poposed active power gated (APG) circuit is analyzed using various ground bounce noise minimization approaches [12, 13] under SOC configuration.
LDO regulator has been designed with the target of reducing power supply noise. LDO regulator has been implemented in the proposed PG architecture in order to reduce more noise on the power over other noise minimization technique.
PROPOSED ACTIVE POWER GATING ARCHITECTURE
Architecture of new Active Power Gating (APG) approach is shown in Figure 1 . This architecture differs from literature [5] in sequential part and data retention part of the design. In this design, both the combinational and sequential blocks are power gated by ST at the positive edge of the clock cycle, whereas reported in literatures [4, 5] , only combinational part is power gated. PG of Retention logic is not done for retaining states. Here, PG is applied in the active mode within the clock cycle to minimize the runtime leakage if idle time is present. Three retention logics are used, two of them ('retention logic 1 negedgeclk' (RL1), 'retention logic 2 negedgeclk' (RL2)) are controlled by the negative edge of the clock cycle and another one ('retention logic 3 posedgeclk (RL3)') is controlled by the positive edge of the clock cycle.
The complete timing diagram of the proposed architecture is shown in Figure 2 . During the hold time (Thold) RL1 and RL2 separate the combinational logic from the sequential logic while RL3 passes all the previous states of combinational logic. These states are propagated through the sequential logic and RL1. After hold time, combinational logic and sequential logic are power gated at the remaining part of the positive clock cycle (TPGoff) to reduce leakage power during which no intermediate nodes are passed. At the negative edge of clock cycle (TPGstart), combinational block becomes active and RL3 is turned-OFF. During the evaluation time (Teval), circuit blocks evaluate its state (Teval). After evaluation, RL2 allows the data for evaluation at this stage. Then, the states propagate through the sequential logic and pass through the RL1. During the setup time (Tsetup) inputs change its states. This process is repeated at the next positive edge. 
NOISE MINIMIZATION APPROACHES
Here, parasitic components: resistance, inductance, and capacitance of the 40-pin Dual In-line Package (DIP-40) [23] assumed in the proposed APG design are 0.217Ω, 8.18nH and 5.32pF respectively for capturing more accurate behavior of the circuit which is not considered in litretures [4, 5] . In this paper, Tri-Mode PG (ckt1), DualSwitch PG (ckt2), 3-Transistor Controlled PG high VT (ckt3), 3-Transistor Controlled PG low VT (ckt4) approaches are implemented in APG architecture to show the effect of noise on the proposed architecture. A benchmark circuit s27 [24] is taken to implement these architectures. Noise immune designs are simulated and noise results are reported.
PROPOSED LOW DROPOUT VOLTAGE REGULATOR (LDO)
The basic LDO circuit [20] consists of a pass device (PD), 'voltage reference' circuit (VR) and error amplifier (EA) with loop control logic. In this work, PD ( Figure 3 ) is implemented with four PNP transistors in Darlington pattern which is quite different from other works. This type of configuration is adopted to generate constant load voltage at high load current with low dropout voltage and low quiescent current for high efficiency. Figure 6 shows the optimized LDO architecture with proposed pass device circuitry that generate desired stable output voltage. A feedback type bandgap reference circuit ( Figure 4 ) is designed according to the specifications for producing a fixed voltage irrespective of power supply variations, temperature changes and loading on the device. Bandgap circuit produces a fixed voltage by adding a voltage that increases with temperature to a voltage that decreases with temperature. These types of voltage can be generated by tapping the base-emitter voltage (VBE) and using difference in VBE of bipolar junction transistors. In this bandgap circuit, two current components (I1, I2) are generated which are proportional to VEB1 and VEB2. Current mirror by shorting the gates of M1 and M2, is used to force currents into a pair of bipolar transistors whose base emitter voltage difference is used to establish ΔVBE. To create ΔVBE drop at R3, V1 and V2 should be equal. The OPAMP is so designed that the two node voltages V1 and V2 are always equal. Normally for ideal OPAMP, V1 and V2 are always equal as no current flows into the inputs. To make node voltage V1 and V2 equal, the currents I1, I2 and resistors R1, R2 are fixed at an equal value. R3 value should be smaller than R1, R2 to create ∆V and temperature incisive of VREF. The current equations of M1 & M2 are: where, VGS, VDS, VT and λ are gate to source voltage, drain to source voltage, threshold voltage and channel length modulation parameter respectively. From these two equations of I1 and I2, it is found that:
Here, λ × V DS ≪ 1. So, we can neglect λV DS from the above equation and the new equation will form in the following manner, 
where, k1 and k2 are aspect ratio. The transistors have been chosen as identical, so the W/L ratio (k) of these two transistors are same, i.e. I1=I2. Current flow through Q1 and Q2 are:
since V1=V2=VBE2, if we apply KVL between two transistors Q1 and Q2, the equation will become:
where, ∆ = , VTV=Thermal voltage V1= I1R1, V2=I2R2, since, V1=V2=> I1R1=I2R2. Now, apply KVL from VREF node to ground along R3 and Q2 to determine VREF.
VREF = V2+I2R2
(10)
Hence, the reference circuit produces a temperature incisive voltage as VBE2 decreases and V increases with temperature. A differential amplifier is used as an EA ( Figure 5 ) to compare the scaled-down version (VSO) of the output (produced by resistors using voltage divider rule) with the fixed reference voltage (VREF). As, proposed PD should be optimized, hence, to drive this device, amplifier is also optimized accordingly. This amplifier is designed using current mirror topology. Current through both the transistors are equal. Sizes of parallel transistors are set as identical (M1=M0, M2=M3, M4=M5) to generate equal amount of current.
The output voltage is maintained by proper design of pass device (PD) and adjusting the ratio of two resistors (R3, R2). The PD must be large enough to guarantee the minimum dropout voltage while providing the maximum load current. The output of the differential amplifier (VERROR) drives the emitter terminal of the two PNP transistors (Q7, Q8). 
when, VERROR = VSO -VREF = 0, then, VSO = VREF:
Equation (14) is true when VIN is sufficiently high to keep differential amplifier and pass device in saturation region. In this region, VOUT becomes constant which set VDO to very low voltage.
As part of its regulation, differential amplifier and bandgap reference circuit of LDO can attenuate any spikes in the input voltage which have some deviation from the internal reference. High PSRR LDO can be generated by minimizing dropout voltage. So, spikes on power rail can be suppressed by using the high PSRR LDO in the circuit. If LDO regulator is used in the power line of PG architecture, then only this device can regulate fluctuated supply voltage during PG mode. It can also help the neighboring circuits to function correctly while sharing the same power line.
PROPOSED NOISE MINIMIZATION SCHEME USING LDO IN APG ARCHITECTURE
In this paper, this architecture implemented considering package inductance is termed as ckt5 (Figure 7) . The proposed LDO regulator is attached between the high_VT transistor (ST) and the power rail (VDDL) in APG architecture. Noise on power rail can be defined by:
where,ξ=(VDD-VDDL), VDD = supply voltage, VDDL= voltage drop at power rail. During mode transition of PG circuit, voltage drops on power rail due to the flow of high current leads to increase in noise (ΔV = VDD -VDDL). To overcome this problem, high PSRR with low dropout LDO regulator is proposed and implemented in the PG circuit that can reject spikes on the power rail (VDDL) using its internal differential amplifier, bandgap reference circuit and pass device. LDO reduces variation on the power rail (VDDL) by regulating the voltage of VDDL node. This in turn increases the value of VDDL node near to VDD. Hence, noise (ΔV=VDD -VDDL) on the power rail reduces which leads to decrease in the rate of change of current ( ( , )) flow through the power rail. There is no contribution of noise due to current drawn by the LDO circuit as on chip VDD drives LDO. 
SIMULATION RESULT
The designs of proposed modified APG and ckt1, ckt2, ckt3, ckt4, ckt5 and LDO are implemented taking benchmark circuit-s27. It is observed from Table 1 , that with the reduction of the frequency of operation as idle time increases, APG design becomes more efficient to reduce leakage. Power saving results of this APG design outperforms the results of the previous PG works [3, 4, [25] [26] [27] . Memory access PG reduces the wasted core leakage power by 38.04% [26] . For 16-bit multiplier design at frequency 1MHz, power saving is 29% compared to PG design [4] . Data-Retained PG (DRPG) [27] used to reduce active mode leakage achieves up to 25.7% saving over conventional designs. At 2.5V input voltage, output and dropout voltage of the proposed LDO is 2.00253V and 247.47mV respectively. This regulator produces load current of 50.3343mA and the quiescent current of 117.79μA. The measured PSRR is as high as 73.97dB with an efficiency of 88.79%. LDO maintains around 250mV dropout voltage in between 2-3V input voltage and plot of line regulation is shown in Figure 8 .
It is also seen that LDO maintains output voltage of about 2.3V with the variation of input voltage from 2.25 -2.75V ( Figure 9 ). The variation of output voltage with the changes of load current from 30 to 50mA is shown in Figure 10 . The load regulation of the proposed LDO is observed to be 2.28%. This proposed LDO provides output voltage of about 2V by varying the load resistance (R4) from 30Ω to 100Ω. It is shown that, output voltage is almost constant at higher load resistance (5 -10KΩ) as shown in Figure 11 .
The variation in output voltage with respect to temperature is also observed for the proposed LDO circuit (Figure 12 ). From the Figure 12 , it is seen that output voltage becomes constant after room temperature attainment for the proposed LDO circuit. Then, bounce noise produced by the ckt1, ckt2, ckt3, ckt4, ckt5 in PG circuit during sleep to active mode is discussed. Table 2 , it is observed that noise on the power supply (0.06mV) in ckt5 is lower than the ckt1, ckt2, ckt3, ckt4 over APG. There is a large deviation in the voltage level of power supply with package inductance in APG. Whereas, there is less peak to peak variation in power rail with the noise reduction logic of the PG circuits. Percentage saving of noise (peak to peak variation) over APG is also shown in Figure 13 .
Percentage saving of noise on the power rail in ckt5 is 98% compared to the ckt1, ckt2, ckt3, ckt4. Transient analysis of peak to peak noise produced on the power rail using LDO regulator for ckt5 is shown in Figure 14 . From Table 3 , it is found that, proposed noise reduction approach gives better result compared to other previous works. Using proposed concept voltage fluctuation reduces significantly, but power and area occupancy increases due to the extra circuit component of LDO. Figure 13 . Saving of Noise on Power Rail over APG Figure 14 . Peak to peak noise on power rail for ckt5 
CONCLUSION
In this paper, a new PG architecture is proposed for minimization of noise and power. PG is applied for both the combinational and sequential block in active mode. But, PG circuit also suffers from noise on the power rail during sleep to active mode. Effectiveness of different noise reduction approaches in this proposed architecture have also been analyzed in this paper. All the circuits -APG, ckt1, ckt2, ckt3, ckt4, ckt5 have been designed for reduction of noise on the power supply. Peak to peak voltage fluctuation is reduced more in this scheme of using low dropout voltage regulator in PG architecture compared to the other approaches.
ACKNOWLEDGEMENT
This work is supported by SMDP-C2SD project, sponsored by MeitY, Govt. of India. 
