The most critical arithmetic functional units are digital multipliers. This system overall performance is depends on the throughput, of the multiplier. An n-bit by n-bit multiplier can be employed by an n-bit by n-bit multiplication, which can be carried out iteratively. K cycles are required for completion of a multiplication, which can be terminated earlier, if some of the leading significant bytes are all zeros or ones. This paper proposes a simple scheme of dual modified adaptive, to exchange the two operands dynamically, for reducing more cycles to 32-bit by 32-bit multiplications. The designing of a dual modified adaptive multiplier for 32*32 bit number multiplication is proposed. 
INTRODUCTION
The important task for many microprocessors and DSP processors is to design a cost effective multiplier.
For achieving the major objectives are area, delay and power consumption. Parallel implementations can use more area and consume more power but which result in a fast multiplier. The serial implementations are using less area and consuming less power, but result in a slow multiplier. A 32-bit by 8-bit Boothmultiplieris employed and spends 4 cycles, for carrying a32-bit by 32-bit multiplication. This implementation signifies a balance among power, delay and area. It requires more cycles to perform a multiplication, but it takes much less time for each cycle and requires less area than a 32-bit by 32-bit parallel multiplier.
In order to perform correctly, traditional circuits use critical path delay as the overall circuit clock cycle.
However, the probability that the critical paths are activated is low. In many cases, the path delay is shorter than the critical path.
The critical path delay is used as the overall cycle period, which will result in significant timing waste.
Hence, to reduce the timing waste of traditional circuits, the variable-latency design was introduced. Booth multiplier and thereby minimizes the effect of glitches. The extra delay and power consumption can be minimized, if the exchange logic is made simple. Note that, from the operand source registers to the multiplicand and multiplier, registers the extra delay is added to that path. The normal addition rule is applied in accurate part, whereas a special method of addition takes place in inaccurate part. Output "sum" value is calculated normally, when any one of the operand value of adder is "0". When both operands are"1", "sum" value can be fixed as "1" from that bit position, to least significant bit. This technique is used to minimize the error distance of the adder, in load demand and renewable generation, because of day-ahead forecast error, there exists inevitable deviation between real time operation and day-ahead generation schedules. Especially in the recent years, the increasing penetration of intermittent resources aggravated the situation and confronted the system with high variability and uncertainty. This poses great challenges on the secure and reliable operation of the power systems. One alternative to solve this problem is, to introduce intra-carry generation scheduling (ICGS), which fills the gap between dayahead generation schedules and real-time operation, and serves as an effective tool for optimal dispatch over a look-ahead time horizon. By enforcing ICGS, we can re-schedule the conventional generation units, with the latest forecast information to meet the net load in a more cost-effective way.
LITERATURE SURVEY
When shorter paths are frequently activated, then the average latency in the variable-latency design is better than that of traditional designs. For example, speculation techniques are used by many variable-latency adders with error detection and recovery. In addition, the critical paths are divided into two shorter paths that could be unequal. The clock cycle is set to the delay of the longer one. To improve performance, these research designs were able to reduce the timing waste of traditional circuits, but they did not consider the aging effect. During the runtime, it could not adjust themselves. Each row contains(n − 1) full adder (FA) cells. In the CSA array, each FA has two outputs: 1) the sum bit goes down and 2) the carry bit goes to the lower left FA. The last row is a ripple adder for carry propagation. In the AM the FAs are always active without regarding of input states. In a low-power column-by passing multiplier design, the FA operations are disabled if the corresponding bit in the multiplicand is 0.
Figure 2: Column-Bypassing Multiplier
The above Figure 2 shows a 4×4 column-bypassing multiplier. For example, let us assume the inputs are10102 * 11112, it can be noted that for the FAs in the first and third diagonals, two of the three input bits are 0. Therefore, in both diagonals the output of the adders is 0, and the output sum bit is simply equal to the third bit, which is the sum output of its upper FA.
Figure 3: Existed System Block Diagram
In the existed system architecture, the column-and row-bypassing multipliers can be examined by the number of zeros in either the multiplicand or multiplicator. It is to predict, whether the operation requires one cycle or two cycles to complete. When there are random input patterns, the number of zeros and ones in the multiplicator and multiplicand follows a normal distribution.
Hence, by using similar architecture the two aging-aware multipliers can be implemented. The difference between the two bypassing multipliers, lies in the input signals of the AHL. The input signal of the AHL in the architecture, with the column-bypassing multiplier is the multiplicand, whereas that of the row-bypassing multiplier is the multiplicator.
Razor flip-flops can be used, to detect whether timing violations occur before the next input pattern arrives. The main flip-flop catches the execution result, by using a normal clock signal. The shadow latch catches the execution result, using a delayed clock signal, which is slower than the normal clock signal. If the latched bit of the shadow latch is different from that of the main flip-flop, then the path delay of the current operation exceeds the cycle period and the main flip-flop catches a result which is incorrect. When errors occur, the Razor flip-flop will set the error signal to 1, to notifying the system tore execute the operation and notify the AHL circuit that an error has occurred. Razor flip-flops are used to detect whether an operation, which is considered to be a one-cycle pattern can really finish in a cycle. If not, the operation is re executed with two cycles. Although the re execution may seem costly, if the re execution frequency is low, then overall cost is low. In the aging-ware variable-latency multiplier, the key component is AHL circuit. The AHL circuit consists of an aging indicator, two judging blocks, one mux, and one D flip-flop. The aging indicator used for indicating whether the circuit has suffered significant performance degradation, is due to the aging effect. In a simple counter, the aging indicator is implemented to counts the number of errors over a certain amount of operations and it is reset to zero at the end of those operations. The column-or row-bypassing multiplier is not able to complete these operations successfully, if the cycle period is too short, causing timing violations.
PROPOSED SYSTEM
According to the input logics, the gates of in the Dual Modified adaptive multiplier are always active. The operations in the Dual Modified adaptive multiplier design are disabled, if the corresponding bit in the multiplicand is 0. 
CONCLUSIONS
An aging-aware variable-latency multiplier is designed with the AHL multiplier. It is able to adjust the AHL to mitigate performance degradation, due to increased delay. The experimental results show the proposed architecture with 32*32 column-bypassing multipliers. This paper proposes a simple scheme, to exchange the multiplicand and multiplier dynamically to reduce more execution cycles for 32-bit by 32-bit multiplications, with a small area and delay overhead.
The Dual Modified adaptive can generate the M and N, interconnected blocks. Each block contains gates and row in the architecture. Dual Modified adaptive multiplier is more efficient than the existed system. The required hardware and chip memory is reduced, by the proposed system. It reduces the delay.
