ABSTRACT
The conventional MBE algorithm generates n/2+1 partial product rows rather than n/2 due to the extra partial product bit at the least significant bit position of each partial product row for negative encoding, leading to an irregular partial product array and a complex reduction tree.
Booth encoding is a technique that leads to smaller, faster multiplication circuits, by recoding the numbers that are multiplied. It is the standard technique used in chip design, and significant improvements over the "long multiplication" technique. The widely used Booth algorithm is the radix-4 based modified Booth algorithm proposed by McSorley where it reduces the partial products into half. As the number of partial product required for the compression module, the height of the Wallace tree is also reduced. Operations on the encoded digits performed with multiplier input However, it is important to note that there are two unavoidable consequences of using MBE: sign extension prevention and negative encoding. The combination of these two unavoidable consequences results in the formation of one additional partial product row. So that, of the extra partial product row requires more hardware, and time. To overcome the above consequences, the authors in [2] added the least significant bit of each partial product row with the neg i0 and a carry c i . Note that both Figure 2 depicts the 8 x 8 MBE partial product array generated by the approach proposed in [2] .
Since c i is at the left one bit position of However , the approach does not remove the additional partial product row. The conventional MBE algorithm generates n/2+1 partial product rows rather than n/2 due to the extra partial product bit at the least significant bit position of each partial product row for g to an irregular partial product array and a complex reduction tree.
Booth encoding is a technique that leads to smaller, faster multiplication circuits, by recoding the numbers that are multiplied. It is the standard technique used in chip design, and significant improvements over the "long multiplication" technique. The widely used Booth 4 based modified Booth algorithm proposed by McSorley where it reduces the partial products into half. As the number of partial products reduces the number of CSAs required for the compression module, the height of the Wallace tree is also reduced. In Figure. 1.Modified Booth algorithm's basic idea is that the bits Yiand Y i-1 are recoded into serves as reference bit. In a separate step, Y i-2 and Y i-3 recoded into serving as reference bit. This signifies that the modified Booth's encoding partitions bits with 1-bit overlap and generates the following five signed digits, 2, 2. Encoding on the each group reduces the number of partial products by factor of 2.
Operations on the encoded digits performed with multiplier input X is illustrated in Table 1 .
is important to note that there are two unavoidable consequences of using MBE: sign extension prevention and negative encoding. The combination of these two unavoidable consequences results in the formation of one additional partial product row. So that, of the extra partial product row requires more hardware, and time. To overcome the above consequences, the authors in [2] added the least significant bit of each neg bit of corresponding row to obtained a new least significant bit . Note that both i0 and c i are generated no later than other partial product bits. Figure 2 depicts the 8 x 8 MBE partial product array generated by the approach proposed in [2] .
is at the left one bit position of neg i, the required additions in the reduction tree However , the approach does not remove the additional partial product row. The conventional MBE algorithm generates n/2+1 partial product rows rather than n/2 due to the extra partial product bit at the least significant bit position of each partial product row for g to an irregular partial product array and a complex reduction tree.
Booth encoding is a technique that leads to smaller, faster multiplication circuits, by recoding the numbers that are multiplied. It is the standard technique used in chip design, and provides significant improvements over the "long multiplication" technique. The widely used Booth 4 based modified Booth algorithm proposed by McSorley where it reduces s reduces the number of CSAs are recoded into recoded into Z i-2 and serving as reference bit. This signifies that the modified Booth's encoding partitions and generates the following five signed digits, 2, 2. Encoding on the each group reduces the number of partial products by factor of 2. Table 1 . is important to note that there are two unavoidable consequences of using MBE: sign extension prevention and negative encoding. The combination of these two unavoidable consequences results in the formation of one additional partial product row. So that, accumulation To overcome the above consequences, the authors in [2] added the least significant bit of each bit of corresponding row to obtained a new least significant bit are generated no later than other partial product bits. Figure 2 depicts the 8 x 8 MBE partial product array generated by the approach proposed in [2] . the required additions in the reduction tree reduced. The conventional method (complement a binary number and add 1 to the complemented number) will not work properly because the propagation delay of the carry linearly increases with the word size and it would be much greater than the delay to generate the partial products. Therefore, he proposed an extension of the well-known algorithm where all the bits after the rightmost "1" in the word are complemented but all the other bits are unchanged. The two's complement of a binary number 001010 2 is 110110 2 . For this number, the rightmost "1" happens in bit position 1. Therefore, values in bit positions 2 to 5 can simply be complemented, while values in bit positions 0 and 1 are kept unchanged. Therefore, two's complementation now comes down to finding the conversion signals that are used for selectively complementing some of the input bit.
If the conversion signal at any position is "0", then the value is kept unchanged and, if the conversion signal is "1", then the value is complemented. The conversion signals after the rightmost "1" are always 1. They are 0 otherwise. Once a lower order bit has been found to be a "1," the conversion signals for the higher order bits to the left of that bit position should all be "1."
However, this searching for the rightmost "1" could be as time consuming as rippling a carry through to the MSB since the previous bits information must be transferred to the MSB Therefore, they find a method to expedite this detection of the rightmost "1."
As we will see, this search for the rightmost "1" can be achieved in logarithmic time using a binary search tree-like structure. They first find the conversion signals for a 2-bit group by grouping two consecutive bits (the grouping always starts from the LSB) from the input and finding the conversion signals in each group, as shown in Figure. 3. Then, they find the conversion signals for a 4-bit group (formed by two consecutive 2-bit groups). Then, they find the conversion signals for a 8-bit group (formed by two consecutive 4-bit groups). This divide-and-conquer approach is pursued until the whole input has been covered. However, the approach must additionally develop and design the 2's complement logic. It possibly enlarges the area and delay of the partial product generator and less contributed by removing the extra row To remove the extra partial product row the authors in [1] extends the methods proposed in [2] and [3] . To remove the extra partial product row pp i=n/2-1 with the partial product bit can be incorporated into the sign extension bits of pp written as
And the carry bit is incorporating into the sign extension bits of first partial product row. The maximal value of these sign extension bits are 100 so the addition of carry bit to sign ext bits will never produce an overflow.
PROPOSED WORK
For recording of MBE, We need at least three signal to represent the digit set { 2X}. Many different ways have been developed. But the proposed selector circuit in [15] is not efficient to design the multiplier because it can not satisfy all the encoding conditions. For that purpose we redesign the selector circuit. The Booth encoder and selector circuits for the implementation of proposed MBE multiplier are shown in Figure. 4, To have a more regular least significant part of each partial product row , the authors in [2] added the least significant bit i0 with carry i . However, the approach does not remove the additional partial product row. i.e. , n/2+1 row still present. To remove this extra row, we are using the advantages of [3] . For generating a 2's complement of a number generally we have two method bit of a binary number and add '1' to the least significant bit. And the second method is that search for a right most binary '1' and all the bits after right most '1' in the word are complemented but all the other b However, the approach must additionally develop and design the 2's complement logic. It possibly enlarges the area and delay of the partial product generator and lessens the benefit contributed by removing the extra row.
To remove the extra partial product row the authors in [1] extends the methods proposed in [2] and [3] . To remove the extra partial product row pp n/2 due to c 3 in Figure 2 , they combine the c 1 with the partial product bit p i1 to produce a new partial product bit i1 and a new carry can be incorporated into the sign extension bits of pp 0 . The logic expressions of i1 and
For recording of MBE, We need at least three signal to represent the digit set {-2X, 2X}. Many different ways have been developed. But the proposed selector circuit in [15] is not efficient to design the multiplier because it can not satisfy all the encoding conditions. For that purpose we redesign the selector circuit. The Booth encoder and selector circuits for the implementation of proposed MBE multiplier are shown in Figure. 4, Figure. 5 respectively.
To have a more regular least significant part of each partial product row , the authors in [2] added i in advance and obtained a new least significant bit . However, the approach does not remove the additional partial product row. i.e. , n/2+1 row still present. To remove this extra row, we are using the advantages of [3] . For generating a 2's complement of a number generally we have two methods. The first method is converting each bit of a binary number and add '1' to the least significant bit. And the second method is that search for a right most binary '1' and all the bits after right most '1' in the word are complemented but all the other bits are unchanged. 
ቃ
And the carry bit is incorporating into the sign extension bits of first partial product row. The maximal value of these sign extension bits are 100 so the addition of carry bit to sign extension 2X, -1X, 0, 1X, 2X}. Many different ways have been developed. But the proposed selector circuit in [15] is not efficient to design the multiplier because it can not satisfy all the encoding conditions. For that purpose we redesign the selector circuit. The Booth encoder and selector circuits for the Figure. 5 respectively.
To have a more regular least significant part of each partial product row , the authors in [2] added in advance and obtained a new least significant bit i0 and a . However, the approach does not remove the additional partial product row. i.e. , n/2+1 row still present. To remove this extra row, we are using the advantages of [3] . For generating a s. The first method is converting each bit of a binary number and add '1' to the least significant bit. And the second method is that search for a right most binary '1' and all the bits after right most '1' in the word are Figure 5 . MBE encoder proposed in [15] In this brief, for our proposed MBE multiplier, we combine the two methods for generating the last partial product row. For the generation of the first two least significant bits of last partial product row, we are using second method i.e. , searching for significant bits and replace the bits according the Table 2 . The corresponding circuits to generate . MBE encoder proposed in [15] In this brief, for our proposed MBE multiplier, we combine the two methods for generating the last partial product row. For the generation of the first two least significant bits of last partial product row, we are using second method i.e. , searching for right most '1' in the two least significant bits and replace the bits according the Table 2 . The corresponding circuits to generate for the proposed multiplier are depicted in Figure 6 , Figure 7 . In this brief, for our proposed MBE multiplier, we combine the two methods for generating the last partial product row. For the generation of the first two least significant bits of last partial right most '1' in the two least significant bits and replace the bits according the Table 2 . The corresponding circuits to generate Where ε i is the carry bit and =n/2 is equal to the weight of sign extension bit sign extension bits of first partial product row. So that the carry bit do not propagate up to the 2n bit position. Since the maximal valu is 100 so that the addition of these bits with conversion of sign extension bits proposed in [1] is shown in Table 3 and corresponding given in Figure 8 . 
SIMULATION RESULTS
The simulation results for different numbers of regular partial product array is shown in Figure. 9, 10, 11 and 12 below. bit proposed multipliers for generating regular partial product array.
The partial products of multiplier MBE algorithm [7] [8] [9] . The latter has widely been adopted in parallel multipliers since it can reduce the number of partial product rows to be added by half, thus reducing the size and enhancing the speed of the reduction tree. However, the conventional MBE algorithm generates n/2 + 1 partial product rows rather than 
=n/2-1. If any carry is generated, since the weight of ε is equal to the weight of sign extension bit 0 at bit position n, ε i can be incorporated with the sign extension bits of first partial product row. So that the carry bit do not propagate up to the 2n bit position. Since the maximal value of sign extension bits 0 s 0 s 0 in the first partial product row is 100 so that the addition of these bits with ε i will never produce an overflow. The total conversion of sign extension bits proposed in [1] is shown in Table 3 and corresponding   Table 3.Truth table for 
ESULTS AND DISCUSSION
The simulation results for different numbers of bits for proposed modified Booth's multipliers for regular partial product array is shown in Figure. 9, 10, 11 and 12 below. Simulation results for 8 bit proposed multipliers for generating regular partial product array.
The partial products of multipliers are generally generated by using two-input AND gates or a [9] . The latter has widely been adopted in parallel multipliers since it can reduce the number of partial product rows to be added by half, thus reducing the size and he speed of the reduction tree. However, the conventional MBE algorithm generates 2 + 1 partial product rows rather than n/2 due to the extra partial product bit (negative 1. If any carry is generated, since the weight of ε i is 2 n, which can be incorporated with the sign extension bits of first partial product row. So that the carry bit do not propagate up to the 2n th in the first partial product row will never produce an overflow. The total conversion of sign extension bits proposed in [1] is shown in Table 3 and corresponding circuit is bits for proposed modified Booth's multipliers for Simulation results for 8-input AND gates or a [9] . The latter has widely been adopted in parallel multipliers since it can reduce the number of partial product rows to be added by half, thus reducing the size and he speed of the reduction tree. However, the conventional MBE algorithm generates negative bit) at the least significant bit position of each partial product row for negative encoding, leading to an irregular partial product array and a complex reduction tree. Some approaches [1] [2] [3] have been proposed to generate more regular partial product arrays, for the MBE multipliers. The authors in [3] added the least significant bit of each partial product row with negative bit in advance, but the method does not remove the additional partial product row. In the proposed work in [2] the authors proposed a method to directly generates the two's complements of a negative row, but it requires extra hardware. In [1] author almost overcome the above problems by extending the methods proposed in [2] and [3] . The carry bit generated at the last partial product row in [3] is incorporating into the sign extension bits of first partial product row. But the proposed circuits were not satisfying all the conditions. So we redesigned the circuits by a simple method for reduction of last negative bit. For comparison, we have implemented the multiplier proposed in [1] . For the implementation of the other partial product except for a few of partial product bits that are generated by different schemes to regularize the partial product array, the other partial product bits are generated using the proposed MBE selector circuits for all the multipliers. These multipliers are modelled in verilog HDL and synthesized by using synopsys design compiler with Artisan TSMC 90nm technology. The implementation results in Table 5 demonstrated that the improved reduction in area 5.9%, power 3.2%, and the delay is 0.5% as compared to proposed multiplier in [1] for8 x 8 multiplier. We can also observe that achievable improvement for 16 x 16 multipliers in area, power, and delay are 4.0%, 2.3%, and 0.3% respectively. For the comparison purpose for the generation of partial products for higher order bits we implemented up to 64 x 64 bit multiplier. As shown in Table 4 . It shows that for higher order multipliers the percentage improvement with respect to the area, delay, and power is respectable. 
CONCLUSION
Multiplication is a frequently encountered operation, especially in signal processing applications. So the development of a multiplier is vital for applications in portable mobile devices such as personal multimedia players, cellular phones, digital cam coders and digital cameras. Many designs have been proposed for Booth encoder and selector logic using CMOS over the past decades. But those designs when implemented in CMOS resulted in higher transistor count. In our research, Booth encoder and selector logic occupies one third of the entire multiplier architecture. So careful optimization of these logic parts will result in a considerable reduction of hardware.
We proposed new circuits for removing the extra partial product array because of negative compensation bit for negative coding. The results obtained here are compared to proposed results in [1] . It shows that proposed multipliers are well regularly structured and complexity of the design is reduced. Also observed that the reduction in area, power, delay are 5.9%, 3.2%, 0.5% respectively for 8 x 8 multipliers and it is 4.0%, 2.3%, 0.3% for 16 x 16 bit multipliers. Apart from above we also presented the new circuit for MBE selector.
