# Instituto Tecnológico y de Estudios Superiores de Occidente 

Repositorio Institucional del ITESO
rei.iteso.mx

Departamento de Electrónica, Sistemas e Informática

# Comparison Among Booth's and Pekmestzi's Algorithms for the Multiplication of Two Numbers 

Rojas-Laguna, Roberto; Villalón-Turrubiates, Iván E.; Serrano-Arellano, Santiago; Alvarado-Méndez, Edgar; Estudillo-Ayala, Juan M.; Vite-Chávez, Osvaldo

Iván E. Villalón-Turrubiates et al., "Comparison Among Booth's and Pekmestzi's Algorithms for the Multiplication of Two Numbers", en Proceedings of the International Conference on Devices, Circuits and Systems (CIDCSVER), Veracruz México, 2003.

Enlace directo al documento: http://hdl.handle.net/11117/3325

Este documento obtenido del Repositorio Institucional del Instituto Tecnológico y de Estudios Superiores de Occidente se pone a disposición general bajo los términos y condiciones de la siguiente licencia:
http://quijote.biblio.iteso.mx/licencias/CC-BY-NC-2.5-MX.pdf

# COMPARISON AMONG BOOTH'S AND PEKMESTZI'S ALGORITHM FOR THE MULTIPLICATION OF TWO NUMBERS 

R. Rojas-Laguna, I. E. Villalón Turrubiates ${ }^{*}$, S. Serrano-Arellano*, E. Alvarado-Méndez, J. M. Estudillo-Ayala and O. Vite-Chávez ${ }^{+}$<br>Universidad de Guanajuato.<br>Facultad de Ingeniería Mecánica, Eléctrica y Electrónica.<br>Maestría en Ingeniería Eléctrica, Opción: Instrumentación y Sistemas Digitales. Prolongación Tampico No. 912, Tel.: 014646480911 ext. 119, Col. Bellavista. 36730 Salamanca, Gto., México.


#### Abstract

A comparison between two different methods of multiplication of two 8 -bit numbers is presented. This methods are the Booth's algorithm and the algorithm proposed by Kiamal Z. Pekmestzi [1]. The general objective is to show the benefits and the advantages obtained if it's used one of this algorithms over the other. This multipliers have low circuit complexity permitting high-speed operations and the interconnections of the cells are regular. This is the reason why the results shown was obtained using VHDL realization on a FPGA XC4010XL by Xilinx.


## 1. INTRODUCTION

Multiplication is the most critical operation in every computational system. Innumerable schemes have been proposed for the realization of this operation. In the early multiplier schemes proposed by Hoffman [2], Burton and Noaks [3], De Mori [4] and Guilt [5] for positive numbers, and by Baugh and Wooley [6] and Hwang [7] for numbers in two's complement form, the effort was on implementations using iterative circuits with uniform interconnections pattern. Also, on the same base, schemes using Booth's algorithm [8], [9] were presented.

The Booth's algorithm serves two purposes:

1. Fast multiplication (when there are consecutive 0 's or 1 's in the multiplier).
2. Signed multiplication.

The Pekmestzi's multiplication algorithm is based on a different mechanism. At each step, one bit of the multiplier and one bit of the multiplicand are processed. So, the algorithm is symmetric; this means that multiplier and multiplicand can be interchanged.

The main objective of this investigation is to find which of those algorithms provides the best results under a VHDL implementation in a FPGA XC4010XL of Xilinx ${ }^{\text {© }}$, and with this, ensure the best option at the moment of its use on a certain application.

## 2. BOOTH'S ALGORITHM

The goal of the algorithm is to recode the multiplier to reduce the number of additions done. The key idea is that when we have a string of 1's in the multiplier, representing a string of multiplicand adds, we can get the same effect with a single add and subtract. This is because:

$$
2^{\mathrm{m}}+2^{\mathrm{m}-1}+\ldots+2^{\mathrm{n}+1}+2^{\mathrm{n}}=2^{\mathrm{m}+1} \quad-2^{\mathrm{n}}
$$

For example:

$$
\begin{aligned}
01110 & =10000-00010 \\
14 & =16-2
\end{aligned}
$$

We can implement the multiplier by looking at the least significant 2 bits of Q . If they are:

$$
\begin{array}{lll}
\mathbf{0} & \mathbf{0} & \text { we shift only } \\
\mathbf{0} & \mathbf{1} & \text { we add M and shift } \\
\mathbf{1} & \mathbf{0} & \text { we subtract M and shift } \\
\mathbf{1} & \mathbf{1} & \text { we shift only }
\end{array}
$$

Let's see how this is done with an example: $9 * 14$ (for $\mathrm{n}=5$ ):

| M | A $\quad$ Q Qprev |  |
| :---: | :---: | :--- |
| 01001 | 00000011100 | Load |
| 01001 | 00000001110 | Shift only |
| 01001 | 11011 | $10011 \mathbf{1}$ |
| 01001 | $1110111001 \mathbf{1}$ | Subtract and shift |
| 01001 | 11110111001 | Shift only |
| 01001 | 00011 | 111100 |

Booth's algorithm also works with negative numbers like 2 * -3 :

| M | A Q Qprev |  |
| :---: | :---: | :--- |
| 00010 | 00000111010 | Load |
| 00010 | 11111011101 | Subtract and shift |
| 00010 | 00000101110 | Add and shift |
| 00010 | 11111010111 | Subtract and shift |
| 00010 | 11111101011 | Shift Only |
| 00010 | 11111110101 | Shift only |

[^0]
## 3. PEKMESTZI'S ALGORITHM

Consider two positive integer numbers X and Y :

$$
\begin{align*}
& X=x_{n-1} x_{n-2} \ldots x_{0}=\sum_{j=0}^{n-1} x_{j} 2^{j}  \tag{1}\\
& Y=y_{n-1} y_{n-2} \ldots . y_{0}=\sum_{j=0}^{n-1} y_{j} 2^{j} \tag{2}
\end{align*}
$$

We define $X_{n-1}$ and $Y_{n-1}$ as the numbers that remain after truncation of bits $\mathrm{x}_{\mathrm{n}-1}$ and $\mathrm{y}_{\mathrm{n}-1}$. So:

$$
\begin{gather*}
X_{n-1}=x_{n-2} x_{n-3} \ldots x_{0}=\sum_{j=0}^{n-2} x_{j} 2^{j} \quad \text { and } \quad X=X_{n-1}+2^{n-1} x_{n-1}  \tag{3}\\
Y_{n-1}=y_{n-2} y_{n-3} \ldots y_{0}=\sum_{j=0}^{n-2} y_{j} 2^{j} \quad \text { and } \quad Y=Y_{n-1}+2^{n-1} y_{n-1} \tag{4}
\end{gather*}
$$

The product of the numbers X and Y , according to (3) and (4), can be computed as follows:

$$
\begin{equation*}
P=2^{2 n-2} x_{n-1} y_{n-1}+2^{n-1}\left\{x_{n-1} Y_{n-1}+y_{n-1} X_{n-1}\right\}+X_{n-1} Y_{n-1} \tag{5}
\end{equation*}
$$

Let us define $P_{n-1}=X_{n-1} Y_{n-1}$ and generally $P_{j}=X_{j} Y_{j}$ (6), where $X_{j}$ and $Y_{j}$ are the numbers formed by the $j$ least significant bits of $X$ and $Y$, respectively. The product $P_{j}$ can be computed by the following recursive equation:

$$
\begin{equation*}
P_{j}=X_{j} Y_{j}=2^{2 j-2} x_{j-1} y_{j-1}+2^{j-1}\left\{x_{j-1} Y_{j-1}+y_{j-1} X_{j-1}\right\}^{2}+P_{j-1} \tag{7}
\end{equation*}
$$

According to the above relations, the product $\mathrm{P}=\mathrm{XY}$ can be computed by the equation:

$$
\begin{equation*}
P=\sum_{j=0}^{n-1} x_{j} y_{j} 2^{2 j}+\sum_{j=1}^{n-1}\left\{x_{j} Y_{j}+y_{j} X_{j}\right\} 2^{j} \tag{8}
\end{equation*}
$$

The main computation concerns the addition of the terms:

$$
\begin{equation*}
Z_{j}=x_{j} Y_{j}+y_{j} X_{j} \tag{9}
\end{equation*}
$$

These terms must be added, properly weighted, and the product is given by the next relation:

$$
\begin{equation*}
P=\sum_{j=0}^{n-1} x_{j} y_{j} 2^{2 j}+\sum_{j=1}^{n-1} Z_{j} 2^{j} \tag{10}
\end{equation*}
$$

The values that quantity $\mathrm{Z}_{\mathrm{j}}$ can take depend on the values of the logical variables $x_{j}$ and $y_{j}$ and are shown in table 1. The only case where $\mathrm{Z}_{\mathrm{j}}$ requires computation is when the two bits of the multiplied numbers have value 1 . At each step $j$, only $s_{j}$ and $c_{j+1}$ are new. The rest of the bits of $S_{j}$ have been formed in the previous $\mathrm{j}-1$ steps according to the relation:

$$
\begin{equation*}
S_{j}=S_{j-1}+x_{j-1}+y_{j-1} \tag{11}
\end{equation*}
$$

The sum $\mathrm{S}=\mathrm{X}+\mathrm{Y}$ can be computed once. During the $j$ th iteration of the algorithm, the $j$ least significant bits of $S$ with $c_{j}$ as the most significant bit form the quantity $\mathrm{S}_{\mathrm{j}}$ which, in turn, is used only if $\mathrm{x}_{\mathrm{j}}=\mathrm{y}_{\mathrm{j}}=1$. In this case, however, $\mathrm{s}_{\mathrm{j}+1}=\mathrm{c}_{\mathrm{j}}$ and, in the proposed algorithm, the quantity $\mathrm{S}_{\mathrm{j}}$ can be computed equivalently by both of the following relations:

$$
\begin{equation*}
S_{j}=c_{j} S_{j-1} s_{j-2} \ldots S_{0} \quad \text { or } \quad S_{j}=S_{j} S_{j-1} S_{j-2} \ldots . S_{0} \tag{12}
\end{equation*}
$$

Table 1
Values of $\mathrm{Z}_{\mathrm{j}}$ TO DETERMINE THE PRODUCT $\mathrm{P}=\mathrm{XY}$

| $\mathbf{X}_{\mathbf{J}}$ | $\mathbf{Y}_{\mathbf{J}}$ | $\mathbf{Z}_{\mathbf{J}}$ |
| :---: | :---: | :---: |
| 0 | 0 | 0 |
| 0 | 1 | $\mathrm{X}_{\mathrm{i}}$ |
| 1 | 0 | $\mathrm{Y}_{\mathrm{i}}$ |
| 1 | 1 | $\mathrm{X}_{\mathrm{i}}+\mathrm{Y}_{\mathrm{i}}=\mathrm{S}_{\mathrm{i}}$ |

The proposed multiplexer-based parallel multiplier is shown in Fig. 1. The cells are described in Fig. 2, 3 and 4. The implementation of a two's complement multiplier based on the proposed technique can be done on an array similar to the array of Fig. 1. Let us consider two integer numbers X and Y in this form:

$$
\begin{align*}
& X=x_{n-1} x_{n-2} \ldots x_{0}=-2^{n-1} x_{n-1}+\sum_{j=0}^{n-2} x_{j} 2^{j}=-2^{n-1} x_{n-1}+X_{n-1}  \tag{13}\\
& Y=y_{n-1} y_{n-2} \ldots . y_{0}=-2^{n-1} y_{n-1}+\sum_{j=0}^{n-2} y_{j} 2^{j}=-2^{n-1} y_{n-1}+Y_{n-1} \tag{14}
\end{align*}
$$

$X_{n-1}$ and $Y_{n-1}$ are the numbers that remain after truncation of the bits $X_{n-1}$ and $\mathrm{y}_{\mathrm{n}-1}$ from X and Y , respectively. The product $P$ of the numbers X and Y , according to (13) and (14), is equal to:

$$
\begin{equation*}
P=X Y=2^{2 n-2} x_{n-1} y_{n-1}-2^{n-1}\left\{x_{n-1} Y_{n-1}+y_{n-1} X_{n-1}\right\}+X_{n-1} Y_{n-1} \tag{15}
\end{equation*}
$$

The above relation differs from the corresponding (5) for positive numbers only in the sign of the term $Z_{n-1}=x_{n-1} Y_{n-1}+y_{n-1} X_{n-1}$. Now, the above term must be subtracted instead of added. The algorithm for all other terms remains unchanged and the product $\mathrm{P}=\mathrm{XY}$ can be computed by the next equation:

$$
\begin{equation*}
P=\sum_{j=0}^{n-1} x_{j} y_{j} 2^{2 j}+\sum_{j=1}^{n-2} Z_{j} 2^{j}-Z_{n-1} 2^{n-1} \tag{16}
\end{equation*}
$$

Quantity $\mathrm{Z}_{\mathrm{n}-1}$ is generated of the left boundary cells and it is subtracted by inverting the $\mathrm{S}_{\text {out }}$ bits of these cells. Also, the two additive inputs of the next to the leftmost top cells must be set to one. The final array implementing the multiplication of numbers in two's complement form is shown in Fig. 5.


Fig. 1. General diagram used by the proposed algorithm to calculate the product between two numbers


Fig. 2. Diagram of the rectangular cells (not filled from Fig. 1)


Fig. 3. Diagram of the rectangular cells with a circle inside (see Fig. 1)


Fig. 4. Diagram of the rectangular cells with a rectangle inside (see Fig. 1)

## 4. DESCRIPTION IN VHDL CODE

The Booth's algorithm proposed in the section 2, as well as the Pekmestzi's algorithm for positive numbers and in complement at two proposed in the section 3, are now coded in the descriptive language VHDL. These three algorithms are coded in combinational logic, whose main characteristics are:

1. The exit functions depends only on the state of the entrances.
2. It doesn't contain memory elements.
3. It has two level realization of logical gates.

## 5. SYNTHESIS AND SIMULATION

Once compiled the three codes, we proceeds to carry out the simulation of the same ones. For it, the software Foundation by Xilinx ${ }^{\circ}$ is used, carrying out the synthesis in a FPGA XC4010XL also by Xilinx ${ }^{\circ}$.


Fig. 5. General diagram of the proposed multiplexer-based two's complement parallel multiplier


Fig. 6. Positive numbers multiplication using Booth's Algorithm

First, we are going to simulate only positive numbers multiplications. Taking A and B as our inputs numbers ( 8 bits). The Fig. 6 shown the results using Booth's algorithm. We can see that the results are shown in hexadecimal numbers, and is divided in two parts (low and high). The final number is the concatenation of these numbers. The Fig. 7 shown the results using Pekmestzi's algorithm. It's clear that the results are shown directly by the output P , in decimal numbers. Now, to show the two's complement multiplication, let's use an example. Let us consider $\mathrm{A}=-19_{10}$ and $\mathrm{B}=+22_{10}$. Now, the number A in two's complement is $\mathrm{A}=11101101_{2}$, and $\mathrm{B}=00010110_{2}$. The multiplication in decimal of these numbers is $\mathrm{P}=-418_{10}$, or $\mathrm{P}=1111111001011110_{2}$. Changing to hexadecimal, the product is $\mathrm{P}=\mathrm{FE} 5 \mathrm{E}_{16}$. The Fig. 8 shown the simulation using Booth's algorithm, where the result is shown divided in two (high and low) again, but the concatenation of these values is the expected result. The Fig. 9 shown the simulation using Pekmestzi's algorithm. The result is direct and is equal to the expected value.

## 6. COMPARISON AMONG THE TWO ALGORITHMS

Now, we proceed to compare the obtained results of the simulations of both algorithms. Of this comparison, we obtain the following remarks:

1. Both algorithms carry out the multiplication of positive numbers and numbers in two's complement.
2. Both algorithms can be coded in the descriptor language VHDL in combinational logic.
3. The operation speed is appropriate.

However, certain advantages and disadvantages exist when using one of these codes instead of the other one:

1. For the Pekmestzi's algorithm, it is necessary a code for positive numbers multiplications and other similar but with certain differences for the multiplication with numbers in two's complement, what takes us to have two extensive codes. For the Booth's algorithm, we have a single and smallest code that carries out both operations.
2. The number of logical gates on the FPGA XC4010XL is 10,000 . The Pekmestzi's code for positive numbers (only) uses a total of 1195 equivalent gates for design (11.95\%). The Pekmestzi's algorithm for numbers in two's complement uses a total of 1236 equivalent gates for design ( $12.36 \%$ ). The Booth's algorithm uses a total of 1287 equivalent gates for design ( $12.87 \%$ ).
3. The VHDL code for the Pekmestzi's algorithm is of around 300 lines, while for the Booth's algorithm decreases to only 100 .
4. The Pekmestzi's algorithm gives us an alone number as a result. However, the Booth's algorithm gives us two numbers that should be concatenated to obtain the result.


Fig. 7. Positive numbers multiplication using Pekmestzi's Algorithm


Fig. 8. Two's complement multiplier using Booth's Algorithm


Fig. 9. Two's complement multiplier using Pekmestzi's Algorithm

## 7. CONCLUSION

The multiplication is one of the most critical operations in every computational systems. In this work two different algorithms are shown for the calculation of multiplications between positive numbers and two's complement numbers, which are the Booth's algorithm and the Pekmestzi's algorithm.

Once we carries out the comparison between both methods, can conclude that these algorithms complete their multiplicative work indeed. However, some advantages exist of using one of them instead of the other one, like it was described in section 6 .

If is needed to use this multiplier in a complex circuit, it is convenient to use the Booth's algorithm, because the code is less extensive and it carries out both operations. However, if only multiplications are going to be carried out, the Pekmestzi's algorithm is effective because in spite of being extensive in code is simple of to understand and to use.

Something very important is the number of gates used for design. It is possible to see that the Pekmestzi's algorithm for positive numbers (only) is the best choice, because it uses less quantity of gates. However, the algorithm has limitations (no
negative numbers). In the end, the selection of the best option will depend on the application in which will be used.

As a personal opinion, the Booth's algorithm can provide good results on a small code and using a very similar number of gates to those that are used by the Pekmestzi's algorithms. So, Booth's algorithm is a very good choice for application problems.

It is necessary to point out that the Foundation software by Xilinx ${ }^{\mathbb{C}}$ did not allow to carry out a single code that carried out both operations in the case of the Pekmestzi's algorithm (due to the extensive code). Also, it didn't allow to carry out the direct concatenation of the exit in the case of the Booth's algorithm.

## 8. ACKNOWLEDGEMENTS

This work was supported by CONACyT under projects J35313-A, J32018-A, J35303-E and the project "Apoyo a Actividades Académicas de los Progrmas de Postgrado de Excelencia de CONACyT".

This project was also supported by Xilinx©, providing the software and hardware (FPGA XC4010XL) for the realization, under their "University Program" for support.

## 9. REFERENCES

[ 1] K. Z. Pekmestzi, "Multiplexer-Based Array Multipliers", IEEE Trans. Computers, vol. 48, no. 1, pp. 15-23, Jan. 1999.
[2] J. Hoffman, G. Lacaze, P. Csillag, "Iterative Logical Network for Parallel Multiplication", Electronics Letters, vol. 4, p. 178, 1986.
[ 3 ] P. Burton, D. R. Noaks, "High-Speed Iterative Multiplier", Electronics Letters, vol. 4, p. 262, 1968.
[ 4 ] R. De Mori, "Suggestion for an IC Fast Parallel Multiplier", Electronics Letters, vol. 5, pp. 50-51, Feb. 1969.
[5] H. Guilt, "Fully Iterative Fast Array for Binary Multiplication", Electronics Letters, vol. 5, p. 269, 1969.
[6] R. Baugh, B. A. Wooley, "A Two's Complement Parallel Array Multiplication Algorithm", IEEE Trans. Computers, vol. 22, no. 12, pp. 1045-1059, Dec. 1973.
[7] K. Hwang, "Global and Modular Two's Complement Array Multipliers", IEEE Trans. Computers, vol. 28, no. 4, pp. 300-306, Apr. 1979.
[ 8 ] A. Booth, "A Signed Binary Multiplication Technique", Quarterly J. Mechanics of Applied Math, vol. 4, pp. 236-240, 1951.
[9] L. MacSorley, "High Speed Arithmetic in Binary Computers", Proc. IRE, vol. 49, Jan. 1961.
[10] http://ivs.cs.unimagdeburg.de/EuK/Lehre/booth.html, "Booth's Algorithm".


[^0]:    * CONACyT Scholarship
    + CONCyTEG Scholarship

