FFT method with the complexity of O(n log n log log n), where n stands for bit length of operands. Karatsuba method is employed in multiplication of hundreds to thousands bits, whereas FFT method is used for millions bit multiplication.
In this paper, we present two design choices of the hardware Karatsuba multiplier: RKM (Recursive Karatsuba Multiplier) and IKM (Iterative Karatsuba Multiplier). At the same time hardware designs of RKM and IKM are described and these VLSI realization are evaluated in terms of the cost, performance, and energy consumption. From Fig. 1 we found that RKM using 0.18µm process has less area cost than WTM (Wallace Tree Multiplier) for bit length larger than 2 9 with area cost of 30 mm 2 . Critical path delay of RKM is always larger than that of WTM. Therefore we should use WTM instead of RKM as combinational circuits for IKM to have better cost performance for practical purposes.
Designs of IKM are denoted by R1IKM, R2IKM, and R3IKM referring to the number of applications of Karatsuba algorithm. From Figs. 2 and 3 a version of IKM can perform 1024-bit multiplications 30 times faster than software Ver.20050709 and used Pentium 4 1.7 GHz CPU with the same design rule as our hardware implementation. Computation energy was found to be nearly 1/600 of that consumed by general purpose processor which executes the software version.
The results obtained by this study will help system designers for applications requiring multi-digit multiplication to select design alternatives including VLSI realization. Multi-digit multiplication is widely used for various applications in recent years, including numerical calculation, chaos arithmetic, primality testing. Systems with high performance and low energy consumption are demanded, especially for image processing and communications with cryptography using chaos. Karatsuba algorithm with computational complexity of O(n 1.58 ) has been employed in software for multiplication of hundreds to thousands bits, where n stands for bit length of operands. In this paper, hardware design of multi-digit integer multiplication based on Karatsuba algorithm is described and its VLSI realization is evaluated in terms of the cost, performance, and energy consumption. We present two design choices of the Karatsuba hardware: RKM (Recursive Karatsuba Multiplier) and IKM (Iterative Karatsuba Multiplier). We found that RKM has less area cost than WTM (Wallace Tree Multiplier) for bit length larger than 2 9 with area cost of 30mm 2 . Critical path delay of RKM is always larger than that of WTM. Therefore, we should use WTM as combinational circuits for IKM to have better cost performance. We also found that a version of IKM using 0.18µm process can perform 1024-bit multiplications 30 times faster than software at the area cost of 10.9mm 2 . Energy for the computation by the IKM version was found to be nearly 1/600 of that consumed by general purpose processor which executes the software. The results obtained by this study will help system designers for applications requiring multi-digit multiplication to select design alternatives including ASIC realization. 
1.

ASIC (Application Specific Integrated
2 Carry-Save RKM 3 Binary RKM Table 8 . Scheduling of operations performed by ACC.
5(a)
RKM 1.73 Karatsuba O(n 1.58 ) WTM 1.90 O(n 2 ) 5(a) 2 9 WTM RKM RKM WTM 30mm 2 5(b) RKM WTM O(log n) RKM WTM 2 9 RKM RKM WTM 2 9 30mm 2 WTM IKM WTM IKM 5. I K M IKM 5 1 Karatsuba 2 IKM 5 2 IKM Karatsuba (4) 5 1 n Karatsuba 2 4n IKM n 4n A B n 4 A = a 3 2 3n + a 2 2 2n + a 1 2 n + a 0 · · · · · · · · · · · · (15) B = b 3 2 3n + b 2 2 2n + b 1 2 n + b 0 · · · · · · · · · · · · · (16) A B P P = p 7 2 7n + p 6 2 6n + p 5 2 5n + p 4 2 4n +p 3 2 3n + p 2 2 2n + p 1 2 n + p 0 · · · · · · · · · · ·(17) A 1 = a 3 2 n +a 2 A 0 = a 1 2 n +a 0 B 1 = b 3 2 n +b 2 B 0 = b 1 2 n + b 0 A = A 1 2 2n + A 0 , B = B 1 2 2n + B 0 P P = AB = A 1 B 1 2 4n + A 10 B 10 − A 1 B 1 − A 0 B 0 2 2n + A 0 B 0 · · · · · · · · · · · · · · · · · · · · · · · · · · (18) A 10 = A 1 + A 0 B 10 = B 1 + B 0 2 (18) A 1 B 1 A 0 B 0 A 10 B 10 A 1 B 1 = (a 3 2 n + a 2 )(b 3 2 n + b 2 ) = a 3 b 3 2 2n + (a 32 b 32 − a 3 b 3 − a 2 b 2 )2 n + a 2 b 2 · · · · · · · · · · · · · · · · · · · · (19) A 0 B 0 = (a 1 2 n + a 0 )(b 1 2 n + b 0 ) = a 1 b 1 2 2n + (a 10 b 10 − a 1 b 1 − a 0 b 0 )2 n + a 0 b 0 · · · · · · · · · · · · · · · · · · · ·(L +− − + H + − − + pp1 a1b1 L − + + − H − + + − pp2 a2b2 L − + + − H − + + − pp3 a3b3 L + − − + H + − − + pp4 a10b10 L − + H − + pp5 a20b20 L − + H − + pp6 a31b31 L + − H + − pp7 a32b32 L + − H + − pp8 a3120b3120 L + H + (19)(20)(21) (18) P = 2 6n a 3 b 3 +2 5n a 32 b 32 − a 3 b 3 − a 2 b 2 +2 4n a 31 b 31 − a 3 b 3 − a 1 b 1 + a 2 b 2 +2 3n a 3120 b 3120 − a 31 b 31 − a 20 b 20 −a 32 b 32 + a 3 b 3 + a 2 b 2 +2 2n a 20 b 20 − a 2 b 2 − a 0 b 0 + a 1 b 1 +2 n a 10 b 10 − a 1 b 1 − a 0 b 0 +a 0 b 0 · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · ·(22) (17) p7,. . . ,p0 (22) 9 a 0 b 0 , . . . , a 3 b 3 , a 10 b 10 , . . . , a 32 b 32 , a 3120 b 3120 pp0,. . . ,pp8 p 7 ,. . . ,p 0 p 0 p 7 p 7 ,. . . ,p 0 p 7 ,. . . ,p 0 6 p 7 ,. . . ,p 0 L H n (17) P p 7 ,. . . ,p 0 p 7 ,. . . ,p 0 n p 7 ,. . . ,p 0 Karatsuba 2 6 IKM8 − a31, b31 a0b0(= pp0) − a2031 − b2031 9 − a32, b32 a1b1(= pp1) − − − − 10 − a2031, b2031 a2b2(= pp2) − − − − 11 − − a3b3(= pp3) − − − − 12 − − a10b10(= pp4) − − − − 13 − − a20b20(= pp5) − − − − 14 − − a31b31(= pp6) − − − − 15 − − a32b32(= pp7) − − − − 16 − − a2031b2031(= pp8) − − − − 8 ACC
Adder-Subtracter1
Adder-Subtracter2 Adder-Subtracter3 Adder-Subtracter4
Step 
