Abstract-This paper and its companion (Part I) are devoted to to the analysis of the application of a chaotic piecewise-linear one-dimensional (PL1D) map as Random Number Generator (RNG). In Part I, we have mathematically analyzed the information generation process of a class of PL1D maps. In this paper, we find optimum parameters that give an RNG with lowest redundancy and maximum margin against parasitic attractors. Further, the map is implemented in a 0.8 m standard CMOS process utilizing switched current techniques. Post-layout circuit simulations of the RNG indicate no periodic attractors over variations in temperature, power supply and process conditions, and maximum redundancy of 0.4%. We esimate that the output bit rate of our RNG is 1 Mbit/s, which is substantially higher than the output bit rate of RNGs available on the market.
I. INTRODUCTION
To construct a random number generator (RNG) based on chaos, in this paper and its companion [1] we exploit the double nature of chaos, deterministic in microscopic space, and random in macroscopic space. In Part I, we have shown that our chaos-based RNG is mathematically proven to act as an information source and is not prone to silent breakdowns. In this paper, we address the questions of practical realization of our RNG. In particular, we find optimum parameters of RNG and show how it can be efficiently implemented on-chip. Here is the layout of our paper. Section II shows the way a RNG based on a chaotic map can be optimized. Section III benefits from the results presented in Section II, and gives the design of a chaotic RNG based implemented by virtue of switched current circuits. Concluding remarks are given in Section IV.
II. OPTIMIZATION OF CHAOS-BASED RNGS
In Part I, we have analyzed the information generation process of the following PL1D maps: xn+1 = f (xn) = q 1 + k 1 x n ; for x n < 0 01 + k 2 x n ; for x n 0 (1) in the region P = f(k 1 ; k 2 ; q 1 ) j 1 < k 1 < 2; k 1 k 2 < 2; k 1 0 1 < q1 < 1=(k2 0 1)g of the (three-dimensional) 3-D parameter space k1 2 k2 2 q1 . In this section, we optimize our chaos-based RNG.
A. Redundancy Reduction Techniques
As shown in Part I, larger k 1 and k 2 mean smaller redundancy and a better RNG, but they also mean a higher risk of appearance of periodic attractors and of breakdown of the RNG. Therefore, k 1 and k 2 must Manuscript received January 7. 2000; revised September 17, 2000. This work was supported in part by the Army Research Office under Grant DAAG55-98-1-0269, in part by the Department of Electronics, under Grant DE-FG03-95ER14516, and in part, by the National Science Foundation under Grant NCR-9612250. This paper was recommended by Associate Editor C. K. Tse.
T. Stojanovski is with the Expert Information Services, Melbourne 3027, Australia (e-mail: tonis@expert.com.au).
J. Pihl is with NTNU, Department of Physical Electronics, N-7034 Trondheim, Norway ( e-mail: Johnny.Pihl@fysel.ntnu.no).
L. Kocarev is with the Institute for Nonlinear Science, University of California, San Diego, La Jolla, CA 92093-0402 USA (e-mail: kocarev@heisen-berg.ucsd.edu).
Publisher Item Identifier S 1057-7122(01)01400-3.
be small enough to assure chaotic behavior of (1) across all temperature and power supply fluctuations. Increased redundancy for smaller k 1 and k 2 must be lowered via processing the output bits. Redundancy in an information source can be due to two sources, diferrence in the probabilities of the two binary symbols, and memory of an information source. A good redundancy reduction technique must affect both sources of randomness. The two simplest redundancy reduction techniques, which can be on-chip implemented with a very simple circuitry, are bit skipping [2] , [3] and bit counting.
In bit skipping only every p-th bit from the original binary sequence is used. For example, if the original sequence is X 0 ; X 1 ; X 2 ; . . . ; then bit skipping where only every p-th bit is used will produce the sequence X0; Xp; X2p; . . .. Skipping bits reduces only the redundancy due to the memory of an information source, but it does not reduce the difference in the probabilities of the two binary symbols. When p ! 1, the redundancy tends to 1 0 HB (P fX = 0g) where P fX = 0g is probability of binary symbol 0.
In bit counting, bits from the original binary sequence are grouped in blocks of p bits and summed up modulo 2 to produce an output bit. For example, if the original sequence is X 0 ; X 1 ; X 2 ; . . . ; then bit counting with blocks of p bits will produce the sequence therefore their entropies and redundancies are identical. Which redundancy reduction technique is preferred between these two, depends on the ease of practical implementation. They can both be implemented with a one-stage binary counter, the only difference being that the binary counter is set to 0 at the start of every block of p bits. Contrary to bit skipping, bit counting affects both sources of redundancy. This is the reason behind the superiority of bit counting with respect to bit skipping, and this will be pointed out throughout following sections. We show that bit counting is superior to bit skipping in the sense that it is more robust to the inevitable fluctuations of the parameter values from the nominal ones, and provides lower redundancy.
Both bit counting and bit skipping reduce the output bit generation rate p times, and one must compromise between reduction in the redundancy and reduction in the bit generation rate. Therefore we give results only for moderate values of p 6. Even for p 6 redundancies are very small, and further reduction in the bit generation rate by choosing larger p can not be justified.
B. Redundancy Reduction in 1-D Regions
Figs. 1 and 2 show redundancies for bit skipping and bit counting in the 1-D regions P 1 and P 2 , which are defined as P1 = f(k1;k2;q1)j 1 < k2 = k1 < 2; q1 = 1g and P 2 = f(k 1 ; k 2 ; q 1 ) j 1 < k 1 = k 2 < 2; q 1 = k 1 0 1g. Common value of k 1 and k 2 is denoted k in both P 1 and P 2 . For P 1 , where both binary symbols are equiprobable, bit skipping and bit counting are with comparable performances. For P 2 bit counting is substantially superior to bit skipping. In P 2 , binary symbols are not equiprobable, and bit skipping offers no cure against such a source of redundancy. As illustrated in Fig. 2(b) , redundancies for bit skipping go very quickly down to the lower limit 1 0 H B (P fX = 0g), which is determined by the differences in probabilities of 0s and 1s.
There are local minima of the redundancy both in P1 and P2, that is, larger p and/or higher slope k do not always mean smaller redundancy, and smaller redundancy does not necessarily mean lower margins L. We observed local minima of the redundancy also in the other 1-D regions which we examined. For bit counting in P 1 ; p = 5 is better than p = 6, and p = 3 is better than p = 4 for moderate values of k < 1:85. This is a consequence of the fact that P fY = 0g = P fY = 1g only for odd p, while P fY = 0g 6 = P fY = 1g for even p contributes to higher redundancy. Figures also show that width of local minima decrease for higher p. Thus, a local minima for higher p can be difficult to achieve with real circuits where parameter fluctuations are inevitable. This is an additional reason against usage of higher p.
On the basis of previous results and analysis carried out also in several other 1-D regions of the parameter space, we conjecture that the conclusions on the existence of local minima of the redundancy, on the narrower width of local minima for higher p, and on the superiority of bit counting to bit skipping in sense of assuring less redundancy are valid also in the 3-D parameter space P. The existence of local minima creates possibility for optimization of the choice of parameters, that is, smaller k 1 ; k 2 and p can be chosen to provide smaller redundancy and higher margin L.
C. Optimum Choice of Parameters
What is the optimum choice of parameters? How can we exploit results from this subsection to find the optimum choice? When one uses map (1) as an RNG, then one wants to be secure against appearance of parasitic attractors. When designing an RNG from the circuit implementation, one can compute the fluctuations in I+ and I0 due to temperature, power supply, and fabrication fluctuations, and then one can specify a minimum required margin Lmin against appearance of parasitic attractors. The next requirement is that the bit generation rate is higher than a certain value v b , which for a given clock frequency vc transforms into a requirement that p pmax = dvc=v b e, where dxe denotes the largest integer smaller than or equal to x. For given L min and p max , we define the optimum parameters as the set of parameters (k1; k2; q1; p)opt which minimizes redundancy amongst all sets of parameters (k 1 ; k 2 ; q 1 ; p) which satisfy (k 1 ; k 2 ; q 1 ) 2 P;p p max ; L L min , that is, 
Brute-force searching of optimum parameters in P is a formidable task, and the following discussion relieves it. We show that optimization in the 1-D region P 1 provides results that are almost as good as those obtained by optimization in the 3-D region P.
Instead of searching P, we restrict our attention to a 3-D region P 3 = f(k 1 ; k 2 ; q 1 ) j (k 1 2 (1:6; 1:9); k 2 2 (k 1 ; 1:9); q 1 2 (0:9; 1:1)g. Very small slopes give small h entropy, while very large ones provide small margin L. For larger jq 1 01j, the map becomes increasingly asymmetrical and the difference in probabilities of 0s and 1s increases.
It is possible that a local minimum of a redundancy curve in P 1 (see Fig. 1 ) is not a local minimum in P. Then, a set of parameters lying very close to P1 is a local minimum in P. This was the motivation to examine 1% and 2% neighborhoods of P 1 ; P 1;1% = f(k 1 ; k 2 ; q 1 ) j k 1 This way, we examined behavior of redundancy in the 1% neighborhood of P1; P 1;1% = f(k1;k2;q1) j k1 k2 1:01k1; 0:99 q1 1:01g, We repeated this procedure for a 2% neighborhood P 1 .
On the basis of previous computations, we performed optimization (2) in P1; P3, and P 1;1% [ P 1;2% for 0 < Lmin < 0:45 and 2 p max 6. We present results only for p max = 2 and p max = 6, see Fig. 3 . For large L min (larger than approximately 0.15), optimization in P1 provides very close results to optimization in P3 and P 1;1% [P 1;2% , while for smaller L min optimizing in P 1 is superior to optimizing in P 3 and P 1;1% [ P 1;2% . The latter superiority is due to the small width of minima for k1; k2 2, and the chance is small that these minima are hit by the finite number of points analyzed in P 3 ; P 1;1% , and P 1;2% . A twice finer grid of points in P3 and P 1;1% would increase the number of points 8 times. In contrast, in P10D we analyzed only 574 points for which is a Markov partition with order r < 12. Motivated by (i) the substantially smaller computational complexity of the analysis and optimization in P 1 than in P 3 ; P 1;1% , and P 1;2% ; and (ii) the comparable or superior quality of the optimization in P 1 compared to the optimization in P3; P 1;1% , and P 1;2% we suggest only an optimization in P 1 is done when bit counting is employed.
III. PRACTICAL REALISATION

A. Circuit Design
Parameter variations due to implementation imprecision and external influences (temperature, power supply etc.) need to be estimated. As they are slowly varying compared to the iteration speed of the map, their temporal changes can be neglected and it can be approximated that the parameters are constant in time, though mismatched from the nominal ones. 
may be implemented in VLSI technology. Fig. 4 shows a VLSI implementation of map (3), in a standard 0.8 m CMOS process [5] .
The implementation is a switched-current based circuit based on [4] . MHz, which together with bit counting with p = 5 yields a total RNG bit rate of 1 Mbit/s. This is substantially higher than the output bit rate of the RNGs available on the market: from 7600 bits/s to 76 000 bits/s. Furthermore, our RNG requires no software postprocessing.
IV. CONCLUSION
We have addressed the practical issues of our chaos-based RNG. Performances of simple redundancy reduction techniques such as bit counting and bit skipping, which due to their simplicity can be implemented on-chip, are analyzed. Bit counting is superior to bit skipping as it yields smaller redundancy, provides higher robustness of redundancy to fluctuations in parameters, and optimization of parameters can be done in a much simpler mode. Post-layout circuit simulations of the RNG indicate maximum redundancy of 0.4% for bit counting with p = 5, and no parasitic attractors over variations in temperature, power supply and process conditions. Maximum operating clock frequency is estimated to 5 MHz, which together with bit counting with p = 5 yields a total RNG bit rate of 1 Mbit/s. This is substantially higher than the output bit rate of the RNGs available on the market.
