Abstract: Stochastic computing is an old and unconventional computing technique that is finding promising new applications in image processing and the handling of complex error-correcting codes. Stochastic circuits offer an alternative to conventional digital circuits because of their extremely small size and inherent noise tolerance. They are also well-suited to meeting the requirements of emerging nanoscale technologies where non-deterministic behavior due to manufacturing defects and soft errors cannot be ignored. Error analysis of stochastic circuits, however, has received little attention and remains a largely open problem, especially when multiple errors affecting both the data sources and the stochastic circuits can occur in the course of a computation. This paper attempts to analyze stochastic circuits under various error conditions, and to compare their behavior to that of conventional circuits under similar error conditions. We use probabilistic transfer matrices for this analysis, complemented by circuit simulation. Our results indicate that stochastic circuits provide significantly better error tolerance under severe error conditions.
Introduction
Non-deterministic behavior is becoming ubiquitous in digital circuits implemented using conventional CMOS transistors or various novel nanotechnologies [5] . Because of their small size, these circuits are easily affected by small manufacturing defects and by transient errors due to environmental noise, both of which tend to be nondeterministic in nature. For example, carbon nanotube field-effect transistors (CNFETs), an emerging alternative to CMOS, exhibit behavioral variations that are difficult to identify and control [19] . Methods of designing circuits that tolerate errors are also of increasing interest. Von Neumann took the first steps in designing reliable circuits using unreliable switches in the 1950s [21] . Since then, many error-tolerant techniques have been proposed, ranging from error-correcting codes [15] , to replication of hardware, software and/or data [12, 17] . Most of these approaches impose high circuit overhead, and tend to be used only in the most cost-insensitive applications.
Stochastic computing (SC) processes data in the form of (pseudo) random sequences of 0 s and 1 s [2, 7] . These bit-streams are referred to as stochastic numbers (SNs) and are interpreted as probabilities. In its basic "unipolar" form, a bit-stream of length containing 1 1 s denotes the probability = 1 / , i. e., the probability of a randomly-chosen bit of being 1. The same value is represented by many different bit-streams, so this number format is highly redundant. For example, 0010, 01000001 and 00001100 all denote the same number 1/4. Unipolar SNs directly approximate numbers lying in the interval [0, 1], but they can be scaled to other ranges such as the interval [−1, 1] used for signed numbers (the so-called "bipolar" SNs) [2] .
The key advantage of SC is that arithmetic operations can be performed by extremely simple and standard logic circuits using any desired hardware implementation technology. Figure 1 shows stochastic circuits for multiplication, addition, and number format conversion. In this figure (and in the rest of the paper), circuit wires and the signals (Boolean variables) they carry are denoted by small letters , , ,. . . , bit-streams or SNs are denoted by big letters , , ,. . . , and their numerical values are denoted by , , , etc. For example, the two-input AND gate performs the multiplication × in clock cycles, with the caveat that the input bit-streams and must be statistically independent (uncorrelated) and sufficiently long to provide acceptable precision. Addition is implemented by the two-way multiplexer in the scaled form 0.5( + ), which ensures that the sum always lies in the probability interval [0, 1]. Observe that the scaling factor 0.5 is supplied by a stochastic number of constant value 0.5, i. e., a purely random sequence of 0 s and 1 s. The remaining circuits convert numbers between conventional binary and stochastic forms. They serve to interface standard and stochastic digital circuits, and to re-randomize SNs that have become unduly correlated. The (pseudo) random number generator in Figure 1c is generally implemented by a linear feedback shift register (LFSR). Number conversion circuits sometimes constitute the largest part of an SC system. SC has several features that limit its usefulness, including long bit-streams and computing times, inaccuracies caused by random bit fluctuations, and awkward scaling requirements. However, because of their inherently re- dundant encoding format, errors of the bit-flip type have little effect on the value of an SN. For example, a single bit-flip occurring in an n-bit SN changes the output value by 1/ , a relatively small error whose significance diminishes as increases. Furthermore, if the errors are bidirectional, i. e., if 0-to-1 and 1-to-0 bit-flips are equally likely, then the errors tend to cancel one another. This suggests that stochastic computing can outperform binary in applications such as noisy image processing [3, 14] and complex error-correcting decoding based on probability calculations [6, 9, 16] . Figure 2 compares the behavior of stochastic and conventional image-processing circuits in the presence of different levels of noise in the input data. It shows that when noise in the image data causes conventional image processing to fail, stochastic circuits can still produce acceptable results.
Although the error-tolerance property of stochastic circuits has long been recognized, it has never been thoroughly analyzed. In [18] , a soft error analysis of SC circuits is carried out that is limited to bit-flips occurring in the circuit's input data; the stochastic circuits themselves are assumed to be fault-or error-free. In this work, we aim to provide a more general error analysis for stochastic circuits, especially in the presence of high error rates such as are encountered in avionics or spacecraft instrumentation [8] .
We first develop a mathematical framework for the analysis using Bernoulli random variables (BRVs) and probabilistic transfer matrices (PTMs). BRVs are widely employed in probability studies [20] , and PTMs are used to determine probabilistic properties of logic circuits, such as their reliability or soft-error rate [13] . We also present a case study in image processing, which shows that stochastic circuits can outperform conventional ones under severe error conditions. This section discusses stochastic computations in terms of random variable theory, and reviews the key ideas of PTM algebra.
Stochastic numbers
A stochastic number , as defined in [2] , is a bit-stream carrying a probability value that denotes 1 (success) or 0 (failure). It can therefore be viewed as a set of samples from a real-valued random variable (RV) with a Bernoulli distribution in which the probability of success is [20] . Since probabilistic behavior can be easily modeled and analyzed in terms of Bernoulli RVs, we now use them to give a formal definition of a stochastic number that abstracts away from bit-stream formatting issues: a stochastic number is a Bernoulli random variable with parameter .
When dealing with RVs, we usually need to sample them in order to estimate their values. The sampling process is, in fact, a very basic form of stochastic computing. 
Thus the MSE of an SN estimate can be reduced by increasing the number of samples i. e., the bit-stream length . Also note that is a function of and only, implying that no matter what the circuit is (whether the AND of Figure 3 or any other circuit), once the expected rate of success at the output is calculated, we can use Equation (1) to calculate its MSE.
Besides the random fluctuations inherent in the selection of a particular bit-stream to represent in a stochastic circuit , various non-deterministic physical phenomena associated with itself and its environment affect the sampling process and distort the expected values of . It is convenient to lump such effects into a bit-flip error e that occurs with some probability . For example, it is often assumed in the literature [4] that causes bit-flips in Z, which affect 0 s and 1 s with equal probability . Whatever, the error behavior assumed, two basic questions should be addressed: How do we model the impact of on , and how do we introduce into a previously error-free stochastic circuit ? First, we assume the error to be a Bernoulli RV with parameter (the bit-flip rate), so it can be treated like another SN associated with . The circuit's fault-free output then changes to an erroneous function * , as illustrated in Figure 3a for the AND-gate stochastic multiplier. For simulation purposes, it is convenient to have a mechanism for injecting the error in a way that flips the normal signal with probability , resulting in the erroneous output * . Figure 3b shows how to do this by inserting an XOR gate with input into 's output line. For example, a bit-flip rate of = 0.05 with input values = 0.2 and = 0.3 changes the expected success rate at the output of the AND multiplier from 0.06 to 0.104. An analytical method of calculating the expected value * and its MSE will be developed next. 
Probabilistic transfer matrices
A convenient tool for analyzing the probabilistic behavior of logic circuits is the probabilistic transfer matrix (PTM) and its associated algebra [13] . PTMs were introduced to analyze the reliability of conventional logic circuits. Their use may be limited by the fact that PTM size grows exponentially with circuit size. This is less of a problem with stochastic circuits, however, which typically consist of just a handful of gates. In the PTM formulation, an -input -output combinational circuit's behavior is represented by a 2 ×2 zeroone matrix whose rows correspond to all input combinations and whose columns correspond to all output combinations. This matrix, which is referred to an ideal transfer matrix (ITM), is a slightly modified truth 
where the rows correspond to xy = 00, 01, 10, 11 and the columns correspond to = 0, 1. A general PTM is obtained from the ITM by allowing the entry in row and column to become any real number in the interval [0, 1] that denotes the conditional probability of producing output in response to input . In the AND gate's ITM AND shown in (2), the top row tells us that the AND produces output = 0 with probability 1, and output = 1 with probability 0, in response to the input xy = 00. By choosing suitable probability values, PTMs can be constructed to represent a remarkably wide range of error scenarios [13] . For example, the effect of a bit-flip error with rate on the output of the AND gate model in Figure 3a , is represented by the PTM
Input-dependent bit-flips can be modeled by associating a different value with every row. Observe that a PTM must satisfy the stochastic requirement that all entries in each row sum to 1, and that the ITM is just the PTM for the error-free case.
Circuit PTMs can be manipulated by means of a welldefined algebra which loosely resembles linear algebra. Every element or circuit is represented by a PTM that describes 's logic function and error status; see Figure 4 . An -output fanout gate copies an input signal to its outputs. Figure 4c shows the ITM for the 2-output fanout gate 2 . Wire permutations such as crossing wires are represented by the swap or crossover gate. The ITM for an adjacent wire swap is shown in Figure 4d ; any permutation of wires can be modeled by a series of adjacent swaps. PTMs can be combined in two basic ways corresponding to the two basic circuit interconnection structures, series and parallel. The PTM of two circuits 1 and 2 connected in series is the ordinary matrix product of their PTMs, i. e., 1 × 2 . The PTM of two circuits connected in parallel is the tensor product of the PTMs, denoted 1 ⊗ 2 . In the tensor product, each element of the first matrix 1 is multiplied by the entire second matrix 2 , which leads to rapid growth in matrix size. A wire corresponds to a 2 × 2 PTM; the ITM case is just the identity matrix. From this, we conclude that * , i. e., the probability of seeing a 1 in *, is 0.104. Note that the PTM AND of the AND gate implicitly incorporates the bit-flip error, so there is no need for the XOR gate of Figure 3b , as is also the case in (3).
Error analysis
Next, we discuss the impact of bit-flips on stochastic numbers and stochastic circuits. We present a general analytical approach which can be implemented with PTMs to handle larger circuits.
Impact of soft errors on stochastic numbers
Consider a stochastic number with the expected value [ ] = . In a noisy environment, if is affected by bitflip error with expected value , the SN becomes * = ⊕ . We therefore have
Besides the expected value of * , we are interested in * , the mean square error of * , which denotes the average error occurring in a stochastic circuit, i. e., the average difference between the estimated valuê * and the exact Note that * reflects both the random fluctuations of the bit-stream representation and the error due to bit-flips. As mentioned earlier, * is a Bernoulli RV defined by its expected value, so using only (5), we should be able to find * and hence * analytically. Assuming the estimated valuê * = 1/ ∑ =1 * obtained by summing independent samples of * , we get
There are two terms in the final result of (6). The first term is the difference between the expected values of and * , and its only cause is the bit-flip . The second term is a random fluctuation error that diminishes with increasing . We can re-write * in terms of and by substituting (5) into (6) thus:
Observe that the MSE error depends on both and . For sufficiently large , * becomes 0 when = 1/2, while it becomes for = 1.
In a similar way, we can analyze the effect of bit-flip errors on conventional (non-stochastic) binary numbers. An -bit binary number , affected by independent and identically distributed bit-flips on each bit becomes * , which can potentially be any -bit number with some probability. The error of * and its probability of occurrence depend on the number of bit-flips . To find the MSE * in this case, we calculate the weighted average error over all possible
Using the above equations, we compare the effect of bitflips on a 5-bit binary number and an SN of length 32. Figure 5a shows the MSEs * and * at different bit-flips rates. Initially, the SN has a higher error due to its random fluctuations. However, as increases, SN outperforms the binary number with respect to error tolerance. Figure 5b shows the MSEs * and * at different values of ; the MSEs in this case are averaged over several bit-flip rates ranging from = 0.001 to 0.25. As can be seen, * is approximately 50% less than * . These analytical results are also confirmed in Figure 5 by Monte Carlo simulation.
Impact of errors on stochastic circuits
A key feature of our error analysis is the use of PTMs to estimate the impact of errors on stochastic behavior. PTMs can be used to calculate the probability distribution of all output combinations of a stochastic circuit . Given specific input signal probabilities, the input vector is multiplied by the PTM of the circuit to obtain the output probabilities. For example, consider again the AND gate of Figure 3 which is multiplying two SNs and and has output error . Generalizing (4) gives the 1 × 4 input vector
Now, let us consider two cases: first, the AND gate is errorfree, and second, it contains the error defined by the PTM in (3). In the error-free case, the output vector is
indicating that the probability of output 1 is . If an error is present, then using AND from (3), the output becomes
where denotes matrix transposition (used to save space).
This implies that the expected value of the output is * = ((1− )(1− )+(1− ) + (1− ))+(1− ) .
The MSE * can now be calculated from Equation (6) . We can readily generalize the above technique to arbitrary stochastic circuits to analyze their stochastic behavior under single or multiple errors. First, generate the PTMs and ITMs for each individual logic or wiring gate. Then, apply the ordinary and tensor products repeatedly to calculate the PTM and ITM for the entire circuit [13] . Again, if the circuit has inputs and outputs, its final PTM and ITM will both be 2 × 2 matrices.
Besides using the PTM method to analyze the behavior of a stochastic circuit in the presence of errors, we can employ gate-level circuit simulation to achieve the same goal. We inject the bit-flips into a gate via an XOR gate that flips the output signal of with probability , resulting in a new erroneous signal * . For a circuit containing multiple gates, the error is injected into every gate. Consider, for example, the stochastic realization of scaled addition. This operation can be implemented either by a majority circuit or a multiplexer [1] , as shown in Figure 6a-b . The special input is a constant scaling factor of value 0.5. The corresponding circuits with XOR gates added for error injection are shown in Figure 6c -d. To focus on the behavior of the computational hardware (the logic gates) in the presence of errors, we assume the data sources are not affected by errors. Figure 7 presents error data obtained by PTM analysis and circuit simulation for the two basic gate types AND and NOT, as well as the scaled adder circuits of Figure 6 . The error rates of all gates are assumed to be the same
= ), but they are generated from independent random sources. We simulated the circuit with and without the added XOR gates to get the expected errorfree values and the values affected by soft errors. The MSE is given by * = [(̂ * − ) 2 ]. As Figure 7 shows, the analytical and simulation results are quite consistent. We also constructed PTMs MAJ and MUX for the circuits of Figure 6a -b level by level from the PTMs of their component gates, including wiring gates, according to the method of [13] . The vertical lines on the figure separate the circuits into levels. In high-level symbolic form, we obtain the PTM expressions Each parenthesized term in these equations corresponds to a circuit level, and the overall PTM of a circuit is obtained by multiplying the PTMs of all its levels. For example, the first level 1 of the majority circuit consists of three 2 gates in parallel, so 1 = ( 2 ⊗ 2 ⊗ 2 ). Similarly, we obtain 2 = ( ⊗swap⊗swap⊗ ), 3 = (AND2 ⊗AND2 ⊗ AND2 ), and 4 = (OR3 ). The PTM for the majority gate is then MAJ = 1 × 2 × 3 × 4 . We use the same method to construct the PTM for the multiplexer.
Fully expanded to binary form, MAJ and MUX become 8 × 2 matrices, which we derived from the above equations with the aid of GNU Octave [10] . The ITMs MAJ Random input r = 0.5
Subtrator Abs. value circuit When errors are present, the 0-1 entries of MAJ and MUX must be replaced by complex polynomial expressions involving the variable to obtain MAJ and MUX in expanded form.
Knowing both the erroneous PTMs and the ITMs, we can calculate the corresponding MSEs; see Figure 7 . Again, the analytical results confirm the circuit simulations. In other words, both circuit simulation and PTM manipulation are valid methods for estimating soft-error effects in stochastic circuits. These results also show that when multiple errors are present, the errors accumulate. Hence, when the error rate is low, the multi-gate adders have worse MSE than single gates. When the error rate is high, for example, near 0.5, the behavior of all the circuits tends to appear random, so that they all have approximately the same MSE.
Case study: image edge detection
Edge detection is a fundamental operation in image processing and computer vision. Its goal is to identify significant local changes of intensity in digital images. Stochastic edge detectors have been shown to be significantly smaller, faster, more power-efficient, and more noisetolerant than conventional ones in real-time image processing; see Figure 8 [3] . These designs, which are based on the Roberts cross algorithm [10] , compute a moving average across a pixel window of size 2 × 2 for each pixel Figure 8b shows the corresponding binary design which contains several large arithmetic blocks, including addition, subtraction and absolute-value circuits. The stochastic edge detector is about two orders of magnitude smaller than the conventional binary design.
We now use PTMs to analyze the behavior of these circuits of under noisy conditions. The effect of a bit-flip rate of on the output of every gate in the circuits is represented by a suitable PTM. Suppose the PTMs for the stochastic and conventional edge detectors are sc and conv , respectively. For each pixel and its 2 × 2 window, we generate the corresponding input vectors in and in for the stochastic and conventional edge detectors, respectively. The result of the edge-detection operation is then calculated as in × sc and in × conv . In this example, we assume that 5-bit precision is required, so the bitstream length is 2 5 = 32 for the stochastic design. We do not consider any additional circuits that might be needed for number conversion between the binary and stochastic formats. Figure 9 compares the MSEs of the stochastic and conventional designs. As expected, when the error rate is low, the stochastic circuit is more affected by random fluctuation errors and performs worse than the conventional one. However, as the error rate increases, the MSE of the conventional design increases rapidly. When the error rate is very high, all the signal values become essentially random in both designs, so the MSEs coverage to the same value. Note that this result is consistent with the results shown in Figure 7 . Figure 10 compares the output image quality of the two edge detectors of Figure 8 in the presence of errors injected into them to simulate the impact of soft errors on the edge-detection hardware. It shows that when noise causes the output of the conventional circuit to become almost unrecognizable (at around = 2%), the stochastic circuit still produces acceptable results. Note that this experiment is different from the one shown in Figure 2 , in which noise was only injected into the input image signals and the image-processing circuitry was assumed to be error-free. In this later experiment, noise is injected into the stochastic circuits themselves to demonstrate their fault-tolerant behavior. Together, these two experiments show that when the conventional design fails to produce recognizable results, stochastic computing can produce good results in the presence of severe noise that affects both the input image and the edge-detection circuit.
Conclusions
Stochastic circuits are finding major new applications because of their small size and error tolerance. We have presented a quantitative study of error tolerance that considers multiple error effects, and accounts for the inherent fluctuations in stochastic data as well as externally induced bit-flip errors affecting the data-processing circuits. We successfully used two complementary approaches: algebraic analysis with probabilistic transfer matrices (PTMs), and Monte Carlo circuit simulation. The algebraic analysis is more accurate than the simulation approach, but has the disadvantage of being infeasible for large circuits. Since stochastic circuits are very simple by nature, the algebraic approach is generally feasible for them, and is hence preferred. The simulation approach gives reasonably accurate results, and is feasible for circuits of any size. The experimental results we presented show that stochastic circuits have far better error tolerance than conventional binary circuits, especially at higher error rates.
