Abstract-Thermodynamic-driven filament formation in redox-based resistive memory and the impact of thermal fluctuation on switching probability of emerging magnetic memory are probabilistic phenomena in nature. Therefore, process of binary switching in these nonvolatile memories are considered stochastic that varies from switching-to-switching. Moreover, position-dependent, spatially correlated, and distancedependent variation in these electron devices, like advanced CMOS processes, provide rich in-situ spatiotemporal stochastic characteristics. Based on a partial characterization of the switching variation, this preliminary work presents highly scalable neuromorphic hardware based on crossbar array of 1-bit resistive elements as distributed stochastic synapses. The network shows the ability to emulate selectivity of synaptic potentials in neurons of primary visual cortex to the orientation of a visual image. The proposed model could be configured to accept a wide range of emerging nonvolatile memory technologies.
I. Introduction
Cognitive computing with nanoelectronics is an emerging field of research that aims to fill the gap between CPU's performance and mammalian brains. CPUs outperform human brain in almost all tasks that involve deterministic computation. They, however, lack a very important feature, which is the ability to learn and work with unreliable building blocks. This unique characteristic of mammalian brains has attracted attention of scientists from different fields including electronic engineering. In the late 1980s, Caver Mead, and later Eric Vittoz, envisioned the use of very-large-scale integration (VLSI) systems to mimic neuro-biological architectures of nervous system [1] , [2] .
It is believed that the hippocampus of a mammalian brain is responsible for memory and learning. It consists of neurons and chemical synapses [3] . On the contrary to the common belief that neural systems, including hippocampus, are analog systems, there are several neurobiological evidence for existence of discrete changes of synaptic strength at least in some subfields in hippocampus [4] . There are also a number of research that suggest efficient learning with digital synapses [5] , [6] . Therefore, even though an analog approach promises more power efficiency, a digital implementation improves design simplicity, it does not have limited scalability of analog implementations, it is less costly, small, and by far more integrable, hence, practically the best way, if not the only way, to implement highly scalable neuromorphic systems [7] . For instance, analog designs allow low-voltage operation, which then requires larger sensing circuitry and also does not necessarily support reasonable throughputs (spikes/s). It is expected that massively parallel, reliable, scalable, and potentially energy efficient cognitive systems could be implemented using 1-bit stochastic synapses. A deterministic approach, has its focus on 'more number of bits per synapse' to allow a more precise implementation of a learning rule, and generally suffers multiple data conversion. On the other side, emerging artificial synapses, such as RRAM (resitive RAM), CBRAM (conductive bridge RAM), PCM (phase change memory) and atomic switches, in their analog operating mode, have issues like (i) complicated programming pulse-schemes to achieve more intermediate states, and (ii) the problem of resistance drift or state retention [8] , [9] . Technologies like STT-MRAM (spin-transfer torque magnetoresistive RAM) and MeRAM (magnetoelectric RAM) also show existence of multi-stable states, but their low signal-to-noise properties suggests them impractical for conventional analog neuromorphic computing [10] . A workaround to these problems is to make the artificial synaptic transmission a probabilistic process, which can be achieved either in-situ or ex-situ [8] , [11] - [14] .
This work uses the in-situ stochastic switching characteristic of RRAM devices to present a technology independent approach to implement a highly scalable neuromorphic hardware that is capable of emulating sensitivity of synaptic potentials in neurons of primary visual cortex to the orientation of a visual image. Considering the underlying physical mechanisms of binary switching in RRAM, CBRAM and PCM it can be envisioned that their yield-loss and variation remain substantially larger than their CMOS counterparts. Therefore, a probabilistic approach is one of the most practical and highly scalable designs for neuromorphic hardware and logic operations with nanoelectronics.
II. Stochastic nano-Synapse

A. Stochastic finite state machines
Stochastic binary switching is common between the emerging memory technologies. This behavior normally occurs under certain condition, which is called "weak" programming. Although, the weak programming condition is different across technologies, a technology independent approach is presented to pave the way for further unification of system-level behavior based on above mentioned technologies.
Stochastic switching of a nanodevice could be treated as a black-box; a generalized model, which is identified by a set of inputs and outputs. Fig. 1 provides an overview of the model. Outputs x (n+1) and G (n+1) represent next state and next conductance of the device, which both are probabilistic and their values are defined by the functions f (·) and g(·), respectively. Once x (n+1) is determined, G (n+1) is calculated using an experimentally verified distribution (in this paper, log-normal) of ON and OFF state resistances. The function f (·) is a function of voltage/current, time (here pulse-width) and operating temperature. Small updates in g(·) is independent of function f (·) output, which means if x n = x (n+1) , G n and G (n+1) may not be equal. , I ), operating temperature, T , signal pulse-width, t, initial state xn and initial conductance Gn. Device type and relevant fitting parameters are given by D. The model accepts cross-point yield as a defect rate input F (not to be confused with the material's point defect). Function f (·) defines switching probability and function g(·) defines conductance value randomness under a given C2C variation distribution. Switching probabilities are defined as Pa and P b for switching from logic state '0', S0, to logic state '1', S1, and from logic state '1' to logic state '0', respectively. Outputs are current and voltage responses, next state (x (n+1) ), next conductance (G (n+1) ), as well as total consumed energy J of the whole operation to performing a programming or sensing, regardless of its success.
B. Probabilistic plasticity
Experimental data on voltage, pulse-width and temperature dependency of a resistive memory systems is gathered from [8] , [12] - [18] . Device switching probability is defined as a strong function of applied voltage, ∆V , and time, ∆t. For simplicity, temperature dependency and switching threshold variation of the device are considered to have an overall impact on the switching probability, therefore, their impact are not discussed as independent variables. Fig. 2 demonstrates switching probability of the SET (S 0 GA S 1 ) and RESET (S 1 GA S 0 ). Under similar condition, a SET is more likely than a RESET. Given that the S 0 GA S 1 seems more likely, it is possible to modulate the probabilistic LTP using a complicated learning rule, ∆W , where W represents synaptic weight that is bounded between a maximum and a minimum limits, through a different programming pulse-scheme for LTP, which makes the design more complex. Experimentally verified information regarding the switching probability model are given in [13] . The switching probability is achieved by integrating a Poisson-like distribution, which is in an excellent agreement with the hypothesis of thermodynamic activation during filament formation over a dominant energy barrier in RRAM devices. The mentioned experimental studies suggest that the switching probability of redox-based memory is a cumulative phenomenon, which links the digital and analog characteristic of the device. The probabilistic SET and RESET, under such circumstance, can therefore be shown as a function of number of applied pulses with a given amplitude-time combination to model the history dependent effects. One could argue that such spike-historydependent-plasticity provides further phenomenological similarities between these artificial synapses and the chemical synapses, where history dependent effects also play an important role [19] .
This stochastic behavior in combination with ∆W results a probabilistic LTP (long-term potentiation) and LTD (long-term depression), which is a function of time difference (∆t) between a pre-and a post-synaptic spike (probabilistic spike timing-dependent plasticity, pSTDP) or their difference in voltage (∆V ). Stochastic change of the device state can also occur under a sequence of applied pulses with a fixed t p . The probabilistic plasticity, which defines the probability of change in the device state, is shown in Fig. 3 . Here ∆W is defined as
It can be shown that a simple form of ∆W , like Eq. (1), is enough to successfully implement a robust and reliable selectivity task. There is a significant cycle-to-cycle resistance variation. This uncertainty is defined and verified using the function g(·). The RRAM and CBRAM devices have experimentally shown log-normal distribution of ON and OFF state resistances. Fig. 4 illustrates high and low resistance variation of a RRAM device for more than 4000 switching cycles. In some devices, like [12] , OFF state resistances may shows wider variation due to the uncontrolled metallic filament dissolution. III. Orientation Sensitivity A two layer neural network is implemented with 64×64-pixel binary visual images. Inputs were generated using a Gabor-like function to produce more than a thousand, randomly centered, receptive fields. Each input pixel drives an input neuron that spikes if the pixel represents '1'. There are nine output neurons that are fully connected to all input neurons through one-bit resolution stochastic RRAM connections. Therefore, 36, 864 synapses are available that are randomly programed and some are faulty. Faulty device patterns, random initial state, and some of the visual inputs are shown in Fig. 5 . Output neurons are integration-and-fire neurons that are connected to each other through inhibitory connections, which form a winner-takes-all network. Output neurons could be configured to implement Hebbian, anti-Hebbian, STDP, and anti-STDP learning with a support for a programmable refractory period. The system implements an unsupervised competitive learning that is capable of accepting a wide range of input patterns, including visual and auditory with or without preprocessing. During training, inputs were shown to the network and each was repeated several times. After training, as shown in Fig. 6a , output neurons become sensitive to several orientations. Fig. 6b shows that the system was not sensitive to any angle prior to learning. The system sensitivity to particular angles of the visual input patterns is demonstrated in Fig. 6c . One of the main parameters of designing the system is the dynamic tuning of the output neurons' firing threshold, which is not the focus of this paper. The training process were repeated with different threshold tuning approaches and a range of initial conditions, device variation, and defect patterns. The system continues to perform reliably under extreme yield (defect rate is > 20%) and variability assumptions, where the function f (·) output varies within 0.1 probability (3σ) around its nominal value. Implementing such system with low-power synapses would produce an ultra-high energy efficient and highly scalable system with an energy consumption of < 1 pJ/spike. Test input angle 
IV. Conclusion
The implemented neuromorphic system promises a highly scalable, easily mappable, variation-aware, reliable, lesser expensive (works without selector devices), potentially energy efficient, and suitable for high throughput applications. Each training process, in this case, demands for an endurance requirement in the order of > 20k cycles, which is well within the current reported endurances of RRAM, CBRAM, PCM, STT-MRAM, MeRAM, and atomic switches. Cost-wise, implementing such network with RRAM, CBRAM, PCM, and atomic switches may be cheaper relative to STT-MRAM and MeRAM. However, rapid progress of these technologies promises high energy efficient memory systems, which makes the cost-energy trade-off of these devices more attractive in the near future. Modeling accuracy can be further improved through systematic studies like [13] . The information gained from these simulations will underpin intense ongoing research aimed at characterizing, formulation and modeling of the rich in-situ spatiotemporal variation, including positiondependent, spatially correlated, and distance-dependent variation characterization.
