All-spin logic devices are promising candidates to augment and complement beyond-CMOS integrated circuit computing due to non-volatility, ultra-low operating voltages, higher logical efficiency, and high density integration. However, the path to reach lower energy-delay product performance compared to CMOS transistors currently is not clear. We show that scaling and engineering the nanoscale magnetic materials and interfaces is the key to realizing spin logic devices that can surpass energy-delay performance of CMOS transistors. With validated stochastic nano-magnetic and vector spin transport numerical models, we derive the target material and interface properties for the nanomagnets and channels. We identified promising new directions for material engineering/discovery focusing on systematic scaling of magnetic anisotropy (H k ) with saturation magnetization (M s ), use of perpendicular magnetic anisotropy, and interface spin mixing conductance of ferromagnet/spin channel interface (G mix ). We provide systematic targets for scaling spin logic energy-delay product toward a 2 aJ.ns energy-delay product, comprehending the stochastic noise for nanomagnets.
The control and manipulation of nanoscale spin transport and magnetism is one of the most promising approaches for beyond CMOS-logic, memory and analog applications [1, 2, and 3] .
Spintronic logic devices [4] [5] [6] [7] [8] [9] [10] are promising candidates for scaling the energy efficiency of computing devices due to the promise of 1) non-volatility [2] ; 2) higher logical efficiency [11] ; and 3) high density integration [12, 13] . One leading device, the All Spin Logic (ASL) device [5, 6, 9, 11] , processes information using only spin for the signal state variable, thus avoiding repeated state variable conversion and making it an excellent candidate for spin logicfor computation. Impressive experimental [14] [15] [16] [17] breakthroughs have been achieved in the recent years accelerating the development of spintronics as a viable logic and memory technology.
However, existing nanomagnet material properties and spin transfer channel properties fall short of the energy and delay targets [3] dictated by modern advanced CMOS devices [3] . Hence, it is of great interest to identify and pursue material and interface target parameters with which ASL devices can be scaled to achieve high efficiency.
In this letter, we show that scaling and engineering the nanoscale magnetic materials and interfaces is the key to realizing spin logic devices that can replace/augment CMOS devices.
Using validated numerical models we derive the target material and interface properties for the nanomagnets and channels. We identify promising new vectors for material exploration focusing on a) Systematic scaling of magnetic anisotropy (H k ) with saturation magnetization (M s ); b) Use of perpendicular magnetic anisotropy; and c) Interface mixing conductance of ferromagnet/spin channel interface (G mix ).
We first describe a representative nanoscale spin logic device operating via interaction of spin currents with nanomagnets. An inverting/non-inverting spin logic device with directionality of signal (information) flow is shown in Figure 1A . The spin logic device is comprised of two nanomagnets sharing a spin conduction channel. Each of the nanomagnets is shared between two spin conduction channels for propagating and regenerating the signal information. More complex circuits, such as majority gates, can be created by having more input magnets sharing the conduction channel. The all spin logic device is also amenable to 3D integration which provides an additional means for logic density scaling ( Figure 1B) . The example materials for forming the nanomagnets can be CoFeB, Pd, NiFe or L10 metals (FePt, FePd). Example channel materials are Ag [14] , Cu [17] and Graphene [15] which exhibit excellent room temperature spin flip lengths.
The ASL device operates by the asymmetric injection of spin currents into the spin conduction channel from the nanomagnets, which are sharing the spin conduction channel. The asymmetry may be created either from different (asymmetric) overlap of the magnets with the input and output channels, or different spin injection efficiency. In this paper, we consider asymmetric overlap ( Figure 1A ). When the ASL device is operated with a negative (rather than positive) supply voltage, the input and output nanomagnets inject electrons into the spin conduction channel where the spin direction is decided by the orientation of the nanoscale ferromagnets (FM) (example FM with spin pointing to the +X direction injects electrons oriented in the +X direction). The dominating magnet therefore sets up a larger spin polarization in its orientation in the channel. A spin current flows from the higher spin potential [6] to the weaker magnet (FM2) creating a spin torque to orient the weaker magnet to FM1. For positive supply voltage, a converse process happens creating the FM2 to align antiparallel to the FM1.
We model the spin logic device using an equivalent vector spin circuit for the spin logic device based on vector spin circuit theory [6, 18, and 19] . The functional form of the spin matrices are arrived at using a Landauer-Büttiker formalism with spin transport [19] applied to metallic circuits. The spin equivalent circuit modeling and the conduction elements for the ferromagnet (FM) to normal magnet (NM) interface are described in Ref [6] . Figure 2 shows the result of spin circuit simulations of the spin logic device with coupled stochastic nanomagnet dynamics and spin transport through the metallic channel. The operating dynamics of the device for nominal material conditions, with a positive applied voltage is shown in Figure 2 . Figure 2A , shows the response of the device with a 10 mV positive supply voltage. The Joule power dissipated by the supply is shown in Figure 2B . The large quiescent power due to the metallic device nature can be observed which shows the need for novel power management to take advantage of the nonvolatility to reduce this leakage power. The injected spin current in the +/-X direction is shown in Figure 2C . Figure 2D 
The dynamics of the spin device are solved self-consistently with the spin transport in the equivalent circuit models. The LLG solvers pass the condition of the magnets to the spin circuit and the spin circuit solver passes the spin vector current to the LLG solver at each pass of the self-consistent loop until a solution is reached. The thermal noise due to microscopic degrees of freedom of the nanomagnets plays a critical role in the dynamics of the nanomagnets and manifests itself as fluctuations to the internal anisotropic H eff (T) magnetic field [21] . A description of the noise properties and numerical methods used for stochastic LLG equations is provided in [6] .
We now evaluate the energy-delay product of nominal the ASL device (comprising magnets with 40kT energy barrier corresponding to 7-year retention time) with nominal existing material
choices. Materials with existing proof of concept integration in CMOS are chosen. Nominal material parameters for the materials are shown in Table 1 . At nominal operating voltage of 10 mV, the ASL device operates with a response time ~0.5 ns dominated by the stochastic delay of switching where the nanomagnet is acquiring the stochastic phase to start the nanomagnet dynamics. The estimated energy-delay product of the device is given by the total joule energy supplied by the supply voltage. The energy-delay product of the in-plane ASL is 3 to 6 fJ.ns Now we will describe the energy-delay reduction scaling of ASL using perpendicular magnetic anisotropy (PMA) material. The perpendicular anisotropy may be achieved by means of surface anisotropy (CoFeB with material thickness less than 1.2 nm) [23] , L10 materials (FePt, FePd) [24] , or by magnetic superlattices (Co/Pd) [25] . The perpendicular anisotropy significantly reduces the critical current for switching magnetic materials. We numerically simulate the effect of PMA on the ASL device and show significant reduction in its energy-delay product. We estimate it for a CoFeB PMA ferromagnet to be 170 aJ.ns using existing surface PMA materials, Figure 3 . The output nanomagnet dynamics are shown in Figure 3B and 3C comparing the nanomagnet dynamics of in-plane and PMA ASL.
We also show that spin diffusion length and Gilbert damping of the magnets have limited effects on improving the energy-delay of the spin logic devices. Above a certain spin diffusion length (when the spin diffusion length is longer than the channel length) the beneficial effect of better spin channels for logic device operation decreases, Figure 4 . We describe the effect of the spin diffusion length of the channel material on the energy-delay performance of the ASL device in Figure 4A where the channel length is 100 nm. The effect of varying the Gilbert damping of the magnet is shown in Figure 4B . The Gilbert damping is related to the noise of the magnetization caused by stochastic magnetic fields. Thus we conclude that any energy-delay reduction scaling expected from spin diffusion length improvement and Gilbert damping improvement is minor compared to improving the anisotropy.
We now describe a second energy-delay scaling technique for spin logic devices by material and nanomagnet engineering where the energy-delay can be scaled by a linear trade-off of material magnetization saturation (M s ) with magnetic anisotropy field (H k ) of the nanomagnets. The effect of scaling up the anisotropy field while reducing the saturation magnetization of the nanomagnets by the same factor is shown in Figure 5 . The nanomagnet volume and the effective thermal energy barrier remain constant. Hence, novel material discovery with a systematic control over the magnetization and H k can provide significant improvements in energy-delay product. A factor of 4 scaling of Ms and H k from the benchmark material (CoFeB) shows that the energy-delay product can be smaller than that for the 22 nm CMOS device by a factor of 3. This will provide active energy-delay product improvements equivalent to 8 generations of CMOS scaling (CMOS in 2028, [3] ). An energy-delay product reduction is expected both for in-plane and PMA ASL. The optimized PMA-ASL provides energy-delay product comparable to the 22 nm CMOS device node ( Figure 5B ) with the added advantages of non-volatility and low supply voltage operation.
Next we will describe a third technique for energy-delay reduction scaling using the optimization of an interface property of the NM-FM interface -the spin-mixing conductance -to optimize the active energy-delay of ASL devices. The spin mixing conductance of the NM-FM interface is responsible for the responsivity of the nanomagnet to the applied spin voltage. The non-collinear spin current injected into the FM for an applied voltage can be written as: (3) where G SL is the spin mixing conductance of the NM-FM interface. The spin mixing conductance is described in terms of spin reflection at the interface [19] as: (4) where ⁄ is the conductance per spin of a ballistic channel with ideal contacts [19] ;
are the reflection coefficients of the up and down spin electrons at the FM-NM interface; n is the number of modes in the NM, m is the number of modes in the FM. The number of modes in a normal metal NM can in-turn be written from the metal's Fermi wave vector k f [26] . It has been argued that are close to zero for many material systems [19] , which simplifies the spin torque conductance to as shown in (4) . A NM-FM interface is shown in Figure 6A where a vector spin current (comprising of spin with a general orientation with respect to to the magnets) can be injected into the device. Figure 6B , shows a 4-component spin conductance tensor matrix for a NM-FM interface where the vector spin current is generated Finally we show that scaled magnetic materials with optimized interfaces can provide significant energy-delay product improvements for ASL devices. We compare the energy-delay product of 4 scaled ASL devices to the energy-delay of low-power 22nm CMOS in Figure 7 . The energydelay performance of the ASL devices (~6.4 fJ.ns) with the nominal existing materials and CoFeB FM and Cu channels falls short of the CMOS energy-delay product as discussed earlier.
However, a perpendicular material configuration such as CoFeB with surface anisotropy can significantly reduce the energy-delay product to a few hundreds of aJ.ns. With a factor 4 improvements in material parameters, see [27, 28] , by scaling the M s down to 0.25·10 6 A/m and increasing the H k to 1.6·10 5 A/m, the energy-delay product of ASL may surpass future state of the art CMOS technology. For example, we note that tetragonally distorted Huesler compounds such as Mn 3 Ga exhibit low magnetization (M s =0.11·10 6 A/m), reasonably large spin polarization of 58% and extremely high anisotropy fields exceeding 10 7 A/m [28] . A variety of material choices remain to be explored for spin logic and embedded memory providing the opportunity for more improvements in energy-delay as well as device density [27, 30] .
To provide even further energy-delay product improvement, we propose the plausible case of an engineered interface with an enhancement in spin mixing conductance such that the energy-delay product can be reduced to as low as 2 aJ.ns, thus providing superior energy-delay product combined with non-volatile logic operation. are not competitive to state-of-art CMOS, at high activity factors. However, improved energydelay product using the introduced material scaling methods, combined with exploiting of nonvolatility, can lead to better effective energy-per -bit at lower activity factors. In conclusion, we outline a scaling path for development of materials and nanomagnetic interfaces for spin based logic using lateral spin valves for higher efficiency spin logic devices. Given the strong need for high efficiency mobile and high performance computing, 3D non-volatile spin logic with scaled magnetic materials can enable radical improvements in computing throughput and energy efficiency. 
