2 immersed in the etchant (H 2 O 2 : H 3 PO 4 : H 2 O=1:5:50 v/v, 6~7 hr). The template was slowly dissolved to release the Pt nanowires to the substrate. The Pt nanowires served as an etch mask for directional RIE to convert the underlying thin film into a nanowire array.
Template-assisted assembly technique
The as-purchased FeO QDs (Ocean NanoTech, Arkansas) was in solution form, in which chloroform was used as the solvent. The solution was diluted by toluene to a concentration of 1 mg/ml, followed by 1 hr sonication. The resulting solution should be homogeneous without any precipitates.
Before starting the template-assisted assembly, the memory chip was cooked in Remover PG (MicroChem, Massachusetts) at 150 °C for 30 min to remove any organic contamination on the surface. The clean chip was then baked (150 °C, 15 min), followed by surface functionalization with hexamethyldisilazane (HMDS). The functionalization was performed by exposing the chip to HMDS vapour in a sealed box for 15 min. The functionalized chip was soaked in the FeO QD solution and slowly withdrawn via a syringe pump (NE-1000 syringe pump, New Era pump Systems). The wellcontrolled speed was optimized for~ 2 mm/min to reach the best assembly result. Figure S1 . The device fabrication scheme. A) SNAP Pt NWs are released onto the pre-doped SOI substrate. B) An additional Pt mask is patterned over the Pt NWs to protect the micro-scale Si contacts. The mask pattern is transferred into the underlying Si with directional RIE. C) The transferred and sectioned pattern consists of lightly-doped SiNWs connected to highly-doped Si contacts. The entire pattern is fabricated from the same single crystal Si film. D) After surface treatment and source/drain metallization, a thin layer of HfO 2 is deposited as the tunnel oxide. E) The template-assisted assembly technique is carried out to align the FeO QDs into the SiNW trenches. F) The device is finished after patterning the metal gates atop the HfO 2 control oxide.
Device fabrication
Both n-and p-type memory devices were simultaneously fabricated on 6" (100) SOI wafers with a ~30-nm (100) Si film atop 250-nm buried SiO 2 . The non-self-aligned doping procedure can achieve ~0.1-µm accuracy via the Moiré technique. [1] Two ion implantations were sequentially carried out to heavily dope the n-and p-type source and drain regions (phosphorus dose of 3.2×10 14 cm -2 at 10 keV; boron dose of 3.8×10 14 cm -2 at 5 keV, Core Systems). A 1-µm channel length was defined by the spacing between the adjacent source and drain. After cleaning, the implanted wafers were annealed under N 2 at 900 °C for 10s to activate the dopants. Next, the SNAP method [2, 3] was used to prepare highly ordered arrays of 2-mm-long, 10-nm-wide SiNWs on the pre-doped wafers. The Pt NWs were initially produced as an etch mask by using SNAP ( Figure S1A ). To fabricate low-resistivity micronscale ohmic contacts, the Pt contact mask was made by using photolithography and lift-off process ( Figure S1B ). [4] Then the NW/micropatterned mask was transferred into the underlying silicon with a ~4-nm thermal silicon dioxide (SiO 2 ) was grown to passivate the Si surface. This surface treatment can dramatically reduce the SiNW surface state density [5, 6] and significantly improve the performance metrics of SiNW FETs. [7] In addition, growing a high-quality SiO 2 on Si surface is also a key step to integrate hafnium oxide (HfO 2 ) gate dielectrics. [8] [9] [10] The HF-resistant Cr/Au (10-nm/50nm) interconnect was used to sustain the frequently-used HF etching process, which was used to etch HfO 2 .
To make good ohmic contacts, the interconnect was designed to directly contact the micro-scale Si pads ( Figure S1D ), on which the native oxide was removed aforehand via buffered oxide etch (BOE).
In our memory devices, the high-κ HfO 2 was utilized as the tunnel and control oxide. Compared to SiO 2 , HfO 2 has two major advantages. First, the lower electron/hole barrier height to SiNWs can enhance the tunnel current to accelerate the program operation. [11, 12] Second, the larger dielectric constant allows the HfO 2 tunnel oxide to have much thicker physical thickness while keeping the same electrical equivalent oxide thickness (EOT). [11] [12] [13] . A thicker HfO 2 tunnel oxide can improve the data retention performance by reducing the gate leakage current. Herein a ~6.4nm-thick HfO 2 was conformally coated onto the substrate using atomic layer deposition (ALD). Figure S1E illustrates the template-assisted assembly technique [14] [15] [16] that was used to align FeO QDs into the SiNW trenches. The substrate was first baked to remove the surface moisture, followed by the chemical functionalization with hexamethyldisilazane (HMDS). The entire substrate was then soaked into the FeO QD solution (FeO QDs in chloroform, Ocean NanoTech) and slowly withdrawn via a syringe pump. The FeO QDs were self-assembled to form a close-packed matrix (Figure 1(c)) inside the SiNW trenches via interfacial capillary force and geometric confinement [15] . Both solution concentration and withdraw speed were carefully optimized to get a well-packed FeO matrix. This assembly technique can apply to varying-sized FeO QDs to reach a controllable packing density ( Figure 3A) . The assembled FeO QDs can offer numerous available DOS to store the injected charges.
A thick layer of HfO 2 (~18.9-nm) was then deposited as the control oxide to block the injected charges. Before the device characterization, the HfO 2 atop the probing pads was completely removed by repeating the HF wet etching. Thus, the HF-resistant Cr/Au was used as the interconnect to avoid the metal corrosion. The device fabrication was finalized by patterning the top metal gates, as shown in Figure S1F .
The charge transport mechanism of the memory devices
The current density of Fowler-Nordheim (F-N) tunneling has the form [17] =
where E is the electric field, and A and B are constants in terms of effective carrier mass and energy barrier height. Equation (1) can be rewritten as
A plot of equation (2) To determine the transport mechanism of our memory devices, the gate current as a function of gate voltage is measured. The gate current density can be obtained by dividing the gate current by the SiNW surface area, as shown in Figure S2 . The current density starts to increase rapidly when the absolute gate voltage is greater than 10V. The inset shows the device F-N plot biased at high gate voltage, ranging from 9.5V-12V. The result exhibits the linear dependency on 1/E, which demonstrates F-N tunneling is the transport mechanism of our memory devices. Figure S3 . The transfer characteristics of n-type memory devices biased at ±8V (A) and ±6V (B) gate voltage. Compared to the p-type counterparts, the n-type devices have smaller memory window under the same gate bias. Furthermore, they are more vulnerable to gate voltage stress, which can be represented by the increasing subthreshold swing (SS). A large SS degradation is observed after one program/erase operation is performed. 
Calculation of the capacitive coupling ratio of a memory device
The capacitive coupling ratio of the memory device can be calculated based on the simplified model shown in Figure S6 . The equivalent capacitor of each layer is modeled by its physical thickness (t) and the dielectric constant (ε r ). The following is the parameters used in this calculation. Control HfO 2 : t = 18.9 nm, ε r = 15.9; FeO QD: t = 10 nm, ε r = 14.2; Tunnel HfO 2 : t = 6.4 nm, ε r = 15.9; Passivation SiO 2 : t = 4 nm, ε r = 3.9. The calculated capacitor ratio is as follows: C C-HfO2 : C QD : C T-HfO2 : C SiO2 = 0.863 : 1.456 : 2.548 : 1.
To keep charge conservation, the ratio of voltage drop across each capacitor is inversely proportional to its capacitor value. Thus, the ratio of the voltage drop across the SiO 2 to the applied gate voltage is given by
Substitute the capacitor ratio into equation (1), the value of γ is ~0.31. For the device that was tested at a gate voltage of +/-21V, a 6.5-V voltage drop across the 4-nm SiO 2 can result in an electric field (~1.6 × 10 7 V/cm) higher than its dielectric strength (10 7 V/cm). We can reduce γ by using thicker control oxide (smaller C C-HfO2 ) or larger FeO QDs (smaller C QD ).
Calculation of the average number of stored charges per FeO QD
To estimate the total stored charges in the memory device, we first need to know the total capacitance of the control oxide. It can be approximated [18] by
where N eff is total number of QDs, r QD is the radius of FeO QDs, t C-HfO2 is the control oxide thickness, 
From our statistic data, the average ∆V TH of 18-and 5-nm FeO QDs are -0.418 and -0.156 V, respectively. Substitute the following parameters (t C-HfO2 = 14.2 nm, ε HfO2 = 15.9, ε QD = 14.2) into Figure S6 . The simplified capacitor model of the memory device.
equation (6) , the calculation shows each18-nm FeO QD holds ~7.7 unit charges but each 5-nm FeO QD only holds 0.3 unit charges. The result implies the charging energy of FeO QDs, which can be expressed as q 2 /(8π×ε HfO2 ×r QD ), [18] dominates the charge transport of this type of memory devices.
