2,192 research outputs found

    High-Performance Energy-Efficient and Reliable Design of Spin-Transfer Torque Magnetic Memory

    Get PDF
    In this dissertation new computing paradigms, architectures and design philosophy are proposed and evaluated for adopting the STT-MRAM technology as highly reliable, energy efficient and fast memory. For this purpose, a novel cross-layer framework from the cell-level all the way up to the system- and application-level has been developed. In these framework, the reliability issues are modeled accurately with appropriate fault models at different abstraction levels in order to analyze the overall failure rates of the entire memory and its Mean Time To Failure (MTTF) along with considering the temperature and process variation effects. Design-time, compile-time and run-time solutions have been provided to address the challenges associated with STT-MRAM. The effectiveness of the proposed solutions is demonstrated in extensive experiments that show significant improvements in comparison to state-of-the-art solutions, i.e. lower-power, higher-performance and more reliable STT-MRAM design

    Performance Analysis of NAND Flash Memory Solid-State Disks

    Get PDF
    As their prices decline, their storage capacities increase, and their endurance improves, NAND Flash Solid-State Disks (SSD) provide an increasingly attractive alternative to Hard Disk Drives (HDD) for portable computing systems and PCs. HDDs have been an integral component of computing systems for several decades as long-term, non-volatile storage in memory hierarchy. Today's typical hard disk drive is a highly complex electro-mechanical system which is a result of decades of research, development, and fine-tuned engineering. Compared to HDD, flash memory provides a simpler interface, one without the complexities of mechanical parts. On the other hand, today's typical solid-state disk drive is still a complex storage system with its own peculiarities and system problems. Due to lack of publicly available SSD models, we have developed our NAND flash SSD models and integrated them into DiskSim, which is extensively used in academe in studying storage system architectures. With our flash memory simulator, we model various solid-state disk architectures for a typical portable computing environment, quantify their performance under real user PC workloads and explore potential for further improvements. We find the following: * The real limitation to NAND flash memory performance is not its low per-device bandwidth but its internal core interface. * NAND flash memory media transfer rates do not need to scale up to those of HDDs for good performance. * SSD organizations that exploit concurrency at both the system and device level improve performance significantly. * These system- and device-level concurrency mechanisms are, to a significant degree, orthogonal: that is, the performance increase due to one does not come at the expense of the other, as each exploits a different facet of concurrency exhibited within the PC workload. * SSD performance can be further improved by implementing flash-oriented queuing algorithms, access reordering, and bus ordering algorithms which exploit the flash memory interface and its timing differences between read and write requests

    Integration of Non-volatile Memory into Storage Hierarchy

    Get PDF
    In this dissertation, we present novel approaches for integrating non-volatile memory devices into storage hierarchy of a computer system. There are several types of non- volatile memory devices, such as flash memory, Phase Change Memory (PCM), Spin- transfer torque memory (STT-RAM). These devices have many appealing features for applications; however, they also offer several challenges. This dissertation is focused on how to efficiently integrate these non-volatile memories into existing memory and disk storage systems. This work is composed of two major parts. The first part investigates a main-memory system employing Phase Change Memory instead of traditional DRAM. Compared to DRAM, PCM has higher density and no static power consumption, which are very important factors for building large capacity memory systems. However, PCM has higher write latency and power consumption compared to read operations. Moreover, PCM has limited write endurance. To efficiently integrate PCM into a memory system, we have to solve the challenges brought by its expensive write operations. We propose new replacement policies and cache organizations for the last-level CPU cache, which can effectively reduce the write traffic to the PCM main memory. We evaluated our design with multiple workloads and configurations. The results show that the proposed approaches improve the lifetime and energy consumption of PCM significantly. The second part of the dissertation considers the design of a data/disk storage using non-volatile memories, e.g. flash memory, PCM and nonvolatile DIMMs. We consider multiple design options for utilizing the nonvolatile memories in the storage hierarchy. First, we consider a system that employs nonvolatile memories such as PCM or nonvolatile DIMMs on memory bus along with flash-based SSDs. We propose a hybrid file system, NVMFS, that manages both these devices. NVMFS exploits the nonvolatile memory to improve the characteristics of the write workload at the SSD. We satisfy most small random write requests on the fast nonvolatile DIMM and only do large and optimized writes on SSD. We also group data of similar update patterns together before writing to flash-SSD; as a result, we can effectively reduce the garbage collection overhead. We implemented a prototype of NVMFS in Linux and evaluated its performance through multiple benchmarks. Secondly, we consider the problem of using flash memory as a cache for a disk drive based storage system. Since SSDs are expensive, a few SSDs are designed to serve as a cache for a large number of disk drives. SSD cache space can be used for both read and write requests. In our design, we managed multiple flash-SSD devices directly at the cache layer without the help of RAID software. To ensure data reliability and cache space efficiency, we only duplicated dirty data on flash- SSDs. We also balanced the write endurance of different flash-SSDs. As a result, no single SSD will fail much earlier than the others. Thirdly, when using PCM-like devices only as data storage, itโ€™s possible to exploit memory management hardware resources to improve file system performance. However, in this case, PCM may share critical system resources such as the TLB, page table with DRAM which can potentially impact PCMโ€™s performance. To solve this problem, we proposed to employ superpages to reduce the pressure on memory management resources. As a result, the file system performance is further improved

    ์ „์ž ์žฅ์น˜ ๋‚ด ๊ตญ๋ถ€์  ์ „๊ณ„ ํ–ฅ์ƒ์„ ์œ„ํ•œ ๋‚˜๋…ธ ๊ตฌ์กฐ์ฒด

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ํ™”ํ•™์ƒ๋ฌผ๊ณตํ•™๋ถ€, 2021.8. ์กฐ์žฌ์˜.The goal of this dissertation is to investigate effect of nanostructures for local electric field enhancement in electronic devices and to provide experimental and theoretical bases for their practical use. Resistive random access memory (RRAM) is a data storage device that can be modulated its resistance states by external electrical stimuli. The electric field generated by the applied potential difference between the two electrodes acts as the driving force to switch the resistance states, so controlling the electric field within the device can lead to improved operational performance and reliability of the device. Even though considerable progress has been made through significant efforts to control the electric field within the device, selectively enhancing the electric field in the intended position for stable and uniform resistive switching behavior is still challenging. Engineered metal structures in the RRAM can efficiently manipulate the electric field. As the radius of the metal structures decreases, the charge density increases, generating electric field enhancements in confined region. To minimize the radius of the metal structure and thus to greatly increase the electric field in a local area, we introduced a nanoscale metal structure into the RRAM. First, pyramid-structured metal electrode with a sharp tip was used to achieve a tip-enhanced electric field, and the effect of the enhanced electric field on the resistive switching behaviors of the device was investigated. Based on numerical simulation and experimental results, we confirmed that pyramidal electrode with a tip radius of tens of nanometers can selectively enhance the electric field at the tip. The tip-enhanced electric field can facilitate the thermochemical reaction in transition metal oxide-based RRAMs and efficiency of charge injection and transport in organic-based RRAMs, as well as provide position selectivity during formation of conductive filament. The resulting RRAM exhibited reliable resistive switching behavior and highly improved device performance compared with conventional RRAM with planar electrode. As another approach to enhance the electric field within the resistive switching layer, we prepared spherical nanostructures via self-assembled block copolymer (BCP)/metal compound micelles. BCP and metal precursors were dissolved in aqueous media for use as BCP/metal compound micelles. These micelles were used as complementary resistive switch (CRS) layers of the memory device and the mechanism of CRS behavior was investigated. The spherical metal nanostructures can improve the electric fields, promoting a resistive switching mechanism based on electrochemical metallization. The resulting CRS memory exhibited reliable resistive switching behavior with four distinct threshold voltages in both cycle-to-cycle and cell-to-cell tests. Also, the conduction and resistive switching mechanism are experimentally demonstrated through the the analysis of the currentโ€“voltage data plot and detemination of the temperature coefficient of resistance. Overall, we pursued efficient engineering of metal nanostructures capable of manipulating electric fields for improving the operational performance and reliability of memory devices. There is no doubt that the commercialized RRAM will become popular in the near future after overcoming all the challenges of RRAM through continuous interest and research. We believe that these results will not only contribute to the significant advancement of all electronic devices, including RRAM, but will also help promote research activities in the electronic device field.๋ณธ ๋…ผ๋ฌธ์˜ ๋ชฉ์ ์€ ๋‚˜๋…ธ ๊ตฌ์กฐ์ฒด๋ฅผ ํ†ตํ•œ ์ „์ž ์žฅ์น˜ ๋‚ด ๊ตญ๋ถ€์  ์ „๊ณ„ ํ–ฅ์ƒ ํšจ๊ณผ๋ฅผ ์กฐ์‚ฌํ•˜๊ณ , ์ด์˜ ์‹ค์ œ ์‚ฌ์šฉ์„ ์œ„ํ•œ ์‹คํ—˜ ๋ฐ ์ด๋ก ์  ๊ธฐ๋ฐ˜์„ ์ œ๊ณตํ•˜๋Š” ๊ฒƒ์ด๋‹ค. ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ (resistive random access memory) ๋Š” ์™ธ๋ถ€ ์ „๊ธฐ ์ž๊ทน์— ์˜ํ•ด ์ €ํ•ญ ์ƒํƒœ๋ฅผ ๋ณ€ํ™” ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๋ฐ์ดํ„ฐ ์ €์žฅ ์žฅ์น˜์ด๋‹ค. ๋‘ ์ „๊ทน ์‚ฌ์ด์— ์ธ๊ฐ€๋œ ์ „์œ„์ฐจ์— ์˜ํ•ด ์ƒ์„ฑ๋œ ์ „๊ธฐ์žฅ์€ ์ €ํ•ญ ์ƒํƒœ๋ฅผ ์ „ํ™˜์‹œํ‚ค๋Š” ๊ตฌ๋™๋ ฅ์œผ๋กœ์จ ์ž‘์šฉํ•˜๋ฏ€๋กœ, ์ „์ž ์žฅ์น˜ ๋‚ด์—์„œ ์ „๊ธฐ์žฅ์„ ์ œ์–ดํ•˜๋ฉด ์žฅ์น˜์˜ ์„ฑ๋Šฅ๊ณผ ์‹ ๋ขฐ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋‹ค. ์žฅ์น˜ ๋‚ด์—์„œ ์ „๊ธฐ์žฅ์„ ์ œ์–ดํ•˜๋ ค๋Š” ๋งŽ์€ ๋…ธ๋ ฅ์„ ํ†ตํ•ด ์ƒ๋‹นํ•œ ์ง„์ „์ด ์žˆ์—ˆ์ง€๋งŒ, ์•ˆ์ •์ ์ด๊ณ  ๊ท ์ผํ•œ ์ €ํ•ญ ๋ณ€ํ™” ๊ฑฐ๋™์„ ์œ„ํ•ด ์˜๋„๋œ ์œ„์น˜์—์„œ ์ „๊ธฐ์žฅ์„ ์„ ํƒ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚ค๋Š” ์ผ์€ ์•„์ง ๋„์ „์  ๊ณผ์ œ์ด๋‹ค. ๊ตฌ์กฐํ™”๋œ ๊ธˆ์†์„ ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์— ์ ‘๋ชฉ์‹œํ‚ด์œผ๋กœ์จ ์ „๊ธฐ์žฅ์„ ํšจ์œจ์ ์œผ๋กœ ์กฐ์ž‘ํ•  ์ˆ˜ ์žˆ๋‹ค. ๊ธˆ์† ๊ตฌ์กฐ์ฒด์˜ ๋ฐ˜๊ฒฝ์ด ๊ฐ์†Œํ•จ์— ๋”ฐ๋ผ ์ „ํ•˜ ๋ฐ€๋„๊ฐ€ ์ฆ๊ฐ€ํ•˜์—ฌ ๊ตญ๋ถ€์  ์˜์—ญ์—์„œ ์ „๊ธฐ์žฅ์ด ํ–ฅ์ƒ๋œ๋‹ค. ์ด ๋…ผ๋ฌธ์—์„œ๋Š” ๊ธˆ์† ๊ตฌ์กฐ์ฒด์˜ ๋ฐ˜๊ฒฝ์„ ์ตœ์†Œํ™”ํ•˜์—ฌ ๊ตญ๋ถ€์ ์œผ๋กœ ์ „๊ธฐ์žฅ์„ ํฌ๊ฒŒ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์— ๋‚˜๋…ธ์Šค์ผ€์ผ์˜ ๊ธˆ์† ๊ตฌ์กฐ์ฒด๋ฅผ ๋„์ž…ํ•˜์˜€๋‹ค. ์ฒซ ๋ฒˆ์งธ๋กœ, ํŒ ๊ฐ•ํ™” (tip-enhanced) ์ „๊ธฐ์žฅ ํšจ๊ณผ๋ฅผ ๋‹ฌ์„ฑํ•˜๊ธฐ ์œ„ํ•ด ๋‚ ์นด๋กœ์šด ํŒ์„ ๊ฐ€์ง€๋Š” ํ”ผ๋ผ๋ฏธ๋“œ ๊ธˆ์† ๊ตฌ์กฐ์ฒด๋ฅผ ์ „๊ทน์œผ๋กœ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฉฐ, ๊ฐ•ํ™”๋œ ์ „๊ธฐ์žฅ์ด ์†Œ์ž์˜ ์ €ํ•ญ ๋ณ€ํ™” ๊ฑฐ๋™์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์กฐ์‚ฌํ•˜์˜€๋‹ค. ์œ ํ•œ์š”์†Œ๋ชจ๋ธ๋ง๊ณผ ์‹คํ—˜๊ฒฐ๊ณผ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ, ์ˆ˜์‹ญ ๋‚˜๋…ธ ๋ฏธํ„ฐ์˜ ํŒ ๋ฐ˜๊ฒฝ์„ ๊ฐ€์ง€๋Š” ํ”ผ๋ผ๋ฏธ๋“œ ๊ตฌ์กฐ์ฒด ์ „๊ทน์ด ํŒ ๋ถ€๊ทผ์—์„œ ์ „๊ธฐ์žฅ์„ ๊ตญ์†Œ์ ์œผ๋กœ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ์„ ํ™•์ธํ•˜์˜€๋‹ค. ํŒ ๊ฐ•ํ™” ์ „๊ธฐ์žฅ์€ ์ „์ด ๊ธˆ์† ์‚ฐํ™”๋ฌผ-๊ธฐ๋ฐ˜ ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์—์„œ ์—ดํ™”ํ•™ (thermochemical) ๋ฐ˜์‘์„ ์ด‰์ง„์‹œํ‚ค๊ณ  ์œ ๊ธฐ-๊ธฐ๋ฐ˜ ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์—์„œ ์ „ํ•˜ ์ฃผ์ž… (charge injection) ๋ฐ ์ˆ˜์†ก (transport) ํšจ์œจ์„ฑ์„ ํ–ฅ์ƒ์‹œํ‚ฌ ๋ฟ ์•„๋‹ˆ๋ผ, ์„ ํƒ์ ์ธ ์œ„์น˜์—์„œ๋งŒ ์ „๋„์„ฑ ํ•„๋ผ๋ฉ˜ํŠธ (conductive filament)๋ฅผ ํ˜•์„ฑ์‹œํ‚ฌ ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ ํ”ผ๋ผ๋ฏธ๋“œ ๊ตฌ์กฐ์ฒด ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ๋Š” ์ข…๋ž˜์˜ ํ‰ํŒ ๊ตฌ์กฐ์ฒด ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์— ๋น„ํ•ด ์•ˆ์ •์ ์ธ ์ €ํ•ญ ๋ณ€ํ™” ๊ฑฐ๋™๊ณผ ํ–ฅ์ƒ๋œ ์žฅ์น˜ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ์ €ํ•ญ ๋ณ€ํ™” ์ธต ๋‚ด์˜ ์ „๊ธฐ์žฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•œ ๋˜ ๋‹ค๋ฅธ ์ ‘๊ทผ๋ฒ•์œผ๋กœ, ์ž๊ธฐ์กฐ๋ฆฝ (self-assembled)๋œ ๋ธ”๋ก๊ณต์ค‘ํ•ฉ์ฒด (block copolymer)/๊ธˆ์† ๋ณตํ•ฉ์ฒด ๋ฏธ์…€ (micelle)์„ ์ด์šฉํ•˜์—ฌ ๊ตฌํ˜•์˜ ๋‚˜๋…ธ๊ตฌ์กฐ์ฒด๋ฅผ ์†Œ์ž์˜ ์ค‘๊ฐ„์ธต์œผ๋กœ ๋„์ž…ํ•˜์˜€๋‹ค. ๋ธ”๋ก๊ณต์ค‘ํ•ฉ์ฒด ๋ฐ ๊ธˆ์†์ „๊ตฌ์ฒด๋ฅผ ๋ณตํ•ฉ์ฒด ๋ฏธ์…€๋กœ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด ์„ ํƒ์  ์šฉ๋งค์— ์šฉํ•ด์‹œ์ผฐ๋‹ค. ํ•ด๋‹น ๋ฏธ์…€์„ ๋ฉ”๋ชจ๋ฆฌ ์†Œ์ž์˜ ์ƒ๋ณด์  ์ €ํ•ญ ๋ณ€ํ™” (complementary resistive switch) ์ธต์œผ๋กœ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฉฐ, ์ƒ๋ณด์  ์ €ํ•ญ ๋ณ€ํ™” ๊ฑฐ๋™์˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์กฐ์‚ฌํ•˜์˜€๋‹ค. ๊ตฌํ˜•์˜ ๊ธˆ์† ๋‚˜๋…ธ๊ตฌ์กฐ์ฒด๋Š” ์ „๊ธฐ์žฅ์„ ํ–ฅ์ƒ์‹œ์ผœ ์ „๊ธฐํ™”ํ•™์  ๊ธˆ์†ํ™” (electrochemical metallization)์— ๊ธฐ๋ฐ˜ํ•œ ์ €ํ•ญ ๋ณ€ํ™” ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์ด‰์ง„์‹œํ‚ฌ ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ ์ƒ๋ณด์  ์ €ํ•ญ ๋ณ€ํ™” ๋ฉ”๋ชจ๋ฆฌ๋Š” ์‚ฌ์ดํด ๋ฐ ์…€๊ฐ„ ๋ฐ˜๋ณต ์‹œํ—˜ ๋ชจ๋‘์—์„œ 4๊ฐœ์˜ ์ž„๊ณ„ ์ „์••์œผ๋กœ ์•ˆ์ •์ ์ธ ์ €ํ•ญ ๋ณ€ํ™” ๋™์ž‘์„ ๋‚˜ํƒ€๋‚ด์—ˆ๋‹ค. ๋˜ํ•œ ์ „๋ฅ˜-์ „์•• ์ž๋ฃŒ ํ”Œ๋กฏ (plot) ๋ถ„์„๊ณผ ์ €ํ•ญ์˜ ์˜จ๋„ ๊ณ„์ˆ˜ ๊ฒฐ์ •์„ ํ†ตํ•ด ์žฅ์น˜์˜ ์ „๋„ ๋ฐ ์ €ํ•ญ ๋ณ€ํ™” ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ ์‹คํ—˜์ ์œผ๋กœ ์ž…์ฆํ•˜์˜€๋‹ค. ์ „๋ฐ˜์ ์œผ๋กœ ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์žฅ์น˜ ๋‚ด ์ „๊ธฐ์žฅ์„ ์ฆํญ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๊ธˆ์† ๋‚˜๋…ธ๊ตฌ์กฐ์ฒด์˜ ํšจ์œจ์ ์ธ ์—”์ง€๋‹ˆ์–ด๋ง์„ ํ†ตํ•ด ๋ฉ”๋ชจ๋ฆฌ ์žฅ์น˜์˜ ์„ฑ๋Šฅ๊ณผ ์‹ ๋ขฐ์„ฑ ํ–ฅ์ƒ์„ ์ถ”๊ตฌํ•˜์˜€๋‹ค. ์ง€์†์ ์ธ ๊ด€์‹ฌ๊ณผ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ์˜ ๋ชจ๋“  ๊ณผ์ œ๋ฅผ ๊ทน๋ณตํ•œ ํ›„, ์ƒ์šฉํ™”๋œ ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ๊ฐ€ ๊ฐ€๊นŒ์šด ๋ฏธ๋ž˜์— ๋Œ€์ค‘ํ™”๋  ๊ฒƒ์ž„์„ ๋ฏฟ์–ด ์˜์‹ฌ์น˜ ์•Š๋Š”๋‹ค. ์šฐ๋ฆฌ๋Š” ์ด ๊ฒฐ๊ณผ๊ฐ€ ์ €ํ•ญ๋ณ€ํ™”๋ฉ”๋ชจ๋ฆฌ๋ฅผ ํฌํ•จํ•œ ๋ชจ๋“  ์ „์ž ์žฅ์น˜์˜ ํš๊ธฐ์ ์ธ ๋ฐœ์ „์— ๊ธฐ์—ฌํ•  ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ์ „์ž ์žฅ์น˜ ๋ถ„์•ผ์˜ ์—ฐ๊ตฌ ํ™œ๋™์„ ์ด‰์ง„ํ•˜๋Š” ๋ฐ์—๋„ ๋„์›€์ด ๋  ๊ฒƒ์ด๋ผ๊ณ  ๋ฏฟ๋Š”๋‹ค.Chapter 1. Introduction 1 1.1. Background 1 1.1.1. Necessity of new memory devices 1 1.1.2. Resistive random access memory 2 1.2. Motivation 4 1.3. Dissertation Overview 6 1.4. References 9 Chapter 2. Tip-Enhanced Electric Field-Driven Efficient Charge Injection and Transport in Organic Material-Based Resistive Memories 19 2.1. Introduction 21 2.2. Experimental 24 2.3. Results and Discussion 27 2.4. Conclusions 37 2.5. References 38 Chapter 3. Facilitation of the Thermochemical Mechanism in NiO-Based Resistive Switching Memories via Tip-Enhanced Electric Fields 52 3.1. Introduction 54 3.2. Experimental 57 3.3. Results and Discussion 60 3.4. Conclusions 66 3.5. References 67 Chapter 4. Facile Achievement of Complementary Resistive Switching Behaviors via Self-Assembled Block Copolymer Micelles 82 4.1. Introduction 83 4.2. Experimental 86 4.3. Results and Discussion 89 4.4. Conclusions 96 4.5. References 97 Chapter 5. Conclusion 109 Abstract in Korean 112๋ฐ•

    Efficient Methods for Unsupervised Learning of Probabilistic Models

    Full text link
    In this thesis I develop a variety of techniques to train, evaluate, and sample from intractable and high dimensional probabilistic models. Abstract exceeds arXiv space limitations -- see PDF

    Vector support for multicore processors with major emphasis on configurable multiprocessors

    Get PDF
    It recently became increasingly difficult to build higher speed uniprocessor chips because of performance degradation and high power consumption. The quadratically increasing circuit complexity forbade the exploration of more instruction-level parallelism (JLP). To continue raising the performance, processor designers then focused on thread-level parallelism (TLP) to realize a new architecture design paradigm. Multicore processor design is the result of this trend. It has proven quite capable in performance increase and provides new opportunities in power management and system scalability. But current multicore processors do not provide powerful vector architecture support which could yield significant speedups for array operations while maintaining arealpower efficiency. This dissertation proposes and presents the realization of an FPGA-based prototype of a multicore architecture with a shared vector unit (MCwSV). FPGA stands for Filed-Programmable Gate Array. The idea is that rather than improving only scalar or TLP performance, some hardware budget could be used to realize a vector unit to greatly speedup applications abundant in data-level parallelism (DLP). To be realistic, limited by the parallelism in the application itself and by the compiler\u27s vectorizing abilities, most of the general-purpose programs can only be partially vectorized. Thus, for efficient resource usage, one vector unit should be shared by several scalar processors. This approach could also keep the overall budget within acceptable limits. We suggest that this type of vector-unit sharing be established in future multicore chips. The design, implementation and evaluation of an MCwSV system with two scalar processors and a shared vector unit are presented for FPGA prototyping. The MicroBlaze processor, which is a commercial IP (Intellectual Property) core from Xilinx, is used as the scalar processor; in the experiments the vector unit is connected to a pair of MicroBlaze processors through standard bus interfaces. The overall system is organized in a decoupled and multi-banked structure. This organization provides substantial system scalability and better vector performance. For a given area budget, benchmarks from several areas show that the MCwSV system can provide significant performance increase as compared to a multicore system without a vector unit. However, a MCwSV system with two MicroBlazes and a shared vector unit is not always an optimized system configuration for various applications with different percentages of vectorization. On the other hand, the MCwSV framework was designed for easy scalability to potentially incorporate various numbers of scalar/vector units and various function units. Also, the flexibility inherent to FPGAs can aid the task of matching target applications. These benefits can be taken into account to create optimized MCwSV systems for various applications. So the work eventually focused on building an architecture design framework incorporating performance and resource management for application-specific MCwSV (AS-MCwSV) systems. For embedded system design, resource usage, power consumption and execution latency are three metrics to be used in design tradeoffs. The product of these metrics is used here to choose the MCwSV system with the smallest value

    The design and analysis of novel integrated phase-change photonic memory and computing devices

    Get PDF
    The current massive growth in data generation and communication challenges traditional computing and storage paradigms. The integrated silicon photonic platform may alleviate the physical limitations resulting from the use of electrical interconnects and the conventional von Neuman computing architecture, due to its intrinsic energy and bandwidth advantages. This work focuses on the development of the phase-change all-photonic memory (PPCM), a device potentially enabling the transition from the electrical to the optical domain by providing the (previously unavailable) non-volatile all-photonic storage functionality. PPCM devices allow for all-optical encoding of the information on the crystal fraction of a waveguide-implemented phase-change material layer, here Ge2Sb2Te5, which in turn modulates the transmitted signal amplitude. This thesis reports novel developments of the numerical methods necessary to emulate the physics of PPCM device operation and performance characteristics, illustrating solutions enabling the realization of a simulation framework modelling the inherently three-dimensional and self-influencing optical, thermal and phase-switching behaviour of PPCM devices. This thesis also depicts an innovative, fast and cost-effective method to characterise the key optical properties of phase-change materials (upon which the performance of PPCM devices depend), exploiting the reflection pattern of a purposely built layer stack, combined with a smart fit algorithm adapting potential solutions drawn from the scientific literature. The simulation framework developed in the thesis is used to analyse reported PPCM experimental results. Numerous sources of uncertainty are underlined, whose systematic analysis reduced to the peculiar non-linear optical properties of Ge2Sb2Te5. Yet, the data fit process validates both the simulation tool and the remaining physical assumptions, as the model captures the key aspects of the PPCM at high optical intensity, and reliably and accurately predicts its behaviour at low intensity, enabling to investigate its underpinning physical mechanisms. Finally, a novel PPCM memory architecture, exploiting the interaction of a much-reduced Ge2Sb2Te5 volume with a plasmonic resonant nanoantenna, is proposed and numerically investigated. The architecture concept is described and the memory functionality is demonstrated, underlining its potential energy and speed improvement on the conventional device by up to two orders of magnitude.Engineering and Physical Sciences Research Council (EPSRC

    Flash Memory Devices

    Get PDF
    Flash memory devices have represented a breakthrough in storage since their inception in the mid-1980s, and innovation is still ongoing. The peculiarity of such technology is an inherent flexibility in terms of performance and integration density according to the architecture devised for integration. The NOR Flash technology is still the workhorse of many code storage applications in the embedded world, ranging from microcontrollers for automotive environment to IoT smart devices. Their usage is also forecasted to be fundamental in emerging AI edge scenario. On the contrary, when massive data storage is required, NAND Flash memories are necessary to have in a system. You can find NAND Flash in USB sticks, cards, but most of all in Solid-State Drives (SSDs). Since SSDs are extremely demanding in terms of storage capacity, they fueled a new wave of innovation, namely the 3D architecture. Today โ€œ3Dโ€ means that multiple layers of memory cells are manufactured within the same piece of silicon, easily reaching a terabit capacity. So far, Flash architectures have always been based on "floating gate," where the information is stored by injecting electrons in a piece of polysilicon surrounded by oxide. On the contrary, emerging concepts are based on "charge trap" cells. In summary, flash memory devices represent the largest landscape of storage devices, and we expect more advancements in the coming years. This will require a lot of innovation in process technology, materials, circuit design, flash management algorithms, Error Correction Code and, finally, system co-design for new applications such as AI and security enforcement

    Domain specific high performance reconfigurable architecture for a communication platform

    Get PDF
    • โ€ฆ
    corecore