2,303 research outputs found

    Numerical solutions of differential equations on FPGA-enhanced computers

    Get PDF
    Conventionally, to speed up scientific or engineering (S&E) computation programs on general-purpose computers, one may elect to use faster CPUs, more memory, systems with more efficient (though complicated) architecture, better software compilers, or even coding with assembly languages. With the emergence of Field Programmable Gate Array (FPGA) based Reconfigurable Computing (RC) technology, numerical scientists and engineers now have another option using FPGA devices as core components to address their computational problems. The hardware-programmable, low-cost, but powerful “FPGA-enhanced computer” has now become an attractive approach for many S&E applications. A new computer architecture model for FPGA-enhanced computer systems and its detailed hardware implementation are proposed for accelerating the solutions of computationally demanding and data intensive numerical PDE problems. New FPGAoptimized algorithms/methods for rapid executions of representative numerical methods such as Finite Difference Methods (FDM) and Finite Element Methods (FEM) are designed, analyzed, and implemented on it. Linear wave equations based on seismic data processing applications are adopted as the targeting PDE problems to demonstrate the effectiveness of this new computer model. Their sustained computational performances are compared with pure software programs operating on commodity CPUbased general-purpose computers. Quantitative analysis is performed from a hierarchical set of aspects as customized/extraordinary computer arithmetic or function units, compact but flexible system architecture and memory hierarchy, and hardwareoptimized numerical algorithms or methods that may be inappropriate for conventional general-purpose computers. The preferable property of in-system hardware reconfigurability of the new system is emphasized aiming at effectively accelerating the execution of complex multi-stage numerical applications. Methodologies for accelerating the targeting PDE problems as well as other numerical PDE problems, such as heat equations and Laplace equations utilizing programmable hardware resources are concluded, which imply the broad usage of the proposed FPGA-enhanced computers

    Declaratively programmable ultra-low latency audio effects processing on FPGA

    Get PDF
    WaveCore is a coarse-grained reconfigurable processor architecture, based on data-flow principles. The processor architecture consists of a scalable and interconnected cluster of Processing Units (PU), where each PU embodies a small floating-point RISC processor. The processor has been designed in technology-independent VHDL and mapped on a commercially available FPGA development platform. The programming methodology is declarative, and optimized to the application domain of audio and acoustical modeling. A benchmark demonstrator algorithm (guitar-model, comprehensive effects-gear box, and distortion/cabinet model) has been developed and applied to the WaveCore development platform. The demonstrator algorithm proved that WaveCore is very well suited for efficient modeling of complex audio/acoustical algorithms with negligible latency and virtually zero jitter. An experimental Faust-to-WaveCore compiler has shown the feasibility of automated compilation of Faust code to the WaveCore processor target

    A Compact CMOS Memristor Emulator Circuit and its Applications

    Full text link
    Conceptual memristors have recently gathered wider interest due to their diverse application in non-von Neumann computing, machine learning, neuromorphic computing, and chaotic circuits. We introduce a compact CMOS circuit that emulates idealized memristor characteristics and can bridge the gap between concepts to chip-scale realization by transcending device challenges. The CMOS memristor circuit embodies a two-terminal variable resistor whose resistance is controlled by the voltage applied across its terminals. The memristor 'state' is held in a capacitor that controls the resistor value. This work presents the design and simulation of the memristor emulation circuit, and applies it to a memcomputing application of maze solving using analog parallelism. Furthermore, the memristor emulator circuit can be designed and fabricated using standard commercial CMOS technologies and opens doors to interesting applications in neuromorphic and machine learning circuits.Comment: Submitted to International Symposium of Circuits and Systems (ISCAS) 201

    Best practices for building hardware designs for living computational science applications

    Get PDF
    Scientific computing or Computational science, is a field of study where engineers and scientists use computer simulations to solve equations that model the physical world. In some cases, these equations come from the first principles of physics. In the past, these simulations were run on a single processor machine. However, due to various technological reasons, the performance of these machines are not likely to improve at the same rate as in the past. In order to improve the performance per watt of these simulations, special-purpose hardware accelerators can be used. This work mainly focuses on using FPGA-based hardware accelerators. In order to run these simulations on an FPGA accelerator, the application code needs to be re-factored into software and hardware sections. These faster simulations have motivated scientists to capture more behavior of the physical world. As additional behavior is captured, the application code needs to be re-factored each time, and a significant effort is required to re-build the design. Unfortunately, these multiple cycles of re-design reduces the overall productivity of scientists and engineers. This work proposes a set of hardware design guidelines for changing computational science codes or living computational science codes. These guidelines co-evolve the hardware with the software, reducing the overall effort of re-design and improving productivity. The design guidelines are evaluated for effectiveness, communicability, and broad applicability. Experimental results have shown that the overall re-design effort is reduced, and these guidelines are broadly applicable to a wide variety of scientific computing applications

    Reconfigurable photonic crystal

    Get PDF
    Tunability and programmability are highly demanded for silicon photonic integrated circuits (PICs) to expand their applications in the next-generation photonics. The main objective of this thesis is to develop several reconfigurable and programmable photonic crystal (PC) devices. In Chapter 2, we developed a relatively general nanofabrication process for integrating PC devices with movable mechanical components on silicon-on-insulator (SOI) wafers. We also investigated grating coupling technology, to facilitate coupling lights into and out of PC devices. In Chapter 3, we developed an all-optical programmable PC device that integrates digital micromirror device (DMD), photo-responsive LC, and PC technologies. We demonstrated the functionality and programmability of the device, by forming a point-defect cavity, a straight waveguide, and a waveguide bend on the single device. In Chapter 4, we developed two types of reconfigurable PC devices by leveraging the strengths of optical nanobeam and nano-electro-mechanical systems (NEMS) technologies. The first device consists of an array of movable nanobeams. Each nanobeam is an electrostatically tunable photonic element in a PC waveguide. We demonstrated the capability of the device to engineer different photonic bandgaps, by tuning one unit in group of two neighboring nanobeam units, tuning one or two in group of three units, and forming two reconfigurable PCs, on the single device. To achieve a higher-level integration, we also theoretically studied another reconfigurable PC integrating an array of mechanical tunable nanobeams with an array of fixed pillars into the top silicon layer of a SOI wafer. In Chapter 5, we developed two tunable photonic crystal-cantilever cavity (PC3) resonators. The first device has an NEMS cantilever embedded into a L6 cavity in a PC slab. The second device has a similar cantilever to insert into a nanobeam-base waveguide. We studied bending characteristics of the cantilever and optical characteristics of these two devices at different applied voltages. In Chapter 6, we conducted theoretical investigation on a nano-opto-mechanical reconfigurable PIC device consisting of an array of silicon plugs and a 2D PC slab. We theoretically demonstrated that a point-defect cavity, a line-defect waveguide, and a waveguide bend can be configured in the PC slab, by inserting different plugs into an air hole, a straight line of holes, and an L-shape line of holes

    Deep Learning Reveals Underlying Physics of Light-matter Interactions in Nanophotonic Devices

    Full text link
    In this paper, we present a deep learning-based (DL-based) algorithm, as a purely mathematical platform, for providing intuitive understanding of the properties of electromagnetic (EM) wave-matter interaction in nanostructures. This approach is based on using the dimensionality reduction (DR) technique to significantly reduce the dimensionality of a generic EM wave-matter interaction problem without imposing significant error. Such an approach implicitly provides useful information about the role of different features (or design parameters such as geometry) of the nanostructure in its response functionality. To demonstrate the practical capabilities of this DL-based technique, we apply it to a reconfigurable optical metadevice enabling dual-band and triple-band optical absorption in the telecommunication window. Combination of the proposed approach with existing commercialized full-wave simulation tools offers a powerful toolkit to extract basic mechanisms of wave-matter interaction in complex EM devices and facilitate the design and optimization of nanostructures for a large range of applications including imaging, spectroscopy, and signal processing. It is worth to mention that the demonstrated approach is general and can be used in a large range of problems as long as enough training data can be provided
    • …
    corecore