2,303 research outputs found
Numerical solutions of differential equations on FPGA-enhanced computers
Conventionally, to speed up scientific or engineering (S&E) computation programs
on general-purpose computers, one may elect to use faster CPUs, more memory, systems
with more efficient (though complicated) architecture, better software compilers, or even
coding with assembly languages. With the emergence of Field Programmable Gate
Array (FPGA) based Reconfigurable Computing (RC) technology, numerical scientists
and engineers now have another option using FPGA devices as core components to
address their computational problems. The hardware-programmable, low-cost, but
powerful “FPGA-enhanced computer” has now become an attractive approach for many
S&E applications.
A new computer architecture model for FPGA-enhanced computer systems and its
detailed hardware implementation are proposed for accelerating the solutions of
computationally demanding and data intensive numerical PDE problems. New FPGAoptimized
algorithms/methods for rapid executions of representative numerical methods
such as Finite Difference Methods (FDM) and Finite Element Methods (FEM) are
designed, analyzed, and implemented on it. Linear wave equations based on seismic
data processing applications are adopted as the targeting PDE problems to demonstrate
the effectiveness of this new computer model. Their sustained computational
performances are compared with pure software programs operating on commodity CPUbased
general-purpose computers. Quantitative analysis is performed from a hierarchical
set of aspects as customized/extraordinary computer arithmetic or function units, compact but flexible system architecture and memory hierarchy, and hardwareoptimized
numerical algorithms or methods that may be inappropriate for conventional
general-purpose computers. The preferable property of in-system hardware
reconfigurability of the new system is emphasized aiming at effectively accelerating the
execution of complex multi-stage numerical applications. Methodologies for
accelerating the targeting PDE problems as well as other numerical PDE problems, such
as heat equations and Laplace equations utilizing programmable hardware resources are
concluded, which imply the broad usage of the proposed FPGA-enhanced computers
Declaratively programmable ultra-low latency audio effects processing on FPGA
WaveCore is a coarse-grained reconfigurable processor architecture, based on data-flow principles. The processor architecture consists of a scalable and interconnected cluster of Processing Units (PU), where each PU embodies a small floating-point RISC processor. The processor has been designed in technology-independent VHDL and mapped on a commercially available FPGA development platform. The programming methodology is declarative, and optimized to the application domain of audio and acoustical modeling. A benchmark demonstrator algorithm (guitar-model, comprehensive effects-gear box, and distortion/cabinet model) has been developed and applied to the WaveCore development platform. The demonstrator algorithm proved that WaveCore is very well suited for efficient modeling of complex audio/acoustical algorithms with negligible latency and virtually zero jitter. An experimental Faust-to-WaveCore compiler has shown the feasibility of automated compilation of Faust code to the WaveCore processor target
A Compact CMOS Memristor Emulator Circuit and its Applications
Conceptual memristors have recently gathered wider interest due to their
diverse application in non-von Neumann computing, machine learning,
neuromorphic computing, and chaotic circuits. We introduce a compact CMOS
circuit that emulates idealized memristor characteristics and can bridge the
gap between concepts to chip-scale realization by transcending device
challenges. The CMOS memristor circuit embodies a two-terminal variable
resistor whose resistance is controlled by the voltage applied across its
terminals. The memristor 'state' is held in a capacitor that controls the
resistor value. This work presents the design and simulation of the memristor
emulation circuit, and applies it to a memcomputing application of maze solving
using analog parallelism. Furthermore, the memristor emulator circuit can be
designed and fabricated using standard commercial CMOS technologies and opens
doors to interesting applications in neuromorphic and machine learning
circuits.Comment: Submitted to International Symposium of Circuits and Systems (ISCAS)
201
Best practices for building hardware designs for living computational science applications
Scientific computing or Computational science, is a field of study where engineers and scientists use computer simulations to solve equations that model the physical world. In some cases, these equations come from the first principles of physics. In the past, these simulations were run on a single processor machine. However, due to various technological reasons, the performance of these machines are not likely to improve at the same rate as in the past. In order to improve the performance per watt of these simulations, special-purpose hardware accelerators can be used. This work mainly focuses on using FPGA-based hardware accelerators. In order to run these simulations on an FPGA accelerator, the application code needs to be re-factored into software and hardware sections. These faster simulations have motivated scientists to capture more behavior of the physical world. As additional behavior is captured, the application code needs to be re-factored each time, and a significant effort is required to re-build the design. Unfortunately, these multiple cycles of re-design reduces the overall productivity of scientists and engineers.
This work proposes a set of hardware design guidelines for changing computational science codes or living computational science codes. These guidelines co-evolve the hardware with the software, reducing the overall effort of re-design and improving productivity. The design guidelines are evaluated for effectiveness, communicability, and broad applicability. Experimental results have shown that the overall re-design effort is reduced, and these guidelines are broadly applicable to a wide variety of scientific computing applications
Reconfigurable photonic crystal
Tunability and programmability are highly demanded for silicon photonic integrated circuits (PICs) to expand their applications in the next-generation photonics. The main objective of this thesis is to develop several reconfigurable and programmable photonic crystal (PC) devices.
In Chapter 2, we developed a relatively general nanofabrication process for integrating PC devices with movable mechanical components on silicon-on-insulator (SOI) wafers. We also investigated grating coupling technology, to facilitate coupling lights into and out of PC devices.
In Chapter 3, we developed an all-optical programmable PC device that integrates digital micromirror device (DMD), photo-responsive LC, and PC technologies. We demonstrated the functionality and programmability of the device, by forming a point-defect cavity, a straight waveguide, and a waveguide bend on the single device.
In Chapter 4, we developed two types of reconfigurable PC devices by leveraging the strengths of optical nanobeam and nano-electro-mechanical systems (NEMS) technologies. The first device consists of an array of movable nanobeams. Each nanobeam is an electrostatically tunable photonic element in a PC waveguide. We demonstrated the capability of the device to engineer different photonic bandgaps, by tuning one unit in group of two neighboring nanobeam units, tuning one or two in group of three units, and forming two reconfigurable PCs, on the single device. To achieve a higher-level integration, we also theoretically studied another reconfigurable PC integrating an array of mechanical tunable nanobeams with an array of fixed pillars into the top silicon layer of a SOI wafer.
In Chapter 5, we developed two tunable photonic crystal-cantilever cavity (PC3) resonators. The first device has an NEMS cantilever embedded into a L6 cavity in a PC slab. The second device has a similar cantilever to insert into a nanobeam-base waveguide. We studied bending characteristics of the cantilever and optical characteristics of these two devices at different applied voltages.
In Chapter 6, we conducted theoretical investigation on a nano-opto-mechanical reconfigurable PIC device consisting of an array of silicon plugs and a 2D PC slab. We theoretically demonstrated that a point-defect cavity, a line-defect waveguide, and a waveguide bend can be configured in the PC slab, by inserting different plugs into an air hole, a straight line of holes, and an L-shape line of holes
Deep Learning Reveals Underlying Physics of Light-matter Interactions in Nanophotonic Devices
In this paper, we present a deep learning-based (DL-based) algorithm, as a
purely mathematical platform, for providing intuitive understanding of the
properties of electromagnetic (EM) wave-matter interaction in nanostructures.
This approach is based on using the dimensionality reduction (DR) technique to
significantly reduce the dimensionality of a generic EM wave-matter interaction
problem without imposing significant error. Such an approach implicitly
provides useful information about the role of different features (or design
parameters such as geometry) of the nanostructure in its response
functionality. To demonstrate the practical capabilities of this DL-based
technique, we apply it to a reconfigurable optical metadevice enabling
dual-band and triple-band optical absorption in the telecommunication window.
Combination of the proposed approach with existing commercialized full-wave
simulation tools offers a powerful toolkit to extract basic mechanisms of
wave-matter interaction in complex EM devices and facilitate the design and
optimization of nanostructures for a large range of applications including
imaging, spectroscopy, and signal processing. It is worth to mention that the
demonstrated approach is general and can be used in a large range of problems
as long as enough training data can be provided
- …