6 research outputs found
Recommended from our members
Large Scale Simulations of a Ship Power System with Energy Storage and Multiple Directed Energy Loads
A large scale Simulink® simulation model of the electrical power system of a ship is described. The model includes the major systems onboard, from prime movers to the actual loads, and incorporates several intermittent duty loads along with continuous duty loads. Three types of energy storage systems have been modeled: flywheels, batteries, and capacitors. Therefore, critical issues like stability, reconfigurability, fault management, and minimum rating of energy storage units can be studied. The presence of energy storage has also allowed the study of how these systems can be used to improve the overall performance of the ship. Typical functions, for example, would include load leveling of the power bus, an uninterruptible power supply function for sections of the ship, and the potential for fuel efficiency improvement by reducing the number of turbines being run at fractional loads to fewer being run closer to their optimal specific fuel efficiency point. Typical outputs of the simulations are presented and discussed. In addition, several challenges presented by the scale of the simulations, the software platform used, and the underlying modeling philosophy are discussed with an outlook toward future improvements both in the computing hardware and in the programming methods.Center for Electromechanic
Fast Simulation of Programmable Network Forwarding Plane Devices
With the evolution of the Internet, the processing of packets at the routers while providing
flexibility in deploying new protocols and services at the same time has become a major concern.
Programmable forwarding elements with high processing capability have emerged as a solution.
But the main challenge is to find the optimal hardware architecture while taking into account
constraints such as different packet processing functions, task scheduling options, electrical
power consumption and providing quality-of-service (QoS) guarantees. Therefore, it is essential
to investigate methods that help in identifying limitations and bottlenecks before physical
fabrication. Having an appropriate model provides designers a progressive path to narrow the
design space and establish credible and feasible alternatives before deciding on an
implementation.
In this thesis, we propose a flexible and fast instruction accurate host-compiled simulator to
make it possible to explore wide ranges of architectures and application scenarios to find the
optimal configuration that meets given performance, throughput and latency for programmable
forwarding elements. Application developers can use the simulator as a virtual prototype to
simulate and debug their applications before hardware availability. Moreover, forwarding device
architects can use simulator to evaluate the trade-offs between different hardware/software
design decisions
Cycle-accurate modeling of multicore processors on FPGAs
Thesis (Ph. D.)--Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2013.Cataloged from PDF version of thesis.Includes bibliographical references (pages 169-176).We present a novel modeling methodology which enables the generation of a high-performance, cycle-accurate simulator from a cycle-level specification of the target design. We describe Arete, a full-system multicore processor simulator, developed using our modeling methodology. We provide details on Arete's resource-efficient and high-performance implementation on multiple FPGA platforms, and the architectural experiments performed using it. We present clear evidence that the use of simplified models in architectural studies can lead to wrong conclusions. Through two experiments performed using both cycle-accurate and simplified models, we show that on one hand there are substantial quantitative and qualitative differences in results, and on the other, the results match quite well.by Asif Imtiaz Khan.Ph.D
Speeding up dynamic compilation: concurrent and parallel dynamic compilation
The main challenge faced by a dynamic compilation system is to detect and
translate frequently executed program regions into highly efficient native code
as fast as possible. To efficiently reduce dynamic compilation latency, a dynamic
compilation system must improve its workload throughput, i.e. compile
more application hotspots per time. As time for dynamic compilation
adds to the overall execution time, the dynamic compiler is often decoupled
and operates in a separate thread independent from the main execution loop
to reduce the overhead of dynamic compilation.
This thesis proposes innovative techniques aimed at effectively speeding
up dynamic compilation. The first contribution is a generalised region
recording scheme optimised for program representations that require dynamic
code discovery (e.g. binary program representations). The second contribution
reduces dynamic compilation cost by incrementally compiling several
hot regions in a concurrent and parallel task farm. Altogether the combination
of generalised light-weight code discovery, large translation units,
dynamic work scheduling, and concurrent and parallel dynamic compilation
ensures timely and efficient processing of compilation workloads. Compared
to state-of-the-art dynamic compilation approaches, speedups of up to 2.08
are demonstrated for industry standard benchmarks such as BioPerf, Spec
Cpu 2006, and Eembc.
Next, innovative applications of the proposed dynamic compilation scheme
to speed up architectural and micro-architectural performance modelling are
demonstrated. The main contribution in this context is to exploit runtime
information to dynamically generate optimised code that accurately models
architectural and micro-architectural components. Consequently, compilation
units are larger and more complex resulting in increased compilation
latencies. Large and complex compilation units present an ideal use case for
our concurrent and parallel dynamic compilation infrastructure. We demonstrate
that our novel micro-architectural performance modelling is faster than
state-of-the-art Fpga-based simulation, whilst providing the same level of
accuracy