8,419 research outputs found
Developing Efficient Discrete Simulations on Multicore and GPU Architectures
In this paper we show how to efficiently implement parallel discrete simulations on multicoreandGPUarchitecturesthrougharealexampleofanapplication: acellularautomatamodel of laser dynamics. We describe the techniques employed to build and optimize the implementations using OpenMP and CUDA frameworks. We have evaluated the performance on two different hardware platforms that represent different target market segments: high-end platforms for scientific computing, using an Intel Xeon Platinum 8259CL server with 48 cores, and also an NVIDIA Tesla V100GPU,bothrunningonAmazonWebServer(AWS)Cloud;and on a consumer-oriented platform, using an Intel Core i9 9900k CPU and an NVIDIA GeForce GTX 1050 TI GPU. Performance results were compared and analyzed in detail. We show that excellent performance and scalability can be obtained in both platforms, and we extract some important issues that imply a performance degradation for them. We also found that current multicore CPUs with large core numbers can bring a performance very near to that of GPUs, and even identical in some cases.Ministerio de Economía, Industria y Competitividad, Gobierno de España (MINECO), and the Agencia Estatal de Investigación (AEI) of Spain, cofinanced by FEDER funds (EU) TIN2017-89842
Lattice QCD based on OpenCL
We present an OpenCL-based Lattice QCD application using a heatbath algorithm
for the pure gauge case and Wilson fermions in the twisted mass formulation.
The implementation is platform independent and can be used on AMD or NVIDIA
GPUs, as well as on classical CPUs. On the AMD Radeon HD 5870 our double
precision dslash implementation performs at 60 GFLOPS over a wide range of
lattice sizes. The hybrid Monte-Carlo presented reaches a speedup of four over
the reference code running on a server CPU.Comment: 19 pages, 11 figure
System Design of Internet-of-Things for Residential Smart Grid
Internet-of-Things (IoTs) envisions to integrate, coordinate, communicate,
and collaborate real-world objects in order to perform daily tasks in a more
intelligent and efficient manner. To comprehend this vision, this paper studies
the design of a large scale IoT system for smart grid application, which
constitutes a large number of home users and has the requirement of fast
response time. In particular, we focus on the messaging protocol of a universal
IoT home gateway, where our cloud enabled system consists of a backend server,
unified home gateway (UHG) at the end users, and user interface for mobile
devices. We discuss the features of such IoT system to support a large scale
deployment with a UHG and real-time residential smart grid applications. Based
on the requirements, we design an IoT system using the XMPP protocol, and
implemented in a testbed for energy management applications. To show the
effectiveness of the designed testbed, we present some results using the
proposed IoT architecture.Comment: 10 pages, 6 figures, journal pape
- …