Search CORE

30 research outputs found

CUDA Fortran による確率的ボラティリティ変動モデルのGPU計算

Author: GPU Computing of Stochastic Volatility Model with CUDA Fortran高石哲弥
Publication venue: 広島経済大学経済学会
Publication date: 30/09/2014
Field of study

Disengaged Scheduling for Fair, Protected Access to Fast Computational Accelerators

Author: Dwarakinath A.
GPU
Gupta V.
Intel Corporation
Kato S.
Kato S.
Kyriazis G.
Menychtas K.
Shen K.
Soares L.
Publication venue
Publication date: 04/09/2014
Field of study

Today’s operating systems treat GPUs and other computational accelerators as if they were simple devices, with bounded and predictable response times. With accelerators assuming an increasing share of the workload on modern machines, this strategy is already problematic, and likely to become untenable soon. If the operating system is to enforce fair sharing of the machine, it must assume responsibility for accelerator scheduling and resource management. Fair, safe scheduling is a particular challenge on fast accelerators, which allow applications to avoid kernel-crossing overhead by interacting directly with the device. We propose a disengaged scheduling strategy in which the kernel intercedes between applications and the accelerator on an infrequent basis, to monitor their use of accelerator cycles and to determine which applications should be granted access over the next time interval. Our strategy assumes a well defined, narrow interface exported by the accelerator. We build upon such an interface, systematically inferred for the latest Nvidia GPUs. We construct several example schedulers, including Disengaged Timeslice with overuse control that guarantees fairness and Disengaged Fair Queueing that is effective in limiting resource idleness, but probabilistic. Both schedulers ensure fair sharing of the GPU, even among uncooperative or adversarial applications; Disengaged Fair Queueing incurs a 4 % overhead on average (max 18%) compared to direct devic

CiteSeerX

Crossref

Procesamiento distribuido y paralelo de bajo costo basado en cloud&movil

Author: De Giusti Armando Eduardo
Distributed Systems
eficiencia energética
GPU
Naiouf Marcelo
Petrocelli David
Publication venue
Publication date: 01/01/2017
Field of study

Actualmente el procesamiento intensivo se realiza a través de estructuras HPC híbridas (Grid, Cluster, Cloud) utilizando procesadores de arquitectura x86 y GPUs Nvidia, AMD o Intel, incurriendo en altísimos costos económicos y energéticos. Sin embargo, gracias a la evolución constante del hardware y con el advenimiento de los dispositivos móviles/microcomputadores con CPUs/GPUs ARM acompañado de la masividad de los mismos es posible pensar en una solución de bajo costo y consumo energético para solventar este tipo de problemas. Estos dispositivos incrementan su capacidad, eficiencia, estabilidad y potencia a diario, mientras ganan mercado, conservando un bajo costo, tamaño y consumo energético. A su vez, presentan lapsos de ociosidad, lo que representa una gran capacidad de recursos desaprovechados. Por tal motivo, el objetivo de este trabajo es presentar un prototipo de arquitectura distribuida dinámica, escalable y redundante geográficamente para explotar esta disponibilidad y realizar procesamiento intensivo aprovechando recursos y reduciendo costos.XVIII Workshop de Procesamiento Distribuido y Paralelo (WPDP

Centro de Servicios en Gestión de Información

Cerita Cinta INDONESIA 45 Cerpen Pilihan

Author: 45 Penulis GPU
Publication venue: Gramedia
Publication date: 01/01/2014
Field of study

www.digilib.unipdu.ac.id

CERITA CINTA INDONESIA : Kumpulan Cerita Pendek

Author: 45 Penulis GPU
Publication venue: PT GRAMEDIA PUSTAKA UTAMA
Publication date: 01/01/2014
Field of study

www.digilib.unipdu.ac.id

Track pattern-recognition on GPGPUs in the LHCb experiment

Author: Gallorini Stefano
GPU@LHCb Trigger working group Collaboration
Publication venue: Deutsches Elektronen-Synchrotron, DESY
Publication date: 01/01/2015
Field of study

The LHCb experiment is entering in its upgrading phase, with its detector and read-out system re-designed to cope with the increased LHC energy after the long shutdown of 2018. In this upgrade, a trigger-less data acquisition is being developed to read-out the full detector at the bunch-crossing rate of 40 MHz. In particular, the High Level Trigger (HLT) system has to be heavily revised. Since the small LHCb event size (about 100 kB), many-core architectures such as General Purpose Graphics Processing Units (GPGPUs) and multi-core CPUs can be used to process many events in parallel for real-time selection, and may offer a solution for reducing the cost of the HLT farm. Track reconstruction and vertex finding are the more time-consuming applications running in HLT and therefore are the first to be ported on many-core. In this talk we present our implementation of the existing tracking algorithms on GPGPU, discussing in detail the case of the VErtex LOcator detector (VELO), and we show the achieved performances. We discuss also other tracking algorithms that can be used in view of the LHCb upgra

DESY Publication Database

DESY