Search CORE

25 research outputs found

Simultaneous Branch and Warp Interweaving for Sustained GPU Performance

Author: Brunie Nicolas
Collange Caroline
Diamos Gregory
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/11/2011
Field of study

International audienceSingle-Instruction Multiple-Thread (SIMT) micro-architectures implemented in Graphics Processing Units (GPUs) run fine-grained threads in lockstep by grouping them into units, referred to as warps, to amortize the cost of instruction fetch, decode and control logic over multiple execution units. As individual threads take divergent execution paths, their processing takes place sequentially, defeating part of the efficiency advantage of SIMD execution. We present two complementary techniques that mitigate the impact of thread divergence on SIMT micro-architectures. Both techniques relax the SIMD execution model by allowing two distinct instructions to be scheduled to disjoint subsets of the the same row of execution units, instead of one single instruction. They increase flexibility by providing more thread grouping opportunities than SIMD, while preserving the affinity between threads to avoid introducing extra memory divergence. We consider (1) co-issuing instructions from different divergent paths of the same warp and (2) co-issuing instructions from different warps. To support (1), we introduce a novel thread reconvergence technique that ensures threads are run back in lockstep at control-flow reconvergence points without hindering their ability to run branches in parallel. We propose a lane shuffling technique to allow solution (2) to benefit from inter-warp correlations in divergence patterns. The combination of all these techniques improves performance by 23% on a set of regular GPGPU applications and by 40% on irregular applications, while maintaining the same instruction-fetch and processing-unit resource requirements as the contemporary Fermi GPU architecture

HAL-ENS-LYON

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

Federated Benchmarking of Medical Artificial Intelligence With MedPerf

Author: Abid Abubakar
AI4SafeChole Consortium
Albrecht Jacob
Aristizabal Alejandro
Awad Mark M
Baid Ujjwal
Bakas Spyridon
Bala Srini
Bergquist Timothy
Beutel Daniel J
Bittorf Victor
BraTS-2020 Consortium
Chaudhari Akshay
Chowdhury Alexander
Chung Verena
Coleman Cody
Desinghu Bala
Diamos Gregory
Dimitriadis Dimitrios
Dutta Debo
Eddy James
Feddema Diane
Ferro Moraes Cassiano
FeTS Consortium
Fu Huazhu
Fursin Grigori
George Johnu
Goh Rick Siow Mong
Guo Junyi
Howard Jeremy P
Huang Xinyuan
Johnson Jason M
Kanter David
Karargyris Alexandros
Kashyap Satyananda
Kassem Hasan
Lane Nicholas
Liu Yong
Loda Massimo
Mallick Indranil
Marchionni Luigi
Mascagni Pietro
Mattson Peter
Mehta Virendra
Miller Geralyn
Nalawade Sahil
Narayana Moorthy Prakash
Natarajan Vivek
Nikolov Nikola
Padoy Nicolas
Pati Sarthak
Pekhimenko Gennady
Qian Yiming
Reddi Vijay Janapa
Reddy Puchala Sreekar
Reina G Anthony
Ribalta Pablo
Ricciuti Biagio
Rosenthal Jacob
Rosenthal Michael
Samineni Soujanya
Sanseviero Omar
Sengupta Eshna
Shah Prashant
Sheller Micah J
Singh Abhishek
Talby David
Thiagarajan Jayaraman J
Tunstall Lewis
Umeton Renato
Wolf Thomas
Wuest Anna
Xenochristou Maria
Xu Daguang
Xu Xinxing
Yadav Poonam
Zenk Maximilian
Publication venue: DigitalCommons@TMC
Publication date: 01/07/2023
Field of study

Medical artificial intelligence (AI) has tremendous potential to advance healthcare by supporting and contributing to the evidence-based practice of medicine, personalizing patient treatment, reducing costs, and improving both healthcare provider and patient experience. Unlocking this potential requires systematic, quantitative evaluation of the performance of medical AI models on large-scale, heterogeneous data capturing diverse patient populations. Here, to meet this need, we introduce MedPerf, an open platform for benchmarking AI models in the medical domain. MedPerf focuses on enabling federated evaluation of AI models, by securely distributing them to different facilities, such as healthcare organizations. This process of bringing the model to the data empowers each facility to assess and verify the performance of AI models in an efficient and human-supervised process, while prioritizing privacy. We describe the current challenges healthcare and AI communities face, the need for an open platform, the design philosophy of MedPerf, its current implementation status and real-world deployment, our roadmap and, importantly, the use of MedPerf with multiple international institutions within cloud-based technology and on-premises scenarios. Finally, we welcome new contributions by researchers and organizations to further strengthen MedPerf as an open benchmarking platform

DigitalCommons@The Texas Medical Center