Search CORE

41 research outputs found

Multigrain Affinity for Heterogeneous Work Stealing

Author: Carribault Patrick
Cohen Albert
Vet Jean-Yves
Publication venue: HAL CCSD
Publication date: 23/01/2012
Field of study

International audienceIn a parallel computing context, peak performance is hard to reach with irregular applications such as sparse linear algebra operations. It requires dynamic adjustments to automatically balance the workload between several processors. The problem becomes even more complicated when an architecture contains processing units with radically different computing capabilities. We present a hierarchical scheduling scheme designed to harness several CPUs and a GPU. It is built on a two-level work stealing mechanism tightly coupled to a software-managed cache. We show that our approach is well suited to dynamically control heterogeneous architectures, while taking advantage of a reduction of data transfers

INRIA a CCSD electronic archive server

HAL-CEA

Studies on automatic parallelization for heterogeneous and homogeneous multicore processors

Author: Hayashi Akihiro
Publication venue
Publication date: 01/01/2012
Field of study

制度:新 ; 報告番号:甲3537号 ; 学位の種類:博士(工学) ; 授与年月日:2012/2/25 ; 早大学位記番号:新587

Waseda University Repository

Factory: A n Object-Oriented Parallel Programming Substrate for Deep Multiprocessors

Author: Schneider Scott Arthur
Publication venue: W&M ScholarWorks
Publication date: 01/01/2005
Field of study

College of William & Mary: W&M Publish

Studies on parallelism improvement and power reduction in multigrain automatic parallelizing compiler

Author: 白子準
Publication venue: [出版者不明]
Publication date: 01/03/2007
Field of study

制度:新 ; 文部省報告番号:甲2421号 ; 学位の種類:博士(工学) ; 授与年月日:2007/3/15 ; 早大学位記番号:新450

Waseda University Repository

Multigrain Affinity for Heterogeneous Work Stealing

Author: Carribault Patrick
Cohen Albert
Vet Jean-Yves
Publication venue: HAL CCSD
Publication date: 23/01/2012
Field of study

INRIA a CCSD electronic archive server

Factory: An Object-Oriented Parallel Programming Substrate for Deep Multiprocessors

Author: L. Hammond
M. Frigo
S. Shah
Z. Radović
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Crossref

SIMPLE: A Methodology for Programming High Performance Algorithms on Clusters of Symmetric Multiprocessors (SMPs) (Preliminary Version)

Author: Bader D.A.
Publication venue: UNM Digital Repository
Publication date: 01/11/1998
Field of study

Multigrain shared memory

Author: Yeung Donald, 1968-
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1998
Field of study

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998.Includes bibliographical references (p. 197-203).by Donald Yeung.Ph.D

CiteSeerX

DSpace@MIT

Algorithm/Architecture Co-Exploration of Visual Computing: Overview and Future Perspectives

Author: Chen Yen-Kuang
Lee Gwo Giun (Chris)
Mattavelli Marco
S. Jang Euee
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/01/2010
Field of study

Concurrently exploring both algorithmic and architectural optimizations is a new design paradigm. This survey paper addresses the latest research and future perspectives on the simultaneous development of video coding, processing, and computing algorithms with emerging platforms that have multiple cores and reconfigurable architecture. As the algorithms in forthcoming visual systems become increasingly complex, many applications must have different profiles with different levels of performance. Hence, with expectations that the visual experience in the future will become continuously better, it is critical that advanced platforms provide higher performance, better flexibility, and lower power consumption. To achieve these goals, algorithm and architecture co-design is significant for characterizing the algorithmic complexity used to optimize targeted architecture. This paper shows that seamless weaving of the development of previously autonomous visual computing algorithms and multicore or reconfigurable architectures will unavoidably become the leading trend in the future of video technology

Infoscience - École polytechnique fédérale de Lausanne