Search CORE

37 research outputs found

The TheLMA project: Multi-GPU Implementation of the Lattice Boltzmann Method

Author: Kuznik F.
Obrecht C.
Roux J.-J.
Tourancheau Bernard
Publication venue: 'SAGE Publications'
Publication date: 29/06/2011
Field of study

International audienceIn this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow solver based on the lattice Boltzmann method (LBM). The LBM is a novel approach in computational fluid dynamics, with numerous interesting features from a computational, numerical, and physical standpoint. Our program is based on CUDA and uses POSIX threads to manage multiple computation devices. Using recently released hardware, our solver may therefore run eight GPUs in parallel, which allows us to perform simulations at a rather large scale. Performance and scalability are excellent, the speedup over sequential implementations being at least of two orders of magnitude. In addition, we discuss tiling and communication issues for present and forthcoming implementations

HAL-ENS-LYON

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

A reduced-reference perceptual image and video quality metric based on edge preservation

Author: AL Yarbus
AM van Dijk
CM Privitera
D Marr
HR Sheikh
HR Sheikh
I Gunawan
J Woods
K Seshadrinathan
K Seshadrinathan
K Seshadrinathan
M Carnec
M Zhang
MG Martini
MH Pinson
N Kazakova
P Le Callet
Q Li
S Tourancheau
S Wolf
S Wolf
U Engelke
U Engelke
W Zhou
Z Musoromy
Z Wang
Z Wang
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

In image and video compression and transmission, it is important to rely on an objective image/video quality metric which accurately represents the subjective quality of processed images and video sequences. In some scenarios, it is also important to evaluate the quality of the received video sequence with minimal reference to the transmitted one. For instance, for quality improvement of video transmission through closed-loop optimisation, the video quality measure can be evaluated at the receiver and provided as feedback information to the system controller. The original image/video sequence-prior to compression and transmission-is not usually available at the receiver side, and it is important to rely at the receiver side on an objective video quality metric that does not need reference or needs minimal reference to the original video sequence. The observation that the human eye is very sensitive to edge and contour information of an image underpins the proposal of our reduced reference (RR) quality metric, which compares edge information between the distorted and the original image. Results highlight that the metric correlates well with subjective observations, also in comparison with commonly used full-reference metrics and with a state-of-the-art RR metric. © 2012 Martini et al

Crossref

Springer - Publisher Connector

UCL Discovery

Kingston University Research Repository

WestminsterResearch

The All-Data-Based Evolutionary Hypothesis of Ciliated Protists with a Revised Classification of the Phylum Ciliophora (Eukaryota, Alveolata)

Author: A Baroin-Tourancheau
A Fleury
A Kahl
A Stamatakis
AW Jankowski
B Hammerschmidt
BA Dehority
BJ Wicklow
BK Diggles
C Shao
C Shao
C Shao
CA Grolière
CF Bardele
CF Bardele
D Williams
DC Hoffman
DH Lynn
DH Lynn
DL Lipscomb
DL Lipscomb
E Gentekaki
EA Hewitt
F Abascal
F Gao
F Gao
F Gao
F Gao
F Ronquist
FM Affa’a
H Berger
H Shimodaira
H Shimodaira
H Wang
H Xu
ID da Silva Neto
J Gong
J Grain
J Huang
J Jiang
J Li
JM Feng
KL Kivimaki
L Li
L Li
L Li
LR Utz
LW Parfrey
M Dunthorn
M Dunthorn
M Kathol
M Miao
M Miao
M Miao
M Miao
MC Strüder-Kypke
MC Strüder-Kypke
MC Strüder-Kypke
MD Johnson
MK Shin
O Penn
OL Snoeyenbos-West
P Sun
P Sun
P Sun
P Sun
P Vdacny
P Vd’ačný
PC Bradbury
Pd Puytorac
Q Zhang
Q Zhang
Q Zhang
Q Zhang
S Agatha
S Agatha
S Gao
SL Schmidt
SM Adl
W Foissner
W Foissner
W Foissner
W Foissner
W Foissner
W Foissner
W Foissner
W Foissner
W Liu
W Miao
W Miao
W Orsi
W Song
WA Bourland
WW Liu
X Chen
X Chen
X Fan
X Fan
X Fan
X Pan
Y Gong
Y Gong
Z Fokam
Z Yi
Z Yi
Z Yi
Z Yi
Z Yi
Z Zhan
Z Zhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/04/2016
Field of study

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The file attached is the published version of the article

Natural History Museum Repository

Crossref

Institutional Repository of Yantai Institute of Coastal Zone Research, CAS

PubMed Central

Preface: Clusters and computational grids for scientific computing

Author: Dongarra J.
Tourancheau B.
Publication venue
Publication date: 01/06/2001
Field of study

The University of Manchester - Institutional Repository

Performance Study of LU Factorization with Low Communication Overhead on Multiprocessors

Author: B. Tourancheau
F. Desprez
J. J. Dongarra
Publication venue
Publication date: 01/01/1995
Field of study

In this paper, we make efficient use of asynchronous communications on the LU decomposition algorithm with pivoting and a column-scattered data decomposition to derive precise computational complexities. We then compare these results with experiments on the Intel iPSC/860 and Paragon machines and show that very good performances can be obtained on a ring with asynchronous communications

CiteSeerX

The University of Manchester - Institutional Repository

Performance complexity of LU factorization with efficient pipelining and overlap on a multiprocessor

Author: B. Tourancheau
F. Desprez
J. J. Dongarra
Publication venue
Publication date: 01/01/1994
Field of study

In this paper, we make efficient use of pipelining on LU decomposition with pivoting and a column-scattered data decomposition to derive precise variations of the computational complexities. We then compare these results with experiments on the Intel iPSC/860 and Paragon machines

CiteSeerX

PERFORMANCE STUDY OF LU FACTORIZATION WITH LOW COMMUNICATION OVERHEAD ON MULTIPROCESSORS

Author: B. TOURANCHEAU
F. DESPREZ
J. J. DONGARRA
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref

Author: Accepted B. Tourancheau
Dorian C. Arnold
Jack J. Dongarra
Sathish S. Vahdiyar
Sz Dongarra
Publication venue
Publication date: 01/01/2001
Field of study

Great advances in high-performance computing have given rise to scientific applications that place large demands on software and hardware infrastructures for both computational and data services. With these trends the necessity has emerged for distributed systems developers that once distinguished between these elements to acknowledge that indeed computational and data services are tightly coupled and need to be addressed simultaneously. In this article, we compile and discuss several strategies and techniques, like co-scheduling and co-allocation of computational and data services, dynamic storage capabilities, and quality-of-service, that can be used to help resolve some of the aforementioned issues. We present our interactions with a distributed computing system, NetSolve, and a Distributed Storage Infrastructure, IBP, as a case study of how some of these techniques can be effectively deployed and offer experimental evidence from early prototypes that validate our motivation and direction

CiteSeerX

The University of Manchester - Institutional Repository