Search CORE

10 research outputs found

FPGA acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods

Author: A Stamataki
A Stamatakis
B Minh
C Than
CL Schoch
D Zwickl
DR Robinson
F de Dinechin
F Ronquist
G Altekar
H Fu
H Schmidt
J Felsenstein
J Felsenstein
J Felsenstein
J Felsenstein
J Williams
Jason D Bakos
JW Spatafora
KH Abed
L Zhuo
M A Suchard
M Binder
ME Alfaro
ML Berbee
N Alachiotis
N Alachiotis
R Bauer
R-C Li
SM Barns
Stephanie Zierke
T Hamada
T Keane
TST Mak
X Feng
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA\u27s on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. Results We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10× speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Conclusions Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs)

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

Mechanizing conventional SSA for a verified destruction with coalescing

Author: Demange D.
Dupont de Dinechin B.
Sreedhar V.
Publication venue: HAL CCSD
Publication date: 17/03/2016
Field of study

International audienceModern optimizing compilers rely on the Static Single Assignment (SSA) form to make optimizations fast and simpler to implement. From a semantic perspective, the SSA form is nowadays fairly well understood, as witnessed by recent advances in the field of formally verified compilers. The destruction of the SSA form, however, remains a difficult problem, even in a non-verified environment. In fact, the out-of-SSA transformation has been revisited, for correctness and performance issues, up until recently. Unsurprisingly, state-of-the-art compiler formalizations thus either completely ignore, only partially handle, or implement naively the SSA destruction. This paper reports on the implementation of such a destruction within a verified compiler. We formally define and prove the properties of the generation of Conventional SSA (CSSA) which make its destruction simple to implement and prove. Second, we implement and prove correct a coalescing destruction of CSSA, à la Boissinot et al., where variables can be coalesced according to a refined notion of interference. This formalization work extends the CompCertSSA compiler, whose correctness proof is mechanized in the Coq proof assistant. Our CSSA-based, coalescing destruction removes, on average , more than 99% of introduced copies, and leads to encouraging results concerning spilling during post-SSA register allocation

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Novel Arithmetics in Deep Neural Networks Signal Processing for Autonomous Driving: Challenges and Opportunities

Author: Cococcioni M.
De Dinechin B. D.
Rossi F.
Ruffaldi E.
Saponara S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

This article focuses on the trends, opportunities, and challenges of novel arithmetic for deep neural network (DNN) signal processing, with particular reference to assisted- and autonomous driving applications. Due to strict constraints in terms of the latency, dependability, and security of autonomous driving, machine perception (i.e., detection and decision tasks) based on DNNs cannot be implemented by relying on remote cloud access. These tasks must be performed in real time in embedded systems on board the vehicle, particularly for the inference phase (considering the use of DNNs pretrained during an offline step). When developing a DNN computing platform, the choice of the computing arithmetic matters. Moreover, functional safe applications, such as autonomous driving, impose severe constraints on the effect that signal processing accuracy has on the final rate of wrong detection/decisions. Hence, after reviewing the different choices and tradeoffs concerning arithmetic, both in academia and industry, we highlight the issues in implementing DNN accelerators to achieve accurate and lowcomplexity processing of automotive sensor signals (the latter coming from diverse sources, such as cameras, radar, lidar, and ultrasonics). The focus is on both general-purpose operations massively used in DNNs, such as multiplying, accumulating, and comparing, and on specific functions, including, for example, sigmoid or hyperbolic tangents used for neuron activation

Archivio della Ricerca - Università di Pisa

Efficient Method for Periodic Task Scheduling with Storage Requirement Minimization

Author: A. Schrijver
A.E. Eichenberger
B..D.. Dinechin de
C. Hanen
D. Fimmel
H.W. Kuhn
M.M. Strout
W. Thies
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

Verifying floating-point programs with constraint programming and abstract interpretation techniques

Author: B Botella
Claude Michel
D Goldberg
F Dinechin de
H Collavizza
L Granvilliers
M Barnett
Michel Rueher
Olivier Ponsini
PV Hentenryck
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

High Performance and Low Power Fixed-point Special Function Unit for Mobile Vertex Processors

Author: Chung K and Kim L-S
De Caro D
De Dinechin F and Tisserand A
Kim H
Nagayama S
Nam B
Nam Byeong-gyu and Yoo Hoi-jun
Pineiro J A
Schulte M J and Swartzlander E E
Woo Jeong-Ho
Yu Chang-Hyo
Publication venue: 'China Science Publishing & Media Ltd.'
Publication date
Field of study

Crossref

Worst case analysis of decomposed software pipelining for cyclic unitary RCPSP with precedence delays

Author: A. Darte
A. Dasdan
A. Munier
Abir Benabid
B. Dupont de Dinechin
B. R. Rau
B. R. Rau
C. E. Leiserson
C. Hanen
Claire Hanen
D. Alcaide
E. Levner
E. Levner
F. Gasperoni
H. C. Chou
J. Llosa
J. M. Proth
J. Wang
M. Lam
P. Brucker
P. Y. Calland
R. A. Huff
V. H. Allan
V. Kats
Y. Robert
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A space oddity: Geographic and specific modulation of migration in Eudyptes penguins

Author: A Clarke
A Giret
A Giret
A Raya Rey
A Tsoar
AES Kemp
AJ Baker
Azwianewi B. Makhado
B Robson
C Calenge
C Cotté
C Freitas
C Péron
CA Bost
CA Bost
Charles-André Bost
CL Hull
CL Hull
CL Hull
CR Brown
CW Clark
D Grémillet
David Pinaud
DK Cairns
Eric J. Woehler
G Ballard
H Dingle
H Weimerskirch
H Weimerskirch
I McDougall
IJ Staniland
IM Belkin
J Banks
J González-Solís
J González-Solís
JA Clarke
JA Green
JB Thiebot
JB Thiebot
JB Thiebot
JB Thiebot
JC Stahl
Jean-Baptiste Thiebot
K Pütz
KA Cresswell
M de Dinechin
M Frederiksen
MD Brooke
P Berthold
P Berthold
P Pinet
P Ward
Philip N. Trathan
PN Trathan
R Andersen
RJM Crawford
RJM Crawford
Robert J. M. Crawford
RP Wilson
RP Wilson
RP Wilson
RP Wilson
S Becquey
T Alerstam
T Guilford
T Mueller
V Ridoux
VL Friesen
WJ Sutherland
WZ Trivelpiece
Y Cherel
Y Cherel
Y Tremblay
Yves Cherel
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Post-breeding migration in land-based marine animals is thought to offset seasonal deterioration in foraging or other important environmental conditions at the breeding site. However the inter-breeding distribution of such animals may reflect not only their optimal habitat, but more subtle influences on an individual’s migration path, including such factors as the intrinsic influence of each locality’s paleoenvironment, thereby influencing animals’ wintering distribution. In this study we investigated the influence of the regional marine environment on the migration patterns of a poorly known, but important seabird group. We studied the inter-breeding migration patterns in three species of Eudyptes penguins (E. chrysolophus, E. filholi and E. moseleyi), the main marine prey consumers amongst the World’s seabirds. Using ultra-miniaturized logging devices (light-based geolocators) and satellite tags, we tracked 87 migrating individuals originating from 4 sites in the southern Indian Ocean (Marion, Crozet, Kerguelen and Amsterdam Islands) and modelled their wintering habitat using the MADIFA niche modelling technique. For each site, sympatric species followed a similar compass bearing during migration with consistent species-specific latitudinal shifts. Within each species, individuals breeding on different islands showed contrasting migration patterns but similar winter habitat preferences driven by sea-surface temperatures. Our results show that inter-breeding migration patterns in sibling penguin species depend primarily on the site of origin and secondly on the species. Such site-specific migration bearings, together with similar wintering habitat used by parapatrics, support the hypothesis that migration behaviour is affected by the intrinsic characteristics of each site. The paleo-oceanographic conditions (primarily, sea-surface temperatures) when the populations first colonized each of these sites may have been an important determinant of subsequent migration patterns. Based on previous chronological schemes of taxonomic radiation and geographical expansion of the genus Eudyptes, we propose a simple scenario to depict the chronological onset of contrasting migration patterns within this penguin group

CiteSeerX

Public Library of Science (PLOS)

Cape Town University OpenUCT

Crossref

Directory of Open Access Journals

PubMed Central

NERC Open Research Archive

FigShare