10 research outputs found

    FPGA acceleration of the phylogenetic likelihood function for Bayesian MCMC inference methods

    Get PDF
    Background Likelihood (ML)-based phylogenetic inference has become a popular method for estimating the evolutionary relationships among species based on genomic sequence data. This method is used in applications such as RAxML, GARLI, MrBayes, PAML, and PAUP. The Phylogenetic Likelihood Function (PLF) is an important kernel computation for this method. The PLF consists of a loop with no conditional behavior or dependencies between iterations. As such it contains a high potential for exploiting parallelism using micro-architectural techniques. In this paper, we describe a technique for mapping the PLF and supporting logic onto a Field Programmable Gate Array (FPGA)-based co-processor. By leveraging the FPGA\u27s on-chip DSP modules and the high-bandwidth local memory attached to the FPGA, the resultant co-processor can accelerate ML-based methods and outperform state-of-the-art multi-core processors. Results We use the MrBayes 3 tool as a framework for designing our co-processor. For large datasets, we estimate that our accelerated MrBayes, if run on a current-generation FPGA, achieves a 10Ă— speedup relative to software running on a state-of-the-art server-class microprocessor. The FPGA-based implementation achieves its performance by deeply pipelining the likelihood computations, performing multiple floating-point operations in parallel, and through a natural log approximation that is chosen specifically to leverage a deeply pipelined custom architecture. Conclusions Heterogeneous computing, which combines general-purpose processors with special-purpose co-processors such as FPGAs and GPUs, is a promising approach for high-performance phylogeny inference as shown by the growing body of literature in this field. FPGAs in particular are well-suited for this task because of their low power consumption as compared to many-core processors and Graphics Processor Units (GPUs)

    Mechanizing conventional SSA for a verified destruction with coalescing

    Get PDF
    International audienceModern optimizing compilers rely on the Static Single Assignment (SSA) form to make optimizations fast and simpler to implement. From a semantic perspective, the SSA form is nowadays fairly well understood, as witnessed by recent advances in the field of formally verified compilers. The destruction of the SSA form, however, remains a difficult problem, even in a non-verified environment. In fact, the out-of-SSA transformation has been revisited, for correctness and performance issues, up until recently. Unsurprisingly, state-of-the-art compiler formalizations thus either completely ignore, only partially handle, or implement naively the SSA destruction. This paper reports on the implementation of such a destruction within a verified compiler. We formally define and prove the properties of the generation of Conventional SSA (CSSA) which make its destruction simple to implement and prove. Second, we implement and prove correct a coalescing destruction of CSSA, Ă  la Boissinot et al., where variables can be coalesced according to a refined notion of interference. This formalization work extends the CompCertSSA compiler, whose correctness proof is mechanized in the Coq proof assistant. Our CSSA-based, coalescing destruction removes, on average , more than 99% of introduced copies, and leads to encouraging results concerning spilling during post-SSA register allocation

    Novel Arithmetics in Deep Neural Networks Signal Processing for Autonomous Driving: Challenges and Opportunities

    No full text
    This article focuses on the trends, opportunities, and challenges of novel arithmetic for deep neural network (DNN) signal processing, with particular reference to assisted- and autonomous driving applications. Due to strict constraints in terms of the latency, dependability, and security of autonomous driving, machine perception (i.e., detection and decision tasks) based on DNNs cannot be implemented by relying on remote cloud access. These tasks must be performed in real time in embedded systems on board the vehicle, particularly for the inference phase (considering the use of DNNs pretrained during an offline step). When developing a DNN computing platform, the choice of the computing arithmetic matters. Moreover, functional safe applications, such as autonomous driving, impose severe constraints on the effect that signal processing accuracy has on the final rate of wrong detection/decisions. Hence, after reviewing the different choices and tradeoffs concerning arithmetic, both in academia and industry, we highlight the issues in implementing DNN accelerators to achieve accurate and lowcomplexity processing of automotive sensor signals (the latter coming from diverse sources, such as cameras, radar, lidar, and ultrasonics). The focus is on both general-purpose operations massively used in DNNs, such as multiplying, accumulating, and comparing, and on specific functions, including, for example, sigmoid or hyperbolic tangents used for neuron activation

    A space oddity: Geographic and specific modulation of migration in Eudyptes penguins

    Get PDF
    Post-breeding migration in land-based marine animals is thought to offset seasonal deterioration in foraging or other important environmental conditions at the breeding site. However the inter-breeding distribution of such animals may reflect not only their optimal habitat, but more subtle influences on an individual’s migration path, including such factors as the intrinsic influence of each locality’s paleoenvironment, thereby influencing animals’ wintering distribution. In this study we investigated the influence of the regional marine environment on the migration patterns of a poorly known, but important seabird group. We studied the inter-breeding migration patterns in three species of Eudyptes penguins (E. chrysolophus, E. filholi and E. moseleyi), the main marine prey consumers amongst the World’s seabirds. Using ultra-miniaturized logging devices (light-based geolocators) and satellite tags, we tracked 87 migrating individuals originating from 4 sites in the southern Indian Ocean (Marion, Crozet, Kerguelen and Amsterdam Islands) and modelled their wintering habitat using the MADIFA niche modelling technique. For each site, sympatric species followed a similar compass bearing during migration with consistent species-specific latitudinal shifts. Within each species, individuals breeding on different islands showed contrasting migration patterns but similar winter habitat preferences driven by sea-surface temperatures. Our results show that inter-breeding migration patterns in sibling penguin species depend primarily on the site of origin and secondly on the species. Such site-specific migration bearings, together with similar wintering habitat used by parapatrics, support the hypothesis that migration behaviour is affected by the intrinsic characteristics of each site. The paleo-oceanographic conditions (primarily, sea-surface temperatures) when the populations first colonized each of these sites may have been an important determinant of subsequent migration patterns. Based on previous chronological schemes of taxonomic radiation and geographical expansion of the genus Eudyptes, we propose a simple scenario to depict the chronological onset of contrasting migration patterns within this penguin group
    corecore