230 research outputs found

    ATM: approximate task memoization in the runtime system

    Get PDF
    Redundant computations appear during the execution of real programs. Multiple factors contribute to these unnecessary computations, such as repetitive inputs and patterns, calling functions with the same parameters or bad programming habits. Compilers minimize non useful code with static analysis. However, redundant execution might be dynamic and there are no current approaches to reduce these inefficiencies. Additionally, many algorithms can be computed with different levels of accuracy. Approximate computing exploits this fact to reduce execution time at the cost of slightly less accurate results. In this case, expert developers determine the desired tradeoff between performance and accuracy for each application. In this paper, we present Approximate Task Memoization (ATM), a novel approach in the runtime system that transparently exploits both dynamic redundancy and approximation at the task granularity of a parallel application. Memoization of previous task executions allows predicting the results of future tasks without having to execute them and without losing accuracy. To further increase performance improvements, the runtime system can memoize similar tasks, which leads to task approximate computing. By defining how to measure task similarity and correctness, we present an adaptive algorithm in the runtime system that automatically decides if task approximation is beneficial or not. When evaluated on a real 8-core processor with applications from different domains (financial analysis, stencil-computation, machine-learning and linear-algebra), ATM achieves a 1.4x average speedup when only applying memoization techniques. When adding task approximation, ATM achieves a 2.5x average speedup with an average 0.7% accuracy loss (maximum of 3.2%).This work has been supported by the RoMoL ERC Advanced Grant (GA 321253), by the Spanish Government (grant SEV2015-0493 of the Severo Ochoa Program), by the Spanish Ministry of Science and Innovation (contracts TIN2015-65316), by Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272) and the European HiPEAC Network of Excellence. M. Moretó has been partially supported by the Ministry of Economy and Competitiveness under Juan de la Cierva postdoctoral fellowship number JCI-2012-15047. M. Casas is supported by the Secretary for Universities and Research of the Ministry of Economy and Knowledge of the Government of Catalonia and the Cofund programme of the Marie Curie Actions of the 7th R&D Framework Programme of the European Union (Contract 2013 BP B 00243). I. Brumar has been partially supported by the Spanish Ministry of Education, Culture and Sports under grant FPU2015/12849.Peer ReviewedPostprint (author's final draft

    Consumption of low-moderate level arsenic contaminated water does not increase spontaneous pregnancy loss: a case control study

    Get PDF
    Previous work suggests an increased risk for spontaneous pregnancy loss linked to high levels of inorganic arsenic (iAs) in drinking water sources (\u3e10 μg/L). However, there has been little focus to date on the impact of low-moderate levels of iAs in drinking water (\u3c10 \u3eμg/L). To address this data gap we conducted a hospital-based case–control study in Timis County, Romania

    In vitro corrosion of titanium nitride and oxynitride-based biocompatible coatings deposited on stainless steel

    Get PDF
    The reactive cathodic arc deposition technique was used to produce Ti nitride and oxynitride coatings on 304 stainless steel substrates (SS). Both mono (SS/TiN, SS/TiNO) and bilayer coatings (SS/TiN/TiNO and SS/TiNO/TiN) were investigated in terms of elemental and phase composition, microstructure, grain size, morphology, and roughness. The corrosion behavior in a solution consisting of 0.10 M NaCl + 1.96 M H2O2 was evaluated, aiming for biomedical applications. The results showed that the coatings were compact, homogeneously deposited on the substrate, and displaying rough surfaces. The XRD analysis indicated that both mono and bilayer coatings showed only cubic phases with (111) and (222) preferred orientations. The highest crystallinity was shown by the SS/TiN coating, as indicated also by the largest grain size of 23.8 nm, which progressively decreased to 16.3 nm for the SS/TiNO monolayer. The oxynitride layers exhibited the best in vitro corrosion resistance either as a monolayer or as a top layer in the bilayer structure, making them a good candidate for implant applications

    Densely connected GCN model for motion prediction

    Get PDF
    © 2020 The Authors. Computer Animation and Virtual Worlds published by John Wiley & Sons, Ltd. Human motion prediction is a fundamental problem in understanding human natural movements. This task is very challenging due to the complex human body constraints and diversity of action types. Due to the human body being a natural graph, graph convolutional network (GCN)-based models perform better than the traditional recurrent neural network (RNN)-based models on modeling the natural spatial and temporal dependencies lying in the motion data. In this paper, we develop the GCN-based models further by adding densely connected links to increase their feature utilizations and address oversmoothing problem. More specifically, the GCN block is used to learn the spatial relationships between the nodes and each feature map of the GCN block propagates directly to every following block as input rather than residual linked. In this way, the spatial dependency of human motion data is exploited more sufficiently and the features of different level of scale are fused more efficiently. Extensive experiments demonstrate our model achieving the state-of-the-art results on CMU dataset

    On the generalized Freedman-Townsend model

    Full text link
    Consistent interactions that can be added to a free, Abelian gauge theory comprising a finite collection of BF models and a finite set of two-form gauge fields (with the Lagrangian action written in first-order form as a sum of Abelian Freedman-Townsend models) are constructed from the deformation of the solution to the master equation based on specific cohomological techniques. Under the hypotheses of smoothness in the coupling constant, locality, Lorentz covariance, and Poincare invariance of the interactions, supplemented with the requirement on the preservation of the number of derivatives on each field with respect to the free theory, we obtain that the deformation procedure modifies the Lagrangian action, the gauge transformations as well as the accompanying algebra. The interacting Lagrangian action contains a generalized version of non-Abelian Freedman-Townsend model. The consistency of interactions to all orders in the coupling constant unfolds certain equations, which are shown to have solutions.Comment: LaTeX, 62 page

    Co-Evaluation of Pattern Matching Algorithms on IoT Devices with Embedded GPUs

    Get PDF
    Pattern matching is an important building block for many security applications, including Network Intrusion Detection Systems (NIDS). As NIDS grow in functionality and complexity, the time overhead and energy consumption of pattern matching become a significant consideration that limits the deployability of such systems, especially on resource-constrained devices.\ua0On the other hand, the emergence of new computing platforms, such as embedded devices with integrated, general-purpose Graphics Processing Units (GPUs), brings new, interesting challenges and opportunities for algorithm design in this setting: how to make use of new architectural features and how to evaluate their effect on algorithm performance. Up to now, work that focuses on pattern matching for such platforms has been limited to specific algorithms in isolation.In this work, we present a systematic and comprehensive benchmark that allows us to co-evaluate both existing and new pattern matching algorithms on heterogeneous devices equipped with embedded GPUs, suitable for medium- to high-level IoT deployments. We evaluate the algorithms on such a heterogeneous device, in close connection with the architectural features of the platform and provide insights on how these features affect the algorithms\u27 behavior. We find that, in our target embedded platform, GPU-based pattern matching algorithms have competitive performance compared to the CPU and consume half as much energy as the CPU-based variants.\ua0Based on these insights, we also propose HYBRID, a new pattern matching approach that efficiently combines techniques from existing approaches and outperforms them by 1.4x, across a range of realistic and synthetic data sets. Our benchmark details the effect of various optimizations, thus providing a path forward to make existing security mechanisms such as NIDS deployable on IoT devices

    Design and Experimental Evaluation of a Haptic Robot-Assisted System for Femur Fracture Surgery

    Full text link
    In the face of challenges encountered during femur fracture surgery, such as the high rates of malalignment and X-ray exposure to operating personnel, robot-assisted surgery has emerged as an alternative to conventional state-of-the-art surgical methods. This paper introduces the development of Robossis, a haptic system for robot-assisted femur fracture surgery. Robossis comprises a 7-DOF haptic controller and a 6-DOF surgical robot. A unilateral control architecture is developed to address the kinematic mismatch and the motion transfer between the haptic controller and the Robossis surgical robot. A real-time motion control pipeline is designed to address the motion transfer and evaluated through experimental testing. The analysis illustrates that the Robossis surgical robot can adhere to the desired trajectory from the haptic controller with an average translational error of 0.32 mm and a rotational error of 0.07 deg. Additionally, a haptic rendering pipeline is developed to resolve the kinematic mismatch by constraining the haptic controller (user hand) movement within the permissible joint limits of the Robossis surgical robot. Lastly, in a cadaveric lab test, the Robossis system assisted surgeons during a mock femur fracture surgery. The result shows that Robossis can provide an intuitive solution for surgeons to perform femur fracture surgery.Comment: This paper is to be submitted to an IEEE journa

    A Mosquito Pick-and-Place System for PfSPZ-based Malaria Vaccine Production

    Full text link
    The treatment of malaria is a global health challenge that stands to benefit from the widespread introduction of a vaccine for the disease. A method has been developed to create a live organism vaccine using the sporozoites (SPZ) of the parasite Plasmodium falciparum (Pf), which are concentrated in the salivary glands of infected mosquitoes. Current manual dissection methods to obtain these PfSPZ are not optimally efficient for large-scale vaccine production. We propose an improved dissection procedure and a mechanical fixture that increases the rate of mosquito dissection and helps to deskill this stage of the production process. We further demonstrate the automation of a key step in this production process, the picking and placing of mosquitoes from a staging apparatus into a dissection assembly. This unit test of a robotic mosquito pick-and-place system is performed using a custom-designed micro-gripper attached to a four degree of freedom (4-DOF) robot under the guidance of a computer vision system. Mosquitoes are autonomously grasped and pulled to a pair of notched dissection blades to remove the head of the mosquito, allowing access to the salivary glands. Placement into these blades is adapted based on output from computer vision to accommodate for the unique anatomy and orientation of each grasped mosquito. In this pilot test of the system on 50 mosquitoes, we demonstrate a 100% grasping accuracy and a 90% accuracy in placing the mosquito with its neck within the blade notches such that the head can be removed. This is a promising result for this difficult and non-standard pick-and-place task.Comment: 12 pages, 11 figures, Manuscript submitted for Special Issue of IEEE CASE 2019 for IEEE T-AS

    Dissection of quantitative blackleg resistance reveals novel variants of resistance gene Rlm9 in elite Brassica napus

    Get PDF
    Blackleg is one of the major fungal diseases in oilseed rape/canola worldwide. Most commercial cultivars carry R gene-mediated qualitative resistances that confer a high level of race-specific protection against Leptosphaeria maculans, the causal fungus of blackleg disease. However, monogenic resistances of this kind can potentially be rapidly overcome by mutations in the pathogen’s avirulence genes. To counteract pathogen adaptation in this evolutionary arms race, there is a tremendous demand for quantitative background resistance to enhance durability and efficacy of blackleg resistance in oilseed rape. In this study, we characterized genomic regions contributing to quantitative L. maculans resistance by genome-wide association studies in a multiparental mapping population derived from six parental elite varieties exhibiting quantitative resistance, which were all crossed to one common susceptible parental elite variety. Resistance was screened using a fungal isolate with no corresponding avirulence (AvrLm) to major R genes present in the parents of the mapping population. Genome-wide association studies revealed eight significantly associated quantitative trait loci (QTL) on chromosomes A07 and A09, with small effects explaining 3–6% of the phenotypic variance. Unexpectedly, the qualitative blackleg resistance gene Rlm9 was found to be located within a resistance-associated haploblock on chromosome A07. Furthermore, long-range sequence data spanning this haploblock revealed high levels of singlenucleotide and structural variants within the Rlm9 coding sequence among the parents of the mapping population. The results suggest that novel variants of Rlm9 could play a previously unknown role in expression of quantitative disease resistance in oilseed rape
    corecore