240 research outputs found
ATM: approximate task memoization in the runtime system
Redundant computations appear during the execution of real programs. Multiple factors contribute to these unnecessary computations, such as repetitive inputs and patterns, calling functions with the same parameters or bad programming habits. Compilers minimize non useful code with static analysis. However, redundant execution might be dynamic and there are no current approaches to reduce these inefficiencies. Additionally, many algorithms can be computed with different levels of accuracy. Approximate computing exploits this fact to reduce execution time at the cost of slightly less accurate results. In this case, expert developers determine the desired tradeoff between performance and accuracy for each application. In this paper, we present Approximate Task Memoization (ATM), a novel approach in the runtime system that transparently exploits both dynamic redundancy and approximation at the task granularity of a parallel application. Memoization of previous task executions allows predicting the results of future tasks without having to execute them and without losing accuracy. To further increase performance improvements, the runtime system can memoize similar tasks, which leads to task approximate computing. By defining how to measure task similarity and correctness, we present an adaptive algorithm in the runtime system that automatically decides if task approximation is beneficial or not. When evaluated on a real 8-core processor with applications from different domains (financial analysis, stencil-computation, machine-learning and linear-algebra), ATM achieves a 1.4x average speedup when only applying memoization techniques. When adding task approximation, ATM achieves a 2.5x average speedup with an average 0.7% accuracy loss (maximum of 3.2%).This work has been supported by the RoMoL ERC Advanced Grant (GA 321253), by the Spanish Government (grant SEV2015-0493 of the Severo Ochoa Program), by the Spanish Ministry of Science and Innovation (contracts
TIN2015-65316), by Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272) and the European HiPEAC Network of Excellence. M. MoretĂł has been partially supported by the Ministry of Economy and Competitiveness under Juan de la Cierva postdoctoral fellowship number JCI-2012-15047. M. Casas is supported by the
Secretary for Universities and Research of the Ministry of Economy and Knowledge of the Government of Catalonia and the Cofund programme of the Marie Curie Actions of
the 7th R&D Framework Programme of the European Union (Contract 2013 BP B 00243). I. Brumar has been partially supported by the Spanish Ministry of Education, Culture and
Sports under grant FPU2015/12849.Peer ReviewedPostprint (author's final draft
Consumption of low-moderate level arsenic contaminated water does not increase spontaneous pregnancy loss: a case control study
Previous work suggests an increased risk for spontaneous pregnancy loss linked to high levels of inorganic arsenic (iAs) in drinking water sources (\u3e10 ÎŒg/L). However, there has been little focus to date on the impact of low-moderate levels of iAs in drinking water (\u3c10 \u3eÎŒg/L). To address this data gap we conducted a hospital-based caseâcontrol study in Timis County, Romania
In vitro corrosion of titanium nitride and oxynitride-based biocompatible coatings deposited on stainless steel
The reactive cathodic arc deposition technique was used to produce Ti nitride and oxynitride
coatings on 304 stainless steel substrates (SS). Both mono (SS/TiN, SS/TiNO) and bilayer coatings
(SS/TiN/TiNO and SS/TiNO/TiN) were investigated in terms of elemental and phase composition,
microstructure, grain size, morphology, and roughness. The corrosion behavior in a solution consisting
of 0.10 M NaCl + 1.96 M H2O2 was evaluated, aiming for biomedical applications. The results showed
that the coatings were compact, homogeneously deposited on the substrate, and displaying rough
surfaces. The XRD analysis indicated that both mono and bilayer coatings showed only cubic phases
with (111) and (222) preferred orientations. The highest crystallinity was shown by the SS/TiN coating,
as indicated also by the largest grain size of 23.8 nm, which progressively decreased to 16.3 nm
for the SS/TiNO monolayer. The oxynitride layers exhibited the best in vitro corrosion resistance
either as a monolayer or as a top layer in the bilayer structure, making them a good candidate for
implant applications
Densely connected GCN model for motion prediction
© 2020 The Authors. Computer Animation and Virtual Worlds published by John Wiley & Sons, Ltd. Human motion prediction is a fundamental problem in understanding human natural movements. This task is very challenging due to the complex human body constraints and diversity of action types. Due to the human body being a natural graph, graph convolutional network (GCN)-based models perform better than the traditional recurrent neural network (RNN)-based models on modeling the natural spatial and temporal dependencies lying in the motion data. In this paper, we develop the GCN-based models further by adding densely connected links to increase their feature utilizations and address oversmoothing problem. More specifically, the GCN block is used to learn the spatial relationships between the nodes and each feature map of the GCN block propagates directly to every following block as input rather than residual linked. In this way, the spatial dependency of human motion data is exploited more sufficiently and the features of different level of scale are fused more efficiently. Extensive experiments demonstrate our model achieving the state-of-the-art results on CMU dataset
On the generalized Freedman-Townsend model
Consistent interactions that can be added to a free, Abelian gauge theory
comprising a finite collection of BF models and a finite set of two-form gauge
fields (with the Lagrangian action written in first-order form as a sum of
Abelian Freedman-Townsend models) are constructed from the deformation of the
solution to the master equation based on specific cohomological techniques.
Under the hypotheses of smoothness in the coupling constant, locality, Lorentz
covariance, and Poincare invariance of the interactions, supplemented with the
requirement on the preservation of the number of derivatives on each field with
respect to the free theory, we obtain that the deformation procedure modifies
the Lagrangian action, the gauge transformations as well as the accompanying
algebra. The interacting Lagrangian action contains a generalized version of
non-Abelian Freedman-Townsend model. The consistency of interactions to all
orders in the coupling constant unfolds certain equations, which are shown to
have solutions.Comment: LaTeX, 62 page
Co-Evaluation of Pattern Matching Algorithms on IoT Devices with Embedded GPUs
Pattern matching is an important building block for many security applications, including Network Intrusion Detection Systems (NIDS). As NIDS grow in functionality and complexity, the time overhead and energy consumption of pattern matching become a significant consideration that limits the deployability of such systems, especially on resource-constrained devices.\ua0On the other hand, the emergence of new computing platforms, such as embedded devices with integrated, general-purpose Graphics Processing Units (GPUs), brings new, interesting challenges and opportunities for algorithm design in this setting: how to make use of new architectural features and how to evaluate their effect on algorithm performance. Up to now, work that focuses on pattern matching for such platforms has been limited to specific algorithms in isolation.In this work, we present a systematic and comprehensive benchmark that allows us to co-evaluate both existing and new pattern matching algorithms on heterogeneous devices equipped with embedded GPUs, suitable for medium- to high-level IoT deployments. We evaluate the algorithms on such a heterogeneous device, in close connection with the architectural features of the platform and provide insights on how these features affect the algorithms\u27 behavior. We find that, in our target embedded platform, GPU-based pattern matching algorithms have competitive performance compared to the CPU and consume half as much energy as the CPU-based variants.\ua0Based on these insights, we also propose HYBRID, a new pattern matching approach that efficiently combines techniques from existing approaches and outperforms them by 1.4x, across a range of realistic and synthetic data sets. Our benchmark details the effect of various optimizations, thus providing a path forward to make existing security mechanisms such as NIDS deployable on IoT devices
Design and Experimental Evaluation of a Haptic Robot-Assisted System for Femur Fracture Surgery
In the face of challenges encountered during femur fracture surgery, such as
the high rates of malalignment and X-ray exposure to operating personnel,
robot-assisted surgery has emerged as an alternative to conventional
state-of-the-art surgical methods. This paper introduces the development of
Robossis, a haptic system for robot-assisted femur fracture surgery. Robossis
comprises a 7-DOF haptic controller and a 6-DOF surgical robot. A unilateral
control architecture is developed to address the kinematic mismatch and the
motion transfer between the haptic controller and the Robossis surgical robot.
A real-time motion control pipeline is designed to address the motion transfer
and evaluated through experimental testing. The analysis illustrates that the
Robossis surgical robot can adhere to the desired trajectory from the haptic
controller with an average translational error of 0.32 mm and a rotational
error of 0.07 deg. Additionally, a haptic rendering pipeline is developed to
resolve the kinematic mismatch by constraining the haptic controller (user
hand) movement within the permissible joint limits of the Robossis surgical
robot. Lastly, in a cadaveric lab test, the Robossis system assisted surgeons
during a mock femur fracture surgery. The result shows that Robossis can
provide an intuitive solution for surgeons to perform femur fracture surgery.Comment: This paper is to be submitted to an IEEE journa
A Mosquito Pick-and-Place System for PfSPZ-based Malaria Vaccine Production
The treatment of malaria is a global health challenge that stands to benefit
from the widespread introduction of a vaccine for the disease. A method has
been developed to create a live organism vaccine using the sporozoites (SPZ) of
the parasite Plasmodium falciparum (Pf), which are concentrated in the salivary
glands of infected mosquitoes. Current manual dissection methods to obtain
these PfSPZ are not optimally efficient for large-scale vaccine production. We
propose an improved dissection procedure and a mechanical fixture that
increases the rate of mosquito dissection and helps to deskill this stage of
the production process. We further demonstrate the automation of a key step in
this production process, the picking and placing of mosquitoes from a staging
apparatus into a dissection assembly. This unit test of a robotic mosquito
pick-and-place system is performed using a custom-designed micro-gripper
attached to a four degree of freedom (4-DOF) robot under the guidance of a
computer vision system. Mosquitoes are autonomously grasped and pulled to a
pair of notched dissection blades to remove the head of the mosquito, allowing
access to the salivary glands. Placement into these blades is adapted based on
output from computer vision to accommodate for the unique anatomy and
orientation of each grasped mosquito. In this pilot test of the system on 50
mosquitoes, we demonstrate a 100% grasping accuracy and a 90% accuracy in
placing the mosquito with its neck within the blade notches such that the head
can be removed. This is a promising result for this difficult and non-standard
pick-and-place task.Comment: 12 pages, 11 figures, Manuscript submitted for Special Issue of IEEE
CASE 2019 for IEEE T-AS
Dissection of quantitative blackleg resistance reveals novel variants of resistance gene Rlm9 in elite Brassica napus
Blackleg is one of the major fungal diseases in oilseed rape/canola worldwide. Most
commercial cultivars carry R gene-mediated qualitative resistances that confer a high
level of race-specific protection against Leptosphaeria maculans, the causal fungus of
blackleg disease. However, monogenic resistances of this kind can potentially be rapidly
overcome by mutations in the pathogenâs avirulence genes. To counteract pathogen
adaptation in this evolutionary arms race, there is a tremendous demand for quantitative
background resistance to enhance durability and efficacy of blackleg resistance in
oilseed rape. In this study, we characterized genomic regions contributing to quantitative
L. maculans resistance by genome-wide association studies in a multiparental mapping
population derived from six parental elite varieties exhibiting quantitative resistance,
which were all crossed to one common susceptible parental elite variety. Resistance
was screened using a fungal isolate with no corresponding avirulence (AvrLm) to
major R genes present in the parents of the mapping population. Genome-wide
association studies revealed eight significantly associated quantitative trait loci (QTL)
on chromosomes A07 and A09, with small effects explaining 3â6% of the phenotypic
variance. Unexpectedly, the qualitative blackleg resistance gene Rlm9 was found to be
located within a resistance-associated haploblock on chromosome A07. Furthermore,
long-range sequence data spanning this haploblock revealed high levels of singlenucleotide
and structural variants within the Rlm9 coding sequence among the parents
of the mapping population. The results suggest that novel variants of Rlm9 could play a
previously unknown role in expression of quantitative disease resistance in oilseed rape
- âŠ