1,338 research outputs found

    Fuzzy memoization for floating-point multimedia applications

    Get PDF
    Instruction memoization is a promising technique to reduce the power consumption and increase the performance of future low-end/mobile multimedia systems. Power and performance efficiency can be improved by reusing instances of an already executed operation. Unfortunately, this technique may not always be worth the effort due to the power consumption and area impact of the tables required to leverage an adequate level of reuse. In this paper, we introduce and evaluate a novel way of understanding multimedia floating-point operations based on the fuzzy computation paradigm: performance and power consumption can be improved at the cost of small precision losses in computation. By exploiting this implicit characteristic of multimedia applications, we propose a new technique called tolerant memoization. This technique expands the capabilities of classic memoization by associating entries with similar inputs to the same output. We evaluate this new technique by measuring the effect of tolerant memoization for floating-point operations in a low-power multimedia processor and discuss the trade-offs between performance and quality of the media outputs. We report energy improvements of 12 percent for a set of key multimedia applications with small LUT of 6 Kbytes, compared to 3 percent obtained using previously proposed techniques.Peer ReviewedPostprint (published version

    Análisis cognitivo en la preparación de un experimento de enseñanza acerca de variables estadísticas y sus escalas de medición

    Get PDF
    El ciclo de análisis didáctico (Gómez, 2002) se compone entre otros elementos del análisis de contenido, análisis cognitivo, análisis de instrucción y análisis de actuación. En el proceso de preparación del experimento de enseñanza se trabaja en el desarrollo de los primeros tres primeros elementos. Se describe de manera general los referentes teóricos relacionados con el ciclo de análisis didáctico específicamente el análisis cognitivo, en el cual según González y Gómez (2013) señalan que el foco de atención recae sobre el aprendizaje del estudiante, razón por la cual se especifican lo que se espera que aprendan los estudiantes acerca de las escalas de medición y la variable estadística, esto concretado en el planteamiento de competencias, objetivos y capacidades. Esta comunicación reporta un avance del trabajo de maestría titulado ‘Variable estadística y escalas de medición’ enmarcado en la línea de investigación en Educación Estadística de la Universidad Pedagógica Nacional

    Initial results on fuzzy floating point computation for multimedia processors

    Get PDF
    During the recent years, the market of mid/low-end portable systems such as PDAs or mobile digital phones have experimented a revolution in both selling volume and features as handheld devices incorporate Multimedia applications. This fact brings to an increase in the computational demands of the devices, while still having the limitation of power (and energy) consumption. Instruction memoization is a promising technique to help alleviate the problem of power consumption of expensive functional units such as the floating-point one. Unfortunately, this technique could be energy-inefficient for low-end systems due to the additional power consumption of the relatively big tables required. In this paper we present a novel way of understanding multimedia floating point operations based on the fuzzy computation paradigm: losses in the computation precision may exchange performance for negligible errors in the output. Exploiting the implicit characteristics of media FP computation, we propose a new technique called fuzzy memoization. Fuzzy memoization expands the capabilities of classic memoization by attaching entries with similar inputs to the same output. We present a case of study for a SH4 like processor and report good performance and power-delay improvements with feasible hardware requirements.Peer ReviewedPostprint (published version

    Performance analysis of a hardware accelerator of dependence management for taskbased dataflow programming models

    Get PDF
    Along with the popularity of multicore and manycore, task-based dataflow programming models obtain great attention for being able to extract high parallelism from applications without exposing the complexity to programmers. One of these pioneers is the OpenMP Superscalar (OmpSs). By implementing dynamic task dependence analysis, dataflow scheduling and out-of-order execution in runtime, OmpSs achieves high performance using coarse and medium granularity tasks. In theory, for the same application, the more parallel tasks can be exposed, the higher possible speedup can be achieved. Yet this factor is limited by task granularity, up to a point where the runtime overhead outweighs the performance increase and slows down the application. To overcome this handicap, Picos was proposed to support task-based dataflow programming models like OmpSs as a fast hardware accelerator for fine-grained task and dependence management, and a simulator was developed to perform design space exploration. This paper presents the very first functional hardware prototype inspired by Picos. An embedded system based on a Zynq 7000 All-Programmable SoC is developed to study its capabilities and possible bottlenecks. Initial scalability and hardware consumption studies of different Picos designs are performed to find the one with the highest performance and lowest hardware cost. A further thorough performance study is employed on both the prototype with the most balanced configuration and the OmpSs software-only alternative. Results show that our OmpSs runtime hardware support significantly outperforms the software-only implementation currently available in the runtime system for finegrained tasks.This work is supported by the Spanish Government through Programa Severo Ochoa (SEV-2015-0493), by the Spanish Ministry of Science and Technology through TIN2015-65316-P project, by the Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272) and by the European Research Council RoMoL Grant Agreement number 321253. We also thank the Xilinx University Program for its hardware and software donations.Peer ReviewedPostprint (published version

    MCMC Bayesian spatial filtering for hedonic models in real estate markets

    Get PDF
    The traditional hedonic model postulates that housing prices depend on their characteristics and their location. However, this model assumes a constant relationship between the dependent and the independent variables. This assumption is unrealistic because empirical studies have shown that the regression coefficients depend on the housing location. For this reason, it is necessary to use models with spatially varying coefficients. The approaches proposed in the literature used to estimate this type of models do not incorporate the uncertainty associated with the estimation and selection of models and/or are computationally expensive. To improve these aspects, this paper proposes spatial filtering techniques to parsimoniously model the spatial dependencies of the hedonic coefficients and an adaptive MCMC Bayesian algorithm to select the most appropriate filters. The method is illustrated through an application to the real estate market of Zaragoza, and a comparison with alternative procedures is conducted. Our results show that our valuation methodology has better goodness of fit and predictive performance properties than alternative methods. Although our proposal assumes normality and homoscedasticity of the model error distribution, the method is easy to implement and not very computationally demanding, which makes this approach attractive and useful from a practical viewpoint

    Los Almendros (La Roda): a new Paleolithic site in Albacete

    Get PDF
    El yacimiento de Los Almendros se ubica en la localidad manchega de La Roda (Albacete). El material lítico estudiado fue recogido utilizando un sistema radial de sectorización que cubría 2500 metros cuadrados de una superficie problemática, con zonas de abundante monte bajo, y otras, más despejadas. En total se hallaron 1004 restos líticos, fundamentalmente en cuarcita. De ellos, 163 son útiles, destacando las raederas, lascas retocadas, y raspadores. Una primera aproximación a las características de la industria lítica nos ha permitido sugerir su atribución crono-cultural al tecno-complejo Musteriense.The site of Los Almendros is located in La Roda (Albacete). The lithic material studied was gathered through a radial sectorization collecting system which covered a problematic surface of 2500 m2, including zones of abundant scrubland and others clearer. A total amount of 1004 lithic pieces were registred, mainly in quartzite, 163 of which were tools. The side-scrapers, retouched flakes, and scrapers are the most numerous. A first approach to the features of the lithic industry has allowed us to suggest the crono-cultural attribution of this site to a Mousterian period

    Architectural support for task dependence management with flexible software scheduling

    Get PDF
    The growing complexity of multi-core architectures has motivated a wide range of software mechanisms to improve the orchestration of parallel executions. Task parallelism has become a very attractive approach thanks to its programmability, portability and potential for optimizations. However, with the expected increase in core counts, finer-grained tasking will be required to exploit the available parallelism, which will increase the overheads introduced by the runtime system. This work presents Task Dependence Manager (TDM), a hardware/software co-designed mechanism to mitigate runtime system overheads. TDM introduces a hardware unit, denoted Dependence Management Unit (DMU), and minimal ISA extensions that allow the runtime system to offload costly dependence tracking operations to the DMU and to still perform task scheduling in software. With lower hardware cost, TDM outperforms hardware-based solutions and enhances the flexibility, adaptability and composability of the system. Results show that TDM improves performance by 12.3% and reduces EDP by 20.4% on average with respect to a software runtime system. Compared to a runtime system fully implemented in hardware, TDM achieves an average speedup of 4.2% with 7.3x less area requirements and significant EDP reductions. In addition, five different software schedulers are evaluated with TDM, illustrating its flexibility and performance gains.This work has been supported by the RoMoL ERC Advanced Grant (GA 321253), by the European HiPEAC Network of Excellence, by the Spanish Ministry of Science and Innovation (contracts TIN2015-65316-P, TIN2016-76635-C2-2-R and TIN2016-81840-REDT), by the Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272), and by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 671697 and No. 671610. M. Moretó has been partially supported by the Ministry of Economy and Competitiveness under Juan de la Cierva postdoctoral fellowship number JCI-2012-15047.Peer ReviewedPostprint (author's final draft

    Simultaneous solution for water, heat and salt balances in a Mediterranean coastal lagoon (Mar Menor, Spain)

    Get PDF
    A modelling approach is proposed to evaluate the environmental dynamics of coastal lagoons. The water, heat and salt balances are addressed simultaneously, providing a better estimation of evaporation and water exchanges. Compared to traditional approaches, the model presented accounts for the effects of water salinity, heat storage and net energy advected in the water body. The model was applied daily to the Mar Menor coastal lagoon (SE Spain) from 2003 through 2006. Water exchanges with the Mediterranean Sea were estimated based on the monthly trend of the lagoon salinity and were correlated with monthly averages of wind speed. The mean daily water exchange with the sea was 1.77 hm3 d 1. This exchange accounted for only 1% of the heat losses in the lagoon heat balance, and it is the most important flow in the water balance. The mean annual evaporation flux amounted to 101.3 W m-2 (3.55 mm d-1), while the sensible heat flux amounted to 19.7 W m-2, leading to an annual Bowen ratio on the order of 0.19. To validate the model, daily water temperatures were predicted based on the daily heat balance of the water body and were compared with remote sensing data from water surface standard products.The authors acknowledge the Fundación Instituto Euromediterráneo del Agua (Murcia, Spain) for the financial support that made possible this study
    corecore