1,904 research outputs found

    A Survey of Prediction and Classification Techniques in Multicore Processor Systems

    Get PDF
    In multicore processor systems, being able to accurately predict the future provides new optimization opportunities, which otherwise could not be exploited. For example, an oracle able to predict a certain application\u27s behavior running on a smart phone could direct the power manager to switch to appropriate dynamic voltage and frequency scaling modes that would guarantee minimum levels of desired performance while saving energy consumption and thereby prolonging battery life. Using predictions enables systems to become proactive rather than continue to operate in a reactive manner. This prediction-based proactive approach has become increasingly popular in the design and optimization of integrated circuits and of multicore processor systems. Prediction transforms from simple forecasting to sophisticated machine learning based prediction and classification that learns from existing data, employs data mining, and predicts future behavior. This can be exploited by novel optimization techniques that can span across all layers of the computing stack. In this survey paper, we present a discussion of the most popular techniques on prediction and classification in the general context of computing systems with emphasis on multicore processors. The paper is far from comprehensive, but, it will help the reader interested in employing prediction in optimization of multicore processor systems

    Simulation of 1+1 dimensional surface growth and lattices gases using GPUs

    Get PDF
    Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show that efficient simulation algorithms can be realized on GPUs either by CUDA or by OpenCL programming. We consider a deposition/evaporation model following Kardar-Parisi-Zhang growth in 1+1 dimensions related to the Asymmetric Simple Exclusion Process and show that for sizes, that fit into the shared memory of GPUs one can achieve the maximum parallelization speedup ~ x100 for a Quadro FX 5800 graphics card with respect to a single CPU of 2.67 GHz). This permits us to study the effect of quenched columnar disorder, requiring extremely long simulation times. We compare the CUDA realization with an OpenCL implementation designed for processor clusters via MPI. A two-lane traffic model with randomized turning points is also realized and the dynamical behavior has been investigated.Comment: 20 pages 12 figures, 1 table, to appear in Comp. Phys. Com

    Massively Parallel Computation Using Graphics Processors with Application to Optimal Experimentation in Dynamic Control

    Get PDF
    The rapid increase in the performance of graphics hardware, coupled with recent improvements in its programmability has lead to its adoption in many non-graphics applications, including wide variety of scientific computing fields. At the same time, a number of important dynamic optimal policy problems in economics are athirst of computing power to help overcome dual curses of complexity and dimensionality. We investigate if computational economics may benefit from new tools on a case study of imperfect information dynamic programming problem with learning and experimentation trade-off that is, a choice between controlling the policy target and learning system parameters. Specifically, we use a model of active learning and control of linear autoregression with unknown slope that appeared in a variety of macroeconomic policy and other contexts. The endogeneity of posterior beliefs makes the problem difficult in that the value function need not be convex and policy function need not be continuous. This complication makes the problem a suitable target for massively-parallel computation using graphics processors. Our findings are cautiously optimistic in that new tools let us easily achieve a factor of 15 performance gain relative to an implementation targeting single-core processors and thus establish a better reference point on the computational speed vs. coding complexity trade-off frontier. While further gains and wider applicability may lie behind steep learning barrier, we argue that the future of many computations belong to parallel algorithms anyway.Graphics Processing Units, CUDA programming, Dynamic programming, Learning, Experimentation

    A Real-Time Capable Software-Defined Receiver Using GPU for Adaptive Anti-Jam GPS Sensors

    Get PDF
    Due to their weak received signal power, Global Positioning System (GPS) signals are vulnerable to radio frequency interference. Adaptive beam and null steering of the gain pattern of a GPS antenna array can significantly increase the resistance of GPS sensors to signal interference and jamming. Since adaptive array processing requires intensive computational power, beamsteering GPS receivers were usually implemented using hardware such as field-programmable gate arrays (FPGAs). However, a software implementation using general-purpose processors is much more desirable because of its flexibility and cost effectiveness. This paper presents a GPS software-defined radio (SDR) with adaptive beamsteering capability for anti-jam applications. The GPS SDR design is based on an optimized desktop parallel processing architecture using a quad-core Central Processing Unit (CPU) coupled with a new generation Graphics Processing Unit (GPU) having massively parallel processors. This GPS SDR demonstrates sufficient computational capability to support a four-element antenna array and future GPS L5 signal processing in real time. After providing the details of our design and optimization schemes for future GPU-based GPS SDR developments, the jamming resistance of our GPS SDR under synthetic wideband jamming is presented. Since the GPS SDR uses commercial-off-the-shelf hardware and processors, it can be easily adopted in civil GPS applications requiring anti-jam capabilities

    THE ENVIRONMENT DYNAMICS IDENTIFICATION BASED ON THE MODULAR COMPUTING COMPLEX

    Get PDF
    The research aims at covering the mathematical modeling issues of multidimensional applied problems of ecology based on the application of a modular computing complex. The problem of modeling air pollution processes is solved by mathematical models that adequately describe fundamental processes. That reveals issues such as a detailed analysis of the atmosphere of the city or industrial area, short-term forecast of air quality in the region, assessment of long term air purification programs, optimal emission management, transboundary transfer, etc. At the same time, the formulation and methods of solving problems of environmental dynamics identification are considered, which essence is to estimate the input parameters based on the factual information about the modeled system known from the experiment. In these studies, the multidimensional equation of harmful impurities transfer was reduced to a sequence of schemes involving unknown values in a single direction, alternately in the longitudinal, transverse and vertical.The implicit schemes lead to systems of algebraic linear equations with a three-diagonal structure. Thus, the methodological basis of the difference splitting schemes provides the economic and sustainable implementation of numerical models by the scalar runs method. That approach focuses on the fact that the greatest effect of a parallel processor is achieved when it is used to perform matrix computations of linear algebra.In order to analyze the feasibility of mathematical models, a package of applications was developed to compute the transfer of harmful impurities. A solution to several applied problems for the identification of the environmental dynamics is given.The research aims at covering the mathematical modeling issues of multidimensional applied problems of ecology based on the application of a modular computing complex. The problem of modeling air pollution processes is solved by mathematical models that adequately describe fundamental processes. That reveals issues such as a detailed analysis of the atmosphere of the city or industrial area, short-term forecast of air quality in the region, assessment of long term air purification programs, optimal emission management, transboundary transfer, etc. At the same time, the formulation and methods of solving problems of environmental dynamics identification are considered, which essence is to estimate the input parameters based on the factual information about the modeled system known from the experiment. In these studies, the multidimensional equation of harmful impurities transfer was reduced to a sequence of schemes involving unknown values in a single direction, alternately in the longitudinal, transverse and vertical.The implicit schemes lead to systems of algebraic linear equations with a three-diagonal structure. Thus, the methodological basis of the difference splitting schemes provides the economic and sustainable implementation of numerical models by the scalar runs method. That approach focuses on the fact that the greatest effect of a parallel processor is achieved when it is used to perform matrix computations of linear algebra.In order to analyze the feasibility of mathematical models, a package of applications was developed to compute the transfer of harmful impurities. A solution to several applied problems for the identification of the environmental dynamics is given

    Energy reduction in multiprocessor systems using transactional memory

    Get PDF

    Optimal Power Cost Management Using Stored Energy in Data Centers

    Get PDF
    Since the electricity bill of a data center constitutes a significant portion of its overall operational costs, reducing this has become important. We investigate cost reduction opportunities that arise by the use of uninterrupted power supply (UPS) units as energy storage devices. This represents a deviation from the usual use of these devices as mere transitional fail-over mechanisms between utility and captive sources such as diesel generators. We consider the problem of opportunistically using these devices to reduce the time average electric utility bill in a data center. Using the technique of Lyapunov optimization, we develop an online control algorithm that can optimally exploit these devices to minimize the time average cost. This algorithm operates without any knowledge of the statistics of the workload or electricity cost processes, making it attractive in the presence of workload and pricing uncertainties. An interesting feature of our algorithm is that its deviation from optimality reduces as the storage capacity is increased. Our work opens up a new area in data center power management.Comment: Full version of Sigmetrics 2011 pape

    Activities of the Institute for Computer Applications in Science and Engineering

    Get PDF
    Research conducted at the Institute for Computer Applications in Science and Engineering in applied mathematics, numerical analysis, and computer science during the period April 1, 1985 through October 2, 1985 is summarized
    corecore