Search CORE

665 research outputs found

A Survey of Prediction and Classification Techniques in Multicore Processor Systems

Author: Ababei Cristinel
Moghaddam Milad Ghorbani
Publication venue: e-Publications@Marquette
Publication date: 01/05/2019
Field of study

In multicore processor systems, being able to accurately predict the future provides new optimization opportunities, which otherwise could not be exploited. For example, an oracle able to predict a certain application\u27s behavior running on a smart phone could direct the power manager to switch to appropriate dynamic voltage and frequency scaling modes that would guarantee minimum levels of desired performance while saving energy consumption and thereby prolonging battery life. Using predictions enables systems to become proactive rather than continue to operate in a reactive manner. This prediction-based proactive approach has become increasingly popular in the design and optimization of integrated circuits and of multicore processor systems. Prediction transforms from simple forecasting to sophisticated machine learning based prediction and classification that learns from existing data, employs data mining, and predicts future behavior. This can be exploited by novel optimization techniques that can span across all layers of the computing stack. In this survey paper, we present a discussion of the most popular techniques on prediction and classification in the general context of computing systems with emphasis on multicore processors. The paper is far from comprehensive, but, it will help the reader interested in employing prediction in optimization of multicore processor systems

epublications@Marquette

To boldly go:an occam-π mission to engineer emergence

Author: Klein Mark
Sampson Adam T.
Wallnau Kurt
Welch Peter H.
Publication venue
Publication date: 01/01/2012
Field of study

Future systems will be too complex to design and implement explicitly. Instead, we will have to learn to engineer complex behaviours indirectly: through the discovery and application of local rules of behaviour, applied to simple process components, from which desired behaviours predictably emerge through dynamic interactions between massive numbers of instances. This paper describes a process-oriented architecture for fine-grained concurrent systems that enables experiments with such indirect engineering. Examples are presented showing the differing complex behaviours that can arise from minor (non-linear) adjustments to low-level parameters, the difficulties in suppressing the emergence of unwanted (bad) behaviour, the unexpected relationships between apparently unrelated physical phenomena (shown up by their separate emergence from the same primordial process swamp) and the ability to explore and engineer completely new physics (such as force fields) by their emergence from low-level process interactions whose mechanisms can only be imagined, but not built, at the current time

Abertay Research Portal

Crossref

Kent Academic Repository

Speculative Thread Framework for Transient Management and Bumpless Transfer in Reconfigurable Digital Filters

Author: Ferri Aldo
Ferri Bonnie
Giardino Michael
Maxwell Wayne
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/06/2018
Field of study

There are many methods developed to mitigate transients induced when abruptly changing dynamic algorithms such as those found in digital filters or controllers. These "bumpless transfer" methods have a computational burden to them and take time to implement, causing a delay in the desired switching time. This paper develops a method that automatically reconfigures the computational resources in order to implement a transient management method without any delay in switching times. The method spawns a speculative thread when it predicts if a switch in algorithms is imminent so that the calculations are done prior to the switch being made. The software framework is described and experimental results are shown for a switching between filters in a filter bank.Comment: 6 pages, 7 figures, to be presented at American Controls Conference 201

arXiv.org e-Print Archive

Crossref

The 2014 International Planning Competition: Progress and Trends

Author: Chrpa Lukas
Editor Managing
Grześ Marek
McCluskey Thomas Leo
Roberts Mark
Sanner Scott
Vallati Mauro
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 01/03/2015
Field of study

We review the 2014 International Planning Competition (IPC-2014), the eighth in a series of competitions starting in 1998. IPC-2014 was held in three separate parts to assess state-of-the-art in three prominent areas of planning research: the deterministic (classical) part (IPCD), the learning part (IPCL), and the probabilistic part (IPPC). Each part evaluated planning systems in ways that pushed the edge of existing planner performance by introducing new challenges, novel tasks, or both. The competition surpassed again the number of competitors than its predecessor, highlighting the competition’s central role in shaping the landscape of ongoing developments in evaluating planning systems

Crossref

Kent Academic Repository

Association for the Advancement of Artificial Intelligence: AAAI Publications

University of Huddersfield Repository

Huddersfield Research Portal

Applications Exploiting e-Infrastructures Across Europe and India Within the EU-IndiaGrid Project

Author: Alberto Masoni
Stefano Cozzini
Publication venue: 'IntechOpen'
Publication date: 01/01/2012
Field of study

In the last few years e-Infrastructures across Europe and India faced remarkable developments. Both national and international connectivity improved considerably and Grid Computing also profited of significant developments. As a consequence scientific applications were in the position of taking substantial benefits from this progress. The most relevant cases are represented by High Energy Physics (with the contribution to the program of Large Hadron Collider at CERN, Geneva) and Nano Science (exploiting NKN-TEIN3-GEANT interconnection for crystallography experiments with the remote access & control of experimental facility at the ESRF Synchrotron based in Grenoble, France directly from Mumbai, India). Other relevant application areas include Climate Change research, Biology, and several areas in Material Science. Within this framework, in the last five years period two specific EU funded projects (the EU-IndiaGrid and EU-IndiaGrid2) played a bridging role supporting several applications that exploited these advanced e-Infrastructures for the benefit of Euro-India common research programs. A crucial important part in the projects activity was the support offered to selected applications which ranges from the training the user communities behind up to the porting of their scientific applications on the grid computing infrastructure. This article aims to present and review the main e Infrastructures development in India and their full exploitation by scientific applications with a focus on the role played by the EUIndiaGrid and EU-IndiaGrid2 projects

IntechOpen

Crossref

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Advances in High Performance Computing and Related Issues

Author: Borko Furht
Nenad Korolija
Veljko Milutinović
Zoran Obradović
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

Crossref

Directory of Open Access Journals

Efficient Neural Network Implementations on Parallel Embedded Platforms Applied to Real-Time Torque-Vectoring Optimization Using Predictions for Multi-Motor Electric Vehicles

Author: Cosco Francesco
Dendaluce Jahnke Martin
Gomez-Garay Vicente
Novickis Rihards
Pérez Rastelli Joshué
Publication venue: 'MDPI AG'
Publication date: 01/01/2019
Field of study

The combination of machine learning and heterogeneous embedded platforms enables new potential for developing sophisticated control concepts which are applicable to the field of vehicle dynamics and ADAS. This interdisciplinary work provides enabler solutions -ultimately implementing fast predictions using neural networks (NNs) on field programmable gate arrays (FPGAs) and graphical processing units (GPUs)- while applying them to a challenging application: Torque Vectoring on a multi-electric-motor vehicle for enhanced vehicle dynamics. The foundation motivating this work is provided by discussing multiple domains of the technological context as well as the constraints related to the automotive field, which contrast with the attractiveness of exploiting the capabilities of new embedded platforms to apply advanced control algorithms for complex control problems. In this particular case we target enhanced vehicle dynamics on a multi-motor electric vehicle benefiting from the greater degrees of freedom and controllability offered by such powertrains. Considering the constraints of the application and the implications of the selected multivariable optimization challenge, we propose a NN to provide batch predictions for real-time optimization. This leads to the major contribution of this work: efficient NN implementations on two intrinsically parallel embedded platforms, a GPU and a FPGA, following an analysis of theoretical and practical implications of their different operating paradigms, in order to efficiently harness their computing potential while gaining insight into their peculiarities. The achieved results exceed the expectations and additionally provide a representative illustration of the strengths and weaknesses of each kind of platform. Consequently, having shown the applicability of the proposed solutions, this work contributes valuable enablers also for further developments following similar fundamental principles.Some of the results presented in this work are related to activities within the 3Ccar project, which has received funding from ECSEL Joint Undertaking under grant agreement No. 662192. This Joint Undertaking received support from the European Union’s Horizon 2020 research and innovation programme and Germany, Austria, Czech Republic, Romania, Belgium, United Kingdom, France, Netherlands, Latvia, Finland, Spain, Italy, Lithuania. This work was also partly supported by the project ENABLES3, which received funding from ECSEL Joint Undertaking under grant agreement No. 692455-2

Multidisciplinary Digital Publishing Institute

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

TECNALIA Publications

GPU-Accelerated Large-Eddy Simulation of Turbulent Channel Flows

Author: Antoniou A.S.
Briggs W. L.
Cheng W.
Chorin A.J.
Chung D.
Deardorff J.W.
Driest E.V.
Geveler M.
Griebel M.
Hoyas S.
Jacobsen D.A.
Jacobsen D.A.
Jacobsen D.A.
Kerr A.
Kogge P. M.
Meneveau C.
Smagorinksy J.
The Portland Group
Thibault J.C.
Publication venue: 'IUScholarWorks'
Publication date: 09/01/2012
Field of study

High performance computing clusters that are augmented with cost and power efficient graphics processing unit (GPU) provide new opportunities to broaden the use of large-eddy simulation technique to study high Reynolds number turbulent flows in fluids engineering applications. In this paper, we extend our earlier work on multi-GPU acceleration of an incompressible Navier-Stokes solver to include a large-eddy simulation (LES) capability. In particular, we implement the Lagrangian dynamic subgrid scale model and compare our results against existing direct numerical simulation (DNS) data of a turbulent channel flow at Reτ = 180. Overall, our LES results match fairly well with the DNS data. Our results show that the Reτ = 180 case can be entirely simulated on a single GPU, whereas higher Reynolds cases can benefit from a GPU cluster

Crossref

Boise State University - ScholarWorks

CROSS-STACK PREDICTIVE CONTROL FRAMEWORK FOR MULTICORE REAL-TIME APPLICATIONS

Author: Cao Guangyi
NC DOCKS at The University of North Carolina at Charlotte
Publication venue
Publication date: 01/01/2014
Field of study

Many of the next generation applications in entertainment, human computer interaction, infrastructure, security and medical systems are computationally intensive, always-on, and have soft real time (SRT) requirements. While failure to meet deadlines is not catastrophic in SRT systems, missing deadlines can result in an unacceptable degradation in the quality of service (QoS). To ensure acceptable QoS under dynamically changing operating conditions such as changes in the workload, energy availability, and thermal constraints, systems are typically designed for worst case conditions. Unfortunately, such over-designing of systems increases costs and overall power consumption. In this dissertation we formulate the real-time task execution as a Multiple-Input, Single- Output (MISO) optimal control problem involving tracking a desired system utilization set point with control inputs derived from across the computing stack. We assume that an arbitrary number of SRT tasks may join and leave the system at arbitrary times. The tasks are scheduled on multiple cores by a dynamic priority multiprocessor scheduling algorithm. We use a model predictive controller (MPC) to realize optimal control. MPCs are easy to tune, can handle multiple control variables, and constraints on both the dependent and independent variables. We experimentally demonstrate the operation of our controller on a video encoder application and a computer vision application executing on a dual socket quadcore Xeon processor with a total of 8 processing cores. We establish that the use of DVFS and application quality as control variables enables operation at a lower power op- erating point while meeting real-time constraints as compared to non cross-stack control approaches. We also evaluate the role of scheduling algorithms in the control of homo- geneous and heterogeneous workloads. Additionally, we propose a novel adaptive control technique for time-varying workloads

The University of North Carolina at Greensboro