Search CORE

3,650 research outputs found

GPU Resource Optimization and Scheduling for Shared Execution Environments

Author: Luley Ryan Seamus
Publication venue: SURFACE at Syracuse University
Publication date: 23/08/2020
Field of study

General purpose graphics processing units have become a computing workhorse for a variety of data- and compute-intensive applications, from large supercomputing systems for massive data analytics to small, mobile embedded devices for autonomous vehicles. Making effective and efficient use of these processors traditionally relies on extensive programmer expertise to design and develop kernel methods which simultaneously trade off task decomposition and resource exploitation. Often, new architecture designs force code refinements in order to continue to achieve optimal performance. At the same time, not all applications require full utilization of the system to achieve that optimal performance. In this case, the increased capability of new architectures introduces an ever-widening gap between the level of resources necessary for optimal performance and the level necessary to maintain system efficiency. The ability to schedule and execute multiple independent tasks on a GPU, known generally as concurrent kernel execution, enables application programmers and system developers to balance application performance and system efficiency. Various approaches to develop both coarse- and fine-grained scheduling mechanisms to achieve a high degree of resource utilization and improved application performance have been studied. Most of these works focus on mechanisms for the management of compute resources, while a small percentage consider the data transfer channels. In this dissertation, we propose a pragmatic approach to scheduling and managing both types of resources – data transfer and compute – that is transparent to an application programmer and capable of providing near-optimal system performance. Furthermore, the approaches described herein rely on reinforcement learning methods, which enable the scheduling solutions to be flexible to a variety of factors, such as transient application behaviors, changing system designs, and tunable objective functions. Finally, we describe a framework for the practical implementation of learned scheduling policies to achieve high resource utilization and efficient system performance

Syracuse University Research Facility and Collaborative Environment

Virtue integrated platform : holistic support for distributed ship hydrodynamic design

Author: Duffy Alex H.B.
Vassalos Dracos
Whitfield Ian R.
Wu Zichao
York Phil
Publication venue
Publication date: 01/08/2007
Field of study

Ship hydrodynamic design today is often still done in a sequential approach. Tools used for the different aspects of CFD (Computational Fluid Dynamics) simulation (e.g. wave resistance, cavitation, seakeeping, and manoeuvring), and even for the different levels of detail within a single aspect, are often poorly integrated. VIRTUE (the VIRtual Tank Utility in Europe) project has the objective to develop a platform that will enable various distributed CFD and design applications to be integrated so that they may operate in a unified and holistic manner. This paper presents an overview of the VIRTUE Integrated Platform (VIP), e.g. research background, objectives, current work, user requirements, system architecture, its implementation, evaluation, and current development and future work

University of Strathclyde Institutional Repository

Weiterentwicklung analytischer Datenbanksysteme

Author: Kipf Andreas Michael
Publication venue: Technische Universität München
Publication date
Field of study

This thesis contributes to the state of the art in analytical database systems. First, we identify and explore extensions to better support analytics on event streams. Second, we propose a novel polygon index to enable efficient geospatial data processing in main memory. Third, we contribute a new deep learning approach to cardinality estimation, which is the core problem in cost-based query optimization.Diese Arbeit trägt zum aktuellen Forschungsstand von analytischen Datenbanksystemen bei. Wir identifizieren und explorieren Erweiterungen um Analysen auf Eventströmen besser zu unterstützen. Wir stellen eine neue Indexstruktur für Polygone vor, die eine effiziente Verarbeitung von Geodaten im Hauptspeicher ermöglicht. Zudem präsentieren wir einen neuen Ansatz für Kardinalitätsschätzungen mittels maschinellen Lernens

Bio-inspired computation: where we stand and what's next

Author: Abouhawwash
Abraham
Afifi
Ahmadi-Javid
Al Amro
Al-Faris
Alba
Alba
Alba
Alba
Alba
Amine Bouhlel
Andrade
Andres
Andrés-Pérez
Andrés-Pérez
Antonio
Antonio
Antonio
Arcuri
Arnold
Arnold
Atashpaz-Gargari
Atencia
Auger
Auger
Awad
Awais
Baringo
Barrera
Barták
Basak
Beale
Bechikh
Bello-Orgaz
Ben-Tal
Bermejo
Bertsimas
Bessaou
Beume
Beyer
Bhosekar
Biamonte
Binitha
Biswas
Biswas
Blackwell
Blanchard
Bokrantz
Bonabeau
Bouhlel
Branke
Brest
Bucking
Burke
Burke
Bäck
Camacho
Camacho-Villalón
Cao
Cao
Cao
Carrasco
Chen
Chen
Chen
Cheng
Cheng
Cheng
Chica
Chicano
Choraś
Christelis
Ciliberto
Clerc
Cobb
Coello
Coello Coello
Coello Coello
Collette
Cowling
Cruz
Cuadra
Cui
Dantzig
Das
Das
Das
Das
De Falco
de França
De Jong
Deb
Deb
Deb
Deb
Del Ser
Demertzis
Derrac
Diez-Olivan
Dilek
Dorigo
Drugan
Du
Duan
Duarte
Durillo
Easum
Eberhart
Eiben
Eiben
Eichfelder
Elsayed
Engelbrecht
Epitropakis
Eskandar
Falcón-Cardona
Farina
Fazenda
Fiore
Fister
Fister
Fogel
Forrester
Fu
Gal
Gamarra
Gao
Garcia
García-Martínez
Gen
Gen
Ghaheri
Goh
Goldberg
Gong
Gong
Gonzalez-Pardo
Gonzalez-Pardo
González-Pardo
González-Pardo
Gonçalves
Grandell
Greene
Grobler
Gutjahr
Gálvez
Gómez
Gómez
Hadka
Han
Hansen
Hansen
Hansen
Hansen
He
Hellwig
Holland
Hong
Hooper
Hu
Hu
Huband
Hussain
Hussain
Igel
Igel
Inuiguchi
Ishibuchi
Ishibuchi
Jabbarpour
Jaimes
Jana
Janson
Janson
Jena
Jiang
Jiang
Jin
Jin
Jin
Jones
Jordehi
Joyce
Kalyanmoy
Kamyab
Kar
Kar
Karaboga
Karafotias
Karnan
Kashan
Kim
Komatsu
Kononova
Koza
Koziel
Koziel
Krasnogor
Krasnogor
Kuhn
Kusyk
Lara
Lara-Cabrera
Laszczyk
LaTorre
LaTorre
Lee
Lehman
Lehre
Leskinen
Li
Li
Li
Li
Li
Li
Li
Li
Li
Liang
Liang
Liang
Liang
Liao
Lim
Lin
Liu
Liu
Liu
Liu
Logenthiran
Logeswari
Lones
Lozano
Lu
Lucas
Lynn
Lynn
Lynn
López-Ibáńez
Ma
Maashi
Mahdavi
Mahdavi
Mahdavi
Mahfoud
Malikopoulos
Mallipeddi
Mallipeddi
Mandal
Mane
Martinez
Martí
Mashwani
Maul
Maučec
Mavrovouniotis
Mazzara
McClymont
Melcer
Mendiburu
Meuth
Miikkulainen
Molina
Molina
Molina
Montana
Moscato
Moscato
Moser
Mostaghim
Müller
Müller
Naldi
Nannen
Nebro
Neri
Neumann
Nguyen
Nguyen
Ni
Nogueira Collazo
Novoa-Hernández
Oliveira
Omidvar
Omidvar
Omidvar
Ong
Ong
Orgaz
Parpinelli
Passino
Payne
Peng
Peng
Pescador-Rojas
Piotrowski
Piotrowski
Piotrowski
Piotrowski
Pitzer
Pizzuti
Polakova
Potter
Potter
Pošík
Praditwong
Prebeg
Pétrowski
Qian
Qin
Qu
Qu
Qu
Qu
Queipo
Rajasekhar
Rakshit
Ramírez-Gallego
Rashedi
Ray
Rechenberg
Recio
Remde
Ross
Ross
Rothlauf
Roy
Sahinidis
Saka
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Sareni
Schumacher
Schutze
Schwefel
Senapati
Seredynski
Sergeyev
Shaker
Shang
Shen
Simon
Sivakumar
Smit
Smit
Smith
Smith
Soleimani
Srinivas
Stanley
Starzynski
Storn
Subbu
Subbu
Such
Suganthan
Suganthan
Suganthan
Suganthi
Suganuma
Sun
Sun
Sutton
Swan
Sörensen
Talbi
Tanabe
Tanabe
Tang
Tang
Tang
Tassiulas
Ter Braak
Thangavel
Thomsen
Tintner
Tomassini
Tricoire
Tsai
Tsang
Ursem
Vafaee
Verma
Verma
Vitaliy
Vrugt
Vrugt
Vrugt
Vrugt
Walker
Wang
Wang
Wang
Wang
Wang
Wang
Wari
Weber
Wedyan
Wessing
Weyland
Whitley
Whitley
Woldesenbet
Wolpert
Wu
Wu
Xiao
Xiong
Xue
Xue
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yanıkoğlu
Yannakakis
Yazdani
Yazdi
Yu
Yu
Yu
Yue
Zabinsky
Zainud-Deen
Zhang
Zhang
Zhang
Zhang
Zhao
Zhao
Zhao
Zhou
Zhou
Zhu
Zhuang
Zille
Zille
Zitzler
Özcan
Črepinšek
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

In recent years, the research community has witnessed an explosion of literature dealing with the adaptation of behavioral patterns and social phenomena observed in nature towards efficiently solving complex computational tasks. This trend has been especially dramatic in what relates to optimization problems, mainly due to the unprecedented complexity of problem instances, arising from a diverse spectrum of domains such as transportation, logistics, energy, climate, social networks, health and industry 4.0, among many others. Notwithstanding this upsurge of activity, research in this vibrant topic should be steered towards certain areas that, despite their eventual value and impact on the field of bio-inspired computation, still remain insufficiently explored to date. The main purpose of this paper is to outline the state of the art and to identify open challenges concerning the most relevant areas within bio-inspired optimization. An analysis and discussion are also carried out over the general trajectory followed in recent years by the community working in this field, thereby highlighting the need for reaching a consensus and joining forces towards achieving valuable insights into the understanding of this family of optimization techniques

Crossref

Middlesex University Research Repository

DR-NTU (Digital Repository of NTU)

Bio-inspired computation: where we stand and what's next

Author: Camacho D.
Camacho D.
Coello Coello C.
Coello Coello C.
Das S.
Das S.
Del Ser J.
Del Ser J.
Herrera F.
Herrera F.
Molina D.
Molina D.
Osaba E.
Osaba E.
Salcedo-Sanz S.
Salcedo-Sanz S.
Suganthan P.
Suganthan P.
Yang X.
Yang X.
Publication venue: Elsevier
Publication date: 01/01/2019
Field of study

Middlesex University Research Repository

Provably-Efficient and Internally-Deterministic Parallel Union-Find

Author: Alistarh Dan
Fedorov Alexander
Hashemi Diba
Nadiradze Giorgi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2023
Field of study

Determining the degree of inherent parallelism in classical sequential algorithms and leveraging it for fast parallel execution is a key topic in parallel computing, and detailed analyses are known for a wide range of classical algorithms. In this paper, we perform the first such analysis for the fundamental Union-Find problem, in which we are given a graph as a sequence of edges, and must maintain its connectivity structure under edge additions. We prove that classic sequential algorithms for this problem are well-parallelizable under reasonable assumptions, addressing a conjecture by [Blelloch, 2017]. More precisely, we show via a new potential argument that, under uniform random edge ordering, parallel union-find operations are unlikely to interfere:

T

concurrent threads processing the graph in parallel will encounter memory contention

O(T^2 \cdot \log |V| \cdot \log |E|)

times in expectation, where

|E|

and

|V|

are the number of edges and nodes in the graph, respectively. We leverage this result to design a new parallel Union-Find algorithm that is both internally deterministic, i.e., its results are guaranteed to match those of a sequential execution, but also work-efficient and scalable, as long as the number of threads

T

O(|E|^{\frac{1}{3} - \varepsilon})

, for an arbitrarily small constant

\varepsilon > 0

, which holds for most large real-world graphs. We present lower bounds which show that our analysis is close to optimal, and experimental results suggesting that the performance cost of internal determinism is limited

arXiv.org e-Print Archive

IST Austria: PubRep (Institute of Science and Technology)

How Is a Moving Target Continuously Tracked Behind Occluding Cover?

Author: Grossberg Stephen
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/12/1995
Field of study

Office of Naval Research (N00014-95-1-0657, N00014-95-1-0409

Boston University Institutional Repository (OpenBU)

Aerospace medicine and biology: A continuing bibliography with indexes, supplement 204

Author
Publication venue
Publication date
Field of study

This bibliography lists 140 reports, articles, and other documents introduced into the NASA scientific and technical information system in February 1980

NASA Technical Reports Server