Search CORE

16,237 research outputs found

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Author: Govindarajan R.
Pai Sreepathi
Thazhuthaveetil Matthew J.
Publication venue
Publication date: 01/01/2014
Field of study

Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these GPUs, the thread block scheduler (TBS) uses the FIFO policy to schedule their thread blocks. We show that FIFO leaves performance to chance, resulting in significant loss of performance and fairness. To improve performance and fairness, we propose use of the preemptive Shortest Remaining Time First (SRTF) policy instead. Although SRTF requires an estimate of runtime of GPU kernels, we show that such an estimate of the runtime can be easily obtained using online profiling and exploiting a simple observation on GPU kernels' grid structure. Specifically, we propose a novel Structural Runtime Predictor. Using a simple Staircase model of GPU kernel execution, we show that the runtime of a kernel can be predicted by profiling only the first few thread blocks. We evaluate an online predictor based on this model on benchmarks from ERCBench, and find that it can estimate the actual runtime reasonably well after the execution of only a single thread block. Next, we design a thread block scheduler that is both concurrent kernel-aware and uses this predictor. We implement the SRTF policy and evaluate it on two-program workloads from ERCBench. SRTF improves STP by 1.18x and ANTT by 2.25x over FIFO. When compared to MPMax, a state-of-the-art resource allocation policy for concurrent kernels, SRTF improves STP by 1.16x and ANTT by 1.3x. To improve fairness, we also propose SRTF/Adaptive which controls resource usage of concurrently executing kernels to maximize fairness. SRTF/Adaptive improves STP by 1.12x, ANTT by 2.23x and Fairness by 2.95x compared to FIFO. Overall, our implementation of SRTF achieves system throughput to within 12.64% of Shortest Job First (SJF, an oracle optimal scheduling policy), bridging 49% of the gap between FIFO and SJF.Comment: 14 pages, full pre-review version of PACT 2014 poste

arXiv.org e-Print Archive

CiteSeerX

Crossref

A Graph-Partition-Based Scheduling Policy for Heterogeneous Architectures

Author: Lohmann Daniel
Schröder-Preikschat Wolfgang
Wu Hao
Publication venue
Publication date: 26/02/2015
Field of study

In order to improve system performance efficiently, a number of systems choose to equip multi-core and many-core processors (such as GPUs). Due to their discrete memory these heterogeneous architectures comprise a distributed system within a computer. A data-flow programming model is attractive in this setting for its ease of expressing concurrency. Programmers only need to define task dependencies without considering how to schedule them on the hardware. However, mapping the resulting task graph onto hardware efficiently remains a challenge. In this paper, we propose a graph-partition scheduling policy for mapping data-flow workloads to heterogeneous hardware. According to our experiments, our graph-partition-based scheduling achieves comparable performance to conventional queue-base approaches.Comment: Presented at DATE Friday Workshop on Heterogeneous Architectures and Design Methods for Embedded Image Systems (HIS 2015) (arXiv:1502.07241

arXiv.org e-Print Archive

CiteSeerX

Managing Climatic Risks to Combat Land Degradation and Enhance Food security: Key Information Needs

Author: Aggarwal P.K.
Baethegan W.E.
Cooper P.
Gommes R.
Lee B.
Meinke H.B.
Rathore L.S.
Sivakumar M.V.K.
Publication venue: Elsevier B.V.
Publication date: 01/01/2010
Field of study

This paper discusses the key information needs to reduce the negative impacts of weather variability and climate change on land degradation and food security, and identifies the opportunities and barriers between the information and services needed. It suggests that vulnerability assessments based on a livelihood concept that includes climate information and key socio-economic variables can overcome the narrow focus of common one-dimensional vulnerability studies. Both current and future climatic risks can be managed better if there is appropriate policy and institutional support together with technological interventions to address the complexities of multiple risks that agriculture has to face. This would require effective partnerships among agencies dealing with meteorological and hydrological services, agricultural research, land degradation and food security issues. In addition a state-of-the-art infrastructure to measure, record, store and disseminate data on weather variables, and access to weather and seasonal climate forecasts at desired spatial and temporal scales would be needed

ICRISAT Open Access Repository

Elsevier - Publisher Connector

Wageningen University & Research Publications

Consultation on fish genetic resources

Author: Casal C.M.V. (eds.)
Pullin R.S.V.
Publication venue
Publication date
Field of study

Naturally occurring fish genetic resources are of great importance for fisheries and aquaculture. Fish farmers and fishers face a future in which the diversity of their basic resources is under threat and the genetic composition of these resources will be increasingly reliant on human protection and manipulation. Fish genetic resources research, information and training in the context of existing and future activities are discussed.Fishery resources, Genetic resources, Germplasm conservation Pisces

Research Papers in Economics

Datacenter Traffic Control: Understanding Techniques and Trade-offs

Author: Noormohammadpour Mohammad
Raghavendra Cauligi S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/12/2017
Field of study

Datacenters provide cost-effective and flexible access to scalable compute and storage resources necessary for today's cloud computing needs. A typical datacenter is made up of thousands of servers connected with a large network and usually managed by one operator. To provide quality access to the variety of applications and services hosted on datacenters and maximize performance, it deems necessary to use datacenter networks effectively and efficiently. Datacenter traffic is often a mix of several classes with different priorities and requirements. This includes user-generated interactive traffic, traffic with deadlines, and long-running traffic. To this end, custom transport protocols and traffic management techniques have been developed to improve datacenter network performance. In this tutorial paper, we review the general architecture of datacenter networks, various topologies proposed for them, their traffic properties, general traffic control challenges in datacenters and general traffic control objectives. The purpose of this paper is to bring out the important characteristics of traffic control in datacenters and not to survey all existing solutions (as it is virtually impossible due to massive body of existing research). We hope to provide readers with a wide range of options and factors while considering a variety of traffic control mechanisms. We discuss various characteristics of datacenter traffic control including management schemes, transmission control, traffic shaping, prioritization, load balancing, multipathing, and traffic scheduling. Next, we point to several open challenges as well as new and interesting networking paradigms. At the end of this paper, we briefly review inter-datacenter networks that connect geographically dispersed datacenters which have been receiving increasing attention recently and pose interesting and novel research problems.Comment: Accepted for Publication in IEEE Communications Surveys and Tutorial

arXiv.org e-Print Archive

ZENODO

FigShare

Energy management in mobile devices with the cinder operating system

Author: Levis Philip
Mazieres David
Roy Arjun
Rumble Stephen M.
Stutsman Ryan
Zeldovich Nickolai
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

We argue that controlling energy allocation is an increasingly useful and important feature for operating systems, especially on mobile devices. We present two new low-level abstractions in the Cinder operating system, reserves and taps, which store and distribute energy for application use. We identify three key properties of control -- isolation, delegation, and subdivision -- and show how using these abstractions can achieve them. We also show how the architecture of the HiStar information-flow control kernel lends itself well to energy control. We prototype and evaluate Cinder on a popular smartphone, the Android G1.National Science Foundation (U.S.) (grant #0831163)National Science Foundation (U.S.) (grant #0846014)Natural Sciences and Engineering Research Council of Canada (NSERC)King Abdullah University of Science and TechnologyMicrosoft ResearchNational Science Foundation (U.S.) (Cybertrust award CNS-0716806)National Science Foundation (U.S.) (POMI (Programmable Open Mobile Internet) 2020 Expedition Grant #0832820)T-Mobile US

CiteSeerX

DSpace@MIT

Crossref