Search CORE

4,274 research outputs found

Real-time load balancing in an interactive multiplayer game server

Author: Dickinson Patrick
Munro James
Publication venue: Eurosis
Publication date: 17/11/2010
Field of study

In this paper we investigate optimal load balancing strategies for a scalable parallel game server architecture. Our work develops from an existing multi-threaded implementation of the QuakeWorld game server: we investigate the comparative effectiveness of different load-balancing algorithms, and determine how different metrics can be used to analyse performance. We find that achieving optimal QuakeWorld server performance is a trade-off between consistently achieving an even workload distribution whilst reducing intra-frame wait time, and that a combined set of metrics is required to fully understand how load balancing affects server performance

University of Lincoln Institutional Repository

Porting Decision Tree Algorithms to Multicore using FastFlow

Author: A.C. Sodan
I. Park
J.E. Gehrke
J.R. Quinlan
K. Asanovic
M. Aldinucci
M. Cole
M. Coppola
M. Joshi
M. Vanneschi
M. Zaki
M.K. Sreenivas
R. Jin
R.D. Blumofe
S. Ruggieri
S. Ruggieri
T. Lim
W. Thies
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The whole computer hardware industry embraced multicores. For these machines, the extreme optimisation of sequential algorithms is no longer sufficient to squeeze the real machine power, which can be only exploited via thread-level parallelism. Decision tree algorithms exhibit natural concurrency that makes them suitable to be parallelised. This paper presents an approach for easy-yet-efficient porting of an implementation of the C4.5 algorithm on multicores. The parallel porting requires minimal changes to the original sequential code, and it is able to exploit up to 7X speedup on an Intel dual-quad core machine.Comment: 18 pages + cove

arXiv.org e-Print Archive

CiteSeerX

Crossref

Archivio della Ricerca - Università di Pisa

UnipiEprints

Structure-Aware Dynamic Scheduler for Parallel Machine Learning

Author: Gibson Garth A.
Ho Qirong
Kim Jin Kyu
Lee Seunghak
Xing Eric P.
Publication venue
Publication date: 30/12/2013
Field of study

Training large machine learning (ML) models with many variables or parameters can take a long time if one employs sequential procedures even with stochastic updates. A natural solution is to turn to distributed computing on a cluster; however, naive, unstructured parallelization of ML algorithms does not usually lead to a proportional speedup and can even result in divergence, because dependencies between model elements can attenuate the computational gains from parallelization and compromise correctness of inference. Recent efforts toward this issue have benefited from exploiting the static, a priori block structures residing in ML algorithms. In this paper, we take this path further by exploring the dynamic block structures and workloads therein present during ML program execution, which offers new opportunities for improving convergence, correctness, and load balancing in distributed ML. We propose and showcase a general-purpose scheduler, STRADS, for coordinating distributed updates in ML algorithms, which harnesses the aforementioned opportunities in a systematic way. We provide theoretical guarantees for our scheduler, and demonstrate its efficacy versus static block structures on Lasso and Matrix Factorization

arXiv.org e-Print Archive

CiteSeerX