Search CORE

34,854 research outputs found

Rational bidding using reinforcement learning: an application in automated resource allocation

Author: A. Sherstov
C. Watkins
D. Gode
D. Reeves
E. Medernach
H.J. Herik van den
I. Erev
K. Lai
L. Panait
M. He
M. Kearns
M. Wellman
P. Green
R. Luce
S. Kaplan
T. Saaty
W. Smith
Y. Shoham
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

The application of autonomous agents by the provisioning and usage of computational resources is an attractive research field. Various methods and technologies in the area of artificial intelligence, statistics and economics are playing together to achieve i) autonomic resource provisioning and usage of computational resources, to invent ii) competitive bidding strategies for widely used market mechanisms and to iii) incentivize consumers and providers to use such market-based systems. The contributions of the paper are threefold. First, we present a framework for supporting consumers and providers in technical and economic preference elicitation and the generation of bids. Secondly, we introduce a consumer-side reinforcement learning bidding strategy which enables rational behavior by the generation and selection of bids. Thirdly, we evaluate and compare this bidding strategy against a truth-telling bidding strategy for two kinds of market mechanisms – one centralized and one decentralized

Crossref

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Collaborative signal and information processing for target detection with heterogeneous sensor networks

Author: He B.
Li M.
Lu Y.
Publication venue: 'OMICS Publishing Group'
Publication date: 01/01/2013
Field of study

In this paper, an approach for target detection and acquisition with heterogeneous sensor networks through strategic resource allocation and coordination is presented. Based on sensor management and collaborative signal and information processing, low-capacity low-cost sensors are strategically deployed to guide and cue scarce high performance sensors in the network to improve the data quality, with which the mission is eventually completed more efficiently with lower cost. We focus on the problem of designing such a network system in which issues of resource selection and allocation, system behaviour and capacity, target behaviour and patterns, the environment, and multiple constraints such as the cost must be addressed simultaneously. Simulation results offer significant insight into sensor selection and network operation, and demonstrate the great benefits introduced by guided search in an application of hunting down and capturing hostile vehicles on the battlefield

Crossref

Enlighten

Q-Strategy: A Bidding Strategy for Market-Based Allocation of Grid Services

Author: A. Sherstov
C. Watkins
D. Cliff
D. Gode
D. Minoli
E. Medernach
H.J. Herik van den
I. Erev
K. Lai
L. Panait
M. He
M. Wellman
P. Green
R. Luce
R. Wolski
S. Gjerstad
S. Kaplan
T. Saaty
W. Smith
Y. Shoham
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

The application of autonomous agents by the provisioning and usage of computational services is an attractive research field. Various methods and technologies in the area of artificial intelligence, statistics and economics are playing together to achieve i) autonomic service provisioning and usage of Grid services, to invent ii) competitive bidding strategies for widely used market mechanisms and to iii) incentivize consumers and providers to use such market-based systems. The contributions of the paper are threefold. First, we present a bidding agent framework for implementing artificial bidding agents, supporting consumers and providers in technical and economic preference elicitation as well as automated bid generation by the requesting and provisioning of Grid services. Secondly, we introduce a novel consumer-side bidding strategy, which enables a goal-oriented and strategic behavior by the generation and submission of consumer service requests and selection of provider offers. Thirdly, we evaluate and compare the Q-strategy, implemented within the presented framework, against the Truth-Telling bidding strategy in three mechanisms – a centralized CDA, a decentralized on-line machine scheduling and a FIFO-scheduling mechanisms

Crossref

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Recommended from our members

A survey of simulation techniques in commerce and defence

Author: Eldabi T
Jahangirian M
Mustafee N
Naseer A
Stergioulas L
Young T
Publication venue: 'The Operational Research Society'
Publication date: 01/01/2008
Field of study

Despite the developments in Modelling and Simulation (M&S) tools and techniques over the past years, there has been a gap in the M&S research and practice in healthcare on developing a toolkit to assist the modellers and simulation practitioners with selecting an appropriate set of techniques. This study is a preliminary step towards this goal. This paper presents some results from a systematic literature survey on applications of M&S in the commerce and defence domains that could inspire some improvements in the healthcare. Interim results show that in the commercial sector Discrete-Event Simulation (DES) has been the most widely used technique with System Dynamics (SD) in second place. However in the defence sector, SD has gained relatively more attention. SD has been found quite useful for qualitative and soft factors analysis. From both the surveys it becomes clear that there is a growing trend towards using hybrid M&S approaches

Brunel University Research Archive

Markov Decision Processes with Applications in Wireless Sensor Networks: A Survey

Author: Alsheikh Mohammad Abu
Hoang Dinh Thai
Lin Shaowei
Niyato Dusit
Tan Hwee-Pink
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/01/2015
Field of study

Wireless sensor networks (WSNs) consist of autonomous and resource-limited devices. The devices cooperate to monitor one or more physical phenomena within an area of interest. WSNs operate as stochastic systems because of randomness in the monitored environments. For long service time and low maintenance cost, WSNs require adaptive and robust methods to address data exchange, topology formulation, resource and power optimization, sensing coverage and object detection, and security challenges. In these problems, sensor nodes are to make optimized decisions from a set of accessible strategies to achieve design goals. This survey reviews numerous applications of the Markov decision process (MDP) framework, a powerful decision-making tool to develop adaptive algorithms and protocols for WSNs. Furthermore, various solution methods are discussed and compared to serve as a guide for using MDPs in WSNs

arXiv.org e-Print Archive

University of Canberra Research Repository

An Agent-based approach to modelling integrated product teams undertaking a design activity.

Author: Crowder Richard
Hughes Helen
Robinson Mark
Sim Yee Wai
Publication venue
Publication date: 01/09/2009
Field of study

The interactions between individual designers, within integrated product teams, and the nature of design tasks, all have a significant impact upon how well a design task can be performed, and hence the quality of the resultant product and the time in which it can be delivered. In this paper we describe an ongoing research project which aims to model integrated product teams through the use of multi-agent systems. We first describe the background and rationale for our work, and then present our initial computational model and results from the simulation of an integrated product team. The paper concludes with a discussion of how the model will evolve to improve the accuracy of the simulation

Southampton (e-Prints Soton)

Power Load Management as a Computational Market

Author: Akkermans Hans
Ygge Fredrik
Publication venue: University of Twente, Centre for Telematics and Information Technology
Publication date: 01/01/1996
Field of study

Power load management enables energy utilities to reduce peak loads and thereby save money. Due to the large number of different loads, power load management is a complicated optimization problem. We present a new decentralized approach to this problem by modeling direct load management as a computational market. Our simulation results demonstrate that our approach is very efficient with a superlinear rate of convergence to equilibrium and an excellent scalability, requiring few iterations even when the number of agents is in the order of one thousand. Aframework for analysis of this and similar problems is given which shows how nonlinear optimization and numerical mathematics can be exploited to characterize, compare, and tailor problem-solving strategies in market-oriented programming

Blekinge Institute of Technology

CiteSeerX

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Electronic Research Archive - Blekinge Tekniska Högskola

University of Twente Research Information

A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning

Author: Li Zhe
Lin Sheng
Liu Ning
Qiu Qinru
Tang Jian
Wang Yanzhi
Xu Jielong
Xu Zhiyuan
Publication venue
Publication date: 11/08/2017
Field of study

Automatic decision-making approaches, such as reinforcement learning (RL), have been applied to (partially) solve the resource allocation problem adaptively in the cloud computing system. However, a complete cloud resource allocation framework exhibits high dimensions in state and action spaces, which prohibit the usefulness of traditional RL techniques. In addition, high power consumption has become one of the critical concerns in design and control of cloud computing systems, which degrades system reliability and increases cooling cost. An effective dynamic power management (DPM) policy should minimize power consumption while maintaining performance degradation within an acceptable level. Thus, a joint virtual machine (VM) resource allocation and power management framework is critical to the overall cloud computing system. Moreover, novel solution framework is necessary to address the even higher dimensions in state and action spaces. In this paper, we propose a novel hierarchical framework for solving the overall resource allocation and power management problem in cloud computing systems. The proposed hierarchical framework comprises a global tier for VM resource allocation to the servers and a local tier for distributed power management of local servers. The emerging deep reinforcement learning (DRL) technique, which can deal with complicated control problems with large state space, is adopted to solve the global tier problem. Furthermore, an autoencoder and a novel weight sharing structure are adopted to handle the high-dimensional state space and accelerate the convergence speed. On the other hand, the local tier of distributed server power managements comprises an LSTM based workload predictor and a model-free RL based power manager, operating in a distributed manner.Comment: accepted by 37th IEEE International Conference on Distributed Computing (ICDCS 2017

arXiv.org e-Print Archive

Crossref