Search CORE

262 research outputs found

Comparing policy gradient and value function based reinforcement learning methods in simulated electrical power trade

Author: Burt Graeme
Galloway Stuart
Lincoln Richard
Stephen Bruce
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2012
Field of study

In electrical power engineering, reinforcement learning algorithms can be used to model the strategies of electricity market participants. However, traditional value function based reinforcement learning algorithms suffer from convergence issues when used with value function approximators. Function approximation is required in this domain to capture the characteristics of the complex and continuous multivariate problem space. The contribution of this paper is the comparison of policy gradient reinforcement learning methods, using artificial neural networks for policy function approximation, with traditional value function based methods in simulations of electricity trade. The methods are compared using an AC optimal power flow based power exchange auction market model and a reference electric power system model

Crossref

University of Strathclyde Institutional Repository

Использование иммунного и генетического алгоритмов для оптимизации обучения нейронной сети

Author: Гергет Ольга Михайловна
Голенков В. В.
Publication venue
Publication date: 01/01/2015
Field of study

Nowadays, computer technologies are widely implemented and used in all areas of human activity, including in medicine. They can significantly improve the quality of healthcare by modeling a pathological process in a particular disease. A neural network can be trained to determine diseases, but training may take a long time because of the large number of indicators of human health, as well as increased demands on the accuracy of recognition. Training time can be reduced by using optimization algorithms presented in this article

Electronic archive of Tomsk Polytechnic University

Towards Automatic Learning of Heuristics for Mechanical Transformations of Procedural Code

Author: Carro Manuel
Mariño Julio
Tamarit Salvador
Vigueras Guillermo
Publication venue
Publication date: 09/03/2016
Field of study

The current trend in next-generation exascale systems goes towards integrating a wide range of specialized (co-)processors into traditional supercomputers. However, the integration of different specialized devices increases the degree of heterogeneity and the complexity in programming such type of systems. Due to the efficiency of heterogeneous systems in terms of Watt and FLOPS per surface unit, opening the access of heterogeneous platforms to a wider range of users is an important problem to be tackled. In order to bridge the gap between heterogeneous systems and programmers, in this paper we propose a machine learning-based approach to learn heuristics for defining transformation strategies of a program transformation system. Our approach proposes a novel combination of reinforcement learning and classification methods to efficiently tackle the problems inherent to this type of systems. Preliminary results demonstrate the suitability of the approach for easing the programmability of heterogeneous systems.Comment: Part of the Program Transformation for Programmability in Heterogeneous Architectures (PROHA) workshop, Barcelona, Spain, 12th March 2016, 9 pages, LaTe

arXiv.org e-Print Archive

Directory of Open Access Journals

On the Memory Properties of Recurrent Neural Models

Author: Benetos E
Garcez AD
IEEE
Russell AJ
Publication venue
Publication date: 15/02/2017
Field of study

Queen Mary Research Online

Scikit-learn: Machine Learning in Python

Author: Blondel Mathieu
Brucher Matthieu
Cournapeau David
Dubourg Vincent
Duchesnay Edouard
Gramfort Alexandre
Grisel Olivier
Michel Vincent
Passos Alexandre
Pedregosa Fabian
Perrot Matthieu
Prettenhofer Peter
Thirion Bertrand
Vanderplas Jake
Varoquaux Gaël
Weiss Ron
Publication venue: Microtome Publishing
Publication date: 12/10/2011
Field of study

International audienceScikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net

HAL Clermont Université

INRIA a CCSD electronic archive server

HAL-CEA