34,126 research outputs found
Recommended from our members
A Double Error Dynamic Asymptote Model of Associative Learning
In this paper a formal model of associative learning is presented which incorporates representational and computational mechanisms that, as a coherent corpus, empower it to make accurate predictions of a wide variety of phenomena that so far have eluded a unified account in learning theory. In particular, the Double Error Dynamic Asymptote (DDA) model introduces: 1) a fully-connected network architecture in which stimuli are represented as temporally clustered elements that associate to each other, so that elements of one cluster engender activity on other clusters, which naturally implements neutral stimuli associations and mediated learning; 2) a predictor error term within the traditional error correction rule (the double error), which reduces the rate of learning for expected predictors; 3) a revaluation associability rate that operates on the assumption that the outcome predictiveness is tracked over time so that prolonged uncertainty is learned, reducing the levels of attention to initially surprising outcomes; and critically 4) a biologically plausible variable asymptote, which encapsulates the principle of Hebbian learning, leading to stronger associations for similar levels of cluster activity. The outputs of a set of simulations of the DDA model are presented along with empirical results from the literature. Finally, the predictive scope of the model is discussed
Recommended from our members
Load Frequency Control: A Deep Multi-Agent Reinforcement Learning Approach
The paradigm shift in energy generation towards microgrid-based architectures is changing the landscape of the energy control structure heavily in distribution systems. More specifically, distributed generation is deployed in the network demanding decentralised control mechanisms to ensure reliable power system operations. In this work, a Multi-Agent Reinforcement Learning approach is proposed to deliver an agentbased solution to implement load frequency control without the need of a centralised authority. Multi-Agent Deep Deterministic Policy Gradient is used to approximate the frequency control at the primary and the secondary levels. Each generation unit is represented as an agent that is modelled by a Recurrent Neural Network. Agents learn the optimal way of acting and interacting with the environment to maximise their long term performance and to balance generation and load, thus restoring frequency. In this paper we prove using three test systems, with two, four and eight generators, that our Multi-Agent Reinforcement Learning approach can efficiently be used to perform frequency control in a decentralised way
Recommended from our members
Cost Efficient Distributed Load Frequency Control in Power Systems
The introduction of new technologies and increased penetration of renewable resources is altering the power distribution landscape which now includes a larger numbers of micro-generators. The centralized strategies currently employed for performing frequency control in a cost efficient way need to be revisited and decentralized to conform with the increase of distributed generation in the grid. In this paper, the use of Multi-Agent and Multi-Objective Reinforcement Learning techniques to train models to perform cost efficient frequency control through decentralized decision making is proposed. More specifically, we cast the frequency control problem as a Markov Decision Process and propose the use of reward composition and action composition multi-objective techniques and compare the results between the two. Reward composition is achieved by increasing the dimensionality of the reward function, while action composition is achieved through linear combination of actions produced by multiple single objective models. The proposed framework is validated through comparing the observed dynamics with the acceptable limits enforced in the industry and the cost optimal setups
Cooler and bigger than thought? Planetary host stellar parameters from the InfraRed Flux Method
Effective temperatures and radii for 92 planet-hosting stars as determined
from the InfraRed Flux Method (IRFM) are presented and compared with those
given by other authors using different approaches. The IRFM temperatures we
have derived are systematically lower than those determined from the
spectroscopic condition of excitation equilibrium, the mean difference being as
large as 110 K. They are, however, consistent with previous IRFM studies and
with the colors derived from Kurucz and MARCS model atmospheres. Comparison
with direct measurements of stellar diameters for 7 dwarf stars, which
approximately cover the range of temperatures of the planet-hosting stars,
suggest that the IRFM radii and temperatures are reliable in an absolute scale.
A better understanding of the fundamental properties of the stars with planets
will be achieved once this discrepancy between the IRFM and the spectroscopic
temperature scales is resolved.Comment: 15 pages, 4 figures. Accepted for publication in Ap
Geometrical and spectral study of beta-skeleton graphs
We perform an extensive numerical analysis of beta-skeleton graphs, a particular type of proximity graphs. In beta-skeleton graph (BSG) two vertices are connected if a proximity rule, that depends of the parameter beta is an element of (0, infinity), is satisfied. Moreover, for beta > 1 there exist two different proximity rules, leading to lune-based and circle-based BSGs. First, by computing the average degree of large ensembles of BSGs we detect differences, which increase with the increase of beta, between lune-based and circle-based BSGs. Then, within a random matrix theory (RMT) approach, we explore spectral and eigenvector properties of random BSGs by the use of the nearest-neighbor energy-level spacing distribution and the entropic eigenvector localization length, respectively. The RMT analysis allows us to conclude that a localization transition occurs at beta = 1
Synonymy of Rhamphidera Skelley with Bancous Pic, termitophilous fungus beetles (Coleoptera: Erotylidae).
ThegenusBancousPic, originally described in the Heteromera (Rhysopaussidae) and later transferred to Cucujiformia (incertae sedis), was found to be congeneric with Rhamphidera Skelley (Erotylidae). Bancous is here placed in the family Erotylidae (Erotylinae, Tritomini) and Rhamphidera is moved into synonymy. This synonymy creates two new combinations: Bancous perplexus (Skelley) and Bancous eureka (Skelley). Bancous is redescribed and a lectotype is designated for Bancous irregularis Pic
Recommended from our members
Quantum Probability and Operant Conditioning: Behavioral Uncertainty in Reinforcement Learning
An implicit assumption in the study of operant conditioning and reinforcement learning is that behavior is stochastic, in that it depends on the probability that an outcome follows a response and on how the presence or absence of the output affects the frequency of the response. In this paper we argue that classical probability is not the right tool to represent uncertainty operant conditioning and propose an interpretation of behavioral states in terms of quantum probability instead
- …