49 research outputs found

    A Parallel Incremental Learning Algorithm for Neural Networks with Fault Tolerance

    No full text
    URL : http://vecpar.fe.up.pt/2008/papers/46.pdfInternational audienceThis paper presents a parallel and fault tolerant version of an incremental learning algorithm for feed-forward neural networks used as function approximators. It has been shown in previous works that our incremental algorithm builds networks of reduced size while providing high quality approximations for real data sets. However, for very large sets, the use of our learning process on a single machine may be quite long and even sometimes impossible, due to memory limitations. The parallel algorithm presented in this paper is usable in any parallel system, and in particular, with large dynamical systems such as clusters and grids in which faults may occur. Finally, the quality and performances (without and with faults) of that algorithm are experimentally evaluated

    Basis Expansion in Natural Actor Critic Methods

    Get PDF
    International audienceIn reinforcement learning, the aim of the agent is to find a policy that maximizes its expected return. Policy gradient methods try to accomplish this goal by directly approximating the policy using a parametric function approximator; the expected return of the current policy is estimated and its parameters are updated by steepest ascent in the direction of the gradient of the expected return with respect to the policy parameters. In general, the policy is defined in terms of a set of basis functions that capture important features of the problem. Since the quality of the resulting policies directly depend on the set of basis func- tions, and defining them gets harder as the complexity of the problem increases, it is important to be able to find them automatically. In this paper, we propose a new approach which uses cascade-correlation learn- ing architecture for automatically constructing a set of basis functions within the context of Natural Actor-Critic (NAC) algorithms. Such basis functions allow more complex policies be represented, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically

    Extending and benchmarking the CasPer algorithm

    No full text

    A cascade network algorithm employing Progressive RPROP

    No full text

    Avoiding Roundoff Error in Backpropagating Derivatives

    No full text

    Constructing higher order neurons of increasing complexity in cascade networks

    No full text

    Implementing a Holistic Approach for the Smart City

    No full text

    Nonmonotonic parallel inheritance network

    No full text
    corecore