77,399 research outputs found

    Evolutionary Algorithms for Reinforcement Learning

    Full text link
    There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these approaches. Kaelbling, Littman and Moore recently provided an informative survey of temporal difference methods. This article focuses on the application of evolutionary algorithms to the reinforcement learning problem, emphasizing alternative policy representations, credit assignment methods, and problem-specific genetic operators. Strengths and weaknesses of the evolutionary approach to reinforcement learning are presented, along with a survey of representative applications

    Self-Evaluation Applied Mathematics 2003-2008 University of Twente

    Get PDF
    This report contains the self-study for the research assessment of the Department of Applied Mathematics (AM) of the Faculty of Electrical Engineering, Mathematics and Computer Science (EEMCS) at the University of Twente (UT). The report provides the information for the Research Assessment Committee for Applied Mathematics, dealing with mathematical sciences at the three universities of technology in the Netherlands. It describes the state of affairs pertaining to the period 1 January 2003 to 31 December 2008

    Why Invest in Collaborative Leadership Development? Summary Report

    Get PDF
    The Casey Foundation values skillful leadership in creating sustained social change. The Foundation partnered with the University of Maryland, School of Public Policy in sculpting a new approach to match leadership ability with constructive results for children, families and communities -- a collaborative leadership style for complex social issues. Readers, especially other foundations and nonprofit investors, get a look at the findings, lessons learned and recommendations from three years of collaborative leadership capacity-building effort

    IMPROVING THE DEPENDABILITY OF DESTINATION RECOMMENDATIONS USING INFORMATION ON SOCIAL ASPECTS

    Get PDF
    Prior knowledge of the social aspects of prospective destinations can be very influential in making travel destination decisions, especially in instances where social concerns do exist about specific destinations. In this paper, we describe the implementation of an ontology-enabled Hybrid Destination Recommender System (HDRS) that leverages an ontological description of five specific social attributes of major Nigerian cities, and hybrid architecture of content-based and case-based filtering techniques to generate personalised top-n destination recommendations. An empirical usability test was conducted on the system, which revealed that the dependability of recommendations from Destination Recommender Systems (DRS) could be improved if the semantic representation of social attributes information of destinations is made a factor in the destination recommendation process

    System Support for Bandwidth Management and Content Adaptation in Internet Applications

    Full text link
    This paper describes the implementation and evaluation of an operating system module, the Congestion Manager (CM), which provides integrated network flow management and exports a convenient programming interface that allows applications to be notified of, and adapt to, changing network conditions. We describe the API by which applications interface with the CM, and the architectural considerations that factored into the design. To evaluate the architecture and API, we describe our implementations of TCP; a streaming layered audio/video application; and an interactive audio application using the CM, and show that they achieve adaptive behavior without incurring much end-system overhead. All flows including TCP benefit from the sharing of congestion information, and applications are able to incorporate new functionality such as congestion control and adaptive behavior.Comment: 14 pages, appeared in OSDI 200
    corecore