Search CORE

218,240 research outputs found

Low Emission Building Control with Zero-Shot Reinforcement Learning

Author: Abate Alessandro
Cullen Jonathan M.
Jeen Scott R.
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 05/03/2023
Field of study

Heating and cooling systems in buildings account for 31% of global energy use, much of which are regulated by Rule Based Controllers (RBCs) that neither maximise energy efficiency nor minimise emissions by interacting optimally with the grid. Control via Reinforcement Learning (RL) has been shown to significantly improve building energy efficiency, but existing solutions require access to building-specific simulators or data that cannot be expected for every building in the world. In response, we show it is possible to obtain emission-reducing policies without such knowledge a priori--a paradigm we call zero-shot building control. We combine ideas from system identification and model-based RL to create PEARL (Probabilistic Emission-Abating Reinforcement Learning) and show that a short period of active exploration is all that is required to build a performant model. In experiments across three varied building energy simulations, we show PEARL outperforms an existing RBC once, and popular RL baselines in all cases, reducing building emissions by as much as 31% whilst maintaining thermal comfort. Our source code is available online via https://enjeeneer.io/projects/pearl/Comment: Accepted at AAAI 2023. Code available via https://enjeeneer.io/projects/pearl

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning

Author: Oliehoek Frans A.
van der Pol Elise
van Hoof Herke
Welling Max
Worrall Daniel E.
Publication venue
Publication date: 01/01/2021
Field of study

This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit knowledge about such structure. By building this prior knowledge into policy and value networks using an equivariance constraint, we can reduce the size of the solution space. We specifically focus on group-structured symmetries (invertible transformations). Additionally, we introduce an easy method for constructing equivariant network layers numerically, so the system designer need not solve the constraints by hand, as is typically done. We construct MDP homomorphic MLPs and CNNs that are equivariant under either a group of reflections or rotations. We show that such networks converge faster than unstructured baselines on CartPole, a grid world and Pong

arXiv.org e-Print Archive

International Migration, Integration and Social Cohesion online publications

UvA-DARE

A CASE STUDY INVESTIGATING RULE BASED DESIGN IN AN INDUSTRIAL SETTING

Author: Chavali Siva
Publication venue: Clemson University Libraries
Publication date: 11/12/2007
Field of study

This thesis presents a case study on the implementation of a rule based design (RBD) process for an engineer-to-order (ETO) company. The time taken for programming and challenges associated with this process are documented in order to understand the benefits and limitations of RBD. These times are obtained while developing RBD programs for grid assemblies of bottle packaging machines that are designed and manufactured by Hartness International (HI). In this project, commercially available computer-aided design (CAD) and RBD software are integrated to capture the design and manufacturing knowledge used to automate the grid design process of HI. The stages involved in RBD automation are identified as CAD modeling, knowledge acquisition, capturing parameters, RBD programming, debugging, and testing, and production deployment. The stages and associated times in RBD program development process are recorded for eighteen different grid products. Empirical models are developed to predict development times of RBD program, specifically enabling HI to estimate their return on investment. The models are demonstrated for an additional grid product where the predicted time is compared to actual RBD program time, falling within 20% of each other. This builds confidence in the accuracy of the models. Modeling guidelines for preparing CAD models are also presented to help in RBD program development. An important observation from this case study is that a majority of the time is spent capturing information about product during the knowledge acquisition stage, where the programmer\u27s development of a RBD program is dependent upon the designer\u27s product knowledge. Finally, refining these models to include other factors such as time for building CAD models, programmers experience with the RBD software (learning curve), and finally extending these models to other product domains are identified possible areas of future work

Clemson University: TigerPrints

Recommended from our members

Smart labs and social practice: social tools for pervasive laboratory workspaces: a position paper

Author: Brasher Andrew
Joinson Adam
McAndrew Patrick
Minocha Shailey
Taylor Josie
Publication venue
Publication date: 06/01/2006
Field of study

The emergence of pervasive and ubiquitous computing stimulates a view of future work environments where sharing of information, data and knowledge is easy and commonplace, particularly in highly interactive settings. Much of the work in this area focuses on tool development to support activities such as data collection, data recording and sharing, and so on. We are interested in this kind of technical development, which is both challenging and essential for science communities. But we are also interested in a broader interpretation of knowledge sharing and the human/social side of tools we develop to support this. We are keen to know more about how groups of different kinds of scientists can make their work understandable and shareable with each other in a multidisciplinary setting. This is a complex task because boundaries and barriers can emerge between disciplines engendered by differences in discourses and practices, which may not easily translate into other discipline areas. In the worst case, there may be some hostility between disciplines, or at least doubt and scepticism. Nevertheless, sharing approaches to research, research expertise, data and methods across disciplines can be a very fruitful exercise, and encouragement to engage in this activity is particularly pertinent in the digital era. Issues of privacy and security are also key aspects – knowing when and how to release data or information to other groups is crucial to providing a safe environment for people to work, and there are several sensitivities to be explored here. In this paper we describe an evolving situation that captures many of these issues, which we aim to track longitudinally

Open Research Online (The Open University)

The Knowledge Life Cycle for e-learning

Author: Davis Hugh
Doody Karl
Millard David
Tao Feng
Woukeu Arouna
Publication venue
Publication date: 01/01/2006
Field of study

In this paper, we examine the semantic aspects of e-learning from both pedagogical and technological points of view. We suggest that if semantics are to fulfil their potential in the learning domain then a paradigm shift in perspective is necessary, from information-based content delivery to knowledge-based collaborative learning services. We propose a semantics driven Knowledge Life Cycle that characterises the key phases in managing semantics and knowledge, show how this can be applied to the learning domain and demonstrate the value of semantics via an example of knowledge reuse in learning assessment management

CiteSeerX

Southampton (e-Prints Soton)

Technologies may help thinking

Author: Costa Fernando Albuquerque
Cruz Elisabete
Ferreira Rute
Publication venue: Associação de Professores de Sintra
Publication date: 01/01/2006
Field of study

The objective of teachers’ personal and professional development is an excellent reason to reflect upon the innovation issues in education and a rare opportunity to implement the use of portfolios in the teaching practices. The most recent developments of digital technologies allow experiencing new organisational and knowledge building that state the diversity and multiplicity of purposes, both alone and as a group. From the reflection on these two aspects comes up the present proposal for the analysis and evaluation of the technologies which may easily be accessed by the educational community and may be used in the process of electronic portfolios building. In what teachers are concerned the use of portfolios can become a powerful means helping the change of the educational practices (Cardoso, Peixoto, Serrano and Moreira, 1996) if it is adopted as a metacognitive and reflexive strategy about teaching about them (Galvão, 2005). However there is a lack of information about what portfolios are, which technologies can be used, how they are prepared and how to take advantage of them. All these questions point out to the need of a specific training in this field. Accordingly, this chapter especially aims at helping teachers in that process, providing an analysis and evaluation technologies grid based on their pedagogical potentialities for the building of digital portfolios

Universidade de Lisboa: Repositório.UL

Recommended from our members

Building thermal load prediction through shallow machine learning and deep learning

Author: Hong T
Piette MA
Wang Z
Publication venue: eScholarship, University of California
Publication date: 01/04/2020
Field of study

Building thermal load prediction informs the optimization of cooling plant and thermal energy storage. Physics-based prediction models of building thermal load are constrained by the model and input complexity. In this study, we developed 12 data-driven models (7 shallow learning, 2 deep learning, and 3 heuristic methods) to predict building thermal load and compared shallow machine learning and deep learning. The 12 prediction models were compared with the measured cooling demand. It was found XGBoost (Extreme Gradient Boost) and LSTM (Long Short Term Memory) provided the most accurate load prediction in the shallow and deep learning category, and both outperformed the best baseline model, which uses the previous day's data for prediction. Then, we discussed how the prediction horizon and input uncertainty would influence the load prediction accuracy. Major conclusions are twofold: first, LSTM performs well in short-term prediction (1 h ahead) but not in long term prediction (24 h ahead), because the sequential information becomes less relevant and accordingly not so useful when the prediction horizon is long. Second, the presence of weather forecast uncertainty deteriorates XGBoost's accuracy and favors LSTM, because the sequential information makes the model more robust to input uncertainty. Training the model with the uncertain rather than accurate weather data could enhance the model's robustness. Our findings have two implications for practice. First, LSTM is recommended for short-term load prediction given that weather forecast uncertainty is unavoidable. Second, XGBoost is recommended for long term prediction, and the model should be trained with the presence of input uncertainty

eScholarship - University of California