Search CORE

12 research outputs found

Recommended from our members

Smart Computer Security Audit: Reinforcement Learning with a Deep Neural Network Approximator

Author: Alonso E.
Jones K.
Pozdniakov K.
Stankovic V.
Tam K.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/07/2020
Field of study

A significant challenge in modern computer security is the growing skill gap as intruder capabilities increase, making it necessary to begin automating elements of penetration testing so analysts can contend with the growing number of cyber threats. In this paper, we attempt to assist human analysts by automating a single host penetration attack. To do so, a smart agent performs different attack sequences to find vulnerabilities in a target system. As it does so, it accumulates knowledge, learns new attack sequences and improves its own internal penetration testing logic. As a result, this agent (AgentPen for simplicity) is able to successfully penetrate hosts it has never interacted with before. A computer security administrator using this tool would receive a comprehensive, automated sequence of actions leading to a security breach, highlighting potential vulnerabilities, and reducing the amount of menial tasks a typical penetration tester would need to execute. To achieve autonomy, we apply an unsupervised machine learning algorithm, Q-learning, with an approximator that incorporates a deep neural network architecture. The security audit itself is modelled as a Markov Decision Process in order to test a number of decisionmaking strategies and compare their convergence to optimality. A series of experimental results is presented to show how this approach can be effectively used to automate penetration testing using a scalable, i.e. not exhaustive, and adaptive approach

City Research Online

Smart Security Audit: Reinforcement Learning with a Deep Neural Network Approximator

Author: Alonso E
Jones K
Pozdniakov K
Stankovic V
Tam K
Publication venue
Publication date: 15/06/2020
Field of study

No embargo require

Crossref

Plymouth Electronic Archive and Research Library

A novel switched model predictive control of wind turbines using artificial neural network-Markov chains prediction with load mitigation

Author: Fernández-Ramírez Luis M.
Kamal Tariq
Pervez Mahum
Publication venue: 'Elsevier BV'
Publication date: 01/03/2022
Field of study

The existing model predictive control algorithm based on continuous control using quadratic programming is currently one of the most used modern control strategies applied to wind turbines. However, heavy computational time involved and complexity in implementation are still obstructions in existing model predictive control algorithm. Owing to this, a new switched model predictive control technique is developed for the control of wind turbines with the ability to reduce complexity while maintaining better efficiency. The proposed technique combines model predictive control operating on finite control set and artificial intelligence with reinforcement techniques (Markov Chains, MC) to design a new effective control law which allows to achieve the control objectives in different wind speed zones with minimization of computational complexity. The proposed method is compared with the existing model predictive control algorithm, and it has been found that the proposed algorithm is better in terms of computational time, load mitigation, and dynamic response. The proposed research is a forward step towards refining modern control techniques to achieve optimization in nonlinear process control using novel hybrid structures based on conventional control laws and artificial intelligence.© 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)fi=vertaisarvioitu|en=peerReviewed

Osuva

Repositorio de Objetos de Docencia e Investigación de la Universidad de Cádiz

Recommended from our members

Resource-Aware Predictive Models in Cyber-Physical Systems

Author: AMIR MARAL
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Cyber-Physical Systems (CPS) are composed of computing devices interacting with physical systems. Model-based design is a powerful methodology in CPS design in the implementation of control systems. For instance, Model Predictive Control (MPC) is typically implemented in CPS applications, e.g., in path tracking of autonomous vehicles. MPC deploys a model to estimate the behavior of the physical system at future time instants for a specific time horizon. Ordinary Differential Equations (ODE) are the most commonly used models to emulate the behavior of continuous-time (non-)linear dynamical systems. A complex physical model may comprise thousands of ODEs that pose scalability, performance and power consumption challenges. One approach to address these model complexity challenges are frameworks that automate the development of model-to-model transformation. In this dissertation, a state-based model with tunable parameters is proposed to operate as a reconfigurable predictive model of the physical system. Moreover, we propose a run-time switching algorithm that selects the best model using machine learning. We employed a metric that formulates the trade-off between the error and computational savings due to model reduction. Building statistical models are constrained to having expert knowledge and an actual understanding of the modeled phenomenon or process. Also, statistical models may not produce solutions that are as robust in a real-world context as factors outside the model, like disruptions would not be taken into account. Machine learning models have emerged as a solution to account for the dynamic behavior of the environment and automate intelligence acquisition and refinement. Neural networks are machine learning models, well-known to have the ability to learn linear and nonlinear relations between input and output variables without prior knowledge. However, the ability to efficiently exploit resource-hungry neural networks in embedded resource-bound settings is a major challenge.Here, we proposed Priority Neuron Network (PNN), a resource-aware neural networks model that can be reconfigured into smaller sub-networks at runtime. This approach enables a trade-off between the model's computation time and accuracy based on available resources. The PNN model is memory efficient since it stores only one set of parameters to account for various sub-network sizes. We propose a training algorithm that applies regularization techniques to constrain the activation value of neurons and assigns a priority to each one. We consider the neuron's ordinal number as our priority criteria in that the priority of the neuron is inversely proportional to its ordinal number in the layer. This imposes a relatively sorted order on the activation values. We conduct experiments to employ our PNN as the predictive model in a CPS application. We can see that not only our technique will resolve the memory overhead of DNN architectures but it also reduces the computation overhead for the training process substantially. The training time is a critical matter especially in embedded systems where many NN models are trained on the fly

eScholarship - University of California

Recommended from our members

A machine learning approach for smart computer security audit

Author: Pozdniakov K.
Publication venue
Publication date
Field of study

This thesis presents a novel application of machine learning technology to automate network security audit and penetration testing processes in particular. A model-free reinforcement learning approach is presented. It is characterized by the absence of the environmental model. The model is derived autonomously by the audit system while acting in the tested computer network. The penetration testing process is specified as a Markov decision process (MDP) without definition of reward and transition functions for every state/action pair. The presented approach includes application of traditional and modified Q-learning algorithms. A traditional Q-learning algorithm learns the action-value function stored in the table, which gives the expected utility of executing a particular action in a particular state of the penetration testing process. The modified Q-learning algorithm differs by incorporation of the state space approximator and representation of the action-value function as a linear combination of features. Two deep architectures of the approximator are presented: autoencoder joint with artificial neural network (ANN) and autoencoder joint with recurrent neural network (RNN). The autoencoder is used to derive the feature set defining audited hosts. ANN is intended to approximate the state space of the audit process based on derived features. RNN is a more advanced version of the approximator and differs by the existence of the additional loop connections from hidden to input layers of the neural network. Such architecture incorporates previously executed actions into new inputs. It gives the opportunity to audit system learn sequences of actions leading to the goal of the audit, which is defined as receiving administrator rights on the host. The model-free reinforcement learning approach based on traditional Q-learning algorithms was also applied to reveal new vulnerabilities, buffer overflow in particular. The penetration testing system showed the ability to discover a string, exploiting potential vulnerability, by learning its formation process on the go. In order to prove the concept and to test the efficiency of an approach, audit tool was developed. Presented results are intended to demonstrate the adaptivity of the approach, performance of the algorithms and deep machine learning architectures. Different sets of hyperparameters are compared graphically to test the ability of convergence to the optimal action policy. An action policy is a sequence of actions, leading to the audit goal (getting admin rights on the remote host). The testing environment is also presented. It consists of 80+ virtual machines based on a vSphere virtualization platform. This combination of hosts represents a typical corporate network with Users segment, Demilitarized zone (DMZ) and external segment (Internet). The network has typical corporate services available: web server, mail server, file server, SSH, SQL server. During the testing process, the audit system acts as an attacker from the Internet

City Research Online

Nonlinear moving-horizon state estimation for hardware implementation and a model predictive control application

Author: Brunello Rafael Koji Vatanabe
Publication venue
Publication date: 07/04/2021
Field of study

Dissertação (mestrado)—Universidade de Brasília, Faculdade de Tecnologia, Departamento de Engenharia Mecânica, 2021.Nesta dissertação, exploramos a aplicação de redes neurais artificiais de funções de base radial (RBFs) embutidas em hardware para estimação de estados e controle em tempo real utilizando os algoritmos de Moving-Horizon Estimation(MHE) e Model Predictive Control (MPC). Esses algoritmos foram posteriormente aproximados por RBFs e implementados em um Field Programmable Gate Array (FPGA), que tem mostrado bons resultados em termos de precisão e tempo ˜ computacional. Mostramos que a estimativa de estado usando a versão aproximada do MHE ˜ pode ser executada usando um kit em escala de laboratório de aproximadamente 500 kHz para ´ um pendulo invertido a uma taxa de clock de cerca de 110 MHz. A latência para fornecer uma estimativa pode ser reduzida ainda mais quando FPGAs com clocks mais altos são usados, pois a ˜ arquitetura da rede neural artificial e inerentemente paralela. Após uma inspeção mais detalhada, ˜ descobriu-se que era possível reduzir o custo da área de chip trocando a função de custo por uma ˜ com resultados mais facilmente representáveis. Ele poderia então utilizar uma representação em ˜ 32 bits e o modulo CORDIC poderia ser removido, usando apenas a aproximação mais simples da ˜ serie de Taylor de 2 ´ ª ordem. Em seguida, expandimos isso, investigando a ideia de usar uma única rede neural para substituir tanto o controle quanto o estimatidor de estados. Comparado a um MPC com informações completas, sua versão utilizando o MHE não teve um bom desempenho contra ˜ ruídos de saída. A princípio não foi possível aproximar o controle e a estimativa do pêndulo com um bom resultado, porem ao separar o controle em duas partes obtivemos melhores resultados. Por fim, verificamos que tal rede neural foi capaz de estabilizar o sistema de pendulo invertido, ˆ mas não de aproximar sua parte oscilante n ˜ ao linear. A solução aqui apresentada ˜ e encorajada a ser estendida para sistemas mais complexos e não lineares, uma vez que uma arquitetura com ˜ complexidade razoável é encontrada para a rede neural artificial para ser implementada.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES).In this dissertation, we explore the application of radial basis functions (RBFs) artificial neural networks embedded in hardware for real-time estimation and control algorithms as the Moving- Horizon Estimation (MHE) and the Model Predictive Control (MPC). These algorithms are then approximated using RBFs and implemented in a Field Programmable Gate Array (FPGA), which has shown good results in terms of accuracy and computational time. We show that the state estimate using the approximate version of the MHE can be run using a laboratory-scale kit of approximately 500 kHz for an inverted pendulum at a clock rate of about 110 MHz. The latency to provide an estimate can be further reduced when FPGAs with higher clocks are used as the artificial neural network architecture is inherently parallel. Upon further inspection, it was found to be possible to reduce the chip area cost by switching the cost function for one with more easily representable results. It could then utilize a 32-bits representation and the CORDIC module could be removed, using instead only the simpler 2o order Taylor approximation. We then expand upon this, probing at the idea of using a single neural network to substitute both the control and state-estimation. Compared to a MPC with full information, its version utilizing the MHE did not perform well against output noises. At first, it was not possible to approximate the pendulum control and estimation with a good result, however when separating the control in two parts we gained better outcomes. Lastly, we verify that such a neural network was capable of stabilizing the inverted pendulum system, but not of approximating the non-linear swing-up part of it. The solution herein presented is encouraged to be further extended for more complex and nonlinear systems, given that an architecture is found for the artificial neural network with reasonable complexity to be implemented

Repositório Institucional da Universidade de Brasília

Arquiteturas de hardware para aceleração de algoritmos de controle preditivo não-linear

Author: Sampaio Renato Coral
Publication venue
Publication date: 26/10/2018
Field of study

Tese (doutorado)—Universidade de Brasília, Faculdade de Tecnologia, Departamento de Engenharia Mecânica, 2018.O Controle Preditivo Baseado em Modelos (MPC) é uma técnica avançada de controle que vem ganhando espaço tanto na academia quanto na indústria ao longo das últimas décadas. O fato de incorporar restrições em sua lei de controle e de poder ser aplicada tanto para sistemas lineares simples quanto para sistemas não-lineares complexos com múltiplas entradas e múltiplas saídas tornam seu emprego bastante atraente. Porém, seu alto custo computacional muitas vezes impede sua aplicação a sistemas com dinâmicas rápidas, principalmente a sistemas não-lineares embarcados onde há restrições computacionais e de consumo de energia. Baseado nisso, este trabalho se propõe a desenvolver algoritmos e arquiteturas em hardware capazes de viabilizar a aplicação do Controle Preditivo Não-Linear (NMPC) para sistemas embarcados. Duas abordagens são desenvolvidas ao longo do trabalho. A primeira aplica técnicas de aprendizado de máquina utilizando Redes Neurais Artificiais (RNAs) e Máquinas de Vetor de Suporte (SVMs) para criar soluções que aproximam o comportamento do NMPC em hardware. Neste caso, técnicas para o treinamento das RNAs e SVMs são exploradas com o intuito de generalizar uma solução capaz de lidar com uma ampla faixa de referências de controle. Em seguida, arquiteturas de hardware em ponto-flutuante para a implementação de RNAs do tipo RBF (Radial Basis Functions) e SVMs são desenvolvidas juntamente com configurador automático capaz de gerar os códigos VHDL (VHSIC Hardware Description Language) das respectivas arquiteturas baseado nos resultados de treinamento e sua topologia. As arquiteturas resultantes são testadas em um FPGA (Field-Programmable Gate Array) de baixo custo e são capazes de computar soluções em menos de 1 s. Na segunda abordagem, o algoritmo heurístico de Otimização por Enxame de Partículas (PSO), é estudado e adaptado para etapa de busca da sequência de controle ótima do NMPC. Dentre as modificações estão incluídas a adição de funções de penalização para obedecer às restrições de estados do sistema, o aprimoramento da técnica KPSO (Knowledge-Based PSO), denominada KPSO+SS, onde resultados de períodos de soluções de períodos amostragem anteriores são combinados com informações sobre o sinal de controle em estado estacionário e seus valores máximos e mínimos para agilizar a busca pela solução ótima. Mais uma vez, arquiteturas de hardware em ponto-flutuante são desenvolvidas para viabilizar a aplicação do controlador NMPC-PSO a sistemas embarcados. Um gerador de códigos da solução NMPC-PSO é proposto para permitir a aplicação da mesma arquitetura a outros sistemas. Em seguida, a solução é testada para o procedimento de swing-up do pêndulo invertido utilizando uma plataforma hardware-inthe- loop (HIL) e apresentou bom desempenho em tempo-real calculando a solução em menos de 3 ms. Finalmente, a solução NMPC-PSO é validada em um sistema de pêndulos gêmeos e outro sistema de controle de atitudes de um satélite.Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) e Decanato de Pesquisa e Inovação -(DPI/ UnB).Model-based Predictive Control (MPC) is an advanced control technique that has been gaining adoption in industry and the academy along the last few decades. Its ability to incorporate system constraints in the control law and be applied from simple linear systems up to more complex nonlinear systems with multiple inputs and outputs attracts its usage. However, the high computational cost associated with this technique often hinders its use, especially in embedded nonlinear systems with fast dynamics with computational and restrictions. Based on these facts, this work aims to study and develop algorithms and hardware architectures that can enable the application of Nonlinear Model Predictive Control (NMPC) on embedded systems. Two approaches are developed throughout this work. The first one applies machine learning techniques using Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs) to create solutions that approximate the NMPC behavior in hardware. In this case, ANN and SVM training techniques are explored with the aim to generalize the control solution and work on a large range of reference control inputs. Next, floating-point hardware architectures to implement Radial Basis Function ANNs and SVM solutions are developed along with an automatic architectural configuration too, capable of generating the VHDL (VHSIC Hardware Description Language) codes based on the training results and its topology. Resulting architectures are tested on a low-cost FPGA (Field-Programmable Gate Array) and are capable of computing the solution in under 1 s. In a second approach, the Particle Swarm Optimization (PSO), which is a heuristic algorithm, is studied and adapted to perform the optimal control sequence search phase of the NMPC. Among the main optimizations performed are the addition of penalty functions to address the controlled system state constraints, an improved KPSO (Knowledge-Based PSO) technique named KPSO+SS, where results from previous sampling periods are combined with steady-state control information to speed-up the optimal solution search. Hardware architectures with floating-point arithmetic to enable the application of the NMPC-PSO solution on embedded systems are developed. Once again, a hardware description configuration tool is created to allow the architecture to be applied to multiple systems. Then, the solution is applied to a real-time inverted pendulum swing-up procedure tested on a hardware-in-the-loop (HIL) platform. The experiment yielding good performance and control results and was able to compute the solutions in under 3 ms. Finally, the NMPC-PSO solution is further validated performing a swing-up procedure on a Twin Pendulum system and then on a satellite control platform, a system with multiple inputs and output

Repositório Institucional da Universidade de Brasília

Machine Learning for Identification and Optimal Control of Advanced Automotive Engines.

Author: Janakiraman Vijay Manikandan
Publication venue
Publication date: 01/01/2013
Field of study

The complexity of automotive engines continues to increase to meet increasing performance requirements such as high fuel economy and low emissions. The increased sensing capabilities associated with such systems generate a large volume of informative data. With advancements in computing technologies, predictive models of complex dynamic systems useful for diagnostics and controls can be developed using data based learning. Such models have a short development time and can serve as alternatives to traditional physics based modeling. In this thesis, the modeling and control problem of an advanced automotive engine, the homogeneous charge compression ignition (HCCI) engine, is addressed using data based learning techniques. Several frameworks including design of experiments for data generation, identification of HCCI combustion variables, modeling the HCCI operating envelope and model predictive control have been developed and analyzed. In addition, stable online learning algorithms for a general class of nonlinear systems have been developed using extreme learning machine (ELM) model structure.PHDMechanical EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/102392/1/vijai_1.pd

Deep Blue Documents at the University of Michigan

Elman Neural Networks in Model Predictive Control

Author: Sámek David
Publication venue: European Council for Modelling and Simulation (ECMS)
Publication date: 04/06/2012
Field of study

The goal of this paper is to present interesting way how to model and predict nonlinear systems using recurrent neural network. This type of artificial neural networks is underestimated and marginalized. Nevertheless, it offers superior modelling features at reasonable computational costs. This contribution is focused on Elman Neural Network, two-layered recurrent neural network. The abilities of this network are presented in the nonlinear system control. The task of the controller is to control the liquid level in the second of two interconnected cylindrical tanks. The mathematical model of the real-time system was derived in order to test predictor and consequently the controller in Matlab/Simulink simulations

Institutional repository of Tomas Bata University Library