57,401 research outputs found

    Shaping of molecular weight distribution by iterative learning probability density function control strategies

    Get PDF
    A mathematical model is developed for the molecular weight distribution (MWD) of free-radical styrene polymerization in a simulated semi-batch reactor system. The generation function technique and moment method are employed to establish the MWD model in the form of Schultz-Zimmdistribution. Both static and dynamic models are described in detail. In order to achieve the closed-loop MWD shaping by output probability density function (PDF) control, the dynamic MWD model is further developed by a linear B-spline approximation. Based on the general form of the B-spline MWD model, iterative learning PDF control strategies have been investigated in order to improve the MWD control performance. Discussions on the simulation studies show the advantages and limitations of the methodology

    Payload Oscillations Minimization via Open Loop Control.

    Get PDF
    The results of tests of payload oscillations, forced by linear control function which allows to minimize payload sway after acceleration phase and after overhead crane stopping are presented in this paper. The analysis of solution of this problem has been carried out. The algorithm of operation for real drive system which takes into account the possibilities of driving of an overhead crane is also presented. The impact of inaccuracies of measurement of the ropes length on minimizing a displacements of payload during the duty cycle is shown as well. The correctness of the method is confirmed by results both simulation and experimental tests

    Modelling and control of the flame temperature distribution using probability density function shaping

    Get PDF
    This paper presents three control algorithms for the output probability density function (PDF) control of the 2D and 3D flame distribution systems. For the 2D flame distribution systems, control methods for both static and dynamic flame systems are presented, where at first the temperature distribution of the gas jet flames along the cross-section is approximated. Then the flame energy distribution (FED) is obtained as the output to be controlled by using a B-spline expansion technique. The general static output PDF control algorithm is used in the 2D static flame system, where the dynamic system consists of a static temperature model of gas jet flames and a second-order actuator. This leads to a second-order closed-loop system, where a singular state space model is used to describe the dynamics with the weights of the B-spline functions as the state variables. Finally, a predictive control algorithm is designed for such an output PDF system. For the 3D flame distribution systems, all the temperature values of the flames are firstly mapped into one temperature plane, and the shape of the temperature distribution on this plane can then be controlled by the 3D flame control method proposed in this paper. Three cases are studied for the proposed control methods and desired simulation results have been obtained

    A new Potential-Based Reward Shaping for Reinforcement Learning Agent

    Full text link
    Potential-based reward shaping (PBRS) is a particular category of machine learning methods which aims to improve the learning speed of a reinforcement learning agent by extracting and utilizing extra knowledge while performing a task. There are two steps in the process of transfer learning: extracting knowledge from previously learned tasks and transferring that knowledge to use it in a target task. The latter step is well discussed in the literature with various methods being proposed for it, while the former has been explored less. With this in mind, the type of knowledge that is transmitted is very important and can lead to considerable improvement. Among the literature of both the transfer learning and the potential-based reward shaping, a subject that has never been addressed is the knowledge gathered during the learning process itself. In this paper, we presented a novel potential-based reward shaping method that attempted to extract knowledge from the learning process. The proposed method extracts knowledge from episodes' cumulative rewards. The proposed method has been evaluated in the Arcade learning environment and the results indicate an improvement in the learning process in both the single-task and the multi-task reinforcement learner agents

    Hedging Expected Losses on Derivatives in Electricity Futures Markets

    Full text link
    We investigate the problem of pricing and hedging derivatives of Electricity Futures contract when the underlying asset is not available. We propose to use a cross hedging strategy based on the Futures contract covering the larger delivery period. A quick overview of market data shows a basis risk for this market incompleteness. For that purpose we formulate the pricing problem in a stochastic target form along the lines of Bouchard and al. (2008), with a moment loss function. Following the same techniques as in the latter, we avoid to demonstrate the uniqueness of the value function by comparison arguments and explore convex duality methods to provide a semi-explicit solution to the problem. We then propose numerical results to support the new hedging strategy and compare our method to the Black-Scholes naive approach

    Max-Min SNR Signal Energy based Spectrum Sensing Algorithms for Cognitive Radio Networks with Noise Variance Uncertainty

    Full text link
    This paper proposes novel spectrum sensing algorithms for cognitive radio networks. By assuming known transmitter pulse shaping filter, synchronous and asynchronous receiver scenarios have been considered. For each of these scenarios, the proposed algorithm is explained as follows: First, by introducing a combiner vector, an over-sampled signal of total duration equal to the symbol period is combined linearly. Second, for this combined signal, the Signal-to-Noise ratio (SNR) maximization and minimization problems are formulated as Rayleigh quotient optimization problems. Third, by using the solutions of these problems, the ratio of the signal energy corresponding to the maximum and minimum SNRs are proposed as a test statistics. For this test statistics, analytical probability of false alarm (PfP_f) and detection (PdP_d) expressions are derived for additive white Gaussian noise (AWGN) channel. The proposed algorithms are robust against noise variance uncertainty. The generalization of the proposed algorithms for unknown transmitter pulse shaping filter has also been discussed. Simulation results demonstrate that the proposed algorithms achieve better PdP_d than that of the Eigenvalue decomposition and energy detection algorithms in AWGN and Rayleigh fading channels with noise variance uncertainty. The proposed algorithms also guarantee the desired Pf(Pd)P_f(P_d) in the presence of adjacent channel interference signals

    Feedback MPC for Torque-Controlled Legged Robots

    Full text link
    The computational power of mobile robots is currently insufficient to achieve torque level whole-body Model Predictive Control (MPC) at the update rates required for complex dynamic systems such as legged robots. This problem is commonly circumvented by using a fast tracking controller to compensate for model errors between updates. In this work, we show that the feedback policy from a Differential Dynamic Programming (DDP) based MPC algorithm is a viable alternative to bridge the gap between the low MPC update rate and the actuation command rate. We propose to augment the DDP approach with a relaxed barrier function to address inequality constraints arising from the friction cone. A frequency-dependent cost function is used to reduce the sensitivity to high-frequency model errors and actuator bandwidth limits. We demonstrate that our approach can find stable locomotion policies for the torque-controlled quadruped, ANYmal, both in simulation and on hardware.Comment: Paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019

    Dynamic autonomous intelligent control of an asteroid lander

    Get PDF
    One of the future flagship missions of the European Space Agency (ESA) is the asteroid sample return mission Marco-Polo. Although there have been a number of past missions to asteroids, a sample has never been successfully returned. The return of asteroid regolith to the Earth's surface introduces new technical challenges. This paper develops attitude control algorithms for the descent phase onto an asteroid in micro-gravity conditions and draws a comparison between the algorithms considered. Two studies are also performed regarding the Fault Detection Isolation and Recovery (FDIR) of the control laws considered. The potential of using Direct Adaptive Control (DAC) as a controller for the surface sampling process is also investigated. Use of a DAC controller incorporates increased levels of robustness by allowing realtime variation of control gains. This leads to better response to uncertainties encountered during missions
    corecore