18 research outputs found

    On periodically pendulum-diven systems for underactuated locomotion: a viscoelastic jointed model

    Get PDF
    This paper investigates the locomotion principles and nonlinear dynamics of the periodically pendulum-driven (PD) systems using the case of a 2-DOF viscoelastic jointed model. As a mechanical system with underactuation degree one, the proposed system has strongly coupled nonlinearities and can be utilized as a potential benchmark for studying complicated PD systems. By mathematical modeling and non-dimensionalization of the physical system, an insight is obtained to the global system dynamics. The proposed 2-DOF viscoelastic jointed model establishes a commendable interconnection between the system dynamics and the periodically actuated force. Subsequently, the periodic locomotion principles of the actuated subsystem are elaborately studied and synthesized with the characteristic of viscoelastic element. Then the analysis of qualitative changes is conducted respectively under the varying excitation amplitude and frequency. Simulation results validate the efficiency and performance of the proposed system comparing with the conventional system

    Simultaneous state and input estimation with application to a two-link robotic system

    Full text link
    This paper addresses the problem of estimating simultaneously the state and input of a nonlinear system with application to a two link robotic manipulator - the Pendubot. The system nonlinearity comprises a Lipschitz function with respect to the state, and a nonlinear term which is a function of both the state and input. It is shown that under some conditions, an observer can be designed to estimate simultaneously the system&rsquo;s state and input. Simulation and experimental results, obtained around the inverted equilibrium position, are presented to demonstrate the validity of the approach.<br /

    A Practical and Conceptual Framework for Learning in Control

    No full text
    We propose a fully Bayesian approach for efficient reinforcement learning (RL) in Markov decision processes with continuous-valued state and action spaces when no expert knowledge is available. Our framework is based on well-established ideas from statistics and machine learning and learns fast since it carefully models, quantifies, and incorporates available knowledge when making decisions. The key ingredient of our framework is a probabilistic model, which is implemented using a Gaussian process (GP), a distribution over functions. In the context of dynamic systems, the GP models the transition function. By considering all plausible transition functions simultaneously, we reduce model bias, a problem that frequently occurs when deterministic models are used. Due to its generality and efficiency, our RL framework can be considered a conceptual and practical approach to learning models and controllers whe

    Model learning for trajectory tracking of robot manipulators

    Get PDF
    Abstract Model based controllers have drastically improved robot performance, increasing task accuracy while reducing control effort. Nevertheless, all this was realized with a very strong assumption: the exact knowledge of the physical properties of both the robot and the environment that surrounds it. This assertion is often misleading: in fact modern robots are modeled in a very approximate way and, more important, the environment is almost never static and completely known. Also for systems very simple, such as robot manipulators, these assumptions are still too strong and must be relaxed. Many methods were developed which, exploiting previous experiences, are able to refine the nominal model: from classic identification techniques to more modern machine learning based approaches. Indeed, the topic of this thesis is the investigation of these data driven techniques in the context of robot control for trajectory tracking. In the first two chapters, preliminary knowledge is provided on both model based controllers, used in robotics to assure precise trajectory tracking, and model learning techniques. In the following three chapters, are presented the novelties introduced by the author in this context with respect to the state of the art: three works with the same premise (an inaccurate system modeling), an identical goal (accurate trajectory tracking control) but with small differences according to the specific platform of application (fully actuated, underactuated, redundant robots). In all the considered architectures, an online learning scheme has been introduced to correct the nominal feedback linearization control law. Indeed, the method has been primarily introduced in the literature to cope with fully actuated systems, showing its efficacy in the accurate tracking of joint space trajectories also with an inaccurate dynamic model. The main novelty of the technique was the use of only kinematics information, instead of torque measurements (in general very noisy), to online retrieve and compensate the dynamic mismatches. After that the method has been extended to underactuated robots. This new architecture was composed by an online learning correction of the controller, acting on the actuated part of the system (the nominal partial feedback linearization), and an offline planning phase, required to realize a dynamically feasible trajectory also for the zero dynamics of the system. The scheme was iterative: after each trial, according to the collected information, both the phases were improved and then repeated until the task achievement. Also in this case the method showed its capability, both in numerical simulations and on real experiments on a robotics platform. Eventually the method has been applied to redundant systems: differently from before, in this context the task consisted in the accurate tracking of a Cartesian end effector trajectory. In principle very similar to the fully actuated case, the presence of redundancy slowed down drastically the learning machinery convergence, worsening the performance. In order to cope with this, a redundancy resolution was proposed that, exploiting an approximation of the learning algorithm (Gaussian process regression), allowed to locally maximize the information and so select the most convenient self motion for the system; moreover, all of this was realized with just the resolution of a quadratic programming problem. Also in this case the method showed its performance, realizing an accurate online tracking while reducing both the control effort and the joints velocity, obtaining so a natural behaviour. The thesis concludes with summary considerations on the proposed approach and with possible future directions of research

    Efficient Reinforcement Learning using Gaussian Processes

    Get PDF
    This book examines Gaussian processes (GPs) in model-based reinforcement learning (RL) and inference in nonlinear dynamic systems. First, we introduce PILCO, a fully Bayesian approach for efficient RL in continuous-valued state and action spaces when no expert knowledge is available. PILCO learns fast since it takes model uncertainties consistently into account during long-term planning and decision making. Thus, it reduces model bias, a common problem in model-based RL. Due to its generality and efficiency, PILCO is a conceptual and practical approach to jointly learning models and controllers fully automatically. Across all tasks, we report an unprecedented degree of automation and an unprecedented speed of learning. Second, we propose principled algorithms for robust filtering and smoothing in GP dynamic systems. Our methods are based on analytic moment matching and clearly advance state-of-the-art methods

    Intelligent model-based control of complex three-link mechanisms

    Get PDF
    The aim of this study is to understand the complexity and control challenges of the locomotion of a three-link mechanism of a robot system. In order to do this a three-link robot gymnast (Robogymnast) has been built in Cardiff University. The Robogymnast is composed of three links (one arm, one torso, one leg) and is powered by two geared DC motors. Currently the robot has three potentiometers to measure the relative angles between adjacent links and only one tachometer to measure the relative angular position of the first link. A mathematical model for the robot is derived using Lagrange equations. Since the model is inherently nonlinear and multivariate, it presents more challenges when modelling the Robogymnast and dealing with control motion problems. The proposed approach for dealing with the design of the control system is based on a discrete-time linear model around the upright position of the Robogymnast. To study the swinging motion of the Robogymnast, a new technique is proposed to manipulate the frequency and the amplitude of the sinusoidal signals as a means of controlling the motors. Due to the many combinations of the frequency and amplitude, an optimisation method is required to find the optimal set. The Bees Algorithm (BA), a novel swarm-based optimisation technique, is used to enhance the performance of the swinging motion through optimisation of the manipulated parameters of the control actions. The time taken to reach the upright position at its best is 128 seconds. Two different control methods are adopted to study the balancing/stablising of the Robogymnast in both the downward and upright configurations. The first is the optimal control algorithm using the Linear Quadratic Regulator (LQR) technique with integrators to help achieve and maintain the set of reference trajectories. The second is a combination of Local Control (LC) and LQR. Each controller is implemented via reduced order state observer to estimate the unmeasured states in terms of their relative angular velocities. From the identified data in the relative angular positions of the upright balancing control, it is reported that the maximum amplitude of the deviation in the relative angles on average are approximately 7.5° for the first link and 18° for the second link. It is noted that the third link deviated approximately by 2.5° using only the LQR controller, and no significant deviation when using the LQR with LC. To explore the combination between swinging and balancing motions, a switching mechanism between swinging and balancing algorithm is proposed. This is achieved by dividing the controller into three stages. The first stage is the swinging control, the next stage is the transition control which is accomplished using the Independent Joint Control (IJC) technique and finally balancing control is achieved by the LQR. The duration time of the transition controller to track the reference trajectory of the Robogymnast at its best is found to be within 0.4 seconds. An external disturbance is applied to each link of the Robogymnast separately in order to study the controller's ability to overcome the disturbance and to study the controller response. The simulation of the Robogymnast and experimental realization of the controllers are implemented using MATLAB® software and the C++ program environment respectively

    Controller Design for Rotary Inverted Pendulum System Using Evolutionary Algorithms

    Get PDF
    This paper presents evolutionary approaches for designing rotational inverted pendulum (RIP) controller including genetic algorithms (GA), particle swarm optimization (PSO), and ant colony optimization (ACO) methods. The goal is to balance the pendulum in the inverted position. Simulation and experimental results demonstrate the robustness and effectiveness of the proposed controllers with regard to parameter variations, noise effects, and load disturbances. The proposed methods can be considered as promising ways for control of various similar nonlinear systems

    Sliding Mode Control

    Get PDF
    The main objective of this monograph is to present a broad range of well worked out, recent application studies as well as theoretical contributions in the field of sliding mode control system analysis and design. The contributions presented here include new theoretical developments as well as successful applications of variable structure controllers primarily in the field of power electronics, electric drives and motion steering systems. They enrich the current state of the art, and motivate and encourage new ideas and solutions in the sliding mode control area

    Bio-inspired robotic control in underactuation: principles for energy efficacy, dynamic compliance interactions and adaptability.

    Get PDF
    Biological systems achieve energy efficient and adaptive behaviours through extensive autologous and exogenous compliant interactions. Active dynamic compliances are created and enhanced from musculoskeletal system (joint-space) to external environment (task-space) amongst the underactuated motions. Underactuated systems with viscoelastic property are similar to these biological systems, in that their self-organisation and overall tasks must be achieved by coordinating the subsystems and dynamically interacting with the environment. One important question to raise is: How can we design control systems to achieve efficient locomotion, while adapt to dynamic conditions as the living systems do? In this thesis, a trajectory planning algorithm is developed for underactuated microrobotic systems with bio-inspired self-propulsion and viscoelastic property to achieve synchronized motion in an energy efficient, adaptive and analysable manner. The geometry of the state space of the systems is explicitly utilized, such that a synchronization of the generalized coordinates is achieved in terms of geometric relations along the desired motion trajectory. As a result, the internal dynamics complexity is sufficiently reduced, the dynamic couplings are explicitly characterised, and then the underactuated dynamics are projected onto a hyper-manifold. Following such a reduction and characterization, we arrive at mappings of system compliance and integrable second-order dynamics with the passive degrees of freedom. As such, the issue of trajectory planning is converted into convenient nonlinear geometric analysis and optimal trajectory parameterization. Solutions of the reduced dynamics and the geometric relations can be obtained through an optimal motion trajectory generator. Theoretical background of the proposed approach is presented with rigorous analysis and developed in detail for a particular example. Experimental studies are conducted to verify the effectiveness of the proposed method. Towards compliance interactions with the environment, accurate modelling or prediction of nonlinear friction forces is a nontrivial whilst challenging task. Frictional instabilities are typically required to be eliminated or compensated through efficiently designed controllers. In this work, a prediction and analysis framework is designed for the self-propelled vibro-driven system, whose locomotion greatly relies on the dynamic interactions with the nonlinear frictions. This thesis proposes a combined physics-based and analytical-based approach, in a manner that non-reversible characteristic for static friction, presliding as well as pure sliding regimes are revealed, and the frictional limit boundaries are identified. Nonlinear dynamic analysis and simulation results demonstrate good captions of experimentally observed frictional characteristics, quenching of friction-induced vibrations and satisfaction of energy requirements. The thesis also performs elaborative studies on trajectory tracking. Control schemes are designed and extended for a class of underactuated systems with concrete considerations on uncertainties and disturbances. They include a collocated partial feedback control scheme, and an adaptive variable structure control scheme with an elaborately designed auxiliary control variable. Generically, adaptive control schemes using neural networks are designed to ensure trajectory tracking. Theoretical background of these methods is presented with rigorous analysis and developed in detail for particular examples. The schemes promote the utilization of linear filters in the control input to improve the system robustness. Asymptotic stability and convergence of time-varying reference trajectories for the system dynamics are shown by means of Lyapunov synthesis
    corecore