202 research outputs found

    SYNLOCO: Synthesizing Central Pattern Generator and Reinforcement Learning for Quadruped Locomotion

    Full text link
    The Central Pattern Generator (CPG) is adept at generating rhythmic gait patterns characterized by consistent timing and adequate foot clearance. Yet, its open-loop configuration often compromises the system's control performance in response to environmental variations. On the other hand, Reinforcement Learning (RL), celebrated for its model-free properties, has gained significant traction in robotics due to its inherent adaptability and robustness. However, initiating traditional RL approaches from the ground up presents computational challenges and a heightened risk of converging to suboptimal local minima. In this paper, we propose an innovative quadruped locomotion framework, SYNLOCO, by synthesizing CPG and RL that can ingeniously integrate the strengths of both methods, enabling the development of a locomotion controller that is both stable and natural. Furthermore, we introduce a set of performance-driven reward metrics that augment the learning of locomotion control. To optimize the learning trajectory of SYNLOCO, a two-phased training strategy is presented. Our empirical evaluation, conducted on a Unitree GO1 robot under varied conditions--including distinct velocities, terrains, and payload capacities--showcases SYNLOCO's ability to produce consistent and clear-footed gaits across diverse scenarios. The developed controller exhibits resilience against substantial parameter variations, underscoring its potential for robust real-world applications.Comment: 7 Page

    Design, implementation and integration of an autonomy payload for a quadruped-legged robot

    Get PDF
    Aquesta tesi final de màster versa sobre la descripció, disseny i implantació d’un conjunt de millores que fan possible l’autonomia d’un robot quadrúpede. El conjunt de millores descrites es troben separades en dos grups per millorar la comprensió el lector. Aquests són: Hardware i Software. Durant la primera part, es descriu al complet: el dimensionament, el disseny i la implementació dels elements mecànics i electrònics (bateries, ordinador a bord, càmera, i elements passius) que formen part de la millora en el hardware. Aquesta inclou el disseny i impressió en 3D, així com el procés d’acoblament al robot original. Pel que fa a la millora en "Software"; es planteja, descriu, i implementa un observador d’estat amb la intenció futura d’implementar-lo al mateix robot. L’observador es descriu mitjançant l’ús d’un conegut algoritme de fusió de dades anomenat Filtre de Kalman (on, en aquest cas, se n’utilitza la versió estesa). Per provar l’observador dissenyat, es realitza una simulació amb dades generades artificialment a partir d’un model cinemàtic. A continuació, tot i que la futura implantació es pretén que sigui al robot quadrúpede, per poder prova el filtre amb dades reals, es registren les dades del vol d’un dron fent ús de la comunicació ROS1 2 Galactic. De les mateixes dades, es torna a experimentar amb el filtre, però, aquest cop, fent ús del registre de dades del controlador del dron (model PX4). El treball conclou amb quatre estudis addicionals sobre la tasca: econòmic, temporal, social i mediambiental; així com amb unes conclusions que inclouen propostes de futures tasques a portar a capThis master thesis is based on the description, design and implementation of a set of improve- ments that make possible the autonomy of a quadruped robot. The set of improvements described are separated into two large groups that are disassociated in this memory as Hardware and Software, in order to improve the reader’s comprehension. During the first part, the dimensioning, design and implementation of the mechanical and elec- tronic elements (batteries, on-board computer, camera, and passive elements) that are part of the improvement in the hardware is described in full. This part, in turn, includes the design and 3D printing, as well as the assembly process to the original robot. As for the improvement in software, a state observer is proposed, described and implemented with the future intention of being implemented in the robot itself. This is described through the use of a well-known data fusion algorithm called Kalman Filter (which, in this case, is used through its extended version). To test the designed observer, three experiments are performed. The first one is based on a simulation with artificially generated data from a kinematic model. The second is based on real data collected from the flight of a drone. It also uses the ROS2 2 Galactic operating system to establish communication between data. Finally, the third uses data extracted from the drone’s own controller (PX4 model). The thesis ends with four additional studies: economic, temporal, social, and environmental, as well as conclusions that incorporate proposals for future lines of workEsta tesis final de máster se basa en la descripción, diseño e implementación de un conjunto de mejoras que hacen posible la autonomía de un robot cuadrúpedo. El conjunto de mejoras descritas se encuentran separadas en dos grandes grupos que se desasocian en esta memoria como Hardware y Software, a fin de mejorar la compresión del lector. Durante la primera parte, se describe al completo: el dimensionamiento, el diseño y la imple- mentación de los elementos mecánicos y electrónicos (baterías, ordenador a bordo, cámara, y elementos pasivos) que forman parte de la mejora en el hardware. Esta parte, a su vez, incluye el diseño e impresión en 3D, así como el proceso de ensamblaje al robot original. En cuanto a la mejora en software; se plantea, describe e implementa, un observador de estado con la intención futura de ser implementado en el mismo robot. Este se describe mediante el uso de un conocido algoritmo de fusión de datos llamado Filtro de Kalman (que, en este caso, es usado mediante su versión extendida). Para probar el observador diseñado, se realizan tres experimentos. El primero se basa en una simulación con datos generados artificialmente a partir de un modelo cinemático. El segundo, aunque la futura implementación está pensada sobre el robot cuadrúpedo, se realiza utilizando datos reales recogidos del vuelo de un dron, haciendo uso de la comunicación ROS3 2 Galactic. Por último, el tercero utiliza datos extraídos del propio controlador del dron (modelo PX4). La tesis finaliza con cuatro estudios adicionales sobre la misma: económico, temporal, social y medioambiental; así como, con unas conclusiones que incorporan propuestas para futuras líneas de trabaj

    Contact Optimization for Non-Prehensile Loco-Manipulation via Hierarchical Model Predictive Control

    Full text link
    Recent studies on quadruped robots have focused on either locomotion or mobile manipulation using a robotic arm. Legged robots can manipulate heavier and larger objects using non-prehensile manipulation primitives, such as planar pushing, to drive the object to the desired location. In this paper, we present a novel hierarchical model predictive control (MPC) for contact optimization of the manipulation task. Using two cascading MPCs, we split the loco-manipulation problem into two parts: the first to optimize both contact force and contact location between the robot and the object, and the second to regulate the desired interaction force through the robot locomotion. Our method is successfully validated in both simulation and hardware experiments. While the baseline locomotion MPC fails to follow the desired trajectory of the object, our proposed approach can effectively control both object's position and orientation with minimal tracking error. This capability also allows us to perform obstacle avoidance for both the robot and the object during the loco-manipulation task.Comment: 7 pages, 9 figure

    CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller

    Full text link
    We present CAJun, a novel hierarchical learning and control framework that enables legged robots to jump continuously with adaptive jumping distances. CAJun consists of a high-level centroidal policy and a low-level leg controller. In particular, we use reinforcement learning (RL) to train the centroidal policy, which specifies the gait timing, base velocity, and swing foot position for the leg controller. The leg controller optimizes motor commands for the swing and stance legs according to the gait timing to track the swing foot target and base velocity commands using optimal control. Additionally, we reformulate the stance leg optimizer in the leg controller to speed up policy training by an order of magnitude. Our system combines the versatility of learning with the robustness of optimal control. By combining RL with optimal control methods, our system achieves the versatility of learning while enjoys the robustness from control methods, making it easily transferable to real robots. We show that after 20 minutes of training on a single GPU, CAJun can achieve continuous, long jumps with adaptive distances on a Go1 robot with small sim-to-real gaps. Moreover, the robot can jump across gaps with a maximum width of 70cm, which is over 40% wider than existing methods.Comment: Please visit https://yxyang.github.io/cajun/ for additional result

    Learning to See Physical Properties with Active Sensing Motor Policies

    Full text link
    Knowledge of terrain's physical properties inferred from color images can aid in making efficient robotic locomotion plans. However, unlike image classification, it is unintuitive for humans to label image patches with physical properties. Without labeled data, building a vision system that takes as input the observed terrain and predicts physical properties remains challenging. We present a method that overcomes this challenge by self-supervised labeling of images captured by robots during real-world traversal with physical property estimators trained in simulation. To ensure accurate labeling, we introduce Active Sensing Motor Policies (ASMP), which are trained to explore locomotion behaviors that increase the accuracy of estimating physical parameters. For instance, the quadruped robot learns to swipe its foot against the ground to estimate the friction coefficient accurately. We show that the visual system trained with a small amount of real-world traversal data accurately predicts physical parameters. The trained system is robust and works even with overhead images captured by a drone despite being trained on data collected by cameras attached to a quadruped robot walking on the ground.Comment: In CoRL 2023. Website: https://gmargo11.github.io/active-sensing-loco

    Coupling Vision and Proprioception for Navigation of Legged Robots

    Full text link
    We exploit the complementary strengths of vision and proprioception to develop a point-goal navigation system for legged robots, called VP-Nav. Legged systems are capable of traversing more complex terrain than wheeled robots, but to fully utilize this capability, we need a high-level path planner in the navigation system to be aware of the walking capabilities of the low-level locomotion policy in varying environments. We achieve this by using proprioceptive feedback to ensure the safety of the planned path by sensing unexpected obstacles like glass walls, terrain properties like slipperiness or softness of the ground and robot properties like extra payload that are likely missed by vision. The navigation system uses onboard cameras to generate an occupancy map and a corresponding cost map to reach the goal. A fast marching planner then generates a target path. A velocity command generator takes this as input to generate the desired velocity for the walking policy. A safety advisor module adds sensed unexpected obstacles to the occupancy map and environment-determined speed limits to the velocity command generator. We show superior performance compared to wheeled robot baselines, and ablation studies which have disjoint high-level planning and low-level control. We also show the real-world deployment of VP-Nav on a quadruped robot with onboard sensors and computation. Videos at https://navigation-locomotion.github.ioComment: CVPR 2022 final version. Website at https://navigation-locomotion.github.i

    Design Principles for a Family of Direct-Drive Legged Robots

    Get PDF
    This letter introduces Minitaur, a dynamically running and leaping quadruped, which represents a novel class of direct-drive (DD) legged robots. We present a methodology that achieves the well-known benefits of DD robot design (transparency, mechanical robustness/efficiency, high-actuation bandwidth, and increased specific power), affording highly energetic behaviors across our family of machines despite severe limitations in specific force. We quantify DD drivetrain benefits using a variety of metrics, compare our machines\u27 performance to previously reported legged platforms, and speculate on the potential broad-reaching value of “transparency” for legged locomotion. For more information: Kod*lab
    corecore