Search CORE

9 research outputs found

Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks

Author: Fang Kuan
Levine Sergey
Nair Ashvin
Walke Homer
Yan Gengchen
Yin Patrick
Publication venue
Publication date: 18/04/2023
Field of study

The utilization of broad datasets has proven to be crucial for generalization for a wide range of fields. However, how to effectively make use of diverse multi-task data for novel downstream tasks still remains a grand challenge in robotics. To tackle this challenge, we introduce a framework that acquires goal-conditioned policies for unseen temporally extended tasks via offline reinforcement learning on broad data, in combination with online fine-tuning guided by subgoals in learned lossy representation space. When faced with a novel task goal, the framework uses an affordance model to plan a sequence of lossy representations as subgoals that decomposes the original task into easier problems. Learned from the broad data, the lossy representation emphasizes task-relevant information about states and goals while abstracting away redundant contexts that hinder generalization. It thus enables subgoal planning for unseen tasks, provides a compact input to the policy, and facilitates reward shaping during fine-tuning. We show that our framework can be pre-trained on large-scale datasets of robot experiences from prior work and efficiently fine-tuned for novel tasks, entirely from visual inputs without any manual reward engineering.Comment: CoRL 202

arXiv.org e-Print Archive

Don't Start From Scratch: Leveraging Prior Data to Automate Robotic Reinforcement Learning

Author: Kumar Aviral
Levine Sergey
Orbik Jedrzej
Singh Avi
Walke Homer
Yang Jonathan
Yu Albert
Publication venue
Publication date: 17/07/2022
Field of study

Reinforcement learning (RL) algorithms hold the promise of enabling autonomous skill acquisition for robotic systems. However, in practice, real-world robotic RL typically requires time consuming data collection and frequent human intervention to reset the environment. Moreover, robotic policies learned with RL often fail when deployed beyond the carefully controlled setting in which they were learned. In this work, we study how these challenges can all be tackled by effective utilization of diverse offline datasets collected from previously seen tasks. When faced with a new task, our system adapts previously learned skills to quickly learn to both perform the new task and return the environment to an initial state, effectively performing its own environment reset. Our empirical results demonstrate that incorporating prior data into robotic reinforcement learning enables autonomous learning, substantially improves sample-efficiency of learning, and enables better generalization. Project website: https://sites.google.com/view/ariel-berkeley/Comment: 17 pages, project website at https://sites.google.com/view/ariel-berkeley

arXiv.org e-Print Archive

Stabilizing Contrastive RL: Techniques for Offline Goal Reaching

Author: Eysenbach Benjamin
Fang Kuan
Levine Sergey
Salakhutdinov Ruslan
Walke Homer
Yin Patrick
Zheng Chongyi
Publication venue
Publication date: 05/06/2023
Field of study

In the same way that the computer vision (CV) and natural language processing (NLP) communities have developed self-supervised methods, reinforcement learning (RL) can be cast as a self-supervised problem: learning to reach any goal, without requiring human-specified rewards or labels. However, actually building a self-supervised foundation for RL faces some important challenges. Building on prior contrastive approaches to this RL problem, we conduct careful ablation experiments and discover that a shallow and wide architecture, combined with careful weight initialization and data augmentation, can significantly boost the performance of these contrastive RL approaches on challenging simulated benchmarks. Additionally, we demonstrate that, with these design decisions, contrastive approaches can solve real-world robotic manipulation tasks, with tasks being specified by a single goal image provided after training

arXiv.org e-Print Archive

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Author: Cheng Ching-An
Dragan Anca
Fang Kuan
Hansen-Estruch Philippe
He Andre
Jalobeanu Mihai
Kolobov Andrey
Levine Sergey
Myers Vivek
Walke Homer
Publication venue
Publication date: 17/08/2023
Field of study

Our goal is for robots to follow natural language instructions like "put the towel next to the microwave." But getting large amounts of labeled data, i.e. data that contains demonstrations of tasks labeled with the language instruction, is prohibitive. In contrast, obtaining policies that respond to image goals is much easier, because any autonomous trial or demonstration can be labeled in hindsight with its final state as the goal. In this work, we contribute a method that taps into joint image- and goal- conditioned policies with language using only a small amount of language data. Prior work has made progress on this using vision-language models or by jointly training language-goal-conditioned policies, but so far neither method has scaled effectively to real-world robot tasks without significant human annotation. Our method achieves robust performance in the real world by learning an embedding from the labeled data that aligns language not to the goal image, but rather to the desired change between the start and goal images that the instruction corresponds to. We then train a policy on this embedding: the policy benefits from all the unlabeled data, but the aligned embedding provides an interface for language to steer the policy. We show instruction following across a variety of manipulation tasks in different scenes, with generalization to language instructions outside of the labeled data. Videos and code for our approach can be found on our website: https://rail-berkeley.github.io/grif/ .Comment: 15 pages, 5 figure

arXiv.org e-Print Archive

BridgeData V2: A Dataset for Robot Learning at Scale

Author: Black Kevin
Du Max
Fang Kuan
Finn Chelsea
Hansen-Estruch Philippe
He Andre
Kim Moo Jin
Lee Abraham
Levine Sergey
Myers Vivek
Vuong Quan
Walke Homer
Zhao Tony
Zheng Chongyi
Publication venue
Publication date: 24/08/2023
Field of study

We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains, and institutions, making the dataset a useful resource for a broad range of researchers. Additionally, the dataset is compatible with a wide variety of open-vocabulary, multi-task learning methods conditioned on goal images or natural language instructions. In our experiments, we train 6 state-of-the-art imitation learning and offline reinforcement learning methods on our dataset, and find that they succeed on a suite of tasks requiring varying amounts of generalization. We also demonstrate that the performance of these methods improves with more data and higher capacity models, and that training on a greater variety of skills leads to improved generalization. By publicly sharing BridgeData V2 and our pre-trained models, we aim to accelerate research in scalable robot learning methods. Project page at https://rail-berkeley.github.io/bridgedataComment: 9 page

arXiv.org e-Print Archive

Open X-Embodiment:Robotic learning datasets and RT-X models

Author: Abbeel Pieter
Abou-Chakra Jad
Adebola Simeon
Agia Christopher
Ahn Michael
Amor Heni Ben
Armstrong Travis
Bahl Shikhar
Baijal Rohan
Balakrishna Ashwin
Bao Henghui
Bastani Osbert
Belkhale Suneel
Berseth Glen
Bewley Alex
Bingham Jeffrey
Bisk Yonatan
Black Kevin
Bohg Jeannette
Booher Jonathan
Brohan Anthony
Burgard Wolfram
Burgess-Limerick Ben
Bustamante Samuel
Byrne Kendra
Büchler Dieter
Cao Yue
Castro Mateo Guaman
Ceola Federico
Chan Christine
Chebotar Yevgen
Chen Daphne
Chen Lawrence Yunliang
Chen Magnum
Chen Qiuyu
Chen Xi
Chi Cheng
Cho Yoonyoung
Christensen Henrik I.
Chung Trinity
Cui Yuchen
Cui Zichen Jeff
Darrell Trevor
Dasari Sudeep
Dass Shivin
Davchev Todor
Devin Coline
Ding Mingyu
Ding Tianli
Doshi Ria
Drake Jaimyn
Driess Danny
Du Maximilian
Ehsani Kiana
Ellis Kirsty
Fagan Peter David
Fan Linxi Jim
Fang Hao-Shu
Fang Hongjie
Fang Kuan
Fei-Fei Li
Feng Gilbert
Finn Chelsea
Foster Ethan
Fu Chuyuan
Furuta Hiroki
Gao Jensen
Garg Animesh
Geng Xinyang
Go Keegan
Goldberg Ken
Gopalakrishnan Keerthana
Gu Jiayuan
Guist Simon
Gupta Abhishek
Gupta Agrim
Gupta Anchit
Gupta Tanmay
Ha Huy
Haldar Siddhant
Han Junhyek
Hansen Nicklas
Harada Tatsuya
Hatch Kyle
Hausman Karol
Heess Nicolas
Hejna Joey
Hendrix Rose
Heo Minho
Herzog Alex
Hoque Ryan
Hsu Jasmine
Hsu Kyle
Hu Jiaheng
Huang Chenguang
Ichter Brian
Irpan Alex
Itkina Masha
Itti Laurent
Iwasawa Yusuke
Jain Ajinkya
Jain Arhan
Jain Vidhi
Jayaraman Dinesh
Jiang Yunfan
Johns Edward
Joshi Nikhil J.
Julian Ryan
Kahn Gregory
Kalashnikov Dmitry
Kanazawa Naoaki
Karamcheti Siddharth
Kawaharazuka Kento
Kembhavi Aniruddha
Khazatsky Alexander
Kim Beomjoon
Kim Jaehyung
Kim Moo Jin
Kim Yejin
Kirmani Sean
Kollar Thomas
Kroemer Oliver
Le Charlotte
Leal Isabel
Lee Abraham
Lee Lisa
Lee Youngwoon
Lekkala Kiran
Lepert Marion
Levine Sergey
Li Xuanlin
Li Yunshuang
Li Yunzhu
Liang Jacky
Liangwei Xu
Lim Joseph J.
Lin Kevin
Lin Roy
Lin Shan
Lin Zipeng
Liu Fangchen
Liu Ning
Liu Xiyuan
Lu Cewu
Lu Jingpei
Lu Yao
Luo Jianlan
Ma Yecheng Jason
Ma Zehan
Maddukuri Abhiram
Malik Jitendra
Mandlekar Ajay
Martín-Martín Roberto
Matsuo Yutaka
Matsushima Tatsuya
Mees Oier
Memmel Marius
Mendonca Russell
Miller Patrick Tree
Mirchandani Suvir
Mitrano Peter
Moore Sherry
Mordatch Igor
Morton Daniel
Nair Suraj
Nasiriany Soroush
Oh Jihoon
Osa Takayuki
Oslund Kenneth
Ott Lionel
O’Neill Abby
Padalkar Abhishek
Palo Norman Di
Pan Chuer
Park Sungjae
Pathak Deepak
Pertsch Karl
Peters Jan
Pinto Lerrel
Pooley Acorn
Qian Runjia
Radosavovic Ilija
Rafailov Rafael
Raffin Antonin
Rai Anant
Ramamoorthy Subramanian
Rana Krishan
Rao Kanishka
Rehman Abdul
Sadigh Dorsa
Salhotra Gautam
Salvador Jordi
Sanketi Pannag R.
Scalise Rosario
Schaal Stefan
Schiavi Giulio
Schneider Jan
Schölkopf Bernhard
Sermanet Pierre
Shafiullah Nur Muhammad Mahi
Shah Dhruv
Shah Rutav
Sharma Archit
Sharma Mohit
Shi Haochen
Silvério João
Singh Anikait
Singh Kunal Pratap
Sonawani Shubham
Song Shuran
Spero Max
Srinivasan Krishnan
Srirama Mohan Kumar
Stulp Freek
Su Hao
Suenderhauf Niko
Sukhatme Gaurav S.
Sun Jiankai
Sundaresan Priya
Tan Jie
Tan Liam
Tang Yujin
Tian Ran
Tian Stephen
Tomizuka Masayoshi
Tompson Jonathan
Tung Albert
Vanhoucke Vincent
Vuong Quan
Wahid Ayzaan
Walke Homer
Wang Andrew
Wang Chen
Wang Guanzhi
Wang Kaiyuan
Wang Xiaolong
Wang Yixuan
Weihs Luca
Welker Stefan
Wohlhart Paul
Wu Jeffrey
Wu Jiajun
Wu Jialin
Wu Jimmy
Wu Yilin
Wu Yueh-Hua
Wulfe Blake
Xia Fei
Xiao Ted
Xie Annie
Xu Charles
Xu Chenfeng
Xu Danfei
Xu Peng
Xu Sichun
Xu Ying
Xu Zhuo
Yan Ge
Yang Jingyun
Yang Jonathan
Yavary Arefeh
Yin Patrick
Yip Michael C.
Yu Tianhe
Zeng Kuo-Hao
Zhan Wei
Zhang Kevin
Zhang Mingtong
Zhang Ruohan
Zhang Yunchu
Zhang Zichen
Zhao Feiyu
Zhao Tony Z.
Zhou Gaoyue
Zhou Wenxuan
Zhou Yifan
Zhu Xinghao
Zhu Yifeng
Zhu Yuke
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 08/08/2024
Field of study

Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train "generalist" X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. The project website is robotics-transformer-x.github.io

Edinburgh Research Explorer

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

Author: Agia Christopher
Baijal Rohan
Balakrishna Ashwin
Bastani Osbert
Belkhale Suneel
Berseth Glen
Black Kevin
Bohg Jeannette
Castro Mateo Guaman
Chen Daphne
Chen Lawrence Yunliang
Chen Qiuyu
Chi Cheng
Chung Trinity
Dasari Sudeep
Dass Shivin
Drake Jaimyn
Ellis Kirsty
Fagan Peter David
Finn Chelsea
Foster Ethan Paul
Gao Jensen
Goldberg Ken
Gupta Abhinav
Gupta Abhishek
Ha Huy
Hatch Kyle Beltran
Hejna Joey
Heo Minho
Herrera David Antonio
Hsu Kyle
Hu Jiaheng
Itkina Masha
Jackson Donovon
Jain Arhan
Jayaraman Dinesh
Karamcheti Siddharth
Khazatsky Alexander
Kollar Thomas
Le Charlotte
Lee Abraham
Lee Youngwoon
Lepert Marion
Levine Sergey
Li Yunshuang
Lim Joseph J
Lin Kevin
Lin Roy
Lin Shan
Lu Jingpei
Ma Yecheng Jason
Ma Zehan
Maddukuri Abhiram
Malik Jitendra
Martín-Martín Roberto
Memmel Marius
Mercat Jean
Miller Patrick Tree
Mirchandani Suvir
Morton Daniel
Nair Suraj
Nasiriany Soroush
Nguyen Tony
O'Neill Abigail
Park Sungjae
Pertsch Karl
Radosavovic Ilija
Ramamoorthy Subramanian
Rehman Abdul
Sadigh Dorsa
Sanketi Pannag R
Scalise Rosario
Seale Derick
Sharma Archit
Simpson Cody
Son Victor
Song Shuran
Srirama Mohan Kumar
Tian Stephen
Tran Emi
Vuong Quan
Walke Homer Rich
Wang Andrew E.
Wang Kaiyuan
Wu Jiajun
Wu Jimmy
Wu Yilin
Wulfe Blake
Xiao Ted
Xie Annie
Yang Jingyun
Yang Jonathan Heewon
Yavary Arefeh
Yin Patrick
Yip Michael C.
Zhan Albert
Zhang Yunchu
Zhao Tony Z.
Zhu Yuke
Publication venue
Publication date: 19/03/2024
Field of study

The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a result, even the most general robot manipulation policies today are mostly trained on data collected in a small number of environments with limited scene and task diversity. In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset with 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 84 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months. We demonstrate that training with DROID leads to policies with higher performance and improved generalization ability. We open source the full dataset, policy learning code, and a detailed guide for reproducing our robot hardware setup.Comment: Project website: https://droid-dataset.github.io

arXiv.org e-Print Archive

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration

Author: Abbeel Pieter
Abou-Chakra Jad
Adebola Simeon
Agia Christopher
Ahn Michael
Armstrong Travis
Bahl Shikhar
Baijal Rohan
Balakrishna Ashwin
Bao Henghui
Bastani Osbert
Belkhale Suneel
Ben Amor Heni
Berseth Glen
Bewley Alex
Bingham Jeffrey
Bisk Yonatan
Black Kevin
Bohg Jeannette
Booher Jonathan
Brohan Anthony
Buchler Dieter
Burgard Wolfram
Burgess-Limerick Ben
Bustamante Samuel
Byrne Kendra
Cao Yue
Castro Mateo Guaman
Ceola Federico
Chan Christine
Chebotar Yevgen
Chen Daphne
Chen Lawrence Yunliang
Chen Magnum
Chen Qiuyu
Chen Xi
Chi Cheng
Cho Yoonyoung
Christensen Henrik I.
Chung Trinity
Cui Yuchen
Cui Zichen Jeff
Darrell Trevor
Dasari Sudeep
Dass Shivin
Davchev Todor
Devin Coline
Di Palo Norman
Ding Mingyu
Ding Tianli
Doshi Ria
Drake Jaimyn
Driess Danny
Du Maximilian
Ehsani Kiana
Ellis Kirsty
Fagan Peter David
Fan Linxi Jim
Fang Hao Shu
Fang Hongjie
Fang Kuan
Fei-Fei Li
Feng Gilbert
Finn Chelsea
Foster Ethan
Fu Chuyuan
Furuta Hiroki
Gao Jensen
Garg Animesh
Geng Xinyang
Go Keegan
Goldberg Ken
Gopalakrishnan Keerthana
Gu Jiayuan
Guist Simon
Gupta Abhishek
Gupta Agrim
Gupta Anchit
Gupta Tanmay
Ha Huy
Haldar Siddhant
Han Junhyek
Hansen Nicklas
Harada Tatsuya
Hatch Kyle
Hausman Karol
Heess Nicolas
Hejna Joey
Hendrix Rose
Heo Minho
Herzog Alex
Hoque Ryan
Hsu Jasmine
Hsu Kyle
Hu Jiaheng
Huang Chenguang
Ichter Brian
Irpan Alex
Itkina Masha
Itti Laurent
Iwasawa Yusuke
Jain Ajinkya
Jain Arhan
Jain Vidhi
Jayaraman Dinesh
Jiang Yunfan
Johns Edward
Joshi Nikhil J.
Julian Ryan
Kahn Gregory
Kalashnikov Dmitry
Kanazawa Naoaki
Karamcheti Siddharth
Kawaharazuka Kento
Kembhavi Aniruddha
Khazatsky Alexander
Kim Beomjoon
Kim Jaehyung
Kim Moo Jin
Kim Yejin
Kirmani Sean
Kollar Thomas
Kroemer Oliver
Le Charlotte
Leal Isabel
Lee Abraham
Lee Lisa
Lee Youngwoon
Lekkala Kiran
Lepert Marion
Levine Sergey
Li Xuanlin
Li Yunshuang
Li Yunzhu
Liang Jacky
Liangwei Xu
Lim Joseph J.
Lin Kevin
Lin Roy
Lin Shan
Lin Zipeng
Liu Fangchen
Liu Ning
Liu Xiyuan
Lu Cewu
Lu Jingpei
Lu Yao
Luo Jianlan
Ma Yecheng Jason
Ma Zehan
Maddukuri Abhiram
Malik Jitendra
Mandlekar Ajay
Martin-Martin Roberto
Matsuo Yutaka
Matsushima Tatsuya
Mees Oier
Memmel Marius
Mendonca Russell
Miller Patrick Tree
Mirchandani Suvir
Mitrano Peter
Moore Sherry
Mordatch Igor
Morton Daniel
Nair Suraj
Nasiriany Soroush
O'Neill Abby
Oh Jihoon
Osa Takayuki
Oslund Kenneth
Ott Lionel
Padalkar Abhishek
Pan Chuer
Park Sungjae
Pathak Deepak
Pertsch Karl
Peters Jan
Pinto Lerrel
Pooley Acorn
Qian Runjia
Radosavovic Ilija
Rafailov Rafael
Raffin Antonin
Rai Anant
Ramamoorthy Subramanian
Rana Krishan
Rao Kanishka
Rehman Abdul
Sadigh Dorsa
Salhotra Gautam
Salvador Jordi
Sanketi Pannag R.
Scalise Rosario
Schaal Stefan
Schiavi Giulio
Schneider Jan
Scholkopf Bernhard
Sermanet Pierre
Shafiullah Nur Muhammad Mahi
Shah Dhruv
Shah Rutav
Sharma Archit
Sharma Mohit
Shi Haochen
Silverio Joao
Singh Anikait
Singh Kunal Pratap
Sonawani Shubham
Song Shuran
Spero Max
Srinivasan Krishnan
Srirama Mohan Kumar
Stulp Freek
Su Hao
Suenderhauf Niko
Sukhatme Gaurav S.
Sun Jiankai
Sundaresan Priya
Tan Jie
Tan Liam
Tang Yujin
Tian Ran
Tian Stephen
Tomizuka Masayoshi
Tompson Jonathan
Tung Albert
Vanhoucke Vincent
Vuong Quan
Wahid Ayzaan
Walke Homer
Wang Andrew
Wang Chen
Wang Guanzhi
Wang Kaiyuan
Wang Xiaolong
Wang Yixuan
Weihs Luca
Welker Stefan
Wohlhart Paul
Wu Jeffrey
Wu Jiajun
Wu Jialin
Wu Jimmy
Wu Yilin
Wu Yueh Hua
Wulfe Blake
Xia Fei
Xiao Ted
Xie Annie
Xu Charles
Xu Chenfeng
Xu Danfei
Xu Peng
Xu Sichun
Xu Ying
Xu Zhuo
Yan Ge
Yang Jingyun
Yang Jonathan
Yavary Arefeh
Yin Patrick
Yip Michael C.
Yu Tianhe
Zeng Kuo Hao
Zhan Wei
Zhang Kevin
Zhang Mingtong
Zhang Ruohan
Zhang Yunchu
Zhang Zichen
Zhao Feiyu
Zhao Tony Z.
Zhou Gaoyue
Zhou Wenxuan
Zhou Yifan
Zhu Xinghao
Zhu Yifeng
Zhu Yuke
Publication venue: Institute of Electrical and Electronics Engineers Inc.
Publication date: 01/01/2024
Field of study

Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train "generalist"X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. The project website is robotics-transformer-x.github.io.</p

Queensland University of Technology ePrints Archive