default search action
Pieter Abbeel
Person information
- affiliation: University of California, Berkeley, USA
- affiliation: Stanford University, USA
- award (2021): ACM Prize in Computing
- award (2013): Presidential Early Career Award for Scientists and Engineers
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j28]Philipp Wu, Kourosh Hakhamaneshi, Yuqing Du, Igor Mordatch, Aravind Rajeswaran, Pieter Abbeel:
Semi-Supervised One Shot Imitation Learning. RLJ 5: 2284-2297 (2024) - [c340]Kuba Grudzien Kuba, Masatoshi Uehara, Sergey Levine, Pieter Abbeel:
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization. AISTATS 2024: 2449-2457 - [c339]Hao Liu, Carmelo Sferrazza, Pieter Abbeel:
Chain of Hindsight aligns Language Models with Feedback. ICLR 2024 - [c338]Hao Liu, Matei Zaharia, Pieter Abbeel:
RingAttention with Blockwise Transformers for Near-Infinite Context. ICLR 2024 - [c337]Yilun Du, Sherry Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Andy Zeng, Jonathan Tompson:
Video Language Planning. ICLR 2024 - [c336]Arnav Gudibande, Eric Wallace, Charlie Snell, Xinyang Geng, Hao Liu, Pieter Abbeel, Sergey Levine, Dawn Song:
The False Promise of Imitating Proprietary Language Models. ICLR 2024 - [c335]Vint Lee, Pieter Abbeel, Youngwoon Lee:
DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing. ICLR 2024 - [c334]Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell:
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game. ICLR 2024 - [c333]Sherry Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk:
Scalable Diffusion for Materials Generation. ICLR 2024 - [c332]Sherry Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Black-Box Text-to-Video Models. ICLR 2024 - [c331]Sherry Yang, Yilun Du, Seyed Kamyar Seyed Ghasemipour, Jonathan Tompson, Leslie Pack Kaelbling, Dale Schuurmans, Pieter Abbeel:
Learning Interactive Real-World Simulators. ICLR 2024 - [c330]Kevin Frans, Seohong Park, Pieter Abbeel, Sergey Levine:
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings. ICML 2024 - [c329]Huiwon Jang, Dongyoung Kim, Junsu Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo:
Visual Representation Learning with Stochastic Frame Prediction. ICML 2024 - [c328]Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca D. Dragan:
Learning to Model the World With Language. ICML 2024 - [c327]Michael Psenka, Alejandro Escontrela, Pieter Abbeel, Yi Ma:
Learning a Diffusion Model Policy from Rewards via Q-Score Matching. ICML 2024 - [c326]Sherry Yang, Jacob C. Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, André Barreto, Pieter Abbeel, Dale Schuurmans:
Position: Video as the New Language for Real-World Decision Making. ICML 2024 - [c325]Xingyu Lin, John So, Sashwat Mahalingam, Fangchen Liu, Pieter Abbeel:
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Network. ICRA 2024: 4781-4787 - [c324]Abby O'Neill, Abdul Rehman, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alexander Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie, Anthony Brohan, Antonin Raffin, Archit Sharma, Arefeh Yavary, Arhan Jain, Ashwin Balakrishna, Ayzaan Wahid, Ben Burgess-Limerick, Beomjoon Kim, Bernhard Schölkopf, Blake Wulfe, Brian Ichter, Cewu Lu, Charles Xu, Charlotte Le, Chelsea Finn, Chen Wang, Chenfeng Xu, Cheng Chi, Chenguang Huang, Christine Chan, Christopher Agia, Chuer Pan, Chuyuan Fu, Coline Devin, Danfei Xu, Daniel Morton, Danny Driess, Daphne Chen, Deepak Pathak, Dhruv Shah, Dieter Büchler, Dinesh Jayaraman, Dmitry Kalashnikov, Dorsa Sadigh, Edward Johns, Ethan Paul Foster, Fangchen Liu, Federico Ceola, Fei Xia, Feiyu Zhao, Freek Stulp, Gaoyue Zhou, Gaurav S. Sukhatme, Gautam Salhotra, Ge Yan, Gilbert Feng, Giulio Schiavi, Glen Berseth, Gregory Kahn, Guanzhi Wang, Hao Su, Haoshu Fang, Haochen Shi, Henghui Bao, Heni Ben Amor, Henrik I. Christensen, Hiroki Furuta, Homer Walke, Hongjie Fang, Huy Ha, Igor Mordatch, Ilija Radosavovic, Isabel Leal, Jacky Liang, Jad Abou-Chakra, Jaehyung Kim, Jaimyn Drake, Jan Peters, Jan Schneider, Jasmine Hsu, Jeannette Bohg, Jeffrey Bingham, Jeffrey Wu, Jensen Gao, Jiaheng Hu, Jiajun Wu, Jialin Wu, Jiankai Sun, Jianlan Luo, Jiayuan Gu, Jie Tan, Jihoon Oh, Jimmy Wu, Jingpei Lu, Jingyun Yang, Jitendra Malik, João Silvério, Joey Hejna, Jonathan Booher, Jonathan Tompson, Jonathan Yang, Jordi Salvador, Joseph J. Lim, Junhyek Han, Kaiyuan Wang, Kanishka Rao, Karl Pertsch, Karol Hausman, Keegan Go, Keerthana Gopalakrishnan, Ken Goldberg, Kendra Byrne, Kenneth Oslund, Kento Kawaharazuka, Kevin Black, Kevin Lin, Kevin Zhang, Kiana Ehsani, Kiran Lekkala, Kirsty Ellis, Krishan Rana, Krishnan Srinivasan, Kuan Fang, Kunal Pratap Singh, Kuo-Hao Zeng, Kyle Hatch, Kyle Hsu, Laurent Itti, Lawrence Yunliang Chen, Lerrel Pinto, Li Fei-Fei, Liam Tan, Linxi Jim Fan, Lionel Ott, Lisa Lee, Luca Weihs, Magnum Chen, Marion Lepert, Marius Memmel, Masayoshi Tomizuka, Masha Itkina, Mateo Guaman Castro, Max Spero, Maximilian Du, Michael Ahn, Michael C. Yip, Mingtong Zhang, Mingyu Ding, Minho Heo, Mohan Kumar Srirama, Mohit Sharma, Moo Jin Kim, Naoaki Kanazawa, Nicklas Hansen, Nicolas Heess, Nikhil J. Joshi, Niko Sünderhauf, Ning Liu, Norman Di Palo, Nur Muhammad (Mahi) Shafiullah, Oier Mees, Oliver Kroemer, Osbert Bastani, Pannag R. Sanketi, Patrick Tree Miller, Patrick Yin, Paul Wohlhart, Peng Xu, Peter David Fagan, Peter Mitrano, Pierre Sermanet, Pieter Abbeel, Priya Sundaresan, Qiuyu Chen, Quan Vuong, Rafael Rafailov, Ran Tian, Ria Doshi, Roberto Martín-Martín, Rohan Baijal, Rosario Scalise, Rose Hendrix, Roy Lin, Runjia Qian, Ruohan Zhang, Russell Mendonca, Rutav Shah, Ryan Hoque, Ryan Julian, Samuel Bustamante, Sean Kirmani, Sergey Levine, Shan Lin, Sherry Moore, Shikhar Bahl, Shivin Dass, Shubham D. Sonawani, Shuran Song, Sichun Xu, Siddhant Haldar, Siddharth Karamcheti, Simeon Adebola, Simon Guist, Soroush Nasiriany, Stefan Schaal, Stefan Welker, Stephen Tian, Subramanian Ramamoorthy, Sudeep Dasari, Suneel Belkhale, Sungjae Park, Suraj Nair, Suvir Mirchandani, Takayuki Osa, Tanmay Gupta, Tatsuya Harada, Tatsuya Matsushima, Ted Xiao, Thomas Kollar, Tianhe Yu, Tianli Ding, Todor Davchev, Tony Z. Zhao, Travis Armstrong, Trevor Darrell, Trinity Chung, Vidhi Jain, Vincent Vanhoucke, Wei Zhan, Wenxuan Zhou, Wolfram Burgard, Xi Chen, Xiaolong Wang, Xinghao Zhu, Xinyang Geng, Xiyuan Liu, Liangwei Xu, Xuanlin Li, Yao Lu, Yecheng Jason Ma, Yejin Kim, Yevgen Chebotar, Yifan Zhou, Yifeng Zhu, Yilin Wu, Ying Xu, Yixuan Wang, Yonatan Bisk, Yoonyoung Cho, Youngwoon Lee, Yuchen Cui, Yue Cao, Yueh-Hua Wu, Yujin Tang, Yuke Zhu, Yunchu Zhang, Yunfan Jiang, Yunshuang Li, Yunzhu Li, Yusuke Iwasawa, Yutaka Matsuo, Zehan Ma, Zhuo Xu, Zichen Jeff Cui, Zichen Zhang, Zipeng Lin:
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration. ICRA 2024: 6892-6903 - [c323]Nikhil Mishra, Maximilian Sieb, Pieter Abbeel, Xi Chen:
Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs. ICRA 2024: 11202-11208 - [i340]Chuan Wen, Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel:
Any-point Trajectory Modeling for Policy Learning. CoRR abs/2401.00025 (2024) - [i339]Jakub Grudzien Kuba, Masatoshi Uehara, Pieter Abbeel, Sergey Levine:
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization. CoRR abs/2401.05442 (2024) - [i338]Jianlan Luo, Charles Xu, Fangchen Liu, Liam Tan, Zipeng Lin, Jeffrey Wu, Pieter Abbeel, Sergey Levine:
FMB: a Functional Manipulation Benchmark for Generalizable Robotic Learning. CoRR abs/2401.08553 (2024) - [i337]Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control. CoRR abs/2401.16889 (2024) - [i336]Hao Liu, Wilson Yan, Matei Zaharia, Pieter Abbeel:
World Model on Million-Length Video And Language With Blockwise RingAttention. CoRR abs/2402.08268 (2024) - [i335]Alexandra Souly, Qingyuan Lu, Dillon Bowen, Tu Trinh, Elvis Hsieh, Sana Pandey, Pieter Abbeel, Justin Svegliato, Scott Emmons, Olivia Watkins, Sam Toyer:
A StrongREJECT for Empty Jailbreaks. CoRR abs/2402.10260 (2024) - [i334]Kevin Frans, Seohong Park, Pieter Abbeel, Sergey Levine:
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings. CoRR abs/2402.17135 (2024) - [i333]Sherry Yang, Jacob C. Walker, Jack Parker-Holder, Yilun Du, Jake Bruce, André Barreto, Pieter Abbeel, Dale Schuurmans:
Video as the New Language for Real-World Decision Making. CoRR abs/2402.17139 (2024) - [i332]Toru Lin, Zhao-Heng Yin, Haozhi Qi, Pieter Abbeel, Jitendra Malik:
Twisting Lids Off with Two Hands. CoRR abs/2403.02338 (2024) - [i331]Fangchen Liu, Kuan Fang, Pieter Abbeel, Sergey Levine:
MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting. CoRR abs/2403.03174 (2024) - [i330]Nikhil Mishra, Maximilian Sieb, Pieter Abbeel, Xi Chen:
Closing the Visual Sim-to-Real Gap with Object-Composable NeRFs. CoRR abs/2403.04114 (2024) - [i329]Carmelo Sferrazza, Dun-Ming Huang, Xingyu Lin, Youngwoon Lee, Pieter Abbeel:
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation. CoRR abs/2403.10506 (2024) - [i328]Yide Shentu, Philipp Wu, Aravind Rajeswaran, Pieter Abbeel:
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control. CoRR abs/2405.04798 (2024) - [i327]Huiwon Jang, Dongyoung Kim, Junsu Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo:
Visual Representation Learning with Stochastic Frame Prediction. CoRR abs/2406.07398 (2024) - [i326]Vint Lee, Chun Deng, Leena Elzeiny, Pieter Abbeel, John Wawrzynek:
Chip Placement with Diffusion. CoRR abs/2407.12282 (2024) - [i325]Zhao-Heng Yin, Pieter Abbeel:
Offline Imitation Learning Through Graph Search and Retrieval. CoRR abs/2407.15403 (2024) - [i324]Philipp Wu, Kourosh Hakhamaneshi, Yuqing Du, Igor Mordatch, Aravind Rajeswaran, Pieter Abbeel:
Semi-Supervised One-Shot Imitation Learning. CoRR abs/2408.05285 (2024) - [i323]Carmelo Sferrazza, Dun-Ming Huang, Fangchen Liu, Jongmin Lee, Pieter Abbeel:
Body Transformer: Leveraging Robot Embodiment for Policy Learning. CoRR abs/2408.06316 (2024) - [i322]Himanshu Gaurav Singh, Antonio Loquercio, Carmelo Sferrazza, Jane Wu, Haozhi Qi, Pieter Abbeel, Jitendra Malik:
Hand-Object Interaction Pretraining from Videos. CoRR abs/2409.08273 (2024) - [i321]Wilson Yan, Matei Zaharia, Volodymyr Mnih, Pieter Abbeel, Aleksandra Faust, Hao Liu:
ElasticTok: Adaptive Tokenization for Image and Video. CoRR abs/2410.08368 (2024) - [i320]Kevin Frans, Danijar Hafner, Sergey Levine, Pieter Abbeel:
One Step Diffusion via Shortcut Models. CoRR abs/2410.12557 (2024) - [i319]Jakub Grudzien Kuba, Pieter Abbeel, Sergey Levine:
Cliqueformer: Model-Based Optimization with Structured Transformers. CoRR abs/2410.13106 (2024) - [i318]Renhao Wang, Kevin Frans, Pieter Abbeel, Sergey Levine, Alexei A. Efros:
Prioritized Generative Replay. CoRR abs/2410.18082 (2024) - 2023
- [j27]Kourosh Hakhamaneshi, Marcel Nassar, Mariano Phielipp, Pieter Abbeel, Vladimir Stojanovic:
Pretraining Graph Neural Networks for Few-Shot Analog Circuit Modeling and Design. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(7): 2163-2173 (2023) - [c322]Joey Hejna, Pieter Abbeel, Lerrel Pinto:
Improving Long-Horizon Imitation through Instruction Prediction. AAAI 2023: 7857-7865 - [c321]Kevin Zakka, Philipp Wu, Laura M. Smith, Nimrod Gileadi, Taylor Howell, Xue Bin Peng, Sumeet Singh, Yuval Tassa, Pete Florence, Andy Zeng, Pieter Abbeel:
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning. CoRL 2023: 2975-2994 - [c320]Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James:
Language-Conditioned Path Planning. CoRL 2023: 3384-3396 - [c319]Ajay Jain, Amber Xie, Pieter Abbeel:
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. CVPR 2023: 1911-1920 - [c318]Changyeon Kim, Jongjin Park, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
Preference Transformer: Modeling Human Preferences using Transformers for RL. ICLR 2023 - [c317]Sherry Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. ICLR 2023 - [c316]Weirui Ye, Yunsheng Zhang, Pieter Abbeel, Yang Gao:
Become a Proficient Player with Limited Data through Watching Pure Videos. ICLR 2023 - [c315]Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica:
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning. ICML 2023: 1361-1395 - [c314]Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas:
Guiding Pretraining in Reinforcement Learning with Large Language Models. ICML 2023: 8657-8677 - [c313]Hao Liu, Pieter Abbeel:
Emergent Agentic Transformer from Chain of Hindsight Experience. ICML 2023: 21362-21374 - [c312]Seohong Park, Kimin Lee, Youngwoon Lee, Pieter Abbeel:
Controllability-Aware Unsupervised Skill Discovery. ICML 2023: 27225-27245 - [c311]Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel:
Multi-View Masked World Models for Visual Robotic Manipulation. ICML 2023: 30613-30632 - [c310]David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. ICML 2023: 35024-35036 - [c309]Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran:
Masked Trajectory Models for Prediction, Representation, and Control. ICML 2023: 37607-37623 - [c308]Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel:
Temporally Consistent Transformers for Video Generation. ICML 2023: 39062-39098 - [c307]Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez:
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. ICML 2023: 41414-41428 - [c306]Kai Chen, Stephen James, Congying Sui, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS. ICRA 2023: 2855-2861 - [c305]Yuxuan Liu, Nikhil Mishra, Pieter Abbeel, Xi Chen:
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN. ICRA 2023: 7069-7075 - [c304]Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta:
Train Offline, Test Online: A Real Robot Learning Benchmark. ICRA 2023: 9197-9203 - [c303]Yuxuan Liu, Xi Chen, Pieter Abbeel:
Self-Supervised Instance Segmentation by Grasping. IROS 2023: 1162-1169 - [c302]Hiroshi Yoshitake, Pieter Abbeel:
The Impact of Overall Optimization on Warehouse Automation. IROS 2023: 1621-1628 - [c301]Nikhil Mishra, Pieter Abbeel, Xi Chen, Maximilian Sieb:
Convolutional Occupancy Models for Dense Packing of Complex, Novel Objects. IROS 2023: 9536-9542 - [c300]Yilun Du, Sherry Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Josh Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. NeurIPS 2023 - [c299]Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel:
Video Prediction Models as Rewards for Reinforcement Learning. NeurIPS 2023 - [c298]Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee:
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models. NeurIPS 2023 - [c297]Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo:
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration. NeurIPS 2023 - [c296]Hao Liu, Pieter Abbeel:
Blockwise Parallel Transformers for Large Context Models. NeurIPS 2023 - [c295]Hao Liu, Wilson Yan, Pieter Abbeel:
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment. NeurIPS 2023 - [c294]Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Tingfan Wu, Jay Vakil, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier:
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? NeurIPS 2023 - [c293]Daiki E. Matsunaga, Jongmin Lee, Jaeseok Yoon, Stefanos Leonardos, Pieter Abbeel, Kee-Eung Kim:
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation. NeurIPS 2023 - [c292]Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Robust and Versatile Bipedal Jumping Control through Reinforcement Learning. Robotics: Science and Systems 2023 - [i317]Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. CoRR abs/2302.00111 (2023) - [i316]Hao Liu, Wilson Yan, Pieter Abbeel:
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment. CoRR abs/2302.00902 (2023) - [i315]Younggyo Seo, Junsu Kim, Stephen James, Kimin Lee, Jinwoo Shin, Pieter Abbeel:
Multi-View Masked World Models for Visual Robotic Manipulation. CoRR abs/2302.02408 (2023) - [i314]Hao Liu, Carmelo Sferrazza, Pieter Abbeel:
Chain of Hindsight Aligns Language Models with Feedback. CoRR abs/2302.02676 (2023) - [i313]Seohong Park, Kimin Lee, Youngwoon Lee, Pieter Abbeel:
Controllability-Aware Unsupervised Skill Discovery. CoRR abs/2302.05103 (2023) - [i312]Tianjun Zhang, Fangchen Liu, Justin Wong, Pieter Abbeel, Joseph E. Gonzalez:
The Wisdom of Hindsight Makes Language Models Better Instruction Followers. CoRR abs/2302.05206 (2023) - [i311]Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas:
Guiding Pretraining in Reinforcement Learning with Large Language Models. CoRR abs/2302.06692 (2023) - [i310]Zhongyu Li, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning. CoRR abs/2302.09450 (2023) - [i309]Kimin Lee, Hao Liu, Moonkyung Ryu, Olivia Watkins, Yuqing Du, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Shixiang Shane Gu:
Aligning Text-to-Image Models using Human Feedback. CoRR abs/2302.12192 (2023) - [i308]Changyeon Kim, Jongjin Park, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
Preference Transformer: Modeling Human Preferences using Transformers for RL. CoRR abs/2303.00957 (2023) - [i307]Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale Schuurmans:
Foundation Models for Decision Making: Problems, Methods, and Opportunities. CoRR abs/2303.04129 (2023) - [i306]Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier:
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? CoRR abs/2303.18240 (2023) - [i305]Kevin Zakka, Laura M. Smith, Nimrod Gileadi, Taylor A. Howell, Xue Bin Peng, Sumeet Singh, Yuval Tassa, Pete Florence, Andy Zeng, Pieter Abbeel:
RoboPianist: A Benchmark for High-Dimensional Robot Control. CoRR abs/2304.04150 (2023) - [i304]Yuxuan Liu, Nikhil Mishra, Pieter Abbeel, Xi Chen:
Distributional Instance Segmentation: Modeling Uncertainty and High Confidence Predictions with Latent-MaskRCNN. CoRR abs/2305.01910 (2023) - [i303]Philipp Wu, Arjun Majumdar, Kevin Stone, Yixin Lin, Igor Mordatch, Pieter Abbeel, Aravind Rajeswaran:
Masked Trajectory Models for Prediction, Representation, and Control. CoRR abs/2305.02968 (2023) - [i302]Yuxuan Liu, Xi Chen, Pieter Abbeel:
Self-Supervised Instance Segmentation by Grasping. CoRR abs/2305.06305 (2023) - [i301]Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel:
Video Prediction Models as Rewards for Reinforcement Learning. CoRR abs/2305.14343 (2023) - [i300]Arnav Gudibande, Eric Wallace, Charlie Snell, Xinyang Geng, Hao Liu, Pieter Abbeel, Sergey Levine, Dawn Song:
The False Promise of Imitating Proprietary LLMs. CoRR abs/2305.15717 (2023) - [i299]Ying Fan, Olivia Watkins, Yuqing Du, Hao Liu, Moonkyung Ryu, Craig Boutilier, Pieter Abbeel, Mohammad Ghavamzadeh, Kangwook Lee, Kimin Lee:
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models. CoRR abs/2305.16381 (2023) - [i298]Hao Liu, Pieter Abbeel:
Emergent Agentic Transformer from Chain of Hindsight Experience. CoRR abs/2305.16554 (2023) - [i297]Hao Liu, Pieter Abbeel:
Blockwise Parallel Transformer for Long Context Large Models. CoRR abs/2305.19370 (2023) - [i296]Dongyoung Kim, Jinwoo Shin, Pieter Abbeel, Younggyo Seo:
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration. CoRR abs/2305.19476 (2023) - [i295]Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta:
Train Offline, Test Online: A Real Robot Learning Benchmark. CoRR abs/2306.00942 (2023) - [i294]Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Text-to-Video Models. CoRR abs/2306.01872 (2023) - [i293]Xinran Liang, Anthony Han, Wilson Yan, Aditi Raghunathan, Pieter Abbeel:
ALP: Action-Aware Embodied Learning for Perception. CoRR abs/2306.10190 (2023) - [i292]Joey Hejna, Pieter Abbeel, Lerrel Pinto:
Improving Long-Horizon Imitation Through Instruction Prediction. CoRR abs/2306.12554 (2023) - [i291]Xingyu Lin, John So, Sashwat Mahalingam, Fangchen Liu, Pieter Abbeel:
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks. CoRR abs/2307.03567 (2023) - [i290]Nikhil Mishra, Pieter Abbeel, Xi Chen, Maximilian Sieb:
Convolutional Occupancy Models for Dense Packing of Complex, Novel Objects. CoRR abs/2308.00091 (2023) - [i289]Jessy Lin, Yuqing Du, Olivia Watkins, Danijar Hafner, Pieter Abbeel, Dan Klein, Anca D. Dragan:
Learning to Model the World with Language. CoRR abs/2308.01399 (2023) - [i288]Hiroshi Yoshitake, Pieter Abbeel:
The Impact of Overall Optimization on Warehouse Automation. CoRR abs/2308.06036 (2023) - [i287]Ademi Adeniji, Amber Xie, Carmelo Sferrazza, Younggyo Seo, Stephen James, Pieter Abbeel:
Language Reward Modulation for Pretraining Reinforcement Learning. CoRR abs/2308.12270 (2023) - [i286]Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James:
Language-Conditioned Path Planning. CoRR abs/2308.16893 (2023) - [i285]Philipp Wu, Yide Shentu, Zhongke Yi, Xingyu Lin, Pieter Abbeel:
GELLO: A General, Low-Cost, and Intuitive Teleoperation Framework for Robot Manipulators. CoRR abs/2309.13037 (2023) - [i284]Jiangliu Wang, Jianbo Jiao, Yibing Song, Stephen James, Zhan Tong, Chongjian Ge, Pieter Abbeel, Yunhui Liu:
Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training. CoRR abs/2309.13942 (2023) - [i283]Hao Liu, Matei Zaharia, Pieter Abbeel:
Ring Attention with Blockwise Transformers for Near-Infinite Context. CoRR abs/2310.01889 (2023) - [i282]Weirui Ye, Yunsheng Zhang, Mengchen Wang, Shengjie Wang, Xianfan Gu, Pieter Abbeel, Yang Gao:
Foundation Reinforcement Learning: towards Embodied Generalist Agents with Foundation Prior Assistance. CoRR abs/2310.02635 (2023) - [i281]Mengjiao Yang, Yilun Du, Kamyar Ghasemipour, Jonathan Tompson, Dale Schuurmans, Pieter Abbeel:
Learning Interactive Real-World Simulators. CoRR abs/2310.06114 (2023) - [i280]Hao Liu, Matei Zaharia, Pieter Abbeel:
Exploration with Principles for Diverse AI Supervision. CoRR abs/2310.08899 (2023) - [i279]Yilun Du, Mengjiao Yang, Pete Florence, Fei Xia, Ayzaan Wahid, Brian Ichter, Pierre Sermanet, Tianhe Yu, Pieter Abbeel, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Andy Zeng, Jonathan Tompson:
Video Language Planning. CoRR abs/2310.10625 (2023) - [i278]Boyi Li, Philipp Wu, Pieter Abbeel, Jitendra Malik:
Interactive Task Planning with Language Models. CoRR abs/2310.10645 (2023) - [i277]Yoshua Bengio, Geoffrey E. Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian K. Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atilim Günes Baydin, Sheila A. McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca D. Dragan, Philip H. S. Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann:
Managing AI Risks in an Era of Rapid Progress. CoRR abs/2310.17688 (2023) - [i276]Carmelo Sferrazza, Younggyo Seo, Hao Liu, Youngwoon Lee, Pieter Abbeel:
The Power of the Senses: Generalizable Manipulation from Vision and Touch through Masked Multimodal Learning. CoRR abs/2311.00924 (2023) - [i275]Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell:
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game. CoRR abs/2311.01011 (2023) - [i274]Vint Lee, Pieter Abbeel, Youngwoon Lee:
DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing. CoRR abs/2311.01450 (2023) - [i273]Daiki E. Matsunaga, Jongmin Lee, Jaeseok Yoon, Stefanos Leonardos, Pieter Abbeel, Kee-Eung Kim:
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation. CoRR abs/2311.02194 (2023) - [i272]Mengjiao Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk:
Scalable Diffusion for Materials Generation. CoRR abs/2311.09235 (2023) - [i271]Wilson Yan, Andrew Brown, Pieter Abbeel, Rohit Girdhar, Samaneh Azadi:
Motion-Conditioned Image Animation for Video Editing. CoRR abs/2311.18827 (2023) - [i270]Michael Psenka, Alejandro Escontrela, Pieter Abbeel, Yi Ma:
Learning a Diffusion Model Policy from Rewards via Q-Score Matching. CoRR abs/2312.11752 (2023) - 2022
- [j26]Freek Stulp, Michael Spranger, Kim Listmann, Stéphane Doncieux, Moritz Tenorth, George Konidaris, Pieter Abbeel:
Innovation Paths for Machine Learning in Robotics [Industry Activities]. IEEE Robotics Autom. Mag. 29(4): 141-144 (2022) - [c291]Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia:
Programmatic Modeling and Generation of Real-Time Strategic Soccer Environments for Reinforcement Learning. AAAI 2022: 6028-6036 - [c290]Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch:
Frozen Pretrained Transformers as Universal Computation Engines. AAAI 2022: 7628-7636 - [c289]Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg:
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision. CoRL 2022: 368-380 - [c288]Ilija Radosavovic, Tete Xiao, Stephen James, Pieter Abbeel, Jitendra Malik, Trevor Darrell:
Real-World Robot Learning with Masked Visual Pre-training. CoRL 2022: 416-426 - [c287]Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel:
Masked World Models for Visual Control. CoRL 2022: 1332-1344 - [c286]John So, Amber Xie, Sunggoo Jung, Jeffrey A. Edlund, Rohan Thakker, Ali-akbar Agha-mohammadi, Pieter Abbeel, Stephen James:
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data. CoRL 2022: 1871-1881 - [c285]Philipp Wu, Alejandro Escontrela, Danijar Hafner, Pieter Abbeel, Ken Goldberg:
DayDreamer: World Models for Physical Robot Learning. CoRL 2022: 2226-2240 - [c284]Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole:
Zero-Shot Text-Guided Object Generation with Dream Fields. CVPR 2022: 857-866 - [c283]Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking. ECCV (39) 2022: 533-550 - [c282]Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen:
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction. ECCV (10) 2022: 673-694 - [c281]Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel:
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator. ICIP 2022: 3943-3947 - [c280]Yuqing Du, Pieter Abbeel, Aditya Grover:
It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation. ICLR 2022 - [c279]Kourosh Hakhamaneshi, Ruihan Zhao, Albert Zhan, Pieter Abbeel, Michael Laskin:
Hierarchical Few-Shot Imitation with Skill Transition Models. ICLR 2022 - [c278]Xinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel:
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning. ICLR 2022 - [c277]Jongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning. ICLR 2022 - [c276]Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch:
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. ICML 2022: 9118-9147 - [c275]Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox:
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks. ICML 2022: 13285-13301 - [c274]Younggyo Seo, Kimin Lee, Stephen James, Pieter Abbeel:
Reinforcement Learning with Action-Free Pre-Training from Videos. ICML 2022: 19561-19579 - [c273]Mandi Zhao, Fangchen Liu, Kimin Lee, Pieter Abbeel:
Towards More Generalizable One-shot Visual Imitation Learning. ICRA 2022: 2434-2444 - [c272]Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel:
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions. IROS 2022: 25-32 - [c271]Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto:
Playful Interactions for Representation Learning. IROS 2022: 992-999 - [c270]Albert Zhan, Ruihan Zhao, Lerrel Pinto, Pieter Abbeel, Michael Laskin:
Learning Visual Robotic Control Efficiently with Contrastive Pre-training and Data Augmentation. IROS 2022: 4040-4047 - [c269]Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel:
Multi-Objective Policy Gradients with Topological Constraints. IROS 2022: 9034-9039 - [c268]Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel:
Deep Hierarchical Planning from Pixels. NeurIPS 2022 - [c267]Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel:
Unsupervised Reinforcement Learning with Contrastive Intrinsic Control. NeurIPS 2022 - [c266]Fangchen Liu, Hao Liu, Aditya Grover, Pieter Abbeel:
Masked Autoencoding for Scalable and Generalizable Decision Making. NeurIPS 2022 - [c265]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. NeurIPS 2022 - [c264]Weirui Ye, Pieter Abbeel, Yang Gao:
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions. NeurIPS 2022 - [c263]Mandi Zhao, Pieter Abbeel, Stephen James:
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning. NeurIPS 2022 - [c262]Qiyang Li, Ajay Jain, Pieter Abbeel:
AdaCat: Adaptive categorical discretization for autoregressive models. UAI 2022: 1188-1198 - [i269]Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch:
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. CoRR abs/2201.07207 (2022) - [i268]Julius Frost, Olivia Watkins, Eric Weiner, Pieter Abbeel, Trevor Darrell, Bryan A. Plummer, Kate Saenko:
Explaining Reinforcement Learning Policies through Counterfactual Trajectories. CoRR abs/2201.12462 (2022) - [i267]Denis Yarats, David Brandfonbrener, Hao Liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto:
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning. CoRR abs/2201.13425 (2022) - [i266]Michael Laskin, Hao Liu, Xue Bin Peng, Denis Yarats, Aravind Rajeswaran, Pieter Abbeel:
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery. CoRR abs/2202.00161 (2022) - [i265]Stephen James, Pieter Abbeel:
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning. CoRR abs/2202.03957 (2022) - [i264]Yuqing Du, Pieter Abbeel, Aditya Grover:
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation. CoRR abs/2202.10608 (2022) - [i263]Jongjin Park, Younggyo Seo, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning. CoRR abs/2203.10050 (2022) - [i262]Olivia Watkins, Trevor Darrell, Pieter Abbeel, Jacob Andreas, Abhishek Gupta:
Teachable Reinforcement Learning via Advice Distillation. CoRR abs/2203.11197 (2022) - [i261]Younggyo Seo, Kimin Lee, Stephen James, Pieter Abbeel:
Reinforcement Learning with Action-Free Pre-Training from Videos. CoRR abs/2203.13880 (2022) - [i260]Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel:
Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions. CoRR abs/2203.15103 (2022) - [i259]Kourosh Hakhamaneshi, Marcel Nassar, Mariano Phielipp, Pieter Abbeel, Vladimir Stojanovic:
Pretraining Graph Neural Networks for few-shot Analog Circuit Modeling and Design. CoRR abs/2203.15913 (2022) - [i258]Stephen James, Pieter Abbeel:
Coarse-to-Fine Q-attention with Learned Path Ranking. CoRR abs/2204.01571 (2022) - [i257]Carl Qi, Pieter Abbeel, Aditya Grover:
Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning. CoRR abs/2204.03597 (2022) - [i256]Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin-picking. CoRR abs/2204.07049 (2022) - [i255]Stephen James, Pieter Abbeel:
Coarse-to-fine Q-attention with Tree Expansion. CoRR abs/2204.12471 (2022) - [i254]Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah:
An Empirical Investigation of Representation Learning for Imitation. CoRR abs/2205.07886 (2022) - [i253]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Chain of Thought Imitation with Procedure Cloning. CoRR abs/2205.10816 (2022) - [i252]Xinran Liang, Katherine Shu, Kimin Lee, Pieter Abbeel:
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning. CoRR abs/2205.12401 (2022) - [i251]Xinyang Geng, Hao Liu, Lisa Lee, Dale Schuurams, Sergey Levine, Pieter Abbeel:
Multimodal Masked Autoencoders Learn Transferable Representations. CoRR abs/2205.14204 (2022) - [i250]Mandi Zhao, Pieter Abbeel, Stephen James:
On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning. CoRR abs/2206.03271 (2022) - [i249]Wilson Yan, Ryo Okumura, Stephen James, Pieter Abbeel:
Patch-based Object-centric Transformers for Efficient Video Generation. CoRR abs/2206.04003 (2022) - [i248]Danijar Hafner, Kuang-Huei Lee, Ian Fischer, Pieter Abbeel:
Deep Hierarchical Planning from Pixels. CoRR abs/2206.04114 (2022) - [i247]Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter Abbeel:
DayDreamer: World Models for Physical Robot Learning. CoRR abs/2206.14176 (2022) - [i246]Younggyo Seo, Danijar Hafner, Hao Liu, Fangchen Liu, Stephen James, Kimin Lee, Pieter Abbeel:
Masked World Models for Visual Control. CoRR abs/2206.14244 (2022) - [i245]Ryan Hoque, Lawrence Yunliang Chen, Satvik Sharma, Karthik Dharmarajan, Brijen Thananjeyan, Pieter Abbeel, Ken Goldberg:
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision. CoRR abs/2206.14349 (2022) - [i244]Qiyang Li, Ajay Jain, Pieter Abbeel:
AdaCat: Adaptive Categorical Discretization for Autoregressive Models. CoRR abs/2208.02246 (2022) - [i243]Kyle Hollins Wray, Stas Tiomkin, Mykel J. Kochenderfer, Pieter Abbeel:
Multi-Objective Policy Gradients with Topological Constraints. CoRR abs/2209.07096 (2022) - [i242]Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel:
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator. CoRR abs/2209.07143 (2022) - [i241]Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox:
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks. CoRR abs/2209.07670 (2022) - [i240]Wilson Yan, Danijar Hafner, Stephen James, Pieter Abbeel:
Temporally Consistent Video Transformer for Long-Term Video Prediction. CoRR abs/2210.02396 (2022) - [i239]Ilija Radosavovic, Tete Xiao, Stephen James, Pieter Abbeel, Jitendra Malik, Trevor Darrell:
Real-World Robot Learning with Masked Visual Pre-training. CoRR abs/2210.03109 (2022) - [i238]Yuxuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen:
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction. CoRR abs/2210.07424 (2022) - [i237]Ademi Adeniji, Amber Xie, Pieter Abbeel:
Skill-Based Reinforcement Learning with Intrinsic Reward Matching. CoRR abs/2210.07426 (2022) - [i236]Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica:
CLUTR: Curriculum Learning via Unsupervised Task Representation Learning. CoRR abs/2210.10243 (2022) - [i235]Weirui Ye, Pieter Abbeel, Yang Gao:
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions. CoRR abs/2210.12628 (2022) - [i234]Hao Liu, Lisa Lee, Kimin Lee, Pieter Abbeel:
Instruction-Following Agents with Jointly Pre-Trained Vision-Language Models. CoRR abs/2210.13431 (2022) - [i233]Hao Liu, Xinyang Geng, Lisa Lee, Igor Mordatch, Sergey Levine, Sharan Narang, Pieter Abbeel:
FCM: Forgetful Causal Masking Makes Causal Language Models Better Zero-Shot Learners. CoRR abs/2210.13432 (2022) - [i232]Mengjiao Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum:
Dichotomy of Control: Separating What You Can Control from What You Cannot. CoRR abs/2210.13435 (2022) - [i231]John So, Amber Xie, Sunggoo Jung, Jeffrey A. Edlund, Rohan Thakker, Ali-Akbar Agha-Mohammadi, Pieter Abbeel, Stephen James:
Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data. CoRR abs/2210.14721 (2022) - [i230]Kai Chen, Stephen James, Congying Sui, Yun-Hui Liu, Pieter Abbeel, Qi Dou:
StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS. CoRR abs/2211.01644 (2022) - [i229]Ajay Jain, Amber Xie, Pieter Abbeel:
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. CoRR abs/2211.11319 (2022) - [i228]Fangchen Liu, Hao Liu, Aditya Grover, Pieter Abbeel:
Masked Autoencoding for Scalable and Generalizable Decision Making. CoRR abs/2211.12740 (2022) - [i227]David Venuto, Sherry Yang, Pieter Abbeel, Doina Precup, Igor Mordatch, Ofir Nachum:
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets. CoRR abs/2211.13337 (2022) - 2021
- [j25]Gregory Kahn, Pieter Abbeel, Sergey Levine:
BADGR: An Autonomous Self-Supervised Learning-Based Navigation System. IEEE Robotics Autom. Lett. 6(2): 1312-1319 (2021) - [j24]Gregory Kahn, Pieter Abbeel, Sergey Levine:
LaND: Learning to Navigate From Disengagements. IEEE Robotics Autom. Lett. 6(2): 1872-1879 (2021) - [j23]Xue Bin Peng, Ze Ma, Pieter Abbeel, Sergey Levine, Angjoo Kanazawa:
AMP: adversarial motion priors for stylized physics-based character control. ACM Trans. Graph. 40(4): 144:1-144:20 (2021) - [c261]Xiaofei Wang, Kimin Lee, Kourosh Hakhamaneshi, Pieter Abbeel, Michael Laskin:
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback. CoRL 2021: 1259-1268 - [c260]Seunghyun Lee, Younggyo Seo, Kimin Lee, Pieter Abbeel, Jinwoo Shin:
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble. CoRL 2021: 1702-1712 - [c259]Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani:
Bottleneck Transformers for Visual Recognition. CVPR 2021: 16519-16529 - [c258]Paras Jain, Ajay Jain, Tianjun Zhang, Pieter Abbeel, Joseph Gonzalez, Ion Stoica:
Contrastive Code Representation Learning. EMNLP (1) 2021: 5954-5971 - [c257]Ajay Jain, Matthew Tancik, Pieter Abbeel:
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. ICCV 2021: 5865-5874 - [c256]Ruihan Zhao, Kevin Lu, Pieter Abbeel, Stas Tiomkin:
Efficient Empowerment Estimation for Unsupervised Stabilization. ICLR 2021 - [c255]Nicklas Hansen, Rishabh Jangir, Yu Sun, Guillem Alenyà, Pieter Abbeel, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang:
Self-Supervised Policy Adaptation during Deployment. ICLR 2021 - [c254]Donald Joseph Hejna III, Pieter Abbeel, Lerrel Pinto:
Task-Agnostic Morphology Evolution. ICLR 2021 - [c253]David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan:
Learning What To Do by Simulating the Past. ICLR 2021 - [c252]Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch:
Reset-Free Lifelong Learning with Skill-Space Planning. ICLR 2021 - [c251]Rui Zhao, Yang Gao, Pieter Abbeel, Volker Tresp, Wei Xu:
Mutual Information State Intrinsic Control. ICLR 2021 - [c250]Boyuan Chen, Pieter Abbeel, Deepak Pathak:
Unsupervised Learning of Visual 3D Keypoints for Control. ICML 2021: 1539-1549 - [c249]Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel:
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning. ICML 2021: 6131-6141 - [c248]Kimin Lee, Laura M. Smith, Pieter Abbeel:
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training. ICML 2021: 6152-6163 - [c247]Hao Liu, Pieter Abbeel:
APS: Active Pretraining with Successor Features. ICML 2021: 6736-6747 - [c246]Roshan Rao, Jason Liu, Robert Verkuil, Joshua Meier, John F. Canny, Pieter Abbeel, Tom Sercu, Alexander Rives:
MSA Transformer. ICML 2021: 8844-8856 - [c245]Younggyo Seo, Lili Chen, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
State Entropy Maximization with Random Encoders for Efficient Exploration. ICML 2021: 9443-9454 - [c244]Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin:
Decoupling Representation Learning from Reinforcement Learning. ICML 2021: 9870-9879 - [c243]Yuqing Du, Olivia Watkins, Trevor Darrell, Pieter Abbeel, Deepak Pathak:
Auto-Tuned Sim-to-Real Transfer. ICRA 2021: 1290-1296 - [c242]Zhongyu Li, Xuxin Cheng, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots. ICRA 2021: 2811-2817 - [c241]Cynthia Chen, Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah:
An Empirical Investigation of Representation Learning for Imitation. NeurIPS Datasets and Benchmarks 2021 - [c240]Charles Packer, Pieter Abbeel, Joseph E. Gonzalez:
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL. NeurIPS 2021: 2466-2477 - [c239]Olivia Watkins, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, Jacob Andreas:
Teachable Reinforcement Learning via Advice Distillation. NeurIPS 2021: 6920-6933 - [c238]Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch:
Decision Transformer: Reinforcement Learning via Sequence Modeling. NeurIPS 2021: 15084-15097 - [c237]Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel:
URLB: Unsupervised Reinforcement Learning Benchmark. NeurIPS Datasets and Benchmarks 2021 - [c236]Kimin Lee, Laura M. Smith, Anca D. Dragan, Pieter Abbeel:
B-Pref: Benchmarking Preference-Based Reinforcement Learning. NeurIPS Datasets and Benchmarks 2021 - [c235]Hao Liu, Pieter Abbeel:
Behavior From the Void: Unsupervised Active Pre-Training. NeurIPS 2021: 18459-18473 - [c234]Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin:
Reinforcement Learning with Latent Flow. NeurIPS 2021: 22171-22183 - [c233]Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao:
Mastering Atari Games with Limited Data. NeurIPS 2021: 25476-25488 - [c232]Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel:
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings. NeurIPS 2021: 26779-26791 - [i226]Wenling Shang, Xiaofei Wang, Aravind Srinivas, Aravind Rajeswaran, Yang Gao, Pieter Abbeel, Michael Laskin:
Reinforcement Learning with Latent Flow. CoRR abs/2101.01857 (2021) - [i225]Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani:
Bottleneck Transformers for Visual Recognition. CoRR abs/2101.11605 (2021) - [i224]Younggyo Seo, Lili Chen, Jinwoo Shin, Honglak Lee, Pieter Abbeel, Kimin Lee:
State Entropy Maximization with Random Encoders for Efficient Exploration. CoRR abs/2102.09430 (2021) - [i223]Donald J. Hejna III, Pieter Abbeel, Lerrel Pinto:
Task-Agnostic Morphology Evolution. CoRR abs/2102.13100 (2021) - [i222]Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel:
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings. CoRR abs/2103.02886 (2021) - [i221]Hao Liu, Pieter Abbeel:
Behavior From the Void: Unsupervised Active Pre-Training. CoRR abs/2103.04551 (2021) - [i220]Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch:
Pretrained Transformers as Universal Computation Engines. CoRR abs/2103.05247 (2021) - [i219]Rui Zhao, Yang Gao, Pieter Abbeel, Volker Tresp, Wei Xu:
Mutual Information State Intrinsic Control. CoRR abs/2103.08107 (2021) - [i218]Zhongyu Li, Xuxin Cheng, Xue Bin Peng, Pieter Abbeel, Sergey Levine, Glen Berseth, Koushil Sreenath:
Reinforcement Learning for Robust Parameterized Locomotion Control of Bipedal Robots. CoRR abs/2103.14295 (2021) - [i217]Ajay Jain, Matthew Tancik, Pieter Abbeel:
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. CoRR abs/2104.00677 (2021) - [i216]Xue Bin Peng, Ze Ma, Pieter Abbeel, Sergey Levine, Angjoo Kanazawa:
AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control. CoRR abs/2104.02180 (2021) - [i215]Philippe Hansen-Estruch, Wenling Shang, Lerrel Pinto, Pieter Abbeel, Stas Tiomkin:
GEM: Group Enhanced Model for Learning Dynamical Control Systems. CoRR abs/2104.02844 (2021) - [i214]David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan:
Learning What To Do by Simulating the Past. CoRR abs/2104.03946 (2021) - [i213]Yuqing Du, Olivia Watkins, Trevor Darrell, Pieter Abbeel, Deepak Pathak:
Auto-Tuned Sim-to-Real Transfer. CoRR abs/2104.07662 (2021) - [i212]Wilson Yan, Yunzhi Zhang, Pieter Abbeel, Aravind Srinivas:
VideoGPT: Video Generation using VQ-VAE and Transformers. CoRR abs/2104.10157 (2021) - [i211]Kourosh Hakhamaneshi, Pieter Abbeel, Vladimir Stojanovic, Aditya Grover:
JUMBO: Scalable Multi-task Bayesian Optimization using Offline Data. CoRR abs/2106.00942 (2021) - [i210]Lili Chen, Kevin Lu, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch:
Decision Transformer: Reinforcement Learning via Sequence Modeling. CoRR abs/2106.01345 (2021) - [i209]Kimin Lee, Laura M. Smith, Pieter Abbeel:
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training. CoRR abs/2106.05091 (2021) - [i208]Boyuan Chen, Pieter Abbeel, Deepak Pathak:
Unsupervised Learning of Visual 3D Keypoints for Control. CoRR abs/2106.07643 (2021) - [i207]Catherine Cang, Aravind Rajeswaran, Pieter Abbeel, Michael Laskin:
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL. CoRR abs/2106.09119 (2021) - [i206]Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Sanjit A. Seshia:
Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments. CoRR abs/2106.10365 (2021) - [i205]Seunghyun Lee, Younggyo Seo, Kimin Lee, Pieter Abbeel, Jinwoo Shin:
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble. CoRR abs/2107.00591 (2021) - [i204]Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William H. Guss, Sharada P. Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca D. Dragan:
The MineRL BASALT Competition on Learning from Human Feedback. CoRR abs/2107.01969 (2021) - [i203]Kourosh Hakhamaneshi, Ruihan Zhao, Albert Zhan, Pieter Abbeel, Michael Laskin:
Hierarchical Few-Shot Imitation with Skill Transition Models. CoRR abs/2107.08981 (2021) - [i202]Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto:
Playful Interactions for Representation Learning. CoRR abs/2107.09046 (2021) - [i201]Xiaofei Wang, Kimin Lee, Kourosh Hakhamaneshi, Pieter Abbeel, Michael Laskin:
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback. CoRR abs/2108.05382 (2021) - [i200]Hao Liu, Pieter Abbeel:
APS: Active Pretraining with Successor Features. CoRR abs/2108.13956 (2021) - [i199]Mandi Zhao, Fangchen Liu, Kimin Lee, Pieter Abbeel:
Towards More Generalizable One-shot Visual Imitation Learning. CoRR abs/2110.13423 (2021) - [i198]Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox:
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates. CoRR abs/2110.14818 (2021) - [i197]Michael Laskin, Denis Yarats, Hao Liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel:
URLB: Unsupervised Reinforcement Learning Benchmark. CoRR abs/2110.15191 (2021) - [i196]Weirui Ye, Shaohuai Liu, Thanard Kurutach, Pieter Abbeel, Yang Gao:
Mastering Atari Games with Limited Data. CoRR abs/2111.00210 (2021) - [i195]Kimin Lee, Laura M. Smith, Anca D. Dragan, Pieter Abbeel:
B-Pref: Benchmarking Preference-Based Reinforcement Learning. CoRR abs/2111.03026 (2021) - [i194]Wenlong Huang, Igor Mordatch, Pieter Abbeel, Deepak Pathak:
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning. CoRR abs/2111.03062 (2021) - [i193]Dailin Hu, Pieter Abbeel, Roy Fox:
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning. CoRR abs/2111.14204 (2021) - [i192]Charles Packer, Pieter Abbeel, Joseph E. Gonzalez:
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL. CoRR abs/2112.00901 (2021) - [i191]Ajay Jain, Ben Mildenhall, Jonathan T. Barron, Pieter Abbeel, Ben Poole:
Zero-Shot Text-Guided Object Generation with Dream Fields. CoRR abs/2112.01455 (2021) - [i190]Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox:
Target Entropy Annealing for Discrete Soft Actor-Critic. CoRR abs/2112.02852 (2021) - 2020
- [c231]Wilson Yan, Ashwin Vangipuram, Pieter Abbeel, Lerrel Pinto:
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation. CoRL 2020: 564-574 - [c230]Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter Abbeel, Lerrel Pinto:
Visual Imitation Made Easy. CoRL 2020: 1992-2005 - [c229]Ignasi Clavera, Yao Fu, Pieter Abbeel:
Model-Augmented Actor-Critic: Backpropagating through Paths. ICLR 2020 - [c228]Alexander C. Li, Carlos Florensa, Ignasi Clavera, Pieter Abbeel:
Sub-policy Adaptation for Hierarchical Reinforcement Learning. ICLR 2020 - [c227]Donald J. Hejna III, Lerrel Pinto, Pieter Abbeel:
Hierarchically Decoupled Imitation For Morphological Transfer. ICML 2020: 4159-4171 - [c226]Michael Laskin, Aravind Srinivas, Pieter Abbeel:
CURL: Contrastive Unsupervised Representations for Reinforcement Learning. ICML 2020: 5639-5650 - [c225]Eric Liang, Zongheng Yang, Ion Stoica, Pieter Abbeel, Yan Duan, Xi Chen:
Variable Skipping for Autoregressive Range Density Estimation. ICML 2020: 6040-6049 - [c224]Kara Liu, Thanard Kurutach, Christine Tung, Pieter Abbeel, Aviv Tamar:
Hallucinative Topological Memory for Zero-Shot Visual Planning. ICML 2020: 6259-6270 - [c223]Ramanan Sekar, Oleh Rybkin, Kostas Daniilidis, Pieter Abbeel, Danijar Hafner, Deepak Pathak:
Planning to Explore via Self-Supervised World Models. ICML 2020: 8583-8592 - [c222]Adam Stooke, Joshua Achiam, Pieter Abbeel:
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods. ICML 2020: 9133-9143 - [c221]Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel, Roberto Calandra:
Plan2Vec: Unsupervised Representation Learning by Latent Plans. L4DC 2020: 935-946 - [c220]Paras Jain, Ajay Jain, Aniruddha Nrusimha, Amir Gholami, Pieter Abbeel, Kurt Keutzer, Ion Stoica, Joseph Gonzalez:
Checkmate: Breaking the Memory Wall with Optimal Tensor Rematerialization. MLSys 2020 - [c219]Yuqing Du, Stas Tiomkin, Emre Kiciman, Daniel Polani, Pieter Abbeel, Anca D. Dragan:
AvE: Assistance via Empowerment. NeurIPS 2020 - [c218]Scott Emmons, Ajay Jain, Michael Laskin, Thanard Kurutach, Pieter Abbeel, Deepak Pathak:
Sparse Graphical Memory for Robust Planning. NeurIPS 2020 - [c217]Jonathan Ho, Ajay Jain, Pieter Abbeel:
Denoising Diffusion Probabilistic Models. NeurIPS 2020 - [c216]Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas:
Reinforcement Learning with Augmented Data. NeurIPS 2020 - [c215]Alex X. Lee, Anusha Nagabandi, Pieter Abbeel, Sergey Levine:
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model. NeurIPS 2020 - [c214]Alexander C. Li, Lerrel Pinto, Pieter Abbeel:
Generalized Hindsight for Reinforcement Learning. NeurIPS 2020 - [c213]Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte, Thanard Kurutach, Jinwoo Shin, Pieter Abbeel:
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning. NeurIPS 2020 - [c212]Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto:
Automatic Curriculum Learning through Value Disagreement. NeurIPS 2020 - [c211]Laura M. Smith, Nikita Dhawan, Marvin Zhang, Pieter Abbeel, Sergey Levine:
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos. Robotics: Science and Systems 2020 - [c210]Yilin Wu, Wilson Yan, Thanard Kurutach, Lerrel Pinto, Pieter Abbeel:
Learning to Manipulate Deformable Objects without Demonstrations. Robotics: Science and Systems 2020 - [c209]Ajay Jain, Pieter Abbeel, Deepak Pathak:
Locally Masked Convolution for Autoregressive Models. UAI 2020: 1358-1367 - [e2]Ken Goldberg, Pieter Abbeel, Kostas E. Bekris, Lauren Miller:
Algorithmic Foundations of Robotics XII, Proceedings of the Twelfth Workshop on the Algorithmic Foundations of Robotics, WAFR 2016, San Francisco, California, USA, December 18-20, 2016. Springer Proceedings in Advanced Robotics 13, Springer 2020, ISBN 978-3-030-43088-7 [contents] - [i189]Albert Zhan, Stas Tiomkin, Pieter Abbeel:
Preventing Imitation Learning with Adversarial Policy Ensembles. CoRR abs/2002.01059 (2020) - [i188]Gregory Kahn, Pieter Abbeel, Sergey Levine:
BADGR: An Autonomous Self-Supervised Learning-Based Navigation System. CoRR abs/2002.05700 (2020) - [i187]Kourosh Hakhamaneshi, Keertana Settaluri, Pieter Abbeel, Vladimir Stojanovic:
GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal Black Box Constraint Satisfaction. CoRR abs/2002.07236 (2020) - [i186]Alexander C. Li, Lerrel Pinto, Pieter Abbeel:
Generalized Hindsight for Reinforcement Learning. CoRR abs/2002.11708 (2020) - [i185]Kara Liu, Thanard Kurutach, Christine Tung, Pieter Abbeel, Aviv Tamar:
Hallucinative Topological Memory for Zero-Shot Visual Planning. CoRR abs/2002.12336 (2020) - [i184]Donald J. Hejna III, Pieter Abbeel, Lerrel Pinto:
Hierarchically Decoupled Imitation for Morphological Transfer. CoRR abs/2003.01709 (2020) - [i183]Wilson Yan, Ashwin Vangipuram, Pieter Abbeel, Lerrel Pinto:
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation. CoRR abs/2003.05436 (2020) - [i182]Michael Laskin, Scott Emmons, Ajay Jain, Thanard Kurutach, Pieter Abbeel, Deepak Pathak:
Sparse Graphical Memory for Robust Planning. CoRR abs/2003.06417 (2020) - [i181]Aravind Srinivas, Michael Laskin, Pieter Abbeel:
CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CoRR abs/2004.04136 (2020) - [i180]Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas:
Reinforcement Learning with Augmented Data. CoRR abs/2004.14990 (2020) - [i179]Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel, Roberto Calandra:
Plan2Vec: Unsupervised Representation Learning by Latent Plans. CoRR abs/2005.03648 (2020) - [i178]Ramanan Sekar, Oleh Rybkin, Kostas Daniilidis, Pieter Abbeel, Danijar Hafner, Deepak Pathak:
Planning to Explore via Self-Supervised World Models. CoRR abs/2005.05960 (2020) - [i177]Ignasi Clavera, Violet Fu, Pieter Abbeel:
Model-Augmented Actor-Critic: Backpropagating through Paths. CoRR abs/2005.08068 (2020) - [i176]Yiming Ding, Ignasi Clavera, Pieter Abbeel:
Mutual Information Maximization for Robust Plannable Representations. CoRR abs/2005.08114 (2020) - [i175]Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto:
Automatic Curriculum Learning through Value Disagreement. CoRR abs/2006.09641 (2020) - [i174]Jonathan Ho, Ajay Jain, Pieter Abbeel:
Denoising Diffusion Probabilistic Models. CoRR abs/2006.11239 (2020) - [i173]Ajay Jain, Pieter Abbeel, Deepak Pathak:
Locally Masked Convolution for Autoregressive Models. CoRR abs/2006.12486 (2020) - [i172]Yuqing Du, Stas Tiomkin, Emre Kiciman, Daniel Polani, Pieter Abbeel, Anca D. Dragan:
AvE: Assistance via Empowerment. CoRR abs/2006.14796 (2020) - [i171]Adam Stooke, Joshua Achiam, Pieter Abbeel:
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods. CoRR abs/2007.03964 (2020) - [i170]Nicklas Hansen, Yu Sun, Pieter Abbeel, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang:
Self-Supervised Policy Adaptation during Deployment. CoRR abs/2007.04309 (2020) - [i169]Kimin Lee, Michael Laskin, Aravind Srinivas, Pieter Abbeel:
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning. CoRR abs/2007.04938 (2020) - [i168]Paras Jain, Ajay Jain, Tianjun Zhang, Pieter Abbeel, Joseph E. Gonzalez, Ion Stoica:
Contrastive Code Representation Learning. CoRR abs/2007.04973 (2020) - [i167]Eric Liang, Zongheng Yang, Ion Stoica, Pieter Abbeel, Yan Duan, Xi Chen:
Variable Skipping for Autoregressive Range Density Estimation. CoRR abs/2007.05572 (2020) - [i166]Ruihan Zhao, Pieter Abbeel, Stas Tiomkin:
Efficient Online Estimation of Empowerment for Reinforcement Learning. CoRR abs/2007.07356 (2020) - [i165]Hao Liu, Pieter Abbeel:
Hybrid Discriminative-Generative Training via Contrastive Learning. CoRR abs/2007.09070 (2020) - [i164]Xingyu Lu, Kimin Lee, Pieter Abbeel, Stas Tiomkin:
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning. CoRR abs/2008.00614 (2020) - [i163]Eugene Vinitsky, Yuqing Du, Kanaad Parvate, Kathy Jang, Pieter Abbeel, Alexandre M. Bayen:
Robust Reinforcement Learning using Adversarial Populations. CoRR abs/2008.01825 (2020) - [i162]Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter Abbeel, Lerrel Pinto:
Visual Imitation Made Easy. CoRR abs/2008.04899 (2020) - [i161]Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin:
Decoupling Representation Learning from Reinforcement Learning. CoRR abs/2009.08319 (2020) - [i160]Gregory Kahn, Pieter Abbeel, Sergey Levine:
LaND: Learning to Navigate from Disengagements. CoRR abs/2010.04689 (2020) - [i159]Younggyo Seo, Kimin Lee, Ignasi Clavera, Thanard Kurutach, Jinwoo Shin, Pieter Abbeel:
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning. CoRR abs/2010.13303 (2020) - [i158]Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch:
Reset-Free Lifelong Learning with Skill-Space Planning. CoRR abs/2012.03548 (2020) - [i157]Michael Laskin, Luke Metz, Seth Nabarrao, Mark Saroufim, Badreddine Noune, Carlo Luschi, Jascha Sohl-Dickstein, Pieter Abbeel:
Parallel Training of Deep Networks with Local Updates. CoRR abs/2012.03837 (2020) - [i156]Albert Zhan, Philip Zhao, Lerrel Pinto, Pieter Abbeel, Michael Laskin:
A Framework for Efficient Robotic Manipulation. CoRR abs/2012.07975 (2020)
2010 – 2019
- 2019
- [j22]Sandy H. Huang, David Held, Pieter Abbeel, Anca D. Dragan:
Enabling robots to communicate their objectives. Auton. Robots 43(2): 309-326 (2019) - [j21]Zongheng Yang, Eric Liang, Amog Kamsetty, Chenggang Wu, Yan Duan, Xi Chen, Pieter Abbeel, Joseph M. Hellerstein, Sanjay Krishnan, Ion Stoica:
Deep Unsupervised Cardinality Estimation. Proc. VLDB Endow. 13(3): 279-292 (2019) - [c208]Menglong Guo, Philipp Wu, Brent Yi, David V. Gealy, Stephen McKinley, Pieter Abbeel:
Blue Gripper: A Robust, Low-Cost, and Force-Controlled Robot Hand. CASE 2019: 1505-1510 - [c207]Yunzhi Zhang, Ignasi Clavera, Boren Tsai, Pieter Abbeel:
Asynchronous Methods for Model-Based Reinforcement Learning. CoRL 2019: 1338-1347 - [c206]Kourosh Hakhamaneshi, Nick Werblun, Pieter Abbeel, Vladimir Stojanovic:
Analog Circuit Generator based on Deep Neural Network enhanced Combinatorial Optimization. DAC 2019: 228 - [c205]Kourosh Hakhamaneshi, Nick Werblun, Pieter Abbeel, Vladimir Stojanovic:
BagNet: Berkeley Analog Generator with Layout Optimizer Boosted with Deep Neural Networks. ICCAD 2019: 1-8 - [c204]John D. Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, Jacob Andreas, John DeNero, Pieter Abbeel, Sergey Levine:
Guiding Policies with Language via Meta-Learning. ICLR (Poster) 2019 - [c203]Anusha Nagabandi, Ignasi Clavera, Simin Liu, Ronald S. Fearing, Pieter Abbeel, Sergey Levine, Chelsea Finn:
Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning. ICLR (Poster) 2019 - [c202]Xue Bin Peng, Angjoo Kanazawa, Sam Toyer, Pieter Abbeel, Sergey Levine:
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow. ICLR (Poster) 2019 - [c201]Jonas Rothfuss, Dennis Lee, Ignasi Clavera, Tamim Asfour, Pieter Abbeel:
ProMP: Proximal Meta-Policy Search. ICLR (Poster) 2019 - [c200]Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca D. Dragan:
Preferences Implicit in the State of the World. ICLR (Poster) 2019 - [c199]Jonathan Ho, Xi Chen, Aravind Srinivas, Yan Duan, Pieter Abbeel:
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design. ICML 2019: 2722-2730 - [c198]