A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization

Hu, Xiaobo; Lin, Youfang; Liu, Yue; Wang, Jinwen; Wang, Shuo; Fan, Hehe; Lv, Kai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.01915 (cs)

[Submitted on 4 Dec 2023]

Title:A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization

Authors:Xiaobo Hu, Youfang Lin, Yue Liu, Jinwen Wang, Shuo Wang, Hehe Fan, Kai Lv

View PDF

Abstract:Visual reinforcement learning has proven effective in solving control tasks with high-dimensional observations. However, extracting reliable and generalizable representations from vision-based observations remains a central challenge. Inspired by the human thought process, when the representation extracted from the observation can predict the future and trace history, the representation is reliable and accurate in comprehending the environment. Based on this concept, we introduce a Bidirectional Transition (BiT) model, which leverages the ability to bidirectionally predict environmental transitions both forward and backward to extract reliable representations. Our model demonstrates competitive generalization performance and sample efficiency on two settings of the DeepMind Control suite. Additionally, we utilize robotic manipulation and CARLA simulators to demonstrate the wide applicability of our method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.01915 [cs.CV]
	(or arXiv:2312.01915v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.01915

Submission history

From: Xiaobo Hu [view email]
[v1] Mon, 4 Dec 2023 14:19:36 UTC (2,493 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators