CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

Andersen, Per-Arne; Goodwin, Morten; Granmo, Ole-Christoffer

doi:10.1007/978-3-030-63799-6_7

Computer Science > Machine Learning

arXiv:2210.01805 (cs)

[Submitted on 3 Oct 2022]

Title:CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

Authors:Per-Arne Andersen, Morten Goodwin, Ole-Christoffer Granmo

View PDF

Abstract:Reinforcement Learning (RL) is a general framework concerned with an agent that seeks to maximize rewards in an environment. The learning typically happens through trial and error using explorative methods, such as epsilon-greedy. There are two approaches, model-based and model-free reinforcement learning, that show concrete results in several disciplines. Model-based RL learns a model of the environment for learning the policy while model-free approaches are fully explorative and exploitative without considering the underlying environment dynamics. Model-free RL works conceptually well in simulated environments, and empirical evidence suggests that trial and error lead to a near-optimal behavior with enough training. On the other hand, model-based RL aims to be sample efficient, and studies show that it requires far less training in the real environment for learning a good policy.
A significant challenge with RL is that it relies on a well-defined reward function to work well for complex environments and such a reward function is challenging to define. Goal-Directed RL is an alternative method that learns an intrinsic reward function with emphasis on a few explored trajectories that reveals the path to the goal state.
This paper introduces a novel reinforcement learning algorithm for predicting the distance between two states in a Markov Decision Process. The learned distance function works as an intrinsic reward that fuels the agent's learning. Using the distance-metric as a reward, we show that the algorithm performs comparably to model-free RL while having significantly better sample-efficiently in several test environments.

Comments:	14 pages, 5 figures, In Proceedings of the International Conference on Innovative Techniques and Applications of Artificial Intelligence, SGAI2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.01805 [cs.LG]
	(or arXiv:2210.01805v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.01805
Journal reference:	2020 Springer Nature Switzerland AG
Related DOI:	https://doi.org/10.1007/978-3-030-63799-6_7

Submission history

From: Per-Arne Andersen [view email]
[v1] Mon, 3 Oct 2022 21:16:14 UTC (472 KB)

Computer Science > Machine Learning

Title:CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators