Reinforcement Learning in Discrete and Continuous Domains Applied to Ship Trajectory Generation

Andrzej Rak; Witold Gierusz

doi:10.2478/v10012-012-0020-8

Reinforcement Learning in Discrete and Continuous Domains Applied to Ship Trajectory Generation

Abstract

This paper presents the application of the reinforcement learning algorithms to the task of autonomous determination of the ship trajectory during the in-harbour and harbour approaching manoeuvres. Authors used Markov decision processes formalism to build up the background of algorithm presentation. Two versions of RL algorithms were tested in the simulations: discrete (Q-learning) and continuous form (Least-Squares Policy Iteration). The results show that in both cases ship trajectory can be found. However discrete Q learning algorithm suffered from many limitations (mainly curse of dimensionality) and practically is not applicable to the examined task. On the other hand, LSPI gave promising results. To be fully operational, proposed solution should be extended by taking into account ship heading and velocity and coupling with advanced multi-variable controller.

Keywords:

ship motion control, trajectory generation, autonomous navigation, reinforcement learning, least-squares policy iteration

Details

Issue

Vol. 19 No. S1 (74) (2012)

Section

Latest Articles

Published

31-10-2012

DOI:

https://doi.org/10.2478/v10012-012-0020-8

Licencja:

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Open Access License

This journal provides immediate open access to its content under the Creative Commons BY 4.0 license. Authors who publish with this journal retain all copyrights and agree to the terms of the CC BY 4.0 license.

Authors

Andrzej Rak
Gdynia Maritime University, Faculty of Marine Electrical Engineering
Witold Gierusz
Gdynia Maritime University, Faculty of Marine Electrical Engineering

Download paper

pdf

Main menu

Reinforcement Learning in Discrete and Continuous Domains Applied to Ship Trajectory Generation

Abstract

Keywords:

Details

Authors

Andrzej Rak

Witold Gierusz

Download paper

Most read articles by the same author(s)