MPRG : 機械知覚&ロボティクスグループ／中部大学

26 Nov 2017 国際会議

Travel Time-dependent Maximum Entropy Inverse Reinforcement Learning for Seabird Trajectory Prediction

Author: T. Hirakawa, T. Yamashita, K. Yoda, T. Tamaki, H. Fujiyoshi
Publication: Asian Conference on Pattern Recognition, 2017

Download: PDF (English)

Trajectory prediction is a challenging problem in the fields of computer vision, robotics, and machine learning, and a number of methods for trajectory prediction have been proposed. Most methods generate trajectories that move toward a goal in a straight line (goal-directed) while avoiding obstacles. However, there are not only such goal-directed trajectories but also trajectories that taking detours to reach the goal (non-goal-directed). In this paper, we propose a method of predicting such non-goaldirected trajectories based on the maximum entropy inverse reinforcement learning framework. Our method introduces travel time as a state of the Markov decision process. As a practical example, we apply the proposed method to seabird trajectories measured using global positioning system loggers. Experimental results show that the proposed method can effectively predict non-goal-directed trajectories.