Dept. of Robotics Science and Technology,
Chubu University

Vision Applications Conference

Action Spotting and Temporal Attention Analysis in Soccer Videos

Author
Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi, Mitsuru Nakazawa, Yeongnam Chae, Bjorn Stenger
Publication
International Conference on Machine Vision Applications, 2021

Download: PDF (English)

This paper introduces an action spotting method for video summarization in soccer games. Action spotting is the task of finding a specific action in a video. In this paper, we consider the task of spotting actions in soccer videos, e.g., goals, player substitutions, and card scenes, which are temporally sparse within a complete game. We spot actions using a Transformer model, which allows capturing important features before and after action scenes. Moreover, we analyze which time instances the model focuses on when predicting an action by observing the internal weights of the transformer. Quantitative results on the public SoccerNet dataset show that the proposed method achieves the best mAP, a significant improvement over previous methods. In addition, by analyzing the attention weights, we discover that the model focuses on different temporal neighborhoods for different actions.

Previous Next