ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos - Sorbonne Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Résumé

Action spotting has recently been proposed as an alternative to action detection and key frame extraction. However, the current state-of-the-art method of action spotting requires an expensive ground truth composed of the search sequences employed by human annotators spotting actions - a critical limitation. In this article, we propose to use a reinforcement learning algorithm to perform efficient action spotting using only the temporal segments from the action detection annotations, thus opening an interesting solution for video understanding. Experiments performed on THUMOS14 and ActivityNet datasets show that the proposed method, named ActionSpotter, leads to good results and outperforms state-of-the-art detection outputs redrawn for this application. In particular, the spotting mean Average Precision on THUMOS14 is significantly improved from 59.7% to 65.6% while skipping 23% of video.
Fichier principal
Vignette du fichier
bare_conf.pdf (1.06 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02534615 , version 1 (14-04-2020)
hal-02534615 , version 2 (05-11-2020)

Identifiants

  • HAL Id : hal-02534615 , version 2

Citer

Guillaume Vaudaux-Ruth, Adrien Chan-Hon-Tong, Catherine Achard. ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos. 2020 25th International Conference on Pattern Recognition (ICPR), Jan 2021, Milan, Italy. ⟨hal-02534615v2⟩
204 Consultations
275 Téléchargements

Partager

Gmail Facebook X LinkedIn More