ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Guillaume Vaudaux-Ruth; Adrien Chan-Hon-Tong; Catherine Achard

Communication Dans Un Congrès Année : 2021

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

(1, 2) , (2) , (3, 4, 1)

1
2
3
4

Guillaume Vaudaux-Ruth

Fonction : Auteur
PersonId : 1067601
IdRef : 26843591X

Sorbonne Université

DTIS, ONERA, Université Paris Saclay [Palaiseau]

Adrien Chan-Hon-Tong

Fonction : Auteur
PersonId : 923812

DTIS, ONERA, Université Paris Saclay [Palaiseau]

Catherine Achard

Fonction : Auteur
PersonId : 182097
IdHAL : catherine-achard
ORCID : 0000-0002-5790-0830
IdRef : 13796658X

Institut des Systèmes Intelligents et de Robotique

Perception, Interaction, Robotique sociales

Sorbonne Université

Résumé

Action spotting has recently been proposed as an alternative to action detection and key frame extraction. However, the current state-of-the-art method of action spotting requires an expensive ground truth composed of the search sequences employed by human annotators spotting actions - a critical limitation. In this article, we propose to use a reinforcement learning algorithm to perform efficient action spotting using only the temporal segments from the action detection annotations, thus opening an interesting solution for video understanding. Experiments performed on THUMOS14 and ActivityNet datasets show that the proposed method, named ActionSpotter, leads to good results and outperforms state-of-the-art detection outputs redrawn for this application. In particular, the spotting mean Average Precision on THUMOS14 is significantly improved from 59.7% to 65.6% while skipping 23% of video.

Mots clés

Index Terms-Class L A T E X IEEEtran paper typesetting style template

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

bare_conf.pdf (1.06 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume Vaudaux-Ruth : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02534615

Soumis le : jeudi 5 novembre 2020-11:43:01

Dernière modification le : vendredi 5 avril 2024-14:10:04

Dates et versions

hal-02534615 , version 1 (14-04-2020)

hal-02534615 , version 2 (05-11-2020)

Identifiants

HAL Id : hal-02534615 , version 2

Citer

Guillaume Vaudaux-Ruth, Adrien Chan-Hon-Tong, Catherine Achard. ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos. 2020 25th International Conference on Pattern Recognition (ICPR), Jan 2021, Milan, Italy. ⟨hal-02534615v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ONERA CNRS ISIR GENCI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE SU-SCIENCES ISIR_PIROS GS-ENGINEERING GS-COMPUTER-SCIENCE

204 Consultations

275 Téléchargements

ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager