Fine-tuning Siamese Networks to Assess Sport Gestures Quality - Sorbonne Université Access content directly
Conference Papers Year : 2020

Fine-tuning Siamese Networks to Assess Sport Gestures Quality


This paper presents an Action Quality Assessment (AQA) approach that learns to automatically score action realization from temporal sequences like videos. To manage the small size of most of databases capturing actions or gestures, we propose to use Siamese Networks. In the literature, Siamese Networks are widely used to rank action scores. Indeed, their purpose is not to regress scores but to predict a value that respects true scores order so that it can be used to rank actions according to their quality. For AQA, we need to predict real scores, as well as the difference between these scores and their range. Thus, we first introduce a new loss function to train Siamese Networks in order to regress score gaps. Once the Siamese network is trained, a branch of this network is extracted and fine-tuned for score prediction. We tested our approach on a public database, the AQA-7 dataset, composed of videos from 7 sports. Our results outperform state of the art on AQA task. Moreover, we show that the proposed method is also more efficient for action ranking.
Fichier principal
Vignette du fichier
VISAPP.pdf (36.55 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-02933258 , version 1 (08-09-2020)



Mégane Millan, Catherine Achard. Fine-tuning Siamese Networks to Assess Sport Gestures Quality. 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2020, Feb 2020, Valetta, Malta. ⟨10.5220/0008924600570065⟩. ⟨hal-02933258⟩
178 View
78 Download



Gmail Facebook X LinkedIn More