Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection

Xin Wang; Nicolas Thome; Matthieu Cord

doi:10.1016/j.patcog.2017.07.001

Article Dans Une Revue Pattern Recognition Année : 2017

Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection

(1) , (1) , (1)

Xin Wang

Fonction : Auteur correspondant
PersonId : 1011924

Connectez-vous pour contacter l'auteur

Machine Learning and Information Access

Nicolas Thome

Fonction : Auteur
PersonId : 181803
IdHAL : nicolas-thome
ORCID : 0000-0003-4871-3045
IdRef : 12878332X

Machine Learning and Information Access

Matthieu Cord

Fonction : Auteur
PersonId : 13617
IdHAL : matthieucord
ORCID : 0000-0002-0627-5844
IdRef : 132968126

Machine Learning and Information Access

Résumé

This paper deals with Weakly Supervised Learning (WSL), i.e. performing image classification by leveraging local information with models trained from global image labels. We propose a new WSL method which incorporates gaze features collected by an eye-tracker to guide the region selection policy. Our approach presents two appealing advantages: gaze features are cheap to collect, e.g. one order of magnitude faster than bounding boxes, and our method only requires gaze features during training, while being gaze free at test time. For this purpose, the training objective is enriched with a gaze loss, from which we derive a concave-convex upper bound, leading to an off-the-shelf CCCP optimization scheme.  Extensive experiments are conducted to validate the effectiveness of the approach for WSL image classification on two public datasets with gaze annotation, i.e. PASCAL VOC 2012 action and POET. In addition, we publicly release a new food-related dataset, the Gaze-based UPMC Food dataset (UPMC-G20), which covers 20 food categories and 2,000 images. This dataset intends to promote the research in the food-related computer vision community.

Domaines

Informatique [cs]

Fichier principal

Wang_Gaze_Latent_Support.pdf (5.38 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Gestionnaire HAL 2 Sorbonne Université : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01557368

Soumis le : jeudi 6 juillet 2017-11:58:06

Dernière modification le : mercredi 30 octobre 2024-18:18:04

Archivage à long terme le : mercredi 24 janvier 2018-12:19:18

Dates et versions

hal-01557368 , version 1 (06-07-2017)

Identifiants

HAL Id : hal-01557368 , version 1
DOI : 10.1016/j.patcog.2017.07.001

Citer

Xin Wang, Nicolas Thome, Matthieu Cord. Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection. Pattern Recognition, 2017, 72, pp.59-71. ⟨10.1016/j.patcog.2017.07.001⟩. ⟨hal-01557368⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES ANR

276 Consultations

397 Téléchargements

Gaze Latent Support Vector Machine for Image Classification Improved by Weakly Supervised Region Selection

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager