Towards fast and adaptive optimal control policies for robots: A direct policy search approach

Didier Marin; Olivier Sigaud

Communication Dans Un Congrès Année : 2012

Towards fast and adaptive optimal control policies for robots: A direct policy search approach

(1, 2) , (1, 2)

1
2

Didier Marin

Fonction : Auteur correspondant
PersonId : 925852

Connectez-vous pour contacter l'auteur

Institut des Systèmes Intelligents et de Robotique

AMAC

Olivier Sigaud

Fonction : Auteur
PersonId : 14932
IdHAL : olivier-sigaud
ORCID : 0000-0002-8544-0229
IdRef : 072724714

Institut des Systèmes Intelligents et de Robotique

AMAC

Résumé

Optimal control methods are generally too expensive to be applied on-line and in real-time to the control of robots. An alternative method consists in tuning a parametrized reactive controller so that it converges to optimal behavior. In this paper we present such a method based on the "direct Policy Search" paradigm to get a cost-efficient control policy for a simulated two degrees-of-freedom planar arm actuated by six muscles. We learn a parametric controller from demonstration using a few near-optimal trajectories. Then we tune the parameters of this controller using two versions of a Cross-Entropy Policy Search method that we compare. Finally, we show that the resulting controller is 20000 times faster than an optimal control method producing the same trajectories.

Domaines

Intelligence artificielle [cs.AI]

Didier Marin : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-00703755

Soumis le : lundi 4 juin 2012-12:08:46

Dernière modification le : mercredi 27 mars 2024-15:02:03

Dates et versions

hal-00703755 , version 1 (04-06-2012)

Identifiants

HAL Id : hal-00703755 , version 1

Citer

Didier Marin, Olivier Sigaud. Towards fast and adaptive optimal control policies for robots: A direct policy search approach. Robotica 2012, 2012, Guimaraes, Portugal. pp.21-26. ⟨hal-00703755⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ISIR_AMAC

116 Consultations

0 Téléchargements

Towards fast and adaptive optimal control policies for robots: A direct policy search approach

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager