Efficient Incremental Planning and Learning with Multi-Valued Decision Diagrams

Jean-Christophe Magnan; Pierre-Henri Wuillemin

doi:10.1016/j.jal.2016.11.032

Article Dans Une Revue Journal of Applied Logic Année : 2017

Efficient Incremental Planning and Learning with Multi-Valued Decision Diagrams

(1) , (1)

Jean-Christophe Magnan

Fonction : Auteur
PersonId : 971578

DECISION

Pierre-Henri Wuillemin

Fonction : Auteur correspondant
PersonId : 8633
IdHAL : pierre-henri-wuillemin
ORCID : 0000-0003-3691-4886
IdRef : 12747627X

Connectez-vous pour contacter l'auteur

DECISION

Résumé

In the domain of decision theoretic planning, the factored framework (Factored Markov Decision Process, fmdp) has produced optimized algorithms using structured representations such as Decision Trees (Structured Value Iteration (svi), Structured Policy Iteration (spi)) or Algebraic Decision Diagrams (Stochastic Planning Using Decision Diagrams (spudd)). Since it may be difficult to elaborate the factored models used by these algorithms, the architecture sdyna, which combines learning and planning algorithms using structured representations, was introduced. However, the state-of-the-art algorithms for incremental learning, for structured decision theoretic planning or for reinforcement learning require the problem to be specified only with binary variables and/or use data structures that can be improved in term of compactness. In this paper, we propose to use Multi-Valued Decision Diagrams (mdds) as a more efficient data structure for the sdyna architecture and describe a planning algorithm and an incremental learning algorithm dedicated to this new structured representation. For both planning and learning algorithms, we experimentally show that they allow significant improvements in time, in compactness of the computed policy and of the learned model. We then analyzed the combination of these two algorithms in an efficient sdynainstance for simultaneous learning and planning using mdds.

Mots clés

Factored Markov decision processes Multi-Valued Decision Diagrams

Domaines

Intelligence artificielle [cs.AI] Probabilités [math.PR] Statistiques [math.ST] Théorie [stat.TH] Machine Learning [stat.ML]

Fichier principal

Magnan_Efficient.pdf (1.2 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Pierre-Henri Wuillemin : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01399290

Soumis le : mercredi 23 novembre 2016-15:52:59

Dernière modification le : lundi 15 avril 2024-11:18:12

Archivage à long terme le : mardi 21 mars 2017-05:45:09

Dates et versions

hal-01399290 , version 1 (23-11-2016)

Identifiants

HAL Id : hal-01399290 , version 1
DOI : 10.1016/j.jal.2016.11.032

Citer

Jean-Christophe Magnan, Pierre-Henri Wuillemin. Efficient Incremental Planning and Learning with Multi-Valued Decision Diagrams. Journal of Applied Logic, 2017, 22, pp.63-90. ⟨10.1016/j.jal.2016.11.032⟩. ⟨hal-01399290⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES ANR

328 Consultations

214 Téléchargements

Efficient Incremental Planning and Learning with Multi-Valued Decision Diagrams

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager