The cross-entropy method for policy search in decentralized POMDPs

Oliehoek, F. A.; Kooij, J. F. P.; VLASSIS, Nikos

Download

Article (Scientific journals)

The cross-entropy method for policy search in decentralized POMDPs

Oliehoek, F. A.; Kooij, J. F. P.; VLASSIS, Nikos

2008 • In Informatica, 32 (4), p. 341-357

Peer reviewed

Permalink
https://hdl.handle.net/10993/3365

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

download.pdf

Author postprint (329.88 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

multiagent planning; decentralized POMDPs; combinatorial optimization

Abstract :

[en] Decentralized POMDPs (Dec-POMDPs) are becoming increasingly popular as models for multiagent planning under uncertainty, but solving a Dec-POMDP exactly is known to be an intractable combinatorial optimization problem. In this paper we apply the Cross-Entropy (CE) method, a recently introduced method for combinatorial optimization, to Dec-POMDPs, resulting in a randomized (sampling-based) algorithm for approximately solving Dec-POMDPs. This algorithm operates by sampling pure policies from an appropriately parametrized stochastic policy, and then evaluates these policies either exactly or approximately in order to define the next stochastic policy to sample from, and so on until convergence. Experimental results demonstrate that the CE method can search huge spaces efficiently, supporting our claim that combinatorial optimization methods can bring leverage to the approximate solution of Dec-POMDPs.

Disciplines :

Computer science

Identifiers :

UNILU:UL-ARTICLE-2011-702

Author, co-author :

Oliehoek, F. A.

Kooij, J. F. P.

VLASSIS, Nikos ; Technical University of Crete > Dept. of Production Engineering and Management

Language :

English

Title :

The cross-entropy method for policy search in decentralized POMDPs

Publication date :

2008

Journal title :

Informatica

ISSN :

0868-4952

Publisher :

IOS Press

Volume :

Issue :

Pages :

341-357

Peer reviewed :

Peer reviewed

Additional URL :

http://www.informatica.si/index.php/informatica/article/view/208

Available on ORBilu :

since 04 July 2013

Statistics

Number of views

265 (7 by Unilu)

Number of downloads

175 (1 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations