Academic Journal

Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication

التفاصيل البيبلوغرافية
العنوان: Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication
المؤلفون: Oliehoek, Frans, Spaan, Matthijs
المصدر: Proceedings of the AAAI Conference on Artificial Intelligence; Vol. 26 No. 1 (2012): Twenty-Sixth AAAI Conference on Artificial Intelligence; 1415-1421 ; 2374-3468 ; 2159-5399
بيانات النشر: Association for the Advancement of Artificial Intelligence
سنة النشر: 2021
المجموعة: Association for the Advancement of Artificial Intelligence: AAAI Publications
مصطلحات موضوعية: multiagent planning, delayed communication, tree-based pruning
الوصف: Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful framework for optimal decision making under the assumption of instantaneous communication. We focus on a delayed communication setting (MPOMDP-DC), in which broadcasted information is delayed by at most one time step. This model allows agents to act on their most recent (private) observation. Such an assumption is a strict generalization over having agents wait until the global information is available and is more appropriate for applications in which response time is critical. In this setting, however, value function backups are significantly more costly, and naive application of incremental pruning, the core of many state-of-the-art optimal POMDP techniques, is intractable. In this paper, we overcome this problem by demonstrating that computation of the MPOMDP-DC backup can be structured as a tree and introducing two novel tree-based pruning techniques that exploit this structure in an effective way. We experimentally show that these methods have the potential to outperform naive incremental pruning by orders of magnitude, allowing for the solution of larger problems.
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
Relation: https://ojs.aaai.org/index.php/AAAI/article/view/8257/8116; https://ojs.aaai.org/index.php/AAAI/article/view/8257
DOI: 10.1609/aaai.v26i1.8257
الاتاحة: https://ojs.aaai.org/index.php/AAAI/article/view/8257
https://doi.org/10.1609/aaai.v26i1.8257
Rights: Copyright (c) 2021 Proceedings of the AAAI Conference on Artificial Intelligence
رقم الانضمام: edsbas.BD61F396
قاعدة البيانات: BASE