Academic Journal

Reinforcement Learning with Internal Reward for Multi-Agent Cooperation: A Theoretical Approach

التفاصيل البيبلوغرافية
العنوان: Reinforcement Learning with Internal Reward for Multi-Agent Cooperation: A Theoretical Approach
المؤلفون: Fumito Uwano, Naoki Tatebe, Masaya Nakata, Keiki Takadama, Tim Kovacs
المصدر: EAI Endorsed Transactions on Collaborative Computing, Vol 2, Iss 8, Pp 1-8 (2016)
بيانات النشر: European Alliance for Innovation (EAI), 2016.
سنة النشر: 2016
المجموعة: LCC:Technology
مصطلحات موضوعية: multi-agent system, analysis, q-learning, internal reward, Technology
الوصف: This paper focuses on a multi-agent cooperation which is generally difficult to be achieved without sufficient information of other agents, and proposes the reinforcement learning method that introduces an internal reward for a multi-agent cooperation without sufficient information. To guarantee to achieve such a cooperation, this paper theoretically derives the condition of selecting appropriate actions by changing internal rewards given to the agents, and extends the reinforcement learning methods (Q-learning and Profit Sharing) to enable the agents to acquire the appropriate Q-values updated according to the derived condition. Concretely, the internal rewards change when the agents can only find better solution than the current one. The intensive simulations on the maze problems as one of testbeds have revealed the following implications:(1) our proposed method successfully enables the agents to select their own appropriate cooperating actions which contribute to acquiring the minimum steps towards to their goals, while the conventional methods (i.e., Q-learning and Profit Sharing) cannot always acquire the minimum steps; and (2) the proposed method based on Profit Sharing provides the same good performance as the proposed method based on Q-learning.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2312-8623
Relation: https://doaj.org/toc/2312-8623
DOI: 10.4108/eai.3-12-2015.2262878
URL الوصول: https://doaj.org/article/e75af3a4110c43ada798e036ff209644
رقم الانضمام: edsdoj.75af3a4110c43ada798e036ff209644
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:23128623
DOI:10.4108/eai.3-12-2015.2262878