Not Logged In

Learning to Communicate and Act using Hierarchical Reinforcement Learning

Full Text: aamas04.pdf PDF

In this paper, we address the issue of rational communication behavior among autonomous agents. The goal is for agents to learn a policy to optimize the communication needed for proper coordination, given the communication cost. We extend our previously reported cooperative hierarchical reinforcement learning (HRL) algorithm to include communication decisions and propose a new multiagent HRL algorithm, called COM-Cooperative HRL. In this algorithm, we define cooperative subtasks to be those subtasks in which coordination among agents significantly improves the performance of the overall task. Those levels of the hierarchy which include cooperative subtasks are called cooperation levels. Coordination skills among agents are learned faster by sharing information at the cooperation levels, rather than the level of primitive actions. We add a communication level to the hierarchical decomposition of the problem below each cooperation level. Before making a decision at a cooperative subtask, agents decide if it is worthwhile to perform a communication action. A communication action has a certain cost and provides each agent at a certain cooperation level with the actions selected by the other agents at the same level. We demonstrate the efficacy of the COM-Cooperative HRL algorithm as well as the relation between the communication cost and the learned communication policy using a multiagent taxi domain.

Citation

M. Ghavamzadeh, S. Mahadevan. "Learning to Communicate and Act using Hierarchical Reinforcement Learning". Joint Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), July 2004.

Keywords:  
Category: In Conference

BibTeX

@incollection{Ghavamzadeh+Mahadevan:AAMAS04,
  author = {Mohammad Ghavamzadeh and Sridhar Mahadevan},
  title = {Learning to Communicate and Act using Hierarchical Reinforcement
    Learning},
  booktitle = {Joint Conference on Autonomous Agents and Multi-Agent Systems
    (AAMAS)},
  year = 2004,
}

Last Updated: June 08, 2007
Submitted by Staurt H. Johnson

University of Alberta Logo AICML Logo