Print Close

Cooperation in Multi-Agent Reinforcement Learning with Proximal Policy Optimization

Presented During: Advancements in Reinforcement Learning and Decision Making

Morteza Hashemi Co-Author
University of Kansas

Amin Shojaeighadikolaei Co-Author
University of Kansas

Zsolt Talata First Author
University of Kansas

Zsolt Talata Presenting Author
University of Kansas

Monday, Aug 4: 2:20 PM - 2:35 PM
2174
Contributed Papers

Music City Center

In multi-agent reinforcement learning problems, the interaction of multiple decision-making agents in a shared environment can be modeled by a partially observable Markov game. It extends Markov Decision Processes to a multi-agent setting where agents have individual observations, actions, and rewards. In the multi-agent proximal policy optimization (MA-PPO) approach, the cooperation between the agents is investigated. We present a method to construct a deep stochastic policy that allows efficient optimization based on the actions of the agents. The effectiveness of the obtained statistical model is demonstrated by investigating the cooperation across multiple agents in an industrial application, in electric power distribution networks.

Keywords

reinforcement learning

Markov decision process

proximal policy optimization

multi-agent

cooperation

electric power distribution networks

Main Sponsor

Section on Statistical Learning and Data Science