Cooperation in Multi-Agent Reinforcement Learning with Proximal Policy Optimization

Morteza Hashemi Co-Author
University of Kansas
 
Amin Shojaeighadikolaei Co-Author
University of Kansas
 
Zsolt Talata First Author
University of Kansas
 
Zsolt Talata Presenting Author
University of Kansas
 
Monday, Aug 4: 2:20 PM - 2:35 PM
2174 
Contributed Papers 
Music City Center 
In multi-agent reinforcement learning problems, the interaction of multiple decision-making agents in a shared environment can be modeled by a partially observable Markov game. It extends Markov Decision Processes to a multi-agent setting where agents have individual observations, actions, and rewards. In the multi-agent proximal policy optimization (MA-PPO) approach, the cooperation between the agents is investigated. We present a method to construct a deep stochastic policy that allows efficient optimization based on the actions of the agents. The effectiveness of the obtained statistical model is demonstrated by investigating the cooperation across multiple agents in an industrial application, in electric power distribution networks.

Keywords

reinforcement learning

Markov decision process

proximal policy optimization

multi-agent

cooperation

electric power distribution networks 

Main Sponsor

Section on Statistical Learning and Data Science