The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning

  Рет қаралды 2,024

Prorok Lab

Prorok Lab

Күн бұрын

Paper: arxiv.org/abs/...
More info: www.proroklab.org
Many real-world problems require the coordination of multiple autonomous agents. Recent work has shown the promise of Graph Neural Networks (GNNs) to learn explicit communication strategies that enable complex multi- agent coordination. These works use models of cooperative multi-agent systems whereby agents strive to achieve a shared global goal. When considering agents with self-interested local objectives, the standard design choice is to model these as separate learning systems (albeit sharing the same environment). Such a design choice, however, precludes the existence of a single, differentiable communication channel, and consequently prohibits the learning of inter-agent communication strategies. In this work, we address this gap by presenting a learning model that accommodates individual non-shared rewards and a differentiable communication channel that is common among all agents. We focus on the case where agents have self-interested objectives, and develop a learning algorithm that elicits the emergence of adversarial communications. We perform experiments on multi-agent coverage and path planning problems, and employ a post-hoc interpretability technique to visualize the messages that agents communicate to each other. We show how a single self-interested agent is capable of learning highly manipulative communication strategies that allows it to significantly outperform a cooperative team of agents.

Пікірлер: 1
@revimfadli4666
@revimfadli4666 9 ай бұрын
Thanks, been looking for this. I wonder how well would this apply to Poker, Bridge, Mascarade, and other deduction games. Or maybe 'positive' interaction modern euro board games like Keyflower
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 55 МЛН
哈莉奎因怎么变骷髅了#小丑 #shorts
00:19
好人小丑
Рет қаралды 42 МЛН
when you have plan B 😂
00:11
Andrey Grechka
Рет қаралды 60 МЛН
Variational Autoencoders
15:05
Arxiv Insights
Рет қаралды 498 М.
How might LLMs store facts | Chapter 7, Deep Learning
22:43
3Blue1Brown
Рет қаралды 480 М.
Mr. Anderson escapes from Agents | The Matrix [Open Matte]
4:55
Flashback FM
Рет қаралды 8 МЛН
MIT Introduction to Deep Learning | 6.S191
1:09:58
Alexander Amini
Рет қаралды 556 М.
Lecture 9: Multi-Robot Path Planning
53:12
Prorok Lab
Рет қаралды 12 М.
Heterogeneous Multi-Robot Reinforcement Learning
4:26
Prorok Lab
Рет қаралды 1,1 М.
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 55 МЛН