Connecting Generative Adversarial Networks and Actor Critic Methods, NIPS 2016 | David Pfau

  Рет қаралды 1,530

Preserve Knowledge

Preserve Knowledge

6 жыл бұрын

David Pfau, Oriol Vinyals
arxiv.org/abs/1610.01945
NIPS 2016 Workshop on Adversarial Training Spotlight
Both generative adversarial networks (GAN) in unsupervised learning and actor-critic methods in reinforcement learning (RL) have gained a reputation for being difficult to optimize. Practitioners in both fields have amassed a large number of strategies to mitigate these instabilities and improve training. Here we show that GANs can be viewed as actor-critic methods in an environment where the actor cannot affect the reward. We review the strategies for stabilizing training for each class of models, both those that generalize between the two and those that are particular to that model. We also review a number of extensions to GANs and RL algorithms with even more complicated information flow. We hope that by highlighting this formal connection we will encourage both GAN and RL communities to develop general, scalable, and stable algorithms for multilevel optimization with deep networks, and to draw inspiration across communities.

Пікірлер
What Jumping Spiders Teach Us About Color
32:37
Veritasium
Рет қаралды 959 М.
Separating AI Hype from AI Reality
19:49
IAmTimCorey
Рет қаралды 3,3 М.
Dynamic #gadgets for math genius! #maths
00:29
FLIP FLOP Hacks
Рет қаралды 18 МЛН
Pokey pokey 🤣🥰❤️ #demariki
00:26
Demariki
Рет қаралды 5 МЛН
Cat story: from hate to love! 😻 #cat #cute #kitten
00:40
Stocat
Рет қаралды 13 МЛН
[Vowel]물고기는 물에서 살아야 해🐟🤣Fish have to live in the water #funny
00:53
Negotiations 5 31 24 Part 2
Dodge City Public Schools
Рет қаралды 3
Alien Megastructure Candidates - Not as Crazy as it Sounds!
6:29
Sabine Hossenfelder
Рет қаралды 148 М.
Colonizing Ganymede
31:48
Isaac Arthur
Рет қаралды 32 М.
Planck Stars: Alive Inside a Black Hole
17:17
Astrographics
Рет қаралды 85 М.
Dynamic #gadgets for math genius! #maths
00:29
FLIP FLOP Hacks
Рет қаралды 18 МЛН