generative adversarial networks...applications - text to image • input: text • output: image •...

8
Generative adversarial networks 2019 Philipp Krähenbühl and Chao-Yuan Wu

Upload: others

Post on 19-Jun-2020

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Generative adversarial networks

ⓒ 2019 Philipp Krähenbühl and Chao-Yuan Wu

Page 2: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Transforming noise

• Input: Random noise

• e.g. �

• Output: Image

• Objective

• Output should look good

z ∼ 𝒩(0, I)Deep

Network

Page 3: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Judging output quality

• A second network

• Output looks like training data or not?

Generative Adversarial Nets, Goodfellow et al., NIPS 2014

Generator

Discriminator

real/fake?

real

Page 4: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Adversarial objective

• Generator

• Produce an image that fools discriminator

• Discriminator

• Tell difference between generation and training data

Generator

Discriminator

real/fake?

real

Page 5: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

What does this optimize?

• Discriminator log-likelihood

• Jensen-Shanon divergence data distribution and generator distribution

Page 6: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

GANs work!

• Sampling is easy

• Learn pixel-distance

• Loss on distributions

Large Scale GAN Training for High Fidelity Natural Image Synthesis, Brock et al., ICLR 2019

Page 7: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Applications - Super resolution

• Learn to up-sample images

• Input: Low-res image

• Output: HD image

• Loss

• Reconstruct HD

• GAN for sharp reconstruction

Deep Net

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network, Ledig et al., CVPR 2017

Page 8: Generative adversarial networks...Applications - Text to image • Input: text • Output: Image • Loss: • GAN • Does text and image fit or not? StackGAN: Text to Photo-realistic

Applications - Text to image• Input: text

• Output: Image

• Loss:

• GAN

• Does text and image fit or not?

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks, Zhang et al., ICCV 2017

Image source: StackGAN paper

"This bird has a yellow belly and tarsus, grey back, wings, and brown

throat, nape with a black face"