Info
- Title: Pixel Recurrent Neural Networks
- Task: Image Generation
- Author: A. van den Oord, N. Kalchbrenner, and K. Kavukcuoglu
- Date: Jan 2016
- Arxiv: 1601.06759
- Published: ICML 2016(Best Paper Award)
- Affiliation: Google DeepMind
Highlights & Drawbacks
- Fully tractable modeling of image distribution
- PixelRNN & PixelCNN
Motivation & Design
To estimate the joint distribution $p(x)$ we write it as the product of the conditional distributions over the pixels:
Generating pixel-by-pixel with CNN, LSTM:
Performance & Ablation Study
Samples from ImageNet
Related
- PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modification - Salimans - ICLR 2017
- Gated PixelCNN: Conditional Image Generation with PixelCNN Decoders - van den Oord - NIPS 2016
- VQ-VAE: Neural Discrete Representation Learning - van den Oord - NIPS 2017
- VQ-VAE-2: Generating Diverse High-Fidelity Images with VQ-VAE-2 - Razavi - 2019