Face Generation and Interpolation with BEGAN (PyTorch)

This project is about generating fake faces from random noise vector. The model is trained on CelebFaces dataset, so we can generate beautiful faces.

Methods

The method applied is based on BEGAN (David et al.), the following are the main contribution of this paper:

A GAN with a simple yet robust architecture, standard training procedure with fast and stable convergence.
An equilibrium concept that balances the power of the discriminator against the generator.
A new way to control the trade-off between image diversity and visual quality.
An approximate measure of convergence.

In my modification, I have included following modifications:

Configurable upsample method: tranposed convolutional layer or nearest neighbour.
Configurable repetitive number of layers in each convolutional block.
Configurable input size (8*2^n).

The code should be self-explanatory because each class and function contains docstring and parameter description.

My thoughts on the paper

As to the discriminator, the traditional GAN using image as input, a number between $[0,1]$ which indicates real or fake. However, BEGAN use an auto encoder to reconstruct an image (both real images and fake images), and use the mean $\mu$ of the reconstruction error as an indicator of whether the image is real ($\mu \to 0$) or fake ($\mu \to \infty$).

This is the genius idea because it force the auto encoder to explore the ways to encode and decode the image (this might captures the internal structure of the image), with the improvement of the generator. The loss function was constructed in a way that if the generator does not understand how the auto encoder reconstruct an image, then it will end up with a large loss. As a result, the encoder and generator actually developed an understanding of human faces, which is supported by the interpolation experiment.

A potential method to improve the quality of the generated image is that, instead of uniform sample latent vector, we can learn the distribution of latent vectors of the real image, and then sample from the learned distribution, as the input of the generator.

Usage

Download the data at this link, place the zip file in the root folder.
Create two folders under the root folder:
```
mkdir data_faces output
```
Train the model with default setting:
```
python train.py
```

Train the model with configuration

python train.py --input_dim 128 --output_dim 128 --t_conv True

Configurable Parameters

1. Model Parameters

Param	Default	Type	Note
input_dim	32	int	The height / width of the input image to network
output_dim	32	int	The height / width of the output image of the network
hidden_dim	64	int	Hidden dimension of the auto encoder, should equal to nz
ngf	64	int	The number of filters in the generator.
ndf	64	int	The number of filters in the discriminator.
nc	3	int	The number of input channels.
n_layers	2	int	The number of repetitive (Conv2d + ELU) structure.
exp	False	bool	Decide the way of growth of the number of layers in the 2nd conv block. True if exponentially, False if Linearly
t_conv	False	bool	Decide the way of upsampling. True if use nn.ConvTranspose2d, False if use nn.UpsamplingNearest2d.
mean	0	float	The desired mean of the initialized weight
std	0.002	float	The desired standard deviation of the initialized weight.

2.Training Parameters

Param	Default	Type	Note
batch_size	64	int	Dataloader batch size.
n_epochs	1000	int	Number of epochs to train for.
lr	0.0002	float	Learning rate.
b1	0.5	float	Beta1 for Adam optimizer.
b2	0.999	float	Beta2 for Adam optimizer.
outf	./output/	str	Folder to output images and model checkpoints.
data_path	./data_faces	str	Which dataset to train on.
lambda_k	0.001	float	Learning rate of k.
gamma	0.75	float	Balance bewteen Discriminator and Generator.
sample_interval	1000	int	Save constructed images every this many iterations.
show_every	100	int	Show log info every this many iterations.
lr_update_step	3000	int	Decay lr this many iterations.
lr_gamma	0.5	float	The gamma of lr_scheduler, multiplicative factor of lr decay.

Results

Training on Google Colab, below is the current results (10th epochs).

Face Generation

Interpolation

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
trained_models		trained_models
.gitignore		.gitignore
Interpolation.ipynb		Interpolation.ipynb
README.md		README.md
dataloader.py		dataloader.py
model.py		model.py
model_v2.py		model_v2.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Face Generation and Interpolation with BEGAN (PyTorch)

Methods

Usage

Configurable Parameters

Results

About

Releases

Packages

Languages

sicongzhao/Face-Generation-and-Interpolation-with-BEGAN

Folders and files

Latest commit

History

Repository files navigation

Face Generation and Interpolation with BEGAN (PyTorch)

Methods

Usage

Configurable Parameters

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages