Siamese Network Tensorflow

Siamese network is a neural network that contain two or more identical subnetwork. The objective of this network is to find the similarity or comparing the relationship between two comparable things. Unlike classification task that uses cross entropy as the loss function, siamese network usually uses contrastive loss or triplet loss.

Siamese network has a lot of function, this repository is trying to use Siamese network to do a dimensionality reduction and image retrieval.

This project follows Hadsell-et-al.'06 [1] by computing the Euclidean distance on the output of the shared network and by optimizing the contrastive loss (see paper for more details). The contastive loss is defined as follows

$\begin{align} L_{contrastive} &= L_{similarity}+L_{dissimilarity} \notag \\ &= \frac{1}{2}(Y)(D)^2+\frac{1}{2}(1-Y)(max(0,m-D))^2 \notag \end{align}$

The is the distance of between the output of the network with the input and the input .

The similarity function is defined as . This function will be activated when the Label equal to 1 and deactivated when is equal to 0. The goal of this function is to minimize the distance of the pairs.

The dissimilarity function is defined as . This function will be activated when the Label is equal to 0 and deactivated when is equal to 1. The goal of this function is to give a penalty of the pairs when the distance is lower than margin .

[1] "Dimensionality Reduction by Learning an Invariant Mapping" http://yann.lecun.com/exdb/publis/pdf/hadsell-chopra-lecun-06.pdf

Model

The input of these will be image_left, image_right and . Our model uses 5 layer of convolutional layer and pooling followed. We do not use fully convolutonal net because convolution operation is faster on GPU(especially using CUDNN). See http://cs231n.github.io/convolutional-networks/#convert for more information on converting FC layer to Conv layer.

Run

Train the model

git clone https://github.com/ardiya/siamesenetwork-tensorflow
python train.py

Tensorboard Visualization(After training)

tensorboard --logdir=train.log

Updates

Update the API to 1.0
Cleanup the old code

Dimensionality reduction

The images below shows the final Result on MNIST test dataset. By only using 2 features, we can easily separate the input images.

The gif below shows some animation until it somehow converges.

Image retrieval

Image retrieval uses the trained model to extract the features and get the most similar image using cosine similarity. See here

Retrieving similar test image from trainset

Select id 865 in test image
Retrieved top n similar image from train data with ids of [53144 47864 11074 51561 41350 34215 48182] from train data

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
figure		figure
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Similar image retrieval.ipynb		Similar image retrieval.ipynb
dataset.py		dataset.py
model.py		model.py
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Siamese Network Tensorflow

Model

Run

Updates

Dimensionality reduction

Image retrieval

Retrieving similar test image from trainset

About

Releases

Packages

Contributors 2

Languages

License

ardiya/siamesenetwork-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Siamese Network Tensorflow

Model

Run

Updates

Dimensionality reduction

Image retrieval

Retrieving similar test image from trainset

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages