Error(s) in loading state_dict for ResnetGenerator #296

yfnn · 2018-06-19T02:27:16Z

Today, I want to test my trained model. There are some errors not occur before.

Traceback (most recent call last):
File "test.py", line 19, in
model.setup(opt)
File "/home/t-fayan/vision/pytorch-CycleGAN-and-pix2pix/models/base_model.py", line 43, in setup
self.load_networks(opt.which_epoch)
File "/home/t-fayan/vision/pytorch-CycleGAN-and-pix2pix/models/base_model.py", line 130, in load_networks
net.load_state_dict(state_dict)
File "/home/t-fayan/anaconda2/envs/py27/lib/python2.7/site-packages/torch/nn/modules/module.py", line 721, in load_state_dict
self.class.name, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for ResnetGenerator:
Missing key(s) in state_dict: "model.10.conv_block.6.bias", "model.10.conv_block.6.weight", "model.10.conv_block.7.running_var", "model.10.conv_block.7.running_mean", "model.11.conv_block.6.bias", "model.11.conv_block.6.weight", "model.11.conv_block.7.running_var", "model.11.conv_block.7.running_mean", "model.12.conv_block.6.bias", "model.12.conv_block.6.weight", "model.12.conv_block.7.running_var", "model.12.conv_block.7.running_mean", "model.13.conv_block.6.bias", "model.13.conv_block.6.weight", "model.13.conv_block.7.running_var", "model.13.conv_block.7.running_mean", "model.14.conv_block.6.bias", "model.14.conv_block.6.weight", "model.14.conv_block.7.running_var", "model.14.conv_block.7.running_mean", "model.15.conv_block.6.bias", "model.15.conv_block.6.weight", "model.15.conv_block.7.running_var", "model.15.conv_block.7.running_mean", "model.16.conv_block.6.bias", "model.16.conv_block.6.weight", "model.16.conv_block.7.running_var", "model.16.conv_block.7.running_mean", "model.17.conv_block.6.bias", "model.17.conv_block.6.weight", "model.17.conv_block.7.running_var", "model.17.conv_block.7.running_mean", "model.18.conv_block.6.bias", "model.18.conv_block.6.weight", "model.18.conv_block.7.running_var", "model.18.conv_block.7.running_mean".
Unexpected key(s) in state_dict: "model.10.conv_block.5.weight", "model.10.conv_block.5.bias", "model.10.conv_block.6.running_mean", "model.10.conv_block.6.running_var", "model.11.conv_block.5.weight", "model.11.conv_block.5.bias", "model.11.conv_block.6.running_mean", "model.11.conv_block.6.running_var", "model.12.conv_block.5.weight", "model.12.conv_block.5.bias", "model.12.conv_block.6.running_mean", "model.12.conv_block.6.running_var", "model.13.conv_block.5.weight", "model.13.conv_block.5.bias", "model.13.conv_block.6.running_mean", "model.13.conv_block.6.running_var", "model.14.conv_block.5.weight", "model.14.conv_block.5.bias", "model.14.conv_block.6.running_mean", "model.14.conv_block.6.running_var", "model.15.conv_block.5.weight", "model.15.conv_block.5.bias", "model.15.conv_block.6.running_mean", "model.15.conv_block.6.running_var", "model.16.conv_block.5.weight", "model.16.conv_block.5.bias", "model.16.conv_block.6.running_mean", "model.16.conv_block.6.running_var", "model.17.conv_block.5.weight", "model.17.conv_block.5.bias", "model.17.conv_block.6.running_mean", "model.17.conv_block.6.running_var", "model.18.conv_block.5.weight", "model.18.conv_block.5.bias", "model.18.conv_block.6.running_mean", "model.18.conv_block.6.running_var".

What does it mean? Also, if I test pretrained model like horse2zebra, these errors occur too. But I didn't encounter these errors before.

rawalkhirodkar · 2018-06-21T07:50:33Z

Facing same issue

yfnn · 2018-06-21T09:14:08Z

I deleted the root project directory and clone this repository again. Then, it works. I don't know the reason. This is just a fast method to solve this issue for me.

junyanz · 2018-07-03T05:22:49Z

Yes, please check out the latest commit.

JungJungyeji · 2018-07-11T08:27:50Z

I face the same problem.
I'd like to run a CycleGAN pre-trained model.
However, " RuntimeError : Unexpected key (s) in state_dict " occurs. Please help me.

mailengm · 2018-07-12T12:41:45Z

I face the same issue. I've trained pix2pix model with the previous version of the code and tried to test it using the older and the latest commit and got the same "missing keys in state_dict" error in both.

junyanz · 2018-07-12T22:27:34Z

Could you check if you have used the same normalization (batchnorm, instancenorm) during training and test?

hao44le · 2018-07-13T05:42:41Z

Faced the same error.

mailengm · 2018-07-13T11:34:24Z

I used the default normalization (instancenorm) during training and test.
I was able to solve the problem downloading the newest version of the code and training the model again.

pencilrocketman · 2018-07-13T11:57:12Z

I faced same error on applying a pre-train model (cyclegan) in the newest version.
Is normalization(instancenorm) also related to this case?

pencilrocketman · 2018-07-13T13:00:22Z

Sorry, I solved this problem by correct docker setting.
When I used pytorch-nightly instead of pytorch, I got good results.
This is my dockerfile.(forgive my dockerfile that is dirty)

FROM nvidia/cuda:9.0-cudnn7-devel

RUN apt-get update && apt-get install -y \
   build-essential curl wget git cmake vim pkg-config unzip libgtk2.0-dev python3 python3-pip \
   imagemagick graphviz > /dev/null

# Miniconda3
ENV PATH /opt/conda/bin:$PATH
ENV LB_LIBRARY_PATH /opt/conda/lib:$LB_LIBRARY_PATH
RUN curl -Ls https://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh -o /tmp/install-miniconda.sh && \
   /bin/bash /tmp/install-miniconda.sh -b -p /opt/conda && \
   conda update -n base conda && \
   conda update --all -y

# Basic dependencies
RUN conda config --add channels conda-forge
RUN conda install -y readline mkl openblas numpy scipy hdf5 \
   pillow matplotlib cython pandas gensim protobuf \
   lmdb leveldb boost jupyterlab
RUN pip install pydot_ng nnpack h5py scikit-learn scikit-image hyperdash backports.ssl_match_hostname

# OpenCV
RUN conda install opencv3 -c menpo -y
RUN conda install dominate bz2file visdom

# PyTorch
RUN conda install pytorch-nightly torchvision cuda90 -c pytorch -y

# For CycleGAN and pix2pix
ADD ./vision /vision
WORKDIR /vision
RUN python3 setup.py install
WORKDIR /

junyanz · 2018-07-13T17:38:18Z

@taesung89

taesungp · 2018-07-13T23:59:04Z

The issue with unexpected key: num_batches_tracked should be fixed by the latest commit.
Regarding the first error on this thread, I believe it's because PyTorch's default setting has changed. Could you get the latest pytorch and try again?

JungJungyeji · 2018-07-16T08:22:50Z

Thank you very much. I downloaded the new version of the code and fixed the problem.

kakumarabhishek · 2018-11-13T01:42:36Z

Could you check if you have used the same normalization (batchnorm, instancenorm) during training and test?

Thank you @junyanz This resolved it for me.

dovletov · 2018-11-29T16:15:08Z

I've had similar issue while trying to test CycleGAN (--model test) on my own dataset.
Default --norm instance was used for both training and testing.
Deleting the root project directory and cloning this repository again did not help.

I've noticed that the problem is wrong keys in the state_dict dictionary.
E.g. model tries to load missing key "model.10.conv_block.6.weight", however there is unexpected key "model.10.conv_block.5.weight". So, I decided to fix these wrong keys.

Based on my error message I have created two lists (missing_list and expected_list). Afterwards I've replaced wrong keys with corresponding correct ones.

Example of my snippet is here. I've inserted it after line 135 here.

vis-opt · 2019-09-18T23:43:01Z

Another solution to this is to modify net.load_state_dict(state_dict, strict=False) where I added the strict=False option. This allows the network to load weights as long as the sizes and the number of parameters fit, even if the key-names aren't exact.

SunLeL · 2020-05-31T08:58:34Z

I faced the same issue. But I added "--no_dropout" when I tested, the issue was gone. As follows:
python test.py --no_dropout

Songtingt · 2020-08-28T10:42:36Z

I faced the same issue. But I added "--no_dropout" when I tested, the issue was gone. As follows:
python test.py --no_dropout

Thank you @SunLeL ,it works for me!!!!

anxingle · 2020-12-10T08:53:16Z

I faced the same issue. But I added "--no_dropout" when I tested, the issue was gone. As follows:
python test.py --no_dropout

Amazing! I found when I use unet_256, it is ok. The error happens when I use resnet_6blocks.
By the way, use --no_dropout works for me!!!

mr-easy · 2021-01-18T05:39:40Z

Another solution to this is to modify net.load_state_dict(state_dict, strict=False) where I added the strict=False option. This allows the network to load weights as long as the sizes and the number of parameters fit, even if the key-names aren't exact.

Yes, this method works. One has to make the changes in models/base_model.py file.
Btw, --no_dropout also works. But the results are different.

omid-ghozatlou · 2021-03-03T09:19:49Z

Another solution to this is to modify net.load_state_dict(state_dict, strict=False) where I added the strict=False option. This allows the network to load weights as long as the sizes and the number of parameters fit, even if the key-names aren't exact.

Thank you! It is working but the results are not as good as samples saved during training. It includes much noise

PingjunChen · 2021-06-09T19:17:56Z

@vis-opt @omid-ghozatlou I also got much noise when added strict=False.

junyanz closed this as completed Jul 24, 2018

Keiser04 mentioned this issue Oct 30, 2023

Assistance in training manga-colorization qweasdd/manga-colorization#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error(s) in loading state_dict for ResnetGenerator #296

Error(s) in loading state_dict for ResnetGenerator #296

yfnn commented Jun 19, 2018

rawalkhirodkar commented Jun 21, 2018

yfnn commented Jun 21, 2018

junyanz commented Jul 3, 2018

JungJungyeji commented Jul 11, 2018

mailengm commented Jul 12, 2018

junyanz commented Jul 12, 2018

hao44le commented Jul 13, 2018

mailengm commented Jul 13, 2018

pencilrocketman commented Jul 13, 2018 •

edited

Loading

pencilrocketman commented Jul 13, 2018 •

edited

Loading

junyanz commented Jul 13, 2018

taesungp commented Jul 13, 2018

JungJungyeji commented Jul 16, 2018

kakumarabhishek commented Nov 13, 2018

dovletov commented Nov 29, 2018 •

edited

Loading

vis-opt commented Sep 18, 2019

SunLeL commented May 31, 2020

Songtingt commented Aug 28, 2020

anxingle commented Dec 10, 2020

mr-easy commented Jan 18, 2021 •

edited

Loading

omid-ghozatlou commented Mar 3, 2021

PingjunChen commented Jun 9, 2021

Error(s) in loading state_dict for ResnetGenerator #296

Error(s) in loading state_dict for ResnetGenerator #296

Comments

yfnn commented Jun 19, 2018

rawalkhirodkar commented Jun 21, 2018

yfnn commented Jun 21, 2018

junyanz commented Jul 3, 2018

JungJungyeji commented Jul 11, 2018

mailengm commented Jul 12, 2018

junyanz commented Jul 12, 2018

hao44le commented Jul 13, 2018

mailengm commented Jul 13, 2018

pencilrocketman commented Jul 13, 2018 • edited Loading

pencilrocketman commented Jul 13, 2018 • edited Loading

junyanz commented Jul 13, 2018

taesungp commented Jul 13, 2018

JungJungyeji commented Jul 16, 2018

kakumarabhishek commented Nov 13, 2018

dovletov commented Nov 29, 2018 • edited Loading

vis-opt commented Sep 18, 2019

SunLeL commented May 31, 2020

Songtingt commented Aug 28, 2020

anxingle commented Dec 10, 2020

mr-easy commented Jan 18, 2021 • edited Loading

omid-ghozatlou commented Mar 3, 2021

PingjunChen commented Jun 9, 2021

pencilrocketman commented Jul 13, 2018 •

edited

Loading

pencilrocketman commented Jul 13, 2018 •

edited

Loading

dovletov commented Nov 29, 2018 •

edited

Loading

mr-easy commented Jan 18, 2021 •

edited

Loading