Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Nick-Morgan · 2020-09-02T13:25:36Z

I was running into issues trying to re-create the original paper, and stumbled upon this repository.

I was able to re-create the results when using the caffe pretrained model (which has images in the range of [0, 255]), but had drastically different results when using pytorch's pretrained model (which has images in the range of [0, 1]). I noticed this tidbit of code in your repository:

pytorch-neural-style-transfer/utils/utils.py

Lines 43 to 49 in f5650de

    
           # normalize using ImageNet's mean 
        
           # [0, 255] range worked much better for me than [0, 1] range (even though PyTorch models were trained on latter) 
        
           transform = transforms.Compose([ 
        
               transforms.ToTensor(), 
        
               transforms.Lambda(lambda x: x.mul(255)), 
        
               transforms.Normalize(mean=IMAGENET_MEAN_255, std=IMAGENET_STD_NEUTRAL) 
        
           ])

I applied that same transformation, and got results that are comparable to the original paper. I am somewhat confused about why this works, though. If pytorch's vgg19 is trained on millions of images in the range of [0, 1], wouldn't it just interpret anything above 1 as being pure white?

The text was updated successfully, but these errors were encountered:

gordicaleksa · 2020-09-04T13:41:56Z

Hi Nick!

I have it on my backlog to try and make it work on the [0, 1] range as it feels more natural for PyTorch models as they were pre-trained, as you already said, on [0, 1] range imagery in contrast with those old caffe models.

What I did, because I was puzzled the same as you are, was pass in a [0, 255] say dog image into VGG and check whether the classification output is correct. And it was. The argument being (my hypothesis) that of symmetry. That's why it's working. VGG is able to do correct classifications even for [0, 255] range.

It should/must work for [0, 1] range I'd just need a bit more experimentation. If you figure it out before me please feel free to create a PR and notify me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Nick-Morgan commented Sep 2, 2020

gordicaleksa commented Sep 4, 2020

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Comments

Nick-Morgan commented Sep 2, 2020

gordicaleksa commented Sep 4, 2020