Relation between scaling weights of paper and implementation #59

francois-rozet · 2021-01-15T19:33:33Z

In the LPIPS paper, the 1x1 scaling convolution of the difference of the activations is performed before the squaring.

But, in the implementation, the difference of the activations is first squared, and after scaled.

diffs[kk] = (feats0[kk]-feats1[kk])**2
...
self.lin[kk](diffs[kk])

Is this a mistake ? If yes, in the paper or in the implementation ?

richzhang · 2021-01-15T20:10:40Z

Yes, the weights in the implementation correspond to w^2 in the paper. Fig 10 is also plotting w^2

francois-rozet · 2021-01-15T20:24:38Z

Thanks @richzhang ! Therefore, if I am not mistaken, the convolution could even be performed after the averaging.

Like,

or even

richzhang · 2021-01-15T20:26:10Z

You cannot collapse the channel direction before multiplying by w (which is scaling each channel).

In other words, any of these are fine:
(w y - w yhat)^2
= (w (y-yhat))^2
= w^2 (y -yhat)^2

Hope that makes sense

francois-rozet · 2021-01-15T20:30:57Z

Ah yes sure, my notation isn't very accurate. The norm and MSE should be spatial only.

richzhang · 2021-01-15T20:33:34Z

Great, yes that seems correct then! (also add a sum over channel direction, in front of w_l)

francois-rozet · 2021-01-15T20:38:19Z

Great, yes that seems correct then! (also add a sum over channel direction, in front of w_l)

Oh my bad, I wanted to write a dot product and not an element wise product.

In fact, it means that we can drop the nn.Conv2d for a nn.Linear, with the same weights. It might also make the code a bit faster since the product is performed on only one channel vector.

francois-rozet · 2021-01-15T20:47:16Z

Thank you, you can close the issue 👍

…zhang/PerceptualSimilarity#59

francois-rozet changed the title ~~Implementation doesn't match paper~~ Relation between scaling weights of paper and implementation Jan 15, 2021

richzhang closed this as completed Jan 15, 2021

yohan-pg added a commit to yohan-pg/stargan-v2 that referenced this issue Mar 4, 2021

fixed gram matrix distance to include linear weighting. refer to rich…

d25baa7

…zhang/PerceptualSimilarity#59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relation between scaling weights of paper and implementation #59

Relation between scaling weights of paper and implementation #59

francois-rozet commented Jan 15, 2021 •

edited

Loading

richzhang commented Jan 15, 2021 •

edited

Loading

francois-rozet commented Jan 15, 2021 •

edited

Loading

richzhang commented Jan 15, 2021 •

edited

Loading

francois-rozet commented Jan 15, 2021

richzhang commented Jan 15, 2021

francois-rozet commented Jan 15, 2021

francois-rozet commented Jan 15, 2021

Relation between scaling weights of paper and implementation #59

Relation between scaling weights of paper and implementation #59

Comments

francois-rozet commented Jan 15, 2021 • edited Loading

richzhang commented Jan 15, 2021 • edited Loading

francois-rozet commented Jan 15, 2021 • edited Loading

richzhang commented Jan 15, 2021 • edited Loading

francois-rozet commented Jan 15, 2021

richzhang commented Jan 15, 2021

francois-rozet commented Jan 15, 2021

francois-rozet commented Jan 15, 2021

francois-rozet commented Jan 15, 2021 •

edited

Loading

richzhang commented Jan 15, 2021 •

edited

Loading

francois-rozet commented Jan 15, 2021 •

edited

Loading

richzhang commented Jan 15, 2021 •

edited

Loading