TY - JOUR
T1 - StyleGAN knows Normal, Depth, Albedo, and More
AU - Bhattad, Anand
AU - McKee, Daniel
AU - Hoiem, Derek
AU - Forsyth, D. A.
N1 - This material is based upon work supported by the National Science Foundation under Grant No.2106825 and by gifts from Amazon and Boeing.
PY - 2023
Y1 - 2023
N2 - Intrinsic images, in the original sense, are image-like maps of scene properties like depth, normal, albedo or shading.This paper demonstrates that StyleGAN can easily be induced to produce intrinsic images.Our procedure is straightforward.We show that, if StyleGAN produces G(w) from latent w, then for each type of intrinsic image, there is a fixed offset dc so that G(w + dc) is that type of intrinsic image for G(w).Here dc is independent of w.The StyleGAN we used was pretrained by others, so this property is not some accident of our training regime.We show that there are image transformations StyleGAN will not produce in this fashion, so StyleGAN is not a generic image regression engine.It is conceptually exciting that an image generator should “know” and represent intrinsic images.There may also be practical advantages to using a generative model to produce intrinsic images.The intrinsic images obtained from StyleGAN compare well both qualitatively and quantitatively with those obtained by using SOTA image regression techniques; but StyleGAN's intrinsic images are robust to relighting effects, unlike SOTA methods.
AB - Intrinsic images, in the original sense, are image-like maps of scene properties like depth, normal, albedo or shading.This paper demonstrates that StyleGAN can easily be induced to produce intrinsic images.Our procedure is straightforward.We show that, if StyleGAN produces G(w) from latent w, then for each type of intrinsic image, there is a fixed offset dc so that G(w + dc) is that type of intrinsic image for G(w).Here dc is independent of w.The StyleGAN we used was pretrained by others, so this property is not some accident of our training regime.We show that there are image transformations StyleGAN will not produce in this fashion, so StyleGAN is not a generic image regression engine.It is conceptually exciting that an image generator should “know” and represent intrinsic images.There may also be practical advantages to using a generative model to produce intrinsic images.The intrinsic images obtained from StyleGAN compare well both qualitatively and quantitatively with those obtained by using SOTA image regression techniques; but StyleGAN's intrinsic images are robust to relighting effects, unlike SOTA methods.
UR - http://www.scopus.com/inward/record.url?scp=85178967137&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85178967137&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85178967137
SN - 1049-5258
VL - 36
JO - Advances in Neural Information Processing Systems
JF - Advances in Neural Information Processing Systems
T2 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
Y2 - 10 December 2023 through 16 December 2023
ER -