StyleGAN knows Normal, Depth, Albedo, and More

Anand Bhattad, Daniel McKee, Derek Hoiem, D. A. Forsyth

Research output: Contribution to journalConference articlepeer-review

Abstract

Intrinsic images, in the original sense, are image-like maps of scene properties like depth, normal, albedo or shading.This paper demonstrates that StyleGAN can easily be induced to produce intrinsic images.Our procedure is straightforward.We show that, if StyleGAN produces G(w) from latent w, then for each type of intrinsic image, there is a fixed offset dc so that G(w + dc) is that type of intrinsic image for G(w).Here dc is independent of w.The StyleGAN we used was pretrained by others, so this property is not some accident of our training regime.We show that there are image transformations StyleGAN will not produce in this fashion, so StyleGAN is not a generic image regression engine.It is conceptually exciting that an image generator should “know” and represent intrinsic images.There may also be practical advantages to using a generative model to produce intrinsic images.The intrinsic images obtained from StyleGAN compare well both qualitatively and quantitatively with those obtained by using SOTA image regression techniques; but StyleGAN's intrinsic images are robust to relighting effects, unlike SOTA methods.

Original languageEnglish (US)
JournalAdvances in Neural Information Processing Systems
Volume36
StatePublished - 2023
Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, United States
Duration: Dec 10 2023Dec 16 2023

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'StyleGAN knows Normal, Depth, Albedo, and More'. Together they form a unique fingerprint.

Cite this