TY - GEN
T1 - Counterfactual depth from a single RGB image
AU - Issaranon, Theerasit
AU - Zou, Chuhang
AU - Forsyth, David
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this 'counterfactual depth' that models hidden scene geometry together with the observations. Our method works for the same reason that scene completion works: the spatial structure of objects is simple. But we offer a much higher resolution representation of space than current scene completion methods, as we operate at pixel-level precision and do not rely on a voxel representation. Furthermore, we do not require RGBD inputs. Our method uses a standard encoder-decoder architecture, and with a decoder modified to accept an object mask. We describe a small evaluation dataset that we have collected, which allows inference about what factors affect reconstruction most strongly. Using this dataset, we show that our depth predictions for masked objects are better than other baselines.
AB - We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this 'counterfactual depth' that models hidden scene geometry together with the observations. Our method works for the same reason that scene completion works: the spatial structure of objects is simple. But we offer a much higher resolution representation of space than current scene completion methods, as we operate at pixel-level precision and do not rely on a voxel representation. Furthermore, we do not require RGBD inputs. Our method uses a standard encoder-decoder architecture, and with a decoder modified to accept an object mask. We describe a small evaluation dataset that we have collected, which allows inference about what factors affect reconstruction most strongly. Using this dataset, we show that our depth predictions for masked objects are better than other baselines.
KW - Depth prediction
KW - Object removal
UR - http://www.scopus.com/inward/record.url?scp=85082503368&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082503368&partnerID=8YFLogxK
U2 - 10.1109/ICCVW.2019.00268
DO - 10.1109/ICCVW.2019.00268
M3 - Conference contribution
AN - SCOPUS:85082503368
T3 - Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019
SP - 2129
EP - 2138
BT - Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th IEEE/CVF International Conference on Computer Vision Workshop, ICCVW 2019
Y2 - 27 October 2019 through 28 October 2019
ER -