We present a pipeline for generating defurnished replicas of indoor spaces represented as textured meshes and corresponding multi-view panoramic images. To achieve this, we first segment and remove furniture from the mesh representation, extend planes, and fill holes, obtaining a simplified defurnished mesh (SDM). This SDM acts as an ``X-ray'' of the scene's underlying structure, guiding the defurnishing process. We extract Canny edges from depth and normal images rendered from the SDM. We then use these as a guide to remove the furniture from panorama images via ControlNet inpainting. This control signal ensures the availability of global geometric information that may be hidden from a particular panoramic view by the furniture being removed. The inpainted panoramas are used to texture the mesh. We show that our approach produces higher quality assets than methods that rely on neural radiance fields, which tend to produce blurry low-resolution images, or RGB-D inpainting, which is highly susceptible to hallucinations.
Our pipeline consists of the following components:
This section shows results from our full defurnishing pipeline.
In this section we compare results of our fine-tuned ControlNet inpainting, controlled by Canny edges extracted from depth and normal images, with the following control methods:
In this section we compare results of our full inpainting pipeline to radiance fields methods. For fairness, we run radiance fields methods on perspective images, and we run our method on panoramas and then project to perspective images. We compare to:
We are greateful to Dorra Larnaout, Ky Waegel, Mykhaylo Kurinnyy, Neil Jassal, Will Yu, Senthil Palanisamy, Zack Baker and David Buchhofer for their contributions to this work.
The paper has been accepted to the Workshop on AI for Creative Visual Content Generation, Editing and Understanding at CVPR 2025.
@inproceedings{matterport2025defurnishing,
author = {Dolhasz, Alan and Ma, Chen and Gausebeck, Dave and Chen, Kevin and Miller, Gregor and Hayne, Lucas and Hovden, Gunnar and Sabik, Azwad and Brandt, Olaf and Slavcheva, Mira},
title = {Defurnishing with X-Ray Vision: Joint Removal of Furniture from Panoramas and Mesh},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
year = {2025},
}