We propose a pipeline that leverages Stable Diffusion to improve inpainting results in the context of defurnishing---the removal of furniture items from indoor panorama images. Specifically, we illustrate how increased context, domain-specific model fine-tuning, and improved image blending can produce high-fidelity inpaints that are geometrically plausible without needing to rely on room layout estimation. We demonstrate qualitative and quantitative improvements over other furniture removal techniques.
Our pipeline consists of the following components:
This section shows results from our full defurnishing pipeline.
In this section we compare results of our inpainting module only (without pre- and post-processing) to the following methods:
The paper has been accepted to the Workshop on Generative Models for Computer Vision at CVPR 2024. A short version has been accepted to the Workshop on Computer Vision in the Built Environment at CVPR 2024.
@inproceedings{matterport2024defurnishing,
author = {Slavcheva, Mira and Gausebeck, Dave and Chen, Kevin and Buchhofer, David and Sabik, Azwad and Ma, Chen and Dhillon, Sachal and Brandt, Olaf and Dolhasz, Alan},
title = {An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
year = {2024},
}