LOOSECONTROL: Lifting ControlNet for Generalized Depth Conditioning

Shariq Farooq Bhat, Niloy Mitra, Peter Wonka

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

We present LooseControl to allow generalized depth conditioning for diffusion-based image generation. ControlNet, the SOTA for depth conditioned image generation, produces remarkable results but relies on having access to detailed depth maps for guidance. Creating such exact depth maps, in many scenarios, is challenging. This paper introduces a generalized version of depth conditioning that enables new content creation workflows. Specifically, we allow (C1) scene boundary control for loosely specifying scenes with only boundary conditions, and (C2) 3D box control for specifying the target objects' layout locations rather than the objects' exact shape and appearance. Using LooseControl, along with text guidance, users can create complex environments (e.g., rooms, street views, etc.) by specifying only scene boundaries and locations of primary objects. Further, we provide two editing mechanisms to refine the results: (E1) 3D box editing enables the user to refine images by changing, adding, or removing boxes while freezing the image style. This yields minimal changes apart from changes induced by the edited boxes. (E2) Attribute editing proposes possible editing directions to change one particular aspect of the scene, such as the overall object density or a particular object. Tests and comparisons with baselines demonstrate the generality of our method. We believe that LooseControl can become an important design tool for easily creating complex environments and be extended to other forms of guidance channels.

Original languageEnglish (US)
Title of host publicationProceedings - SIGGRAPH 2024 Conference Papers
EditorsStephen N. Spencer
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400705250
DOIs
StatePublished - Jul 13 2024
Event2024 Special Interest Group on Computer Graphics and Interactive Techniques Conference - Conference Papers, SIGGRAPH 2024 - Denver, United States
Duration: Jul 28 2024Aug 1 2024

Publication series

NameProceedings - SIGGRAPH 2024 Conference Papers

Conference

Conference2024 Special Interest Group on Computer Graphics and Interactive Techniques Conference - Conference Papers, SIGGRAPH 2024
Country/TerritoryUnited States
CityDenver
Period07/28/2408/1/24

Keywords

  • control
  • depth condition
  • diffusion models
  • generative models
  • guided editing
  • layout control
  • partial specification

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Visual Arts and Performing Arts
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'LOOSECONTROL: Lifting ControlNet for Generalized Depth Conditioning'. Together they form a unique fingerprint.

Cite this