Title |
A Generation Method and Evaluation of Architectural Facade Design Using Stable Diffusion with LoRA and ControlNet |
Authors |
박정민(Park, Jungmin) ; 홍순민(Hong, Soonmin) ; 추승연(Choo, Seungyeon) |
DOI |
https://doi.org/10.5659/JAIK.2025.41.8.85 |
Keywords |
Stable Diffusion; Architecture massing; Facade Design; Generative AI; LoRA; ControlNet |
Abstract |
This study proposes a novel approach for generating architectural facade images by combining the Stable Diffusion model with Low-Rank
Adaptation (LoRA) and ControlNet. The standard Stable Diffusion model faces limitations in accurately reflecting architectural elements and
material characteristics, which are critical in the design process. To address these challenges, this research integrates domain-specific
fine-tuning using LoRA and precise shape control through ControlNet. LoRA allows the model to effectively learn architectural styles and
details, ensuring better representation of essential design elements such as windows, balconies, and facade materials. Meanwhile, ControlNet
utilizes Canny Edge and Depth Map information to enhance shape accuracy and spatial consistency, enabling more reliable image generation.
The generated images were evaluated through Contrastive Language-Image Pretraining (CLIP) scores for quantitative analysis and
GPT-4V-based qualitative evaluation, providing a more comprehensive understanding of architectural coherence and visual fidelity. The
GPT-4V assessment offered insights into spatial relationships, contextual relevance, and material expression that are not easily captured
through traditional metrics. This combined approach reduces the repetitive manual adjustments commonly required in text-prompt-based image
generation and facilitates a more intuitive and efficient design process during the early stages of architectural planning. By improving control
over detailed architectural features, the proposed method contributes to the automation of facade design, offering significant potential for
real-world applications in architectural design and visualization. Future research will focus on expanding the dataset to include diverse
architectural styles and validating its practical application in design and construction. |