Title Automatic Crack Detection in Concrete Structures Using a VGG-T Image Classification Model
Authors 백영건(Beak, Young-Gun) ; 김현승(Kim, Hyen-Seung) ; 홍영록(Hong, Rong-Lu) ; 김주형(Kim, Ju-Hyung)
DOI https://doi.org/10.5659/JAIK.2025.41.8.369
Page pp.369-376
ISSN 2733-6247
Keywords Crack; Convolutional Neural Network (CNN); Vision Transformer
Abstract Crack detection plays a crucial role in monitoring and inspecting the condition of construction structures. Traditional Convolutional Neural Network (CNN) methods, which focus mainly on local feature extraction, face limitations in accuracy. In contrast, Vision Transformer (ViT) models effectively capture global features but require large-scale datasets for training. To overcome these challenges, the VGG-T Image Classification model is proposed. This model combines the local feature extraction strength of the CNN-based VGG-16 with the global feature learning capabilities of ViT. Incorporating transfer learning and data augmentation techniques allows effective training even with small datasets. The model was evaluated using binary classification metrics and compared against VGG-16, VGG-19, ResNet-101, and ViT models. Results showed an accuracy of 99.6%, demonstrating that integrating these two architectures significantly improves detection accuracy. This advancement is expected to contribute to the development of structural safety diagnosis, automated safety maintenance, and crack detection technologies.