Mobile QR Code
Title Single-view 3D Reconstruction through Self and Cross Attention Mechanisms
Authors 김혜림(Hye Rim Kim) ; 김선빈(Seon Bin Kim) ; 고병철(Byoung Chul Ko)
DOI https://doi.org/10.5573/ieie.2024.61.11.112
Page pp.112-119
ISSN 2287-5026
Keywords Single-view 3D reconstruction; 3D object reconstruction; Attention mechanism; Implicit field
Abstract Single-view 3D reconstruction, the task of reconstructing a 3D object from a single 2D image, is an active area of research in computer vision and graphics with broad applications in augmented reality, game development, and other domains. This paper proposes a novel approach to enhance 3D reconstruction performance by incorporating attention mechanisms into a single-view 3D reconstruction framework. The proposed method applies self-attention and cross-attention mechanisms to a baseline 3D generative model consisting of an encoder-decoder architecture. These mechanisms emphasize salient local features within the image while also utilizing global contextual information to improve reconstruction accuracy. Experimental evaluation on the ShapeNet dataset, a widely used benchmark for 3D reconstruction, demonstrates that the proposed method outperforms state-of-the-art approaches on various metrics assessing reconstruction accuracy and fidelity, and also exhibits superior capability in capturing fine-grained details of complex objects.