KIEE - The Transactions of the Korean Institute of Electrical Engineers

Mobile QR Code QR CODE : The Transactions of the Korean Institute of Electrical Engineers

QR CODE : The Transactions of the Korean Institute of Electrical Engineers

The Transactions of the Korean Institute of Electrical Engineers

The Transactions of the Korean Institute of Electrical Engineers

ISO Journal TitleTrans. Korean. Inst. Elect. Eng.

SCImago Journal & Country Rank

Main Menu

Journal Search

XML PDF INFO REF


Title	Model Choice Meets Prompt Choice : A Dual-Factor Study of Zero-Shot Low-Resource Plant Recognition
Authors	좌희정(Heejung Jwa) ; 정문희(Munhee Jeong) ; 조정원(Jungwon Cho)
DOI	https://doi.org/10.5370/KIEE.2025.74.8.1426
Page	pp.1426-1431
ISSN	1975-8359
Keywords	Jeju Plants; Classification; Zero-Shot Learning; Image-Text Alignment; Multimodal Embedding
Abstract	In this study, we have assessed the zero-shot classification performance of Jeju Island plant images using five multimodal vision?language models: CLIP, SigLIP, SigLIP Multilingual, SigLIP SO400M, and SigLIP2. Evaluation data comprised image?text pairs of plant species collected from four ecologically distinct regions (Deonggae Coast, Min-oleum, Jabaebong, and Jeju City). All models were evaluated under an identical zero-shot classification protocol to ensure a fair comparison. Among them, SigLIP SO400M achieved the highest accuracy on the Deonggae Coast subset, attaining a macro accuracy of 0.7460 and a micro accuracy of 0.7612, thereby outperforming the other models. The prompt language format exerted a significant influence on performance: English-only prompts consistently surpassed Korean-only prompts across all models. Confusion matrix analysis revealed region-specific class-level misclassification patterns, identifying species prone to frequent confusion. Collectively, these results demonstrate the robust zero-shot classification capabilities of contemporary vision?language models for fine-grained plant species identification and underscore the importance of selecting both an appropriate model and prompt format for a given task. The code used for these experiments is publicly available at github.com/flyaround365/JejuPlantsClassification.

© KIEE All right's reserved

This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution and reproduction in any medium, provided the original work is property cited.