IEIE - Journal of the Institute of Electronics and Information Engineers

Mobile QR Code

Main Menu


Title	Knowledge Distillation Based on Adaptive Quantization for Artificial Neural Network Model Compression
Authors	이조은(Jo Eun Lee) ; 한태희(Tae Hee Han)
DOI	https://doi.org/10.5573/ieie.2020.57.9.37
Page	pp.37-43
ISSN	2287-5026
Keywords	Adaptive quantization; Knowledge distillation; Model compression
Abstract	Deep neural networks (DNNs) have been used in various applications such as image classification and computer vision. However, as the depth and complexity of the neural networks increase, the limitation of deployment on resource-constrained environments like embedded systems occurs, and the research on compressing neural networks has been conducted. It includes the quantization technique that reduces the precision of neural network parameters and the knowledge distillation technique that trains a small network using training data of a large one. This paper focuses on knowledge distillation combining the quantization to optimize the computational complexity and storage usage of the neural network model. We propose an adaptive quantization-based knowledge distillation that processes the precision of each data according to the amount of value. As a result of experimenting with the ResNet model on CIFAR10 and CIFAR100 datasets, the proposed method had an average accuracy increase compared to the quantization method, and the neural network model size decreased by an average of 69.29% compared to the full-precision model.

IEIEJournal of
the Institute of Electronics and Information Engineers