Mobile QR Code QR CODE

  1. (Department of Information Convergence, Kwangwoon University / Seoul, Korea hjn040281@gmail.com)
  2. (Department of Computer Engineering, Kwangwoon University / Seoul, Korea {9200heesang@gmail.com, yulisun@telkomuniversity.ac.id} )
  3. (Department of Electrical Engineering, Kwangwoon University / Seoul, Korea kimrja123@naver.com)
  4. (Department of Electronic Engineering, Catholic Kwandong University, 24, Beomil-ro 579beongil, Gangneung-si Gangwon-do 25601, Korea yongkyoung0108@gmail.com )




1. Introduction

Influenza is one of the main causes of respiratory diseases all over the world [1]. Although the influenza virus can manifest as minor infections in healthy adults, it can cause severe acute respiratory diseases in high-risk groups, such as chronic patients, infants, and the elderly, posing a life-threatening risk. In the United States, an estimated 30,000 people died during the influenza pandemic [2]. Hence, early diagnosis of influenza is crucial.

Attempts of an early examination of influenza have been made in various ways. The present study aimed to detect the influenza virus efficiently before the development of the symptoms. Influenza produces more severe symptoms than the common cold and increases the frequency of dangerous complications that require proper stability and treatment [3]. Furthermore, early diagnosis is very important because it varies slightly depending on whether it is type A or B [4]. An influenza diagnosis appears to require a detailed diagnosis, not binary. Therefore, this study suggests the possibility of classification using deep learning.

In this study, influenza was classified by processing its detect kit images based on deep neural networks. A two-dimensional convolutional neural network (2D CNN) algorithm was designed for the classification task, and its structure was fine-tuned for influenza kit images using an optimization tool, Bayesian optimization hyperband (BOHB) [5]. The performance of the designed model was assessed by calculating the accuracy, precision, and recall. The remainder of this paper is classified as follows. Section 2 reviews the background, and Section 3 presents the experiment. The results are reported in Section 4.

2. Background

This section describes the proposed CNN Model and BOHB method.

2.1 CNN MODEL

LeCun $\textit{et al.}$ proposed the fundamental structure of the convolutional neural network [6], which is an algorithm used widely to classify images, sounds, text, and videos. The CNN has improved performance in recognizing image objects and finding patterns by extracting the features directly from the data [7].

Unlike the general neural network, the CNN can encode the characteristics (horizontal, vertical, and color channels) of image data assuming that the input data is an image and consists of learning weights and bias [8].

Each neuron has an internal operation and a non-linear operation according to its choice. The entire network has a one score function and a loss function on the last layer. The same techniques can be applied to general neural network learning.

The CONV(Convolutional) layer is a key component of a CNN. Receiving input from the input image, this connected area and its weight are calculated, and the output of the CONV layer is interpreted as neurons arranged in three dimensions.

ReLU(Rectified Linear Unit. Activation function) is a layer that applies the following function (1) to the output of the CONV layer.

(1)
$ f=\left\{\begin{array}{l} (x<0) f(x)=0 \\ (x \geq 0) f(x)=x \end{array}\right. $

The reason for applying RELU as an activation function is that it solves the problem of a vanishing gradient better than the sigmoid function.

POOL(Pooling) is a layer that reduces the size to reduce the number of parameters or computations in the network periodically in the middle of the CONV layer in the CNN structure. This layer performs downsampling for each depth and controls the overfit.

The neurons in the Fully-Connected (FC) layer are linked to all the activations in the previous layer. Hence, the FC layer activations multiply the metrics, and add the bias.

Fig. 1 presents the CNN model.

Fig. 1. Unilateral CNN Model Structure.
../../Resources/ieie/IEIESPC.2021.10.1.031/fig1.png

2.2 Bayesian Optimize and Hyperband

The BOHB is a combination of the Bayesian optimization method and a hyperband. The algorithm produces hyperparameter sets of Bayesian optimization, selects one of them, and progresses the hyperband to find the hyperparameters.

The performance of the deep learning algorithm depends greatly on the configuration of the hyperparameter. Bayesian optimization is used to reduce the unnecessary hyperparameter repetitive navigation to find the optimal hyperparameter faster.

Hyperband algorithms first extract the arbitrary sets within a given range for hyperparameter navigation. The predictive performance after learning was compared by applying the corresponding hyperparameters. The low-performance hyperparameter is removed, and the remaining values are used. The performance is then compared. This process is repeated to obtain the optimum until the last hyperparameter remains.

The estimation of hyperparameters can be made faster with the BOHB than Bayesian optimization because it combines the hyperband and Bayesian optimization methods using stochastic estimation information [9].

In this experiment, the three parameters were explored: learning rate (learning rate), number of learning (epochs), and weight decay (weight attenuation) using the BOHB. This is a preferable method to estimate the hyperparameters in a deep learning algorithm because there are more parameters to decide than traditional machine learning algorithms.

2.3 Influenza Kit

Influenza A and B antigens can be detected qualitatively and rapidly in nasal swab specimens using an ulti med Influenza A/B Antigen Test, a chromatographic immunoassay. The test can provide a rapid differential diagnosis of influenza A and B viral infections [10].

Influenza (commonly known as ‘flu’) is a highly contagious, acute viral infection of the respiratory tract that can be transmitted easily by coughing and the sneezing of aerosolized droplets containing live virus. Influenza outbreaks occur each year, generally during the fall and winter months. Type A viruses are typically more prevalent and are associated with most serious influenza epidemics compared to type B viruses. In contrast, type B infections are usually milder. The Influenza A/B Antigen Test detects the presence of the Influenza A or B antigen or both qualitatively in nasal swab specimens, providing results within 15 minutes. The test uses antibodies specific to Influenza A and B to detect the Influenza A and B antigen selectively in nasal swab specimens [10].

The Influenza A/B Antigen Test is a qualitative, lateral flow immunoassay that detects Influenza A and B nucleoproteins in nasal swab specimens. In this test, an antibody specific to the Influenza A and B nucleoproteins are coated separately on the test line regions of the test cassette. During testing, the extracted specimen reacts with the antibodies to Influenza A or B or both. The mixture migrates along the membrane and reacts with the antibodies to Influenza A or B or both. A positive outcome is illustrated by one or two colored lines in the test regions. As a procedural control, a colored line always appears in the control region to show that the test had been performed properly [11].

3. Materials and Methods

This section describes data acquisition, augmentation, preprocessing methods, and influenza image classification, respectively. Fig. 2 presents the experimental procedure.

Fig. 2. Experimental procedure.
../../Resources/ieie/IEIESPC.2021.10.1.031/fig2.png

3.1 Data Acquisition

In this experiment, the designed deep learning model trains the patterns of the influenza detection kit images. One hundred images of six class influenza types were used, which included ‘None’ and low, medium, and high concentrations of ‘Type A’ and ‘Type B’ influenza.

Types A and B influenza image data were the sample data produced by diluting them with a mixture of water, borate-based tween, and bovine serum albumin at the same rate, as listed in Table 2. The data were extracted from an influenza detection kit, and the deep learning model was trained using the corresponding images.

Table 1. Influenza image dataset.

Class name (Influenza type)

Number of data

NC (None type)

100

A type (Low)

15

A type (Mid)

15

A type (High)

15

B type (Low)

15

B type (Mid)

15

B type (High)

15

Table 2. Influenza dilution ratio.

Class name(Influenza type)

Dilution

A type (Low)

1/625,000

A type (Mid)

1/75,000

A type (High)

1/25,000

B type (Low)

1/160,000

B type (Mid)

1/20,000

B type (High)

1/5,000

3.2 Data Augmentation

The performance of the designed CNN model was affected strongly by the quality of the image data. Even if the number of image data is sufficient, overfitting can occur when a model is taught with a set of data with a high or low percentage of data in a particular label, i.e., an imbalanced dataset [11].

The dataset in the present study was also unbalanced between the number of ‘None’ and other classes. Hence, data were generated using image aggregation in the remaining classes except for the ‘None’ type class [5].

Geometric variations were used for data augmentation [12]. To set a balance between the images of ‘None’ and the other types, the data were generated by randomly adjusting the image rotation (between 0$^{\circ}$ and 30$^{\circ}$) and brightness level to the remaining data. Data aggregation was carried out by randomly setting the rotation and bright levels of the images, as shown in Table 3.

Table 3. Augmentation image dataset.

Class name(Influenza type)

Number of data

NC (None type)

100

A type (Low)

90

A type (Mid)

90

A type (High)

90

B type (Low)

90

B type (Mid)

90

B type (High)

90

3.3 Image Classification

Image classification of the influenza detection kits was conducted using an optimized deep learning algorithm, which is explained in Section 2. The model used in this experiment was constructed, as shown in Fig. 4.

Approximately 70% of the total data (427) were used as training data, with the remaining 30% (213) used as test data. The input images were resized to 128${\times}$128. The RGB values were separated, and the convolution operation was conducted. The SGD optimizer was used to update the weights of the networks. The decay rate prevents overfitting. As mentioned earlier, the deep learning model attempted to find better hyperparameters using the BOHB. The BOHB algorithm was used to search for the optimal parameters (learning rate, number of epochs, and weight decay) to fine-tune the image classification model, as shown in Table 4.

Fig. 3. Influenza image data example.
../../Resources/ieie/IEIESPC.2021.10.1.031/fig3.png
Fig. 4. Architecture of the 2D CNN.
../../Resources/ieie/IEIESPC.2021.10.1.031/fig4.png
Table 4. Model parameters.

Hyperparameter

Value

Optimization

Learning rate

Optimal value

(0.001 ~ 0.01)

Number of epochs

Optimal value

(100 ~ 1000)

Batch size

32

Optimizer

SGD

Weight decay

Optimal value

(0.001 ~ 0.01)

Model

Input shape

128*128*3

Number of class

7

4. Results

The classification performance was measured using three metrics: average accuracy (ACC), recall (TPR), and precision (PPV) for each class. Each metric was calculated using Eqs. (2)-(4).

(2)
$ ACC=\frac{TP+TN}{TP+FP+TN+FN} $
(3)
$ TPR=\frac{TP}{TP+FN} $
(4)
$ PPV=\frac{TP}{TN+FN} $

TP and TN are the true positives and true negatives. Similarly, the FP and FN are the false positives and false negatives. Table 5 lists the three best models in this experiment among the models tested. Table 6 presents the results of the three cases. Table 7 lists the confusion matrices of the three cases. Overall, the Case 1 Model showed the highest performance, as shown in Tables 6 and 7.

Table 5. Three best models.

3 model’s information

Learning Late

Weight decay

epoch

Case 1

0.00597168

0.00303567

832

Case 2

0.00852943

0.00101431

340

Case 3

0.00227625

0.00316404

500

Table 6. Details of the experiment results.

Experiment results

                Indicator

Model                

ACC (%)

TPR (%)

PPV (%)

Case 1

90.14

90.71

90.00

Case 2

88.73

89.14

88.57

Case 3

82.40

87.57

87.28

Table 7. The confusion matrix for each case.

Case 1

True label

Predicted label

1

2

3

4

5

6

7

1

33

0

0

0

0

0

0

2

0

20

2

0

2

0

0

3

0

6

28

1

0

0

0

4

0

0

0

29

0

0

0

5

0

4

0

0

28

5

0

6

0

0

0

0

0

25

1

7

0

0

0

0

0

0

29

Case 2

True label

Predicted label

1

2

3

4

5

6

7

1

33

0

0

0

0

0

0

2

0

20

2

0

5

0

0

3

0

6

28

1

0

0

0

4

0

0

0

29

0

0

0

5

0

4

0

0

25

5

0

6

0

0

0

0

0

25

1

7

0

0

0

0

0

0

29

Case 3

True label

Predicted label

1

2

3

4

5

6

7

1

33

2

0

0

0

0

0

2

0

9

2

0

6

0

0

3

0

6

27

0

0

0

0

4

0

0

1

30

0

0

0

5

0

13

0

0

24

9

0

6

0

0

0

0

0

21

0

7

0

0

0

0

0

0

30

(None: 1, Type A low : 2, Type A mid : 3, Type A high: 4, Type B low : 5, Type B mid : 6, Type B high: 7 )

5. Conclusion

Influenza infections can have serious consequences for humans. This study simulated the classification of influenza detection kit images. The fraction using 2d-CNN was attempted, and the hyperparameter optimized using the BOHB algorithm yielded the best performance. For future research, various neural architecture search algorithms will be assessed to improve the performance of the detection kit.

ACKNOWLEDGMENTS

“This research was supported by the MIST(Ministry of Science and ICT), under the National Program for Excellence in SW (2017-0-00096), supervised by the IITP(Institute for Information & communications Technology Promotion)”

REFERENCES

1 
KONG Byoungjae, KIM Yuna, MOON Seokoh, JUNG Younghun, YU Seok-Hyeon, SHIN Jonghyeok, PARK Joonbum, KIM Hye Rin, KWEON. Dae-Hyuk, 2018, Perforation of membrane envelope of influenza A virus., The Korean Society for Biotechnology and Bioengineering, pp. 100-100URL
2 
Kim Ju-sung, Park Geun-yong, Lee Min-koo, Kim Young-sun, Ahn Jung-bae, Yoon Hyun-ju., 1995, Rapid and Type-Specific Detection of Human Influenza Viruses using Reverse Transcription-Polymerase Chain Reaction, Journal of the Korean Microbiological Society, Vol. 30, No. 2, pp. 233-243URL
3 
Kim Sang-tae, Kim Young-gyun, Kim Jang-soo., 2011, Early Diagnostic Method of Avian Influenza Virus Subtype Using Ultra Real-Time PCR., Journal of the Microbiological Society, Vol. 47, No. 1, pp. 30-37URL
4 
Park Yoon-hyung, Woo Young-dae, Kim Seung-gon, Bae Hyung-joon, Park Sang-wook, 2001, Simultaneous detection of the cell fusion respiratory virus, influenza virus A (H3N2HH1N1) and B using multiple reversible polymerase chain reactions in a single laboratory., JOURNAL OF BACTERIOLOGY AND VIROLOGY, Vol. 31, No. 3, pp. 269-274URL
5 
Cho Hyung-hun, Lee Won-jong., 2019, Meta-learning and superparametric optimization for enhancing learning speed and performance of enhanced learning., Journal of Information Science, Vol. 37, No. 11, pp. 25-33URL
6 
Park Sung-hyun, Lim Byung-yeon, Jung Hoe-kyung, 2020, CNN-based toxic plant discriminating system., Journal of the Korean Society of Information and Communication, Vol. 24, No. 8, pp. 993-998URL
7 
Kim Jung-jin, Cho Sung-wook, Ji Young-min, 2017, Image Classification Using CNN., Journal of Comprehensive Academic Presentation of the Korean Society of Information Technology, pp. 452-453URL
8 
Falkner Stefan, Klein Aaron, Hutter Frank, 2018, BOHB: Robust and efficient hyperparameter optimization at scale., arXiv preprint arXiv:1807. 01774URL
9 
https://www.ultimed.org/produkte/influenza-ab-antigen-test/URL
10 
Na Seong-Won, Bae Hyo-Churl, Yoon Kyoungro, 2017, A study of efficient learning methods of CNN for small dataset., The Korean Society Of Broad Engineers, pp. 243-244URL
11 
Kim Geo-sik, Lee Moon-sup, Son Dong-hoon, Kim Jong-un, Min Ki-hyun, Kim Kye-eun, Kang Hyun-seo, 20202, Performance analysis of CNN-based dermal disease image classifier using data balancing algorithm., Journal of the Society of Electronics Engineers, Vol. 57, No. 7, pp. 76-8DOI

Author

Junghwan Lee
../../Resources/ieie/IEIESPC.2021.10.1.031/au1.png

Junghwan Lee is in the MSc Program at the Bio Computing & Machine Learning Laboratory (BCML) in the Department of Computer Engineering at Kwangwoon University, Seoul, Republic of Korea. His research interests include machine learning, and deep learning algorithms.

Heesang Eom
../../Resources/ieie/IEIESPC.2021.10.1.031/au2.png

Heesang Eom is in the MSc Program at the Bio Computing & Machine Learning Laboratory (BCML) in the Department of Computer Engineering at Kwangwoon University, Seoul, Republic of Korea. He received a BSc from the Department of Software Engineering, Korea Polytechnic University, Gyeonggi, Republic of Korea, in 2018. His research interests include computer vision, and deep learning algorithms.

Yuli Sun Hariyani
../../Resources/ieie/IEIESPC.2021.10.1.031/au3.png

Yuli Sun Hariyani received the B.S. degree in telecommunication engi-neering and the M.S. degree in electrical engineering from Telkom University, Bandung, Indonesia, in 2010 and 2013, respectively. She is currently pursuing the Ph.D. degree with Computer Engineering Depart-ment, Kwangwoon University, Seoul, South Korea. Since 2014, she has been a Lecturer with Telkom University, Indonesia. Her research interests include pattern recognition, medical image processing and biomedical signal processing.

Cheonjung Kim
../../Resources/ieie/IEIESPC.2021.10.1.031/au4.png

Cheonjung Kim received the B.S. degree in Electronic Engineering from Kwangwoon University, South Korea in 2015. He is recently studing toward the Electrical Egineering, Kwangwoon University. His research interest includes bioelectronics for biomolecule detection and preconcentration.

Yong Kyoung Yoo
../../Resources/ieie/IEIESPC.2021.10.1.031/au5.png

Yong Kyoung Yoo is an assistant professor at Electronic Engineering, Catholic Kwandong University, South Korea. He received the B.S., M.S., and Ph.D. degrees from Kwangwoon University, Seoul,Korea in 2011, 2013 and 2017. He was a researcher in Department of ClinicalPharmacology and Therapeutics, College of Medicine, Kyung Hee University, Seoul,Republic of Korea in 2017. He joined the Department of Electrical Engineering, Kwangwoon University, where he was as a post-doctoral course in 2018. His current research is focused on biosensors and bioelectronics.

Jeong Hoon Lee
../../Resources/ieie/IEIESPC.2021.10.1.031/au6.png

Jeong Hoon Lee is a professor at Electrical Engineering, Kwangwoon University, South Korea. He received the B.S. degree in the department of Ceramic Engineering at Yonsei University, Seoul, South Korea, in 1997. He received the Ph.D from same department in 2004. He specialized in MEMS/Nanomechanics from 1999 to 2005 at Korea Institute of Science and Technology (KIST) in Seoul, South Korea. Before joining Kwangwoon University in Sep 2008, he was a Postdoctoral Associate at RLE and EECS, Massachusetts Institute of Technology (MIT), USA (2005 to 2008). His current main research is the development of simple and powerful POCT and diagnostic systems based on the integration of electronics and fluidics.

Cheolsoo Park
../../Resources/ieie/IEIESPC.2021.10.1.031/au7.png

Cheolsoo Park is an associate professor in the Computer Engineering Department at Kwangwoon University, Seoul, South Korea. He received a B.Eng. in Electrical Engineering from Sogang University, Seoul, and an MSc from the Biomedical Engineering Department at Seoul National Univer-sity, South Korea. In 2012, he received his PhD in Adaptive Nonlinear Signal Processing from Imperial College London, London, U.K., and worked as a postdoctoral researcher in the Bioengineering Department at the University of California, San Diego, U.S.A. His research interests are mainly in the areas of machine learning and adaptive and statistical signal processing, with applications in brain computer interfaces, computational neuroscience, and wearable technology.