<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JBiSE</journal-id><journal-title-group><journal-title>Journal of Biomedical Science and Engineering</journal-title></journal-title-group><issn pub-type="epub">1937-6871</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jbise.2020.137014</article-id><article-id pub-id-type="publisher-id">JBiSE-101777</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject></subj-group></article-categories><title-group><article-title>
 
 
  A Deep Learning Interpretable Model for Novel Coronavirus Disease (COVID-19) Screening with Chest CT Images
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Eri</surname><given-names>Matsuyama</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Faculty of Informatics, The University of Fukuchiyama, Kyoto, Japan</addr-line></aff><pub-date pub-type="epub"><day>24</day><month>07</month><year>2020</year></pub-date><volume>13</volume><issue>07</issue><fpage>140</fpage><lpage>152</lpage><history><date date-type="received"><day>27,</day>	<month>June</month>	<year>2020</year></date><date date-type="rev-recd"><day>25,</day>	<month>July</month>	<year>2020</year>	</date><date date-type="accepted"><day>28,</day>	<month>July</month>	<year>2020</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  In this article, we propose a convolutional neural network (CNN)-based model, a ResNet-50 based model, for discriminating coronavirus disease 2019 (COVID-19) from Non-COVID-19 using chest CT. We adopted the use of wavelet coefficients of the entire image without cropping any parts of the image as input to the CNN model. One of the main contributions of this study is to implement an algorithm called gradient-weighted class activation mapping to produce a heat map for visually verifying where the CNN model is looking at the image, thereby, ensuring the model is performing correctly. In order to verify the effectiveness and usefulness of the proposed method, we compare the obtained results with that obtained by using pixel values of original images as input to the CNN model. The measures used for performance evaluation include accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and Matthews correlation coefficient (MCC). The overall classification accuracy, F1 score, and MCC for the proposed method (using wavelet coefficients as input) were 92.2%, 0.915%, and 0.839%, and those for the compared method (using pixel values of the original image as input) were 88.3%, 0.876%, and 0.766%, respectively. The experiment results demonstrate the superiority of the proposed method. Moreover, as a comprehensible classification model, the interpretability of classification results was introduced. The region of interest extracted by the proposed model was visualized using heat maps and the probability score was also shown. We believe that our proposed method could provide a promising computerized toolkit to help radiologists and serve as a second eye for them to classify COVID-19 in CT scan screening examination.
 
</p></abstract><kwd-group><kwd>Convolutional Neural Networks</kwd><kwd> Wavelet Transforms</kwd><kwd> Classification</kwd><kwd> Lung Diseases</kwd><kwd> CT Imaging</kwd><kwd> COVID-19</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. INTRODUCTION</title><p>A coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has widely spread all over the world and has become a pandemic. The outbreak of COVID-19 has brought effects on many aspects, like daily lives, public health and the global economy. As of June 28, 2020, The World Health Organization has announced that there are more than 10 million confirmed cases of COVID-19 in the world, and more than 499,000 people have died. In addition, the basic reproduction number (R0), defined as the average number of secondary cases produced by one infected individual, is about 6.47 (range 1.66-10) in China, 2.6 in South Korea, and 4.7 in Iran [1-3], indicating that the spread of COVID-19 is getting seriously. Due to unavailability of specific therapeutic drugs or vaccines for COVID-19 [<xref ref-type="bibr" rid="scirp.101777-ref4">4</xref>], it is the top priority to halt the spread of COVID-19 by screening a large number of suspicious cases and isolating the infected individuals from the community. According to the latest guidelines issued by the Chinese government, the diagnosis of COVID-19 should be confirmed by a reverse transcription polymerase chain reaction (RT-PCR) test. However, RT-PCR might not be high enough in terms of sensitivity. Also, false negatives can occur if the sample contains insufficient quantities of the virus; therefore, the test may need to conduct several times before finally confirmed [5-7]. Thus, fast and accurate diagnostic methods or tools are urgently and essentially necessary to fight against SARS-CoV-2.</p><p>Chest CT is a routine imaging tool for pneumonia diagnosis, thereby providing benefit for diagnosis of COVID-19. The majority of COVID-19 patients demonstrate similar features on CT images, including ground-glass opacities, pulmonary consolidation, and/or interstitial changes with a peripheral lung distribution [8 , 9]. Although chest CT could serve as a practical approach for early screening of COVID-19, it may show some similar imaging features between COVID-19 and other types of infectious and inflammatory lung diseases. Thus, it is not easy for differentiating COVID-19 from other viral pneumonia. Also, radiologists may take a long time to recognize the features. Moreover, manual reading of CT images is a time-consuming task and subject to fatigue, in turn resulting in human error. Therefore, techniques using artificial intelligence (AI) based automated analysis have the potential to help radiologists analyze COVID-19 from CT images.</p><p>Deep learning (DL) is an important breakthrough in AI. One of the typical DL architectures is the convolutional neural network (CNN). The CNN has been widely used in the medical field due to its powerful feature representation [10-14]. Application of CNN techniques together with radiological imaging can be helpful for the accurate detection and classification of COVID-19 [<xref ref-type="bibr" rid="scirp.101777-ref15">15</xref>]. Recent works using the CNN approach for classifying CT images of COVID-19 and Non-COVID-19 are reported [16-19]. Generally, a region of interest (ROI) from a CT image is cropped and used as input to the CNN models. These studies have achieved a satisfactory performance; however, there is ample room for improvement in terms of sensitivity and accuracy of classification.</p><p>In our previous studies, we used wavelet coefficients of original images as input to the CNN systems to histologically classify lung diseases [<xref ref-type="bibr" rid="scirp.101777-ref20">20</xref>] and to discriminate different breast densities [<xref ref-type="bibr" rid="scirp.101777-ref21">21</xref>]. We have obtained satisfactory results in terms of classification accuracy. In the present study, we propose a wavelet-based CNN system for automatically discriminating COVID-19 pneumonia from Non-COVID-19 pneumonia. The inputs to the network are wavelet coefficients of the entire image without cropping any parts of the original image. The present work mainly focuses on further improving performance of classification between COVID-19 and Non-CIVID-19. In this work, a well-known pre-trained CNN model, ResNet-50 was used [22 , 23]. ResNet, a short name for residual network, is a pre-trained model that has been trained on more than one million images in the ImageNet database [<xref ref-type="bibr" rid="scirp.101777-ref24">24</xref>] and was the winner of ImageNet challenge in 2015. ResNet can have a very deep network of up to 152 layers. There are 5 versions of ResNet models, which contains 5, 34, 50, 101, 152 layers respectively. ResNet-50 corresponds to a 50 layer residual network.</p><p>While DP has achieved satisfactory accuracy in image classification, one of its problems is model interpretability, a key component in model understanding. Understanding an accurate classification model could provide us more confidence that the model really captures the correct patterns in the target region. Other than adopting the use of wavelet coefficients instead of raw image data as input to the CNN model, one of the main contributions of this study is that we implemented an algorithm called gradient-weighted class activation mapping (Grad-CAM) [<xref ref-type="bibr" rid="scirp.101777-ref25">25</xref>] to produce a heat map to visually verify where in the image the CNN model is looking at and to ensure the model is performing correctly. In order to demonstrate the effectiveness and usefulness of the proposed method, the results obtained by using pixel values of the original images as input to the CNN model are compared.</p><p>The remainder of this paper is organized as follows: In Section 2, we describe the image data set, the proposed CNN model, and the model interpretability of CNN. In Section 3, we present the experimental results. In Section 4, we bring the discussion of the results. In Section 5, we draw the conclusion of this work.</p></sec><sec id="s2"><title>2. MATERIALS and METHODS</title><p>The CNN environments for implementing the CNN model are as follows. Hardware: Windows10, graphics: NVIDIA Quadro, framework: MATLAB. A ResNet-50-based pre-trained CNN was used and fine tuning operation was conducted. Input data to the network were the wavelet coefficients obtained from COVID-19 and Non-COVID-19 pneumonia CT images. For comparison and verification, raw data of original images (pixel values) were also used as input (hereafter referred to as compared method). The performance of the proposed method and the compared method were evaluated using the 10-fold cross-validation procedure. After obtaining the classification results, the localized region of the chest CT image that determined the final classification was identified by Grad-CAM [<xref ref-type="bibr" rid="scirp.101777-ref25">25</xref>]. Furthermore, the important feature region that influences the probability score of the classification class was visualized using occlusion sensitivity technique.</p><sec id="s2_1"><title>2.1. Image Datasets</title><p>The image datasets used in this study were the COVID-19 CT datasets publicly published by a team of researchers at the University of San Diego [26 , 27] and the image set published by Joseph Cohen at the University of Montreal [<xref ref-type="bibr" rid="scirp.101777-ref28">28</xref>]. Thus, ethics issues do not arise in this work and the requirement to obtain informed consent was waived. A total of 720 images selected from the above described databases consist of 345 COVID-19 CT images and 375 Non-COVID-19 CT images. The collected images varied in matrix size ranging from 153 &#215; 124 to 1853 &#215; 1485. Since these CT images were collected from COVID-19-related papers from bioRxiv, medRxiv, Lancet, NEJM, JAMA, etc., there is a concern that the Hounsfield unit (HU) values were lost and the resolution of images was reduced. However, a senior radiologist at Tongji Hospital in Wuhan, China (a doctor who has been diagnosing and treating COVID-19 patients) said that the concern did not significantly affect the accuracy of diagnosis decision-making [<xref ref-type="bibr" rid="scirp.101777-ref27">27</xref>]. The difference in the number of images between the two categories was due to the fact that the maximum number of images for the same patient was 3, also, inappropriate images, such as the inclusion of markers in the objects were excluded. <xref ref-type="fig" rid="fig1">Figure 1</xref> shows an example of CT images of COVID-19 and Non-COVID-19 pneumonia.</p></sec><sec id="s2_2"><title>2.2. Wavelet Transforms</title><p>The two-dimensional (2D) wavelet transform (WT) has been widely used as an image processing method. Applications to medical image processing include image data compression, image enhancement, and noise removal [<xref ref-type="bibr" rid="scirp.101777-ref29">29</xref>]. In the wavelet analysis, an image is initialized at level 0. The image is decomposed into 4 components of level 1: a low frequency component called low-low (LL) component and 3 detailed components called low-high (LH), high-low (HL) and high-high (HH) components, respectively. Decomposition is further performed on the LL component. More details about the WT can be found in [<xref ref-type="bibr" rid="scirp.101777-ref30">30</xref>].</p><p>We implemented a 2D redundant discrete WT (RDWT) method. The RDWT, unlike the conventional WT, does not perform down-sampling operations. Thus, the four components at each level remain the same size as the original image of level 0. In this way, the problem of shift invariance and disappearance of the contour of the decomposed images could be solved. In this study, Daubechies order 2 (db2) was used as the wavelet basis function. The reason of using it is that db2 is a compactly supported orthogonal wavelet. As a result, the coefficient values, which might be able to distinguish features of interest shown in chest CT images, can be captured. <xref ref-type="fig" rid="fig2">Figure 2</xref> shows two examples of four decomposition components of RDWT at level 1 corresponding to <xref ref-type="fig" rid="fig1">Figure 1</xref>(a) and <xref ref-type="fig" rid="fig1">Figure 1</xref>(c), respectively.</p></sec><sec id="s2_3"><title>2.3. Architecture of the Proposed CNN Model</title><p>In this study, fine tuning on ResNet-50 model was implemented. The outline of the network configuration used is shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. ResNet-50 model consists of 16 processing blocks and incorporates two types of shortcut modules (<xref ref-type="fig" rid="fig3">Figure 3</xref>(b)). One is a module called identity (ID) block (<xref ref-type="fig" rid="fig3">Figure 3</xref>(c)) that there is no convolutional layer in the shortcut path (the input has the same dimension as the output). The other is a module called convolutional block (<xref ref-type="fig" rid="fig3">Figure 3</xref>(d)) that there is a convolutional layer in the shortcut path (the dimension of the input is smaller than that of the output). Both modules contain bottleneck structures consisting of 1 block with 3 layers (1 &#215; 1, 3 &#215; 3, and 1 &#215; 1 convolutional layers). It is possible to reduce the number of parameters without significant decline in model performance.</p><p>We retrained all layers of the network with CT images obtained from the datasets. In other words, two categories, COVID-19 and Non-COVID-19, were classified using fine-tuning network without frozen layers. The last fully connected layer and the final classification layer of the network were replaced with a new layer which can classify the input images into two categories. The input data to the proposed model were wavelet coefficients obtained from RDWT of the original chest CT images. Of the LL, LH, HL, and HH components, a highly accurate combination of LL, LH, and HH components were selected as 3-channel input to the network [20 , 21]. In comparison to the proposed method, the pixel values of 3 identical, original CT images were also used as inputs to the same network (the compared method).</p><p>Since Resnet-50 requires input images to be of size 224 &#215; 224, the input images were resized with the bi-cubic interpolation. As a pre-processing, the resizing was automatically performed prior to proceeding to the input image layer (<xref ref-type="fig" rid="fig3">Figure 3</xref>(a)). We applied 10-fold cross-validation for the network re-training. Of the total 720 images, 648 images were used for re-training and the remaining 72 images for validation. The mini batch size was 81 and optimization algorithm chosen for re-training was stochastic gradient descent with momentum. During the re-training phase, in order to improve the accuracy, the learning speed was made faster in the newly replaced fully connected layer, on the contrary, the learning speed was made slower in the transfer layer. Also, parameters were adjusted so that the learning rate decreased every 5 epochs. Furthermore, an L2 norm regularization was applied to the cost function (also referred to loss function) in order to prevent overfitting. Regarding the epoch setting, accuracy was verified at each iteration cycle, and re-training stops after 5 consecutive iterations when the accuracy has stopped improving.</p></sec><sec id="s2_4"><title>2.4. Interpretability and Explainability</title><sec id="s2_4_1"><title>2.4.1. Gradient-Weighted Class Activation Mapping (Grad-CAM)</title><p>The CNN model combines the feature extraction and classification modules into one integrated system. In general, the classification module contains a fully connected neural network model, and the extracted features are converted into a probability score of each class at the softmax layer. The final prediction (classification result) of the network is the category with the highest probability score. Grad-CAM [<xref ref-type="bibr" rid="scirp.101777-ref25">25</xref>] is class-discriminative and localizes the relevant image regions and it uses the gradient (derivative) of the feature map of the final convolutional layer of the network to highlight the significant region in the image for final prediction. Regions with high gradients are the areas that have great effects on the classification result. <xref ref-type="fig" rid="fig4">Figure 4</xref> depicts the flowchart of how to implement Grad-CAM. More details about Grad-CAM can be found in [<xref ref-type="bibr" rid="scirp.101777-ref25">25</xref>].</p></sec><sec id="s2_4_2"><title>2.4.2. Occlusion Sensitivity Approach</title><p>Occlusion sensitivity is an approach for understanding which parts of an image are most important for classification. Occlusion sensitivity helps us understand the learning behavior of the underlying task by determining whether the network is actually categorizing based on task-specific features [<xref ref-type="bibr" rid="scirp.101777-ref31">31</xref>]. The procedure of the approach can be divided into six steps as follows.</p><p>step 1: Classify the target image using the fine-tuned network, and confirm the probability score of the classification category.</p><p>step 2: Mask (Block) one part of the target image.</p><p>step 3: Input the masked image created in step 2 to the fine-tuned network .</p><p>step 4: Calculate the probability score of the classification category.</p><p>step 5: Move the masked region to another position and implement steps 3 and 4.</p><p>step 6: Repeat steps 2-5 until the mask has moved over the entire image.</p><p>Note that the smaller the mask size and stride, the higher the resolution of the occlusion map will be. However, in this study, taking into account of shortening of computation time, the mask size and the stride were set at integer values closest to 20% and 30% of the input size, respectively. From the results of the above described steps 1-6, we could consider the following two possibilities. As a result of masking (blocking) one part of the target image and inputting to the network; 1) classified as a Non-target class (or the probability score of the target class dropped significantly). In this case, it is highly possible that the masked (blocked) region is a very important feature for determining the target class; 2) the probability score of the target class increased significantly. In this case, the masked (blocked) region is likely to be a cause area that might be misclassified as a Non-target. In this study, the region that has the greatest influence on the probability score is specified using a heat map.</p></sec></sec></sec><sec id="s3"><title>3. RESULTS</title><p>In this study, we attempted to construct a CNN model for discriminating COVID-19 pneumonia and to make the model interpretable and explainable. The image input layer of the network was replaced with the wavelet transform layer, and the redundant wavelet coefficients of the CT image were used as input data, and all layers of the network were re-trained. For comparison, learning and classification were also conducted using the pixel values of the image as input data (we call it as the compared method). The entire image without cropping it was used as input for both methods.</p><p><xref ref-type="table" rid="table1">Table 1</xref> indicates the classification performance of the proposed and the compared methods. The measures used for performance evaluation include accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1 score, and Matthews correlation coefficient (MCC). The PPV and NPV are the proportions of positive and negative results in diagnostic tests that are true positive and true negative results, respectively [<xref ref-type="bibr" rid="scirp.101777-ref32">32</xref>]. They describe the performance of a diagnostic test. The higher the value for the PPV/NPV, the more accurate the diagnostic test is. F1 score is an overall measure of a model’s accuracy that combines precision and recall. A good F1 score means that a model (or a classifier) being evaluated has low false positive and low false negative. The model is considered perfect when the F1 score is 1, while the model is a total failure when the score is 0. The MCC is used as a measure of the quality of binary and multiclass classifications. The MCC is in essence a correlation coefficient value between −1 and +1. A coefficient of +1 indicates a perfect prediction, 0 an average random prediction and −1 a completely inverse prediction. <xref ref-type="fig" rid="fig5">Figure 5</xref> shows the receiver operating characteristic (ROC) curve for the proposed and compared methods. The area under the ROC curve (AUC) is also shown for each method as an overall measure of classification performance.</p><p>In this study, the region being focused on by the network at the time of final judgment (classification) was specified by Grad-CAM. Meanwhile, occlusion sensitivity was used to identify the feature regions that have the strongest influence on the probability score for prediction. <xref ref-type="fig" rid="fig6">Figure 6</xref> illustrates examples of flowcharts for probability scores of classification categories. The upper lows of Figures 6(a)-(c) show COVID-19 CT images. The middle rows of that are results after using occlusion sensitivity. The red region of the heat map is the feature region that has the greatest effect on the probability score of the classification category. That is, when an image is input with the red region occluded, the score of the target class decreases and the probability score of the different category increases. The 6 figures in the lower rows of <xref ref-type="fig" rid="fig6">Figure 6</xref> are the results of Grad-CAM. The red regions are the regions that the network paid the most attention to when making the final decision. <xref ref-type="fig" rid="fig6">Figure 6</xref>(a) is an example of correct classification (true positive) when wavelet coefficients is used as input, while misclassified (false negative) when original image is used as input. Figures 6(b) shows the result of correct classification (true positive) for the both methods and <xref ref-type="fig" rid="fig6">Figure 6</xref>(c) shows that of misclassification (false negative) for both methods, respectively.</p></sec><sec id="s4"><title>4. DISCUSSION</title><p>As shown in <xref ref-type="table" rid="table1">Table 1</xref>, the overall accuracy of the classification using the proposed method (fine tuning with wavelet-coefficient input) is 0.922, and that using the compared method (fine tuning with the pixel values of the original images) is 0.883 (p &lt; 0.05). A higher accuracy is obtained by the proposed method. Sensitivity and specificity for the proposed method are 0.904 and 0.933, and that for the compared</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Performance results obtained from the proposed method (using wavelet coefficients as input) and the compared method (using pixel values of the original image as input)</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Performance Measure</th><th align="center" valign="middle" >Wavelet</th><th align="center" valign="middle" >Original</th></tr></thead><tr><td align="center" valign="middle" >Accuracy</td><td align="center" valign="middle" >0.922</td><td align="center" valign="middle" >0.883</td></tr><tr><td align="center" valign="middle" >Sensitivity</td><td align="center" valign="middle" >0.904</td><td align="center" valign="middle" >0.864</td></tr><tr><td align="center" valign="middle" >Specificity</td><td align="center" valign="middle" >0.933</td><td align="center" valign="middle" >0.901</td></tr><tr><td align="center" valign="middle" >Positive Predictive Value: PPV</td><td align="center" valign="middle" >0.926</td><td align="center" valign="middle" >0.890</td></tr><tr><td align="center" valign="middle" >Negative Predictive Value: NPV</td><td align="center" valign="middle" >0.914</td><td align="center" valign="middle" >0.878</td></tr><tr><td align="center" valign="middle" >F1 Score</td><td align="center" valign="middle" >0.915</td><td align="center" valign="middle" >0.876</td></tr><tr><td align="center" valign="middle" >Matthews Correlation Coefficient: MCC</td><td align="center" valign="middle" >0.839</td><td align="center" valign="middle" >0.766</td></tr></tbody></table></table-wrap><p>method are 0.864 and 0.901, respectively. The results show that the proposed method is superior to the compared method. From the table, our proposed method shows a PPV of 92.6% and a NPV of 91.4%, and that of the compared method shows 89% and 87.8%, respectively. The results suggest that the proposed method perform better as compared to the compared method. The F1 score of the proposed method and that of the compared method are 0.915 and 0.876, respectively, which shows the superiority of the proposed method. The MCCs are 0.839 and 0.766 for the proposed method and the compared method, respectively. The results indicate that the proposed method has a higher correlation with the correct label as compared to the compared method. It is clear from <xref ref-type="fig" rid="fig5">Figure 5</xref>, the proposed method achieves better result (AUC = 0.976) as compared to the compared method (AUC = 0.959).</p><p>In the present study, we used the whole CT image without cropping the ROIs, such as the portions of lesions. As a general perception, there is a problem of where the network is looking at for judgement. <xref ref-type="fig" rid="fig6">Figure 6</xref> is an example to illustrate the ROI where the network is paying attention to. The red regions in the middle rows (occlusion sensitivity map) of the 3 figures show the most important feature regions that influence the probability scores of the classification classes. In all the cases of <xref ref-type="fig" rid="fig6">Figure 6</xref>, the important features extracted by the network are strongly focused in the thoracic cavity, rather than in the background or body figure. This suggests that the network has learned the task-specific features. In addition, by using the proposed method, the network has a strong tendency to determine COVID-19 with confidence that the probability score is ranging from 99% to 100%. On the other hand, in the case of the compared method, the probability score tends to vary from 50.2% to 100%. This demonstrates the instability of learning due to the use of the compared method.</p><p><xref ref-type="fig" rid="fig6">Figure 6</xref>(a) and <xref ref-type="fig" rid="fig6">Figure 6</xref>(b) are examples showing correct classification of COVID-19 with a probability score of 100% by using the proposed method. Here, if you make the red region of the input image unclear, the probability score decreases. It means that the red region is an important feature for judging COVID-19. The blue region is generally a region that negatively affects the score of the class. However, in the case of the proposed method shown in <xref ref-type="fig" rid="fig6">Figure 6</xref>(a) and <xref ref-type="fig" rid="fig6">Figure 6</xref>(b), the probability score for each case is 100%, which means that the probability score is not affected by the blue region. This implies that there is no necessity to crop the image when the proposed method is used. The red regions at the lower rows (Grad-CAM) of Figures 6(a)-(c) are the regions where the network pays the most attention to at the time of the final judgement and has a great influence on the classification result.</p><p><xref ref-type="fig" rid="fig6">Figure 6</xref>(b) is an example of successful classification for both methods, while their corresponding Grad-CAM regions do not match each other. In the proposed method, the judgment is made with emphasis on the consolidation of the posterior basal segment of the left lung. In contrast, in the compared method, not only the consolidation of the posterior basal segment of the both lungs and that of the medial segment of the left lung, but also the latissimi dorsi is regarded as the ROI in the process of judgement. In fact, the latissimi dorsi is unrelated to COVID-19. That is, the judgement made by the compared method was influenced by a bias irrelevant to the classification category.</p><p><xref ref-type="fig" rid="fig6">Figure 6</xref>(c) is an example showing a case of misclassification judged by both methods. It can be seen from the red region of occlusion sensitivity map that the network of the proposed method focuses on the pneumonia feature, i.e., ground-glass opacity. In contrast, for the compared method, the anterior mediastinum of the heart is also shown in slightly red. This might be due to learning the wrong features. As a result, the respective probability scores are 55.5% and 57.9% and misclassified as Non COVID-19, respectively. As shown in red color on Grad-CAM, the ROIs focused by both methods are ground-glass opacity of the posterior basal segment of the right lung. The region localization is considered to be correct, however, the judgment is incorrected. In other words, a CT image with a wide mediastinum range, a narrow lung field area, and frosted glass shadows in both lungs might confuse the judgment made by the network. In order to solve this problem, it would be necessary to increase similar learning data for re-training.</p><p>There are several limitations in this study. First, a high sensitivity is considered to be a significantly important factor in the screening of COVID-19, however, in our experiments, specificity was higher than sensitivity (see <xref ref-type="table" rid="table1">Table 1</xref>). To deal with this issue, further improvement in the proposed method is required. Thus, adjusting the parameters of the network might be necessary. Second, the image data set used in this study contained some images with reduced resolution and loss of CT values. Appropriate selection of learning data is considered important in the subsequent study. Third, we did not confirm the treatment outcomes of the analyzed patients, because they are beyond the purpose of this study. In addition, the wavelet basis function used was db2. However, it is undeniable that the use of other basis functions or optimization of the combination of wavelet coefficients might yield better results. Further investigation on it will be conducted in our future study. Moreover, we employed ResNet-50 in this study. Thus, it would be possible that the network layers are too deep for the number of training data used in the experiments. To verify this issue is also one of our future tasks.</p></sec><sec id="s5"><title>5. CONCLUSIONS</title><p>In this study, a pre-trained network based on ResNet-50 model was employed. Transfer learning was performed using wavelet coefficients of CT images as input to the fine tuning CNN. The network was used to classify COVID-19 pneumonia and Non-COVID-19 pneumonia from chest CT images. As a comprehensible classification model, the interpretability of classification results was introduced. The region of interest extracted by the network was visualized using heat maps and the probability score was also shown. For comparison, the case of using pixel values of the original image as the input of the fine tuning CNN was also shown. The overall accuracy of the classification of the proposed method was as high as 92.2% as compared to 88.3% obtained from the compared method. Other than overall accuracy, in all the calculated performance, measures obtained from the proposed method were higher than that obtained from the compared method. The experiment results demonstrated the superiority of the proposed method over the method that used the pixel values of the original images as input to the CNN.</p><p>Furthermore, by visualizing the extracted features from the region of interest created by the CNN, it is obvious that by using the proposed method, i.e., using the wavelet coefficients of the entire image without cropping way any parts of the image as input to the CNN, the network could correctly learn the extracted features. In addition, the basis of the judgment, i.e., what the CNN is looking, was apparent by visualizing the ROI at the final classification stage. This kind of comprehensible classification model could give insights to users about important predictive relationships in the image data. It is considered that model comprehensibility is very important for the model’s acceptance by users in classification &amp; prediction applications. We believe that our proposed method will provide a promising computerized toolkit to help radiologists and serve as a second eye for them to classify COVID-19 in CT scan screening examination.</p></sec><sec id="s6"><title>ACKNOWLEDGEMENTS</title><p>This work was supported in part by JSPS KAKENHI (Grant-in-Aid for Scientific Research) Grant Number 18K15641. The author would like thank Professor Du-Yih Tsai for his valuable comments and suggestions during the preparation of this paper.</p></sec><sec id="s7"><title>CONFLICTS OF INTEREST</title><p>The author declares no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s8"><title>REFERENCES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.101777-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Tang, B., Wang, X., Li, Q., et al. (2020) Estimation of the Transmission Risk of the 2019-nCoV and Its Implication for Public Health Interventions. Journal of Clinical Medicine, 9, 462. https://doi.org/10.3390/jcm9020462</mixed-citation></ref><ref id="scirp.101777-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Tang, B., Xia, F., Bragazzi, N.L., et al. (2020) Lessons Drawn from China and South Korea for Managing COVID-19 Epidemic: Insights from a Comparative Modeling Study.  
https://doi.org/10.1101/2020.03.09.20033464</mixed-citation></ref><ref id="scirp.101777-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Ahmadi, A., Fadaei, Y., Shirani, M., et al. (2020) Modeling and Forecasting Trend of COVID-19 Epidemic in Iran. https://doi.org/10.1101/2020.03.17.20037671</mixed-citation></ref><ref id="scirp.101777-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Gao, Q., Bao, L., Mao, H., et al. (2020) Rapid Development of an Inactivated Vaccine for SARS-CoV-2.  
https://doi.org/10.1101/2020.04.17.046375</mixed-citation></ref><ref id="scirp.101777-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Corman, V.M., Landt, O., Kaiser, M., et al. (2020) Detection of 2019 Novel Coronavirus (2019-nCoV) by Real-Time RT-PCR. Euro Surveillance, 25, pii = 2000045.  
https://doi.org/10.2807/1560-7917.ES.2020.25.3.2000045</mixed-citation></ref><ref id="scirp.101777-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Chu, D.K.W., Pan, Y., Cheng, S.M.S., et al. (2020) Molecular Diagnosis of a Novel Coronavirus (2019-nCoV) Causing an Outbreak of Pneumonia. Clinical Chemistry, 66, 549-555. https://doi.org/10.1093/clinchem/hvaa029</mixed-citation></ref><ref id="scirp.101777-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, N., Wang, L., Deng, X., et al. (2020) Recent Advances in the Detection of Respiratory Virus Infection in Humans. Journal Medical Virology, 92, 408-417. https://doi.org/10.1002/jmv.25674</mixed-citation></ref><ref id="scirp.101777-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Ai, T., Yang, Z., Hou, H., et al. (2020) Correlation of Chest CT and RT-PCR Testing in Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases. Radiology. https://doi.org/10.1148/radiol.2020200642</mixed-citation></ref><ref id="scirp.101777-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Wang, S., Kang, B., Ma, J., et al. (2020) A Deep Learning Algorithm Using CT Images to Screen for Corona Virus Disease (COVID-19). https://doi.org/10.1101/2020.02.14.20023028</mixed-citation></ref><ref id="scirp.101777-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Choe, J., Lee, S.M., Do, K.H., et al. (2019) Deep Learning-Based Image Conversion of CT Reconstruction Kernels Improves Radiomics Reproducibility for Pulmonary Nodules or Masses. Radiology, 292, 365-373.  
https://doi.org/10.1148/radiol.2019181960</mixed-citation></ref><ref id="scirp.101777-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Kermany, D.S., Goldbaum, M., Cai, W., et al. (2018) Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell, 172, 1122-1131. https://doi.org/10.1016/j.cell.2018.02.010</mixed-citation></ref><ref id="scirp.101777-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Negassi, M., Suarez-Ibarrola, R., Hein, S., et al. (2020) Application of Artificial Neural Networks for Automated Analysis of Cystoscopic Images: A Review of the Current Status and Future Prospects. World Journal Urology.  
https://doi.org/10.1007/s00345-019-03059-0</mixed-citation></ref><ref id="scirp.101777-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Wang, P., Xiao, X., Brown, J.R.G., et al. (2018) Development and Validation of a Deep-Learning Algorithm for the Detection of Polyps during Colonoscopy. Nature Biomedical Engineering, 2, 741-748.  
https://doi.org/10.1038/s41551-018-0301-3</mixed-citation></ref><ref id="scirp.101777-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Yan, Q., Wang, B., Gong, D., et al. (2020) COVID-19 Chest CT Image Segmentation—A Deep Convolutional Neural Network Solution. https://arxiv.org/abs/2004.10987</mixed-citation></ref><ref id="scirp.101777-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Ozturk, T., Talo, M., Yildirim, E.A., et al. (2020) Automated Detection of COVID-19 Cases Using Deep Neural Networks with X-Ray Images. Computers in Biology and Medicine, 121, Article ID: 103792.  
https://doi.org/10.1016/j.compbiomed.2020.103792</mixed-citation></ref><ref id="scirp.101777-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Ardakani, A.A., Kanafi, A.R., Acharya, U.R., et al. (2020) Application of Deep Learning Technique to Manage COVID-19 in Routine Clinical Practice Using CT Images: Results of 10 Convolutional Neural Networks. Computers in Biology and Medicine, 121, Article ID: 103795. https://doi.org/10.1016/j.compbiomed.2020.103795</mixed-citation></ref><ref id="scirp.101777-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Li, L., Qin, L., Xu, Z., et al. (2020) Artificial Intelligence Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT. Radiology.</mixed-citation></ref><ref id="scirp.101777-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Xu, X., Jiang, X., Ma, C., et al. (2020) Deep Learning System to Screen Coronavirus Disease 2019 Pneumonia.  
https://arxiv.org/abs/2002.09334</mixed-citation></ref><ref id="scirp.101777-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Chen, J., Wu, L., Zhang, J., et al. (2020) Deep Learning-Based Model for Detecting 2019 Novel Coronavirus Pneumonia on High-Resolution Computed Tomography: A Prospective Study.  
https://doi.org/10.1101/2020.02.25.20021568</mixed-citation></ref><ref id="scirp.101777-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Matsuyama, E. and Tsai, D.-Y. (2018) Automated Classification of Lung Diseases in Computed Tomography Images Using a Wavelet Based Convolutional Neural Network. Journal of Biomedical Science and Engineering, 11, 263-274. https://doi.org/10.4236/jbise.2018.1110022</mixed-citation></ref><ref id="scirp.101777-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Matsuyama, E., Takehara, M. and Tsai, D.-Y. (2020) Using a Wavelet-Based and Fine-Tuned Convolutional Neural Network for Classification of Breast Density in Mammographic Images. Open Journal of Medical Imaging, 10, 17-29. https://doi.org/10.4236/ojmi.2020.101002</mixed-citation></ref><ref id="scirp.101777-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Narayanan, B.N., Silva, M.S.D., Hardie, R.C., et al. (2019) Understanding Deep Neural Network Predictions for Medical Imaging Applications.</mixed-citation></ref><ref id="scirp.101777-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Narayanan, B.N., Davuluru, V.S.P. and Hardie, R.C. (2020) Two-Stage Deep Learning Architecture for Pneumonia Detection and Its Diagnosis in Chest Radiographs. Proceedings of SPIE Medical Imaging 2020, Houston, 2 March 2020, 113180G, 1-10. https://doi.org/10.1117/12.2547635</mixed-citation></ref><ref id="scirp.101777-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">ImageNet. http://www.image-net.org</mixed-citation></ref><ref id="scirp.101777-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Selvaraju, R.R., Cogswell, M., Das, A., et al. (2020) Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. International Journal of Computer Vision, 128, 336-359.  
https://doi.org/10.1007/s11263-019-01228-7</mixed-citation></ref><ref id="scirp.101777-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">https://arxiv.org/pdf/2003.13865</mixed-citation></ref><ref id="scirp.101777-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">https://github.com/UCSD-AI4H/COVID-CT</mixed-citation></ref><ref id="scirp.101777-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">https://github.com/ieee8023/covid-chestxray-dataset/tree/master/images</mixed-citation></ref><ref id="scirp.101777-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Matsuyama, E., Tsai, D.-Y., Lee, Y., et al. (2013) A Modified Undecimated Discrete Wavelet Transform Based Approach to Mammographic Image Denoising. Journal of Digital Imaging, 26, 748-758.  
https://doi.org/10.1007/s10278-012-9555-6</mixed-citation></ref><ref id="scirp.101777-ref30"><label>30</label><mixed-citation publication-type="other" xlink:type="simple">Daubechies, I. (1992) Ten Lectures on Wavelets. The Society for Industrial and Applied Mathematics, Pennsylvania. https://doi.org/10.1137/1.9781611970104</mixed-citation></ref><ref id="scirp.101777-ref31"><label>31</label><mixed-citation publication-type="other" xlink:type="simple">Rajaraman, S., Silamut, K., Hossain, M.A., et al. (2018) Understanding the Learned Behavior of Customized Convolutional Neural Networks toward Malaria Parasite Detection in Thin Blood Smear Images. Journal of Medical Imaging, 5, Article ID: 034501. https://doi.org/10.1117/1.JMI.5.3.034501</mixed-citation></ref><ref id="scirp.101777-ref32"><label>32</label><mixed-citation publication-type="other" xlink:type="simple">Fletcher, R.H., Fletcher, S.W. and Fletcher, G.S. (2012) Clinical Epidemiology: The Essentials. 5th Edition, College Students Textbooks Pvt. Ltd., Huissen.</mixed-citation></ref></ref-list></back></article>