Engineering and Applied Science Letter
ISSN: 2617-9709 (Online) 2617-9695 (Print)
DOI: 10.30538/psrp-easl2020.0054
A holistic comparison between deep learning techniques to determine Covid-19 patients utilizing chest X-Ray images
Taki Hasan Rafi
Department of Electrical and Electronic Engineering, Ahsanullah University of Science and Technology, Dhaka-1208, Bangladesh.; takihasanrafi@gmail.com
Abstract
Keywords:
1. Introduction and Preliminaries
A novel Coronavirus or COVID-19 is an infectious ailment brought about by a recently distinguished infection which is known to be just transmitted through a set of all animals yet as of late influenced people too. Since December 2019, various instances of "obscure viral pneumonia" identified with a nearby Seafood Wholesale Market was accounted for in Wuhan City, China [1]. A Novel coronavirus fit for tainting people was officially affirmed on January 6, 2020 [2]. As indicated by nature, the spread of coronavirus ailment 2019 (COVID-19) is getting relentless and has just arrived at the important epidemiological measures for it to be announced a pandemic [3]. COVID-19 is an intense settled disease however it can likewise be lethal, with a 2% case casualty rate [4]. Like other coronaviral pneumonia, for example, a serious intense respiratory disorder brought about by coronavirus, COVID-19 can likewise prompt intense respiratory trouble condition [2].
There is a dire requirement for viable treatment. Current spotlight has been on the improvement of novel therapeutics, including antivirals and antibodies. Gathering proof recommends that a subgroup of patients with serious COVID-19 may have a cytokine storm condition [5]. The most widely recognized test method as of now utilized for COVID-19 determination is an ongoing converse interpretation polymerase chain response (RT-PCR) [6]. COVID-19 can cause intense heart injury. In the vast majority of the cases, the patients who have co-morbidity like diabetes, circulatory strain, coronary illness [7].
The side effects of these sicknesses resemble whatever another ordinary influenza which is a disadvantage of distinguishing the genuine influenced ones. The side effects can be demonstrated roughly in the middle of 14 days. As this COVID-19 is another infection for the clinical network, so still explicit treatment with respect to COVID-19 is difficult. There are some recognized side effects in regards to COVID-19, proposed by the World Health Organization (WHO). For example, high fever or mellow fever, hack, breathing problem, exhaustion, muscle or body throbs, migraine, loss of taste or smell, sore throat, clog or runny nose, spewing, diarrhoea. It straightforwardly influences the lung. X-Ray based images can assist us with knowing the lung condition so we can discover more COVID-19 cases as per the lung report. CT scan reports likewise can be utilized [8]. In spite of the fact that by far most of patients just have a typical, gentle type of sickness, around 15-20% of the patients fall into the serious gathering, which means they require helped oxygenation as a major aspect of treatment [9].
While it is about images-based problems, deep convolutional neural network can comprehend this all the more effectively these days. Deep neural-based frameworks can group images or related issues all the more precisely and productively by its condition of workmanship algorithmic strength. These are some enormous algorithms have been presented by deep learning researchers.
In this examination, author assesses the viability of cutting edge pre-trained convolutional neural systems proposed by established researchers, with respect to their mastery in the programmed analysis of COVID-19 from thoracic X-ray images. Author utilized pre-trained models, for example, DenseNet-121, ResNet-50, EfficientNet-B4 and base convolutional model. Our assessment is dependent on AUC.
Further part of this paper-situated as related works, dataset, methodology, results, discussions and conclusion.
2. Related works
The novel coronavirus is another new disease in the field of the clinical network. Clinical researchers, just as deep learning specialists, are attempting to determine this issue. The fundamental test is to distinguish COVID-19 cases in a less measure of time and minimal cost. So the AI research network has come up to handle this test all the more proficiently. Less measure of works has been done as such far. In this segment, we go over some past and effective works with respect to this challenge from AI specialists.Mangla et al., [10] have attempted to tackle COVID-19 case identification utilizing pre-prepared deep convolutional neural systems. Their model contains pre-prepared CheXNet, with a 121-layer Dense Convolutional Network (DenseNet) spine, trailed by a completely associated layer. They supplant CheXNet's last classifier of 14 classes with our characterization layer of 4 classes, each with a sigmoid actuation to deliver the last yield. They wound up with a consequence of AUROC 0.9994 and precision of 87.2% in 4 class grouping. They named their model as CovidAID. El Asnaou et al., [11] have attempted to discover a few inquiries in regards to COVID-19 early recognition utilizing deep learning methods. They executed a few generally utilized deep learning structures, for example, VGG16, VGG19, MobileNet V2, Resnet50, DenseNet201, Inception ResNet V2 and Inception V3 in X-Ray just as Ct-Scan images, where they infer that Inception ResNet V2 has performed superior to different architectures with a 92.18% accuray. Apostolopoulos et al., [12] have utilized pre-prepared deep learning models in their test. They tested in a dataset which contains 1427 X-Ray images, where 700 images are typical pneumonia, 224 images with affirmed Covid-19 cases and 504 images of ordinary conditions. They utilized MobileNet v2, VGG19, Inception, Xception and Inception ResNet v2 designs. Where VGG 19 has given the best yield 98.75% accuracy in 2-class order.
Abiyev et al., [13] conventional convolutional neural system to distinguish chest related ailment. They spoke to a correlation between the convolutional neural system, supervised back-propagation neural system and competitive neural system utilizing chest X-Ray images. Where the convolutional neural system has performed superior to different models. Abbas et al., [14] have actualized a tuned and altered deep neural system in X-ray images to distinguish COVID-19 cases all the more productively. They re-manufacture their model and named as DeTraC which contains 3 periods of layers. They built up this by utilizing ResNet-18 in backend and gets an accuracy of 95.12% in the X-Ray dataset. Rahimzadeh et al., [15] have actualized a connected of Xception and ResNet50V2 design to distinguish COVID-19 cases. In their trial, they utilized unbalanced X-Ray dataset. They observed numerous deep learning models look at the best result. The altered model which is a blend of Xception and ResNet50V2 has accomplished 91.40% accuracy on average. Naurin et al., [16] have executed convolutional neural systems, for example, Inception V3, Inception ResNetV3 and ResNet50 for the identification of COVID-19 cases by X-Ray images. They saw around 98% accuracy in pre-prepared ResNet50 model, which is higher than Inception V3 model.
Considering all references, author attempts to handle this continuous COVID-19 detection issue by various deep learning procedures. We executed EfficientNet-B4, ResNet-50, DenseNet-121 and base CNN model to legitimize which one performs better in this analysis.
3. Dataset
In this investigation, author has utilized and retrieved another arrangement of a dataset for the COVID-19 detection framework. It is accessible for theresearch community to battle against COVID-19 and quicken the exploration results. Later on, this dataset has been presented by Kaggle as an ongoing competition. The dataset contains a total of 5907 X-Ray images, where it has 5283 images for train purpose and 624 images for test purpose. It additionally has two classifications, for example, normal class and pneumonia class. Pneumonia class has four division, for example, SARS, COVID-19, ARDS and Streptococcus. The dataset can be downloaded from https://github.com/ieee8023/covid-chestxray-dataset. The sample of the dataset has appeared in Figure 1.Figure 1. (a) Samples of normal case X-Ray and (b) Samples of COVID-19 case X-Ray.
4. Methodology
In the dataset, 5907 images are utilized to investigate the examination, where the dataset has a few arrangements of images, for example, Normal, SARS, COVID-19, ARDS and Streptococcus. Be that as it may, in this analysis our primary goal to identify COVID cases. To identify all the more proficiently, in this manner author is going to build a model which separates between normal case, Pneumonia and COVID-19 cases. Author additionally lessen the classification number to two. Later on, author applied different pre-trained deep learning models, for instance, Base convolutional neural network, DenseNet-121, ResNet-50 and EfficientNet-B4 to distinguish COVID-19 cases and to locate the best exact outcome as indicated by the individual exhibitions.Figure 2. Block diagram of our methodology.
4.1. Data augmentation
Data augmentation is a strategy that enables practitioners to significantly increase the diversity of data available for training models, without actually collecting new data. Data augmentation techniques such as cropping, padding, and horizontal flipping are commonly used to train large neural networks.4.2. Base convolutional neural network
Convolutional neural networks are practically equivalent to conventional artificial neural networks in that they are included neurons that self-streamline through learning [17]. It fundamentally center around the premise that the info will be involved images. This centers the engineering to be set up in a manner to best suit the requirement for managing the particular sort of data. There are a few functionalities to explain convolutional neural network more briefly. As found in various kinds of artificial neural network, the information layer will hold the pixel estimations of the image. Convolutional layers will choose the yield of neurons of which are related to close-by regions of the commitment through the check of the scalar the thing between their heaps and the area related to the data volume.Figure 3. CNN Architecture.
The reviewed straight unit hopes to apply an elementwise inception capacity or initiation work, for instance, sigmoid to the yield of the sanctioning made by the past layer. Pooling layers will by then simply perform down-testing along the spatial the dimensionality of the given information, further reducing the amount of limits inside that incitation. completely associated layers will by then play out comparative commitments found in standard artificial neural networks and try to convey class scores from the institutions, to be used for portrayal. It is moreover suggested that ReLU may be used between these layers, as to improve execution. Zero-padding is the basic procedure of cushioning the outskirt of the information and is a compelling strategy to give further control with regards to the dimensionality of the yield volumes. Boundary sharing chips away at the supposition that in the event that one area highlight is helpful to register at a set spatial area, at that point, it is probably going to be valuable in another locale. In the event that we compel every individual initiation map inside the yield volume to similar loads and predisposition, at that point we will see a huge decrease in the quantity of boundaries being created by the convolutional layer [17].
4.3. Densely connected convolutional network
Densely connected convolutional network (DenseNet) interfaces each layer to each other layer in a feed-forward design [18], while convolutional neural systems with \(N\) layers have \(N\) associations, one between each layer and its ensuing layer. DenseNets have a few convincing points of interest: they lighten the vanishing gradient issue, fortify element spread, empower include reuse, and significantly diminish the quantity of boundaries. It has better accuracy than ResNet in object recognition [18]. DenseNets are worked from thick squares and pooling activities, where each thick square is an iterative connection of past element maps. This design can be viewed as an augmentation of ResNets [19], which performs an iterative summation of past component maps. In any case, this little change makes them intrigue suggestions such as, boundary proficiency, DenseNets are more efficient in the boundary use. Understood profound oversight, DenseNets perform profound management on account of short ways to all component maps in the design and highlight reuse, all layers can without much of a stretch access their first layers making it simple to reuse the data from recently figured element maps. The attributes of DenseNets make them an awesome fit for the semantic division as they normally actuate skip associations and multi-scale management. Fully connected DenseNets are worked from a downsampling way, an upsampling way and skip associations. Skip associations help the upsampling way recoup spatially point by point data from the downsampling way, by reusing highlights maps. The objective of our model is to additionally misuse the component reuse by broadening the more refined DenseNet engineering while at the same time maintaining a strategic distance from the element blast at the upsampling way of the system [18]. In this investigation, author has utilized pre-trained DenseNet-121 design to actualize in our dataset.4.4. Residual network
Residual network has been created and acquainted by Microsoft Research with handle image recognition all the more without any hurdle. ResNet has about 3.57% less error than VGGNet [19]. It has around 152 layers top to bottom, which is eight multiple times higher than VGGNet design. Its architecture has been inspired by VGGNets architecture. We are meaning the mapping as \(H(a)\), another non-direct mapping can be communicated as \(F(a) = H(a)- a\), the primary mapping can be communicated as \(F(a) + a\). We receive lingering figuring out how to each couple of stacked layers. The structure square can be defined as [19]:Figure 4. ResNet block diagram.
4.5. EficientNet
EfficientNet has been created and presented by Mingxing Tan, staff software engineer at Google. EfficientNet is a systematical model scaling and distinguishes that cautiously adjusting system profundity, width, and goals can prompt better performance [20]. It is propelled by ResNet and MobileNet, and scaling up or down to legitimize better exactness. This is a compound demonstrating framework. There are numerous approaches to scale a ConvNet for various asset limitations: ResNet [19] can be downsized for example ResNet-18 or up e.g., ResNet-200 by altering system profundity or the quantities of layers. A convolutional neural layer can be detailed as \(B_i\) = \(F_i(A_i)\), where, \(B_i\) is the yield tensor, \(A_i\) is the information tensor and \(F_i\) is the employable capacity. Tensor shape \((X_i, Y_i, Z_i)\) where \(X_i\) and \(Y_i\) are spatial measurements and \(Z_i\) is the channel measurement. There are three significant boundaries to consider for scaling reason, for example, profundity, width and goals. Scaling system profundity is the most well-known way utilized by numerous convolutional systems. The instinct is that more profound convolutional systems can catch more extravagant and progressively complex highlights, and sum up well on new errands. Notwithstanding, more profound systems are additionally progressively hard to prepare because of the disappearing inclination issue. The genuine errand of this model is to scaling the profundity, width and goals all the more productively to change the assignment prerequisite, goals with a lot of fixed scaling coefficients. For instance, on the off chance that we need to utilize \(2^n\) times progressively computational assets, at that point we can just build the system profundity by \(\alpha^n\), width by \(\beta^n\), and image size by \(\gamma^n\), where \(\alpha, \beta, \gamma\) are consistent coefficients controlled by a little lattice search on the first little model. In this analysis, author has utilized pre-trained EfficientNet-B4 model for COVID-19 for detecting purpose.4.6. Activation function
Actuation functions are numerical conditions that decide the yield of a neural system. The capacity is appended to every neuron in the system and decides if it ought to be actuated or not, founded on whether every neuron's input is significant for the model's expected output. In this investigation, we utilized two individual activation functions, for example, Sigmoid [21] and ReLU [22] activation function. We utilized Sigmoid in Base CNN model and ReLU in rest of different models as initiation work.- Sigmoid: The sigmoid activation function is here and there alluded to as the strategic capacity or crushing capacity in some literatures [21]. The Sigmoid is a non-direct enactment work utilized for the most part in feedforward neural systems.
\begin{equation} \text{Sigmoid} = [\frac{1}{(1+ exp ^-x)}]^-1.4\,. \end{equation}(3)
- ReLU: Rectified Linear Units (ReLU) as the arrangement work in a deep neural system [22]. Customarily, ReLU is utilized as an actuation work in deep neural systems, with Softmax work as their arrangement work. It works by thresholding values at 0, model \(f (a) = max(0, a)\). Basically, it yields 0 when \(a < 0\), and then again, it yields a straight capacity when \(a 0\). ReLU is not just as an activation function in each concealed layer of a neural system yet additionally as the grouping capacity at the last layer of a system. Thus, the anticipated class for ReLU classifier would be \(\hat{b}\) [22],
\begin{equation} \hat{b} = \text{arg max}_{i=1,.....,N} (0,0). \end{equation}(4)
4.7. Optimizer
The optimizer is a robust algorithm that helps to reduce the loss of a deep neural system by changing some attributes such as learning rate and changing weight and enhance the overall performance of the system. An optimizer can improve the performance of a neural system. It is essential to use an optimizer to reduce loss functions. In this experiment, author used two extensively used optimizers such as Stochastic Gradient Descent and Adam. We used Adam in EfficientNet-B4 algorithm and ResNet-50, on the other hand, author used Stochastic Gradient Descent (SGD) in DenseNet-121 and Base CNN model.- Stochastic gradient descent: Stochastic Gradient Descent is widely utilized optimizer, much of the time, it is utilized in traditional CNN model to streamline [23]. It is an updated form of Batch SGD. SGD gets rid of this repetition by performing each update in turn. It is subsequently generally a lot quicker and can likewise be utilized to learn on the web. SGD performs visit refreshes with a high change that prompt the target capacity to change intensely.
- Adam: Adam is an extensively utilized optimizer, which is 1st order gradient-based optimizer. It is a strategy for proficient stochastic advancement that just requires first-request slopes with little memory prerequisite. The technique registers individual versatile taking in rates for various boundaries from evaluations of first and second snapshots of the angles, the name Adam is gotten from versatile second estimation [24]. This can be formulated as:
\begin{equation} x_1 = (1 - \beta_1) \sum_{i=0}^t \beta_1^{t-1} y_1. \end{equation}(5)
5. Performance matrix
We evaluated our models by AUC, accuracy, precision, specificity and sensitivity.6. Results and discussions
In this analysis, author utilized a few convolutional neural system models to empower better outcome. Author executed EfficientNet-B4, ResNet-50, DenseNet-121 and Base convolutional neural Network model to identify COVID-19 cases all the more proficiently. There are a few works have been done before in an exceptionally brief timeframe to handle this obstacle by colossal scientists. Different specialists have created gathering algorithms for identifying COVID-19 cases.In this examination, EfficientNet-B4 has performed better. It has 98.86% accuracy and 0.996 AUC. Different models, for example, ResNet-50, DenseNet-121 and Base CNN have additionally performed well. Author set the epochs to 20 in every examination. Be that as it may, Base CNN has the most reduced accuracy of 84.50% where ResNet-50 has 97.31% and DenseNet-121 has an accuracy of 96.50%. Author executed the sigmoid activation function in Base CNN model and ReLU activation function in the remainder of the models. Then again, Author utilized stochastic gradient descent (SGD) optimizer in Base CNN model and DenseNet-121 model. Author also likewise utilized Adam optimizer in EfficientNet-B4 and ResNet-50 model. Table 1 has demonstrated the presentation examination of each algorithm. Figure 5, has shown performances of every algorithm with AUC, training loss, validation loss and validation AUC, and Figure 6, we have shown EfficientNet-B4's model accuracy by increasing epochs and model loss in order to increase epochs.
Table 1. Performance analysis of every algorithm (%).
Algorithm | AUC | Accuracy | Specificity | Sensitivity |
---|---|---|---|---|
Base CNN Model | 0.762 | 84.50 | 81.43 | 88.29 |
DenseNet-121 | 0.874 | 96.50 | 92.66 | 93.28 |
ResNet-50 | 0.967 | 97.31 | 97.78 | 96.12 |
EfficientNet-B4 | 0.997 | 98.87 | 99.46 | 98.77 |
Figure 5. (a) Base CNN performance graph; (b) ResNet152 performance graph; (c) DenseNet121 performance graph and (d) EfficientNetB4 performance graph.
Figure 6. (a) Model accuracy of EfficientNetB4 model and (b) Model loss of EfficientNetB4 model.
Finally, because of excellent outcome in chest X-Ray images, we propose and concoct the choice that EfficientNet-B4 model can be utilized for additional identifying limit with regards to clinical network to handle this hardest time for the world.
7. Conclusion
Conclusion of COVID-19 is basic to follow the influenced individuals and limit the transmission as it is a viral disease. RT-PCR method is costly and needs more time to detect COVID-19 cases. These days medical images preparing is one of the fundamental assignments for the scientists to foresee or recognize infections all the more productively. So as referenced, we attempted to handle COVID-19 discovery issue by utilizing medical images, all the more explicitly chest X-Ray images. In this investigation, author has actualized a few pre-tranied deep convolutional neural networks, for example, Base CNN, DenseNet-121, ResNet-50 and EfficientNet-B4 in chest X-Ray dataset. Convolutional neural systems can think of an effective and powerful result than any conceivable way. In this investigation, EfficientNet-B4 has performed superior to different models with an accuracy of 98.86% just as containing higher AUC of 0.996. Then again, ResNet-50 has likewise performed well with accuracy and AUC of 97.31% and 0.967 separately. Different models, for example, DenseNet-121 and Base CNN have an accuracy of 96.50% and 84.50%. So in rundown, author would recommend and proposed EfficientNet-B4 for additional headway of distinguishing COVID-19 cases by utilizing X-Ray images. For future improvement, a more sophisticated dataset is needed with more amount of images to train our model for better outcomes.Conflicts of interest
The author declares no conflict of interest.References
- Ai, T., Yang, Z., Hou, H., Zhan, C., Chen, C., Lv, W.& Xia, L. (2020). Correlation of chest CT and RT-PCR testing in coronavirus disease 2019 (COVID-19) in China: A report of 1014 cases. Radiology, 296(2), 1-8. [Google Scholor]
- Pan, F., Ye, T., & Sun, P. (2020). Time course of lung changes on chest CT during recovery from 2019 novel coronavirus (COVID-19) pneumonia [e-pub ahead of print]. Radiology, 295(3), 715-721.[Google Scholor]
- Remuzzi, A., & Remuzzi, G. (2020). COVID-19 and Italy: what next?. The Lancet, 395(10231), 1225-1228. [Google Scholor]
- Xu, Z., Shi, L., Wang, Y., Zhang, J., Huang, L., Zhang, C. & Tai, Y. (2020). Pathological findings of COVID-19 associated with acute respiratory distress syndrome. The Lancet Respiratory Medicine, 8(4), 420-422. [Google Scholor]
- Mehta, P., McAuley, D. F., Brown, M., Sanchez, E., Tattersall, R. S., Manson, J. J., & HLH Across Speciality Collaboration. (2020). COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet (London, England), 395(10229), 1033-1034. [Google Scholor]
- Ozturk, T., Talo, M., Yildirim, E. A., Baloglu, U. B., Yildirim, O., & Acharya, U. R. (2020). Automated detection of COVID-19 cases using deep neural networks with X-ray images. Computers in Biology and Medicine, 121, 103792. [Google Scholor]
- Zheng, Y. Y., Ma, Y. T., Zhang, J. Y., & Xie, X. (2020). COVID-19 and the cardiovascular system. Nature Reviews Cardiology, 17(5), 259-260. [Google Scholor]
- Xu, X., Jiang, X., Ma, C., Du, P., Li, X., Lv, S., ... & Lang, G. (2020). A deep learning system to screen novel coronavirus disease 2019 pneumonia. Engineering, 6(10), 1122-1129. [Google Scholor]
- Li, X. K., Ji, W. J., Zhao, J., Wang, S. J., & Au, C. T. (2005). Ammonia decomposition over Ru and Ni catalysts supported on fumed SiO2, MCM-41, and SBA-15. Journal of Catalysis, 236(2), 181-189. [Google Scholor]
- Mangal, A., Kalia, S., Rajgopal, H., Rangarajan, K., Namboodiri, V., Banerjee, S., & Arora, C. (2020). CovidAID: COVID-19 Detection Using Chest X-Ray. ArXiv Preprint, arXiv:2004.09803. [Google Scholor]
- Elasnaoui, K., & Chawki, Y. (2020). Using X-ray images and deep learning for automated detection of coronavirus disease. Journal of Biomolecular Structure and Dynamics, (just-accepted), 1-22. [Google Scholor]
- Apostolopoulos, I. D., & Mpesiana, T. A. (2020). Covid-19: automatic detection from x-ray images utilizing transfer learning with convolutional neural networks. Physical and Engineering Sciences in Medicine, 43, 635-640. [Google Scholor]
- Abiyev, R. H., & Ma'aitah, M. K. S. (2018). Deep convolutional neural networks for chest diseases detection. Journal of Healthcare Engineering, 2018, Article ID 4168538.[Google Scholor]
- Abbas, A., Abdelsamea, M. M., & Gaber, M. M. (2020). Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. ArXiv Preprint, arXiv:2003.13815. [Google Scholor]
- Rahimzadeh, M., & Attar, A. (2020). A modified deep convolutional neural network for detecting COVID-19 and pneumonia from chest X-ray images based on the concatenation of Xception and ResNet50V2. Informatics in Medicine Unlocked, 19, 100360.[Google Scholor]
- Narin, A., Kaya, C., & Pamuk, Z. (2020). Automatic detection of coronavirus disease (covid-19) using x-ray images and deep convolutional neural networks. ArXiv Preprint, arXiv:2003.10849. [Google Scholor]
- O'Shea, K., & Nash, R. (2015). An introduction to convolutional neural networks. ArXiv Preprint, arXiv:1511.08458. [Google Scholor]
- Yan, Q., Yang, L., Zhang, L., & Xiao, C. (2017). Distinguishing the indistinguishable: Exploring structural ambiguities via geodesic context. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (pp. 3836-3844). [Google Scholor]
- He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, (pp. 770-778). [Google Scholor]
- Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., & Liu, C. (2018, October). A survey on deep transfer learning. In International conference on artificial neural networks (pp. 270-279). Springer, Cham. [Google Scholor]
- Khan, A. I., Shah, J. L., & Bhat, M. M. (2020). Coronet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Computer Methods and Programs in Biomedicine, 105581.[Google Scholor]
- Agarap, A. F. (2018). Deep learning using rectified linear units (relu). ArXiv Preprint, arXiv:1803.08375. [Google Scholor]
- Ruder, S. (2016). An overview of gradient descent optimization algorithms. ArXiv Preprint, arXiv:1609.04747. [Google Scholor]
- Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. ArXiv Preprint, arXiv:1412.6980. [Google Scholor]