Improving the Performance of an Image Classification with Convolutional Neural Network Model by Using Image Augmentations Technique

Main Article Content

Pimpa Cheewaprakobkit

Abstract

        Image classification is one of the challenges for both humans and computers because it is a critical task. Researchers try to solve this task in many ways. In neural networks, Convolutional Neural Network (CNN) has been widely used for images recognition, images classifications, and objects detections. The process of CNN model includes training and testing. This paper aims to improve the performance of an image classification with Convolutional Neural Network model by using image augmentation technique and compare the accuracy of each technique. The dataset that we used in this paper was collected from Canadian Institute for Advanced Research (CIFAR-10) dataset which consists of a total of 60,000 images. The color images are of size 32x32 in 10 different classes. In the first stage, the 10- fold cross validation method is used to divide 50,000 images for training 300 rounds and 10,000 images for testing. In the second stage, in order to create artificial images, 10,000 images were randomly-selected from all categories and applied the image augmentation technique such as color adjustment, rotation, zoom, or flip image to the image. This image augmentations technique were randomly selected only one technique and applied to one image. The training images were 60,000 images in total; 50,000 images from the first stage plus 10,000 new images. Then the computer trained the images for 300 rounds. The evaluation shows that after using image augmentations technique by creating artificial images, the accuracy of Convolutional Neural Network model performs better from 84.79% up to 87.57%

Article Details

Section
Research Article

References

N. Dalal, and B. Triggs, “HISTOGRAMS OF ORIENTED GRADIENTS FOR HUMAN DETECTION,” IEEE Computer Society Conference On Computer Vision And Pattern Recognition, vol. 1, pp. 886–893, 2005.

M. A. Tanner, and W. H. Wong, “The Calculation of Posterior Distributions By Data Augmentation,” Journal Of The American Statistical Association, vol. 82 NO. 398, pp.528–540, 1987.

G. E. Hinton, S. Osindero, and Y.W. Teh, “A FAST LEARNING ALGORITHM FOR DEEP BELIEF NETS,” NEURAL COMPUTATION, vol.18, no.7, pp.1527 –1554, 2006.

Z. Zafrulla, H. Brashear, n T. Starner, H. Hamilton, and P. Presti, “American Sign Language Recognition With the Kinect,” ICMI '11 Proceedings of the 13th international conference on multimodal interfaces, pp. 279-286, 2011.

T. Siriborvornratanakul, “Five questions with Deep Learning: Automatic cucumber sorting system from pictures @Cucumber Farm in Japan (Part 2/2).”[Online]. Available: https://mgronline.com/daily/detail/9590000091327 [Accessed: 20-Dec-2018].

Z. Hussain, F. Gimenez, D. Yi and D. Rubin. “Differential Data Augmentation Techniques for Medical Imaging Classification Tasks,” AMIA Annual Symposium Proceedings Archive, pp. 979-984, 2018.

C. Niyomthum, “What is CNN?.” [Online]. Available: https://medium.com/@thebear19/neural-network-101-cnn-with-tensorflow-fd5d515e979b. [Accessed: 04-Oct-2018].

Mc. Ai, “Let's see how CNN Thinks!!!.” [Online]. Available: https://mc.ai/มาลองดูวิธีการคิดของ-cnn-ก/. [Accessed: 10-Nov-2018].

M. D. Bloice, C. Stocker, and A. Holzinger, “Augmentor: An Image Augmentation Library for Machine Learning,” The Journal of Open Source Software, vol.2. pp. 1-5, 2017.

C. Shorten, “Image Augmentation Examples in Python.” [Online]. Available: https://towardsdatascience.com/image-augmentation-examples-in-python-d552c26f2873. [Accessed: 10-Nov-2018].

P. Bee, “How to install Keras + Tensorflow (GPU version) on windows 10.” [Online]. Available: https://medium.com/boobeejung/วิธีการติดตั้ง-keras-tensorflow-gpu-version-บน-windows-10-บทความนี้เขียนขึ้นเพื่อหลายๆ-คนที่กำ-e8e3f5105baa. [Accessed: 04-Oct-2018].

K. Jarrett, K. Kavukcuoglu, M. Ranzato and Y. LeCun, “What Is The Best Multi-Stage Architecture for Object Recognition?,” in 2009 IEEE 12th International Conference On Computer Vision, pp. 2146–2153, 2009.