Effects of data count and image scaling on Deep Learning training
Background Deep learning using convolutional neural networks (CNN) has achieved significant results in various fields that use images. Deep learning can automatically extract features from data, and CNN extracts image features by convolution processing. We assumed that increasing the image size usin...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2020-11-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-312.pdf |
id |
doaj-15402d9be32648cfa1e8df7729cff223 |
---|---|
record_format |
Article |
spelling |
doaj-15402d9be32648cfa1e8df7729cff2232020-11-25T04:00:35ZengPeerJ Inc.PeerJ Computer Science2376-59922020-11-016e31210.7717/peerj-cs.312Effects of data count and image scaling on Deep Learning trainingDaisuke Hirahara0Eichi Takaya1Taro Takahara2Takuya Ueda3Department of AI Research Lab, Harada Academy, Kagoshima, Kagoshima, JapanSchool of Science for Open and Environmental Systems, Graduate School of Science and Technology, Keio University, Yokohama, Kanagawa, JapanDepartment of Biological Engineering, School of Engineering, Tokai University, Isehara, Kanagawa, JapanDepartment of Clinical Imaging, Graduate School of Medicine, Tohoku University, Sendai, JapanBackground Deep learning using convolutional neural networks (CNN) has achieved significant results in various fields that use images. Deep learning can automatically extract features from data, and CNN extracts image features by convolution processing. We assumed that increasing the image size using interpolation methods would result in an effective feature extraction. To investigate how interpolation methods change as the number of data increases, we examined and compared the effectiveness of data augmentation by inversion or rotation with image augmentation by interpolation when the image data for training were small. Further, we clarified whether image augmentation by interpolation was useful for CNN training. To examine the usefulness of interpolation methods in medical images, we used a Gender01 data set, which is a sex classification data set, on chest radiographs. For comparison of image enlargement using an interpolation method with data augmentation by inversion and rotation, we examined the results of two- and four-fold enlargement using a Bilinear method. Results The average classification accuracy improved by expanding the image size using the interpolation method. The biggest improvement was noted when the number of training data was 100, and the average classification accuracy of the training model with the original data was 0.563. However, upon increasing the image size by four times using the interpolation method, the average classification accuracy significantly improved to 0.715. Compared with the data augmentation by inversion and rotation, the model trained using the Bilinear method showed an improvement in the average classification accuracy by 0.095 with 100 training data and 0.015 with 50,000 training data. Comparisons of the average classification accuracy of the chest X-ray images showed a stable and high-average classification accuracy using the interpolation method. Conclusion Training the CNN by increasing the image size using the interpolation method is a useful method. In the future, we aim to conduct additional verifications using various medical images to further clarify the reason why image size is important.https://peerj.com/articles/cs-312.pdfImage scaling NearestBilinear Hamming BicubicLanczos |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Daisuke Hirahara Eichi Takaya Taro Takahara Takuya Ueda |
spellingShingle |
Daisuke Hirahara Eichi Takaya Taro Takahara Takuya Ueda Effects of data count and image scaling on Deep Learning training PeerJ Computer Science Image scaling Nearest Bilinear Hamming Bicubic Lanczos |
author_facet |
Daisuke Hirahara Eichi Takaya Taro Takahara Takuya Ueda |
author_sort |
Daisuke Hirahara |
title |
Effects of data count and image scaling on Deep Learning training |
title_short |
Effects of data count and image scaling on Deep Learning training |
title_full |
Effects of data count and image scaling on Deep Learning training |
title_fullStr |
Effects of data count and image scaling on Deep Learning training |
title_full_unstemmed |
Effects of data count and image scaling on Deep Learning training |
title_sort |
effects of data count and image scaling on deep learning training |
publisher |
PeerJ Inc. |
series |
PeerJ Computer Science |
issn |
2376-5992 |
publishDate |
2020-11-01 |
description |
Background Deep learning using convolutional neural networks (CNN) has achieved significant results in various fields that use images. Deep learning can automatically extract features from data, and CNN extracts image features by convolution processing. We assumed that increasing the image size using interpolation methods would result in an effective feature extraction. To investigate how interpolation methods change as the number of data increases, we examined and compared the effectiveness of data augmentation by inversion or rotation with image augmentation by interpolation when the image data for training were small. Further, we clarified whether image augmentation by interpolation was useful for CNN training. To examine the usefulness of interpolation methods in medical images, we used a Gender01 data set, which is a sex classification data set, on chest radiographs. For comparison of image enlargement using an interpolation method with data augmentation by inversion and rotation, we examined the results of two- and four-fold enlargement using a Bilinear method. Results The average classification accuracy improved by expanding the image size using the interpolation method. The biggest improvement was noted when the number of training data was 100, and the average classification accuracy of the training model with the original data was 0.563. However, upon increasing the image size by four times using the interpolation method, the average classification accuracy significantly improved to 0.715. Compared with the data augmentation by inversion and rotation, the model trained using the Bilinear method showed an improvement in the average classification accuracy by 0.095 with 100 training data and 0.015 with 50,000 training data. Comparisons of the average classification accuracy of the chest X-ray images showed a stable and high-average classification accuracy using the interpolation method. Conclusion Training the CNN by increasing the image size using the interpolation method is a useful method. In the future, we aim to conduct additional verifications using various medical images to further clarify the reason why image size is important. |
topic |
Image scaling Nearest Bilinear Hamming Bicubic Lanczos |
url |
https://peerj.com/articles/cs-312.pdf |
work_keys_str_mv |
AT daisukehirahara effectsofdatacountandimagescalingondeeplearningtraining AT eichitakaya effectsofdatacountandimagescalingondeeplearningtraining AT tarotakahara effectsofdatacountandimagescalingondeeplearningtraining AT takuyaueda effectsofdatacountandimagescalingondeeplearningtraining |
_version_ |
1724449584965484544 |