Development of a combined image recognition model
The object of research is the processes of identification and classification of objects in computer vision tasks. Currently, for the recognition of images, the best results are demonstrated by artificial neural networks. However, learning neural networks is a poorly conditioned task. Poor conditioni...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
PC Technology Center
2019-06-01
|
Series: | Tehnologìčnij Audit ta Rezervi Virobnictva |
Subjects: | |
Online Access: | http://journals.uran.ua/tarp/article/view/173122 |
id |
doaj-426262b8ff0b4faf846573d6b01325b0 |
---|---|
record_format |
Article |
spelling |
doaj-426262b8ff0b4faf846573d6b01325b02020-11-25T01:57:17ZengPC Technology CenterTehnologìčnij Audit ta Rezervi Virobnictva2226-37802312-83722019-06-0132(47)91410.15587/2312-8372.2019.173122173122Development of a combined image recognition modelMykola Voloshyn0Pukhov Institute for Modelling in Energy Engineering of National Academy of Sciences of Ukraine, 15, Henerala Naumova str., Kyiv, Ukraine, 03164The object of research is the processes of identification and classification of objects in computer vision tasks. Currently, for the recognition of images, the best results are demonstrated by artificial neural networks. However, learning neural networks is a poorly conditioned task. Poor conditioning means that even a large data set can carry a small amount of information about a problem that is being solved. Therefore, a key role in the synthesis of parameters of a specific mathematical model of a neural network belongs to educational data. Selection of a representative training set is one of the most difficult tasks in machine learning and is not always possible in practice. The new combined model of image recognition using the non-force interaction theory proposed in the paper has the following key features: – designed to handle large amounts of data; – selects useful information from an arbitrary stream; – allows to naturally add new objects; – tolerant of errors and allows to quickly reprogram the behavior of the system. Compared to existing analogues, the recognition accuracy of the proposed model in all experimental studies was higher than the known recognition methods. The average recognition accuracy of the proposed model was 71.3 %; using local binary patterns – 59.9 %; the method of analysis of the main components – 65.2 %; by the method of linear discriminant analysis – 65.6 %. Such recognition accuracy in combination with computational complexity makes this method acceptable for use in systems operating in conditions close to real time. Also, this approach allows to manage the recognition accuracy. This is achieved by adjusting the number of sectors of the histograms of local binary patterns that are used in the description of images and the number of image fragments used in the classification stage by the introformation approach. To a large extent, the number of image fragments affects the time of classification, since in this case, it is necessary to calculate the matching of the system actions in each of the possible directions in pairs.http://journals.uran.ua/tarp/article/view/173122computer vision systemsimage analysisobject recognition and identification |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mykola Voloshyn |
spellingShingle |
Mykola Voloshyn Development of a combined image recognition model Tehnologìčnij Audit ta Rezervi Virobnictva computer vision systems image analysis object recognition and identification |
author_facet |
Mykola Voloshyn |
author_sort |
Mykola Voloshyn |
title |
Development of a combined image recognition model |
title_short |
Development of a combined image recognition model |
title_full |
Development of a combined image recognition model |
title_fullStr |
Development of a combined image recognition model |
title_full_unstemmed |
Development of a combined image recognition model |
title_sort |
development of a combined image recognition model |
publisher |
PC Technology Center |
series |
Tehnologìčnij Audit ta Rezervi Virobnictva |
issn |
2226-3780 2312-8372 |
publishDate |
2019-06-01 |
description |
The object of research is the processes of identification and classification of objects in computer vision tasks. Currently, for the recognition of images, the best results are demonstrated by artificial neural networks. However, learning neural networks is a poorly conditioned task. Poor conditioning means that even a large data set can carry a small amount of information about a problem that is being solved. Therefore, a key role in the synthesis of parameters of a specific mathematical model of a neural network belongs to educational data. Selection of a representative training set is one of the most difficult tasks in machine learning and is not always possible in practice.
The new combined model of image recognition using the non-force interaction theory proposed in the paper has the following key features:
– designed to handle large amounts of data;
– selects useful information from an arbitrary stream;
– allows to naturally add new objects;
– tolerant of errors and allows to quickly reprogram the behavior of the system.
Compared to existing analogues, the recognition accuracy of the proposed model in all experimental studies was higher than the known recognition methods. The average recognition accuracy of the proposed model was 71.3 %; using local binary patterns – 59.9 %; the method of analysis of the main components – 65.2 %; by the method of linear discriminant analysis – 65.6 %. Such recognition accuracy in combination with computational complexity makes this method acceptable for use in systems operating in conditions close to real time. Also, this approach allows to manage the recognition accuracy. This is achieved by adjusting the number of sectors of the histograms of local binary patterns that are used in the description of images and the number of image fragments used in the classification stage by the introformation approach. To a large extent, the number of image fragments affects the time of classification, since in this case, it is necessary to calculate the matching of the system actions in each of the possible directions in pairs. |
topic |
computer vision systems image analysis object recognition and identification |
url |
http://journals.uran.ua/tarp/article/view/173122 |
work_keys_str_mv |
AT mykolavoloshyn developmentofacombinedimagerecognitionmodel |
_version_ |
1724974994962776064 |