MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL

Subject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of variou...

Full description

Bibliographic Details
Main Authors: G. A. Kukharev, Y. N. Matveev, A. L. Oleinik
Format: Article
Language:English
Published: Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) 2017-01-01
Series:Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
Subjects:
Online Access:http://ntv.ifmo.ru/file/article/16407.pdf
id doaj-c8e4a01a7beb4951987e07f998f937e5
record_format Article
spelling doaj-c8e4a01a7beb4951987e07f998f937e52020-11-24T20:59:46ZengSaint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki2226-14942500-03732017-01-01171627410.17586/2226-1494-2017-17-1-62-74MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVALG. A. Kukharev0Y. N. Matveev1A. L. Oleinik2D.Sc., Professor, Westpomeranian University of Technology, 70-310, Szczecin, PolandD.Sc., Chief Scientific Officer, Head of the Chair, “STC-Innovation”, Saint Petersburg, 196084, Russian Federation; ITMO University, Saint Petersburg, 197101, Russian Federationpostgraduate, ITMO University, Saint Petersburg, 197101, Russian FederationSubject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of various kinds (e.g. photos and drawn pictures). This problem is widely represented in practice. Research is this area is based on existing datasets. The algorithms we have proposed in this paper can be applied to arbitrary pairs of related images due to the unified mathematical specification. Method. We have presented three image transformation algorithms. The first one is based on principal component analysis and Karhunen-Loève transform (1DPCA/1DKLT). Unlike the existing solution, it does not use the training set during the transformation process. The second algorithm assumes generation of an image population. The third algorithm performs the transformation based on two-dimensional principal component analysis and Karhunen-Loève transform (2DPCA/2DKLT). Main Results. The experiments on image transformation and population generation have revealed the main features of each algorithm. The first algorithm allows construction of an accurate and stable model of transition between two given sets of images. The second algorithm can be used to add new images to existing bases and the third algorithm is capable of performing the transformation outside the training dataset. Practical Relevance. Taking into account the qualities of the proposed algorithms, we have provided recommendations concerning their application. Possible scenarios include construction of a transition model for related pairs of images, mutual transformation of the images inside and outside the dataset as well as population generation in order to increase representativeness of existing datasets. Thus, the proposed algorithms can be used to improve reliability of face recognition performed on images of various kinds. Moreover, these techniques can be applied to address a wide variety of other CMMR problems.http://ntv.ifmo.ru/file/article/16407.pdfcross-modal multimedia retrievalprincipal component analysisface imagessketchfacial composite
collection DOAJ
language English
format Article
sources DOAJ
author G. A. Kukharev
Y. N. Matveev
A. L. Oleinik
spellingShingle G. A. Kukharev
Y. N. Matveev
A. L. Oleinik
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
cross-modal multimedia retrieval
principal component analysis
face images
sketch
facial composite
author_facet G. A. Kukharev
Y. N. Matveev
A. L. Oleinik
author_sort G. A. Kukharev
title MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
title_short MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
title_full MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
title_fullStr MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
title_full_unstemmed MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
title_sort mutual image transformation algorithms for visual information processing and retrieval
publisher Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
series Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki
issn 2226-1494
2500-0373
publishDate 2017-01-01
description Subject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of various kinds (e.g. photos and drawn pictures). This problem is widely represented in practice. Research is this area is based on existing datasets. The algorithms we have proposed in this paper can be applied to arbitrary pairs of related images due to the unified mathematical specification. Method. We have presented three image transformation algorithms. The first one is based on principal component analysis and Karhunen-Loève transform (1DPCA/1DKLT). Unlike the existing solution, it does not use the training set during the transformation process. The second algorithm assumes generation of an image population. The third algorithm performs the transformation based on two-dimensional principal component analysis and Karhunen-Loève transform (2DPCA/2DKLT). Main Results. The experiments on image transformation and population generation have revealed the main features of each algorithm. The first algorithm allows construction of an accurate and stable model of transition between two given sets of images. The second algorithm can be used to add new images to existing bases and the third algorithm is capable of performing the transformation outside the training dataset. Practical Relevance. Taking into account the qualities of the proposed algorithms, we have provided recommendations concerning their application. Possible scenarios include construction of a transition model for related pairs of images, mutual transformation of the images inside and outside the dataset as well as population generation in order to increase representativeness of existing datasets. Thus, the proposed algorithms can be used to improve reliability of face recognition performed on images of various kinds. Moreover, these techniques can be applied to address a wide variety of other CMMR problems.
topic cross-modal multimedia retrieval
principal component analysis
face images
sketch
facial composite
url http://ntv.ifmo.ru/file/article/16407.pdf
work_keys_str_mv AT gakukharev mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval
AT ynmatveev mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval
AT aloleinik mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval
_version_ 1716781562730643456