MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
Subject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of variou...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
2017-01-01
|
Series: | Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki |
Subjects: | |
Online Access: | http://ntv.ifmo.ru/file/article/16407.pdf |
id |
doaj-c8e4a01a7beb4951987e07f998f937e5 |
---|---|
record_format |
Article |
spelling |
doaj-c8e4a01a7beb4951987e07f998f937e52020-11-24T20:59:46ZengSaint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki2226-14942500-03732017-01-01171627410.17586/2226-1494-2017-17-1-62-74MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVALG. A. Kukharev0Y. N. Matveev1A. L. Oleinik2D.Sc., Professor, Westpomeranian University of Technology, 70-310, Szczecin, PolandD.Sc., Chief Scientific Officer, Head of the Chair, “STC-Innovation”, Saint Petersburg, 196084, Russian Federation; ITMO University, Saint Petersburg, 197101, Russian Federationpostgraduate, ITMO University, Saint Petersburg, 197101, Russian FederationSubject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of various kinds (e.g. photos and drawn pictures). This problem is widely represented in practice. Research is this area is based on existing datasets. The algorithms we have proposed in this paper can be applied to arbitrary pairs of related images due to the unified mathematical specification. Method. We have presented three image transformation algorithms. The first one is based on principal component analysis and Karhunen-Loève transform (1DPCA/1DKLT). Unlike the existing solution, it does not use the training set during the transformation process. The second algorithm assumes generation of an image population. The third algorithm performs the transformation based on two-dimensional principal component analysis and Karhunen-Loève transform (2DPCA/2DKLT). Main Results. The experiments on image transformation and population generation have revealed the main features of each algorithm. The first algorithm allows construction of an accurate and stable model of transition between two given sets of images. The second algorithm can be used to add new images to existing bases and the third algorithm is capable of performing the transformation outside the training dataset. Practical Relevance. Taking into account the qualities of the proposed algorithms, we have provided recommendations concerning their application. Possible scenarios include construction of a transition model for related pairs of images, mutual transformation of the images inside and outside the dataset as well as population generation in order to increase representativeness of existing datasets. Thus, the proposed algorithms can be used to improve reliability of face recognition performed on images of various kinds. Moreover, these techniques can be applied to address a wide variety of other CMMR problems.http://ntv.ifmo.ru/file/article/16407.pdfcross-modal multimedia retrievalprincipal component analysisface imagessketchfacial composite |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
G. A. Kukharev Y. N. Matveev A. L. Oleinik |
spellingShingle |
G. A. Kukharev Y. N. Matveev A. L. Oleinik MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki cross-modal multimedia retrieval principal component analysis face images sketch facial composite |
author_facet |
G. A. Kukharev Y. N. Matveev A. L. Oleinik |
author_sort |
G. A. Kukharev |
title |
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL |
title_short |
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL |
title_full |
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL |
title_fullStr |
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL |
title_full_unstemmed |
MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL |
title_sort |
mutual image transformation algorithms for visual information processing and retrieval |
publisher |
Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) |
series |
Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki |
issn |
2226-1494 2500-0373 |
publishDate |
2017-01-01 |
description |
Subject of Research. The paper deals with methods and algorithms for mutual transformation of related pairs of images in order to enhance the capabilities of cross-modal multimedia retrieval (CMMR) technologies. We have thoroughly studied the problem of mutual transformation of face images of various kinds (e.g. photos and drawn pictures). This problem is widely represented in practice. Research is this area is based on existing datasets. The algorithms we have proposed in this paper can be applied to arbitrary pairs of related images due to the unified mathematical specification. Method. We have presented three image transformation algorithms. The first one is based on principal component analysis and Karhunen-Loève transform (1DPCA/1DKLT). Unlike the existing solution, it does not use the training set during the transformation process. The second algorithm assumes generation of an image population. The third algorithm performs the transformation based on two-dimensional principal component analysis and Karhunen-Loève transform (2DPCA/2DKLT). Main Results. The experiments on image transformation and population generation have revealed the main features of each algorithm. The first algorithm allows construction of an accurate and stable model of transition between two given sets of images. The second algorithm can be used to add new images to existing bases and the third algorithm is capable of performing the transformation outside the training dataset. Practical Relevance. Taking into account the qualities of the proposed algorithms, we have provided recommendations concerning their application. Possible scenarios include construction of a transition model for related pairs of images, mutual transformation of the images inside and outside the dataset as well as population generation in order to increase representativeness of existing datasets. Thus, the proposed algorithms can be used to improve reliability of face recognition performed on images of various kinds. Moreover, these techniques can be applied to address a wide variety of other CMMR problems. |
topic |
cross-modal multimedia retrieval principal component analysis face images sketch facial composite |
url |
http://ntv.ifmo.ru/file/article/16407.pdf |
work_keys_str_mv |
AT gakukharev mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval AT ynmatveev mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval AT aloleinik mutualimagetransformationalgorithmsforvisualinformationprocessingandretrieval |
_version_ |
1716781562730643456 |