A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms

Image quality assessment (IQA) algorithms aim to predict perceived image quality by human observers. Over the last two decades, a large amount of work has been carried out in the field. New algorithms are being developed at a rapid rate in different areas of IQA, but are often tested and compared wi...

Full description

Bibliographic Details
Main Authors:	Shahrukh Athar, Zhou Wang
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Image quality assessment performance evaluation image quality study full-reference IQA no-reference IQA FR fusion
Online Access:	https://ieeexplore.ieee.org/document/8847307/

id	doaj-31dbcc6d686a4d2285f5eae19565d8de
record_format	Article
spelling	doaj-31dbcc6d686a4d2285f5eae19565d8de2021-03-29T23:55:03ZengIEEEIEEE Access2169-35362019-01-01714003014007010.1109/ACCESS.2019.29433198847307A Comprehensive Performance Evaluation of Image Quality Assessment AlgorithmsShahrukh Athar0https://orcid.org/0000-0002-8871-669XZhou Wang1Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, CanadaDepartment of Electrical and Computer Engineering, University of Waterloo, Waterloo, CanadaImage quality assessment (IQA) algorithms aim to predict perceived image quality by human observers. Over the last two decades, a large amount of work has been carried out in the field. New algorithms are being developed at a rapid rate in different areas of IQA, but are often tested and compared with limited existing models using out-of-date test data. There is a significant gap when it comes to large-scale performance evaluation studies that include a wide variety of test data and competing algorithms. In this work we aim to fill this gap by carrying out the largest performance evaluation study so far. We test the performance of 43 full-reference (FR), seven fused FR (22 versions), and 14 no-reference (NR) methods on nine subject-rated IQA datasets, of which five contain singly distorted images and four contain multiply distorted content. We use a variety of performance evaluation and statistical significance testing criteria. Our findings not only point to the top performing FR and NR IQA methods, but also highlight the performance gap between them. In addition, we have also conducted a comparative study on FR fusion methods, and an important discovery is that rank aggregation based FR fusion is able to outperform not only other FR fusion approaches but also the top performing FR methods. It may be used to annotate IQA datasets as a possible alternative to subjective ratings, especially in situations where it is not possible to obtain human opinions, such as in the case of large-scale datasets composed of thousands or even millions of images.https://ieeexplore.ieee.org/document/8847307/Image quality assessmentperformance evaluationimage quality studyfull-reference IQAno-reference IQAFR fusion
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Shahrukh Athar Zhou Wang
spellingShingle	Shahrukh Athar Zhou Wang A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms IEEE Access Image quality assessment performance evaluation image quality study full-reference IQA no-reference IQA FR fusion
author_facet	Shahrukh Athar Zhou Wang
author_sort	Shahrukh Athar
title	A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms
title_short	A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms
title_full	A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms
title_fullStr	A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms
title_full_unstemmed	A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms
title_sort	comprehensive performance evaluation of image quality assessment algorithms
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2019-01-01
description	Image quality assessment (IQA) algorithms aim to predict perceived image quality by human observers. Over the last two decades, a large amount of work has been carried out in the field. New algorithms are being developed at a rapid rate in different areas of IQA, but are often tested and compared with limited existing models using out-of-date test data. There is a significant gap when it comes to large-scale performance evaluation studies that include a wide variety of test data and competing algorithms. In this work we aim to fill this gap by carrying out the largest performance evaluation study so far. We test the performance of 43 full-reference (FR), seven fused FR (22 versions), and 14 no-reference (NR) methods on nine subject-rated IQA datasets, of which five contain singly distorted images and four contain multiply distorted content. We use a variety of performance evaluation and statistical significance testing criteria. Our findings not only point to the top performing FR and NR IQA methods, but also highlight the performance gap between them. In addition, we have also conducted a comparative study on FR fusion methods, and an important discovery is that rank aggregation based FR fusion is able to outperform not only other FR fusion approaches but also the top performing FR methods. It may be used to annotate IQA datasets as a possible alternative to subjective ratings, especially in situations where it is not possible to obtain human opinions, such as in the case of large-scale datasets composed of thousands or even millions of images.
topic	Image quality assessment performance evaluation image quality study full-reference IQA no-reference IQA FR fusion
url	https://ieeexplore.ieee.org/document/8847307/
work_keys_str_mv	AT shahrukhathar acomprehensiveperformanceevaluationofimagequalityassessmentalgorithms AT zhouwang acomprehensiveperformanceevaluationofimagequalityassessmentalgorithms AT shahrukhathar comprehensiveperformanceevaluationofimagequalityassessmentalgorithms AT zhouwang comprehensiveperformanceevaluationofimagequalityassessmentalgorithms
_version_	1724188854919888896

A Comprehensive Performance Evaluation of Image Quality Assessment Algorithms

Similar Items