A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net

The segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance prese...

Full description

Bibliographic Details
Main Authors:	Rabeea Jaffari, Manzoor Ahmed Hashmani, Constantino Carlos Reyes-Aldasoro
Format:	Article
Language:	English
Published:	MDPI AG 2021-04-01
Series:	Sensors
Subjects:	power lines semantic segmentation Matthews correlation coefficient loss function data imbalance
Online Access:	https://www.mdpi.com/1424-8220/21/8/2803

id	doaj-f72ebe78993641109ab86ae7d16b3d0d
record_format	Article
spelling	doaj-f72ebe78993641109ab86ae7d16b3d0d2021-04-16T23:01:32ZengMDPI AGSensors1424-82202021-04-01212803280310.3390/s21082803A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-NetRabeea Jaffari0Manzoor Ahmed Hashmani1Constantino Carlos Reyes-Aldasoro2Department of Computer and Information Sciences, Universiti Teknologi PETRONAS (UTP), Seri Iskandar 32610, MalaysiaDepartment of Computer and Information Sciences, Universiti Teknologi PETRONAS (UTP), Seri Iskandar 32610, MalaysiagiCentre, Department of Computer Science, University of London, London EC1V 0HB, UKThe segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance present in the data. The PLs occupy only a minimal portion (1–5%) of the aerial images as compared to the background region (95–99%). Generally, this class imbalance problem is addressed via the use of PL-specific detectors in conjunction with the popular class balanced cross entropy (BBCE) loss function. However, these PL-specific detectors do not work outside their application areas and a BBCE loss requires hyperparameter tuning for class-wise weights, which is not trivial. Moreover, the BBCE loss results in low dice scores and precision values and thus, fails to achieve an optimal trade-off between dice scores, model accuracy, and precision–recall values. In this work, we propose a generalized focal loss function based on the Matthews correlation coefficient (MCC) or the Phi coefficient to address the class imbalance problem in PL segmentation while utilizing a generic deep segmentation architecture. We evaluate our loss function by improving the vanilla U-Net model with an additional convolutional auxiliary classifier head (ACU-Net) for better learning and faster model convergence. The evaluation of two PL datasets, namely the Mendeley Power Line Dataset and the Power Line Dataset of Urban Scenes (PLDU), where PLs occupy around 1% and 2% of the aerial images area, respectively, reveal that our proposed loss function outperforms the popular BBCE loss by 16% in PL dice scores on both the datasets, 19% in precision and false detection rate (FDR) values for the Mendeley PL dataset and 15% in precision and FDR values for the PLDU with a minor degradation in the accuracy and recall values. Moreover, our proposed ACU-Net outperforms the baseline vanilla U-Net for the characteristic evaluation parameters in the range of 1–10% for both the PL datasets. Thus, our proposed loss function with ACU-Net achieves an optimal trade-off for the characteristic evaluation parameters without any bells and whistles. Our code is available at Github.https://www.mdpi.com/1424-8220/21/8/2803power linessemantic segmentationMatthews correlation coefficientloss functiondata imbalance
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rabeea Jaffari Manzoor Ahmed Hashmani Constantino Carlos Reyes-Aldasoro
spellingShingle	Rabeea Jaffari Manzoor Ahmed Hashmani Constantino Carlos Reyes-Aldasoro A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net Sensors power lines semantic segmentation Matthews correlation coefficient loss function data imbalance
author_facet	Rabeea Jaffari Manzoor Ahmed Hashmani Constantino Carlos Reyes-Aldasoro
author_sort	Rabeea Jaffari
title	A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net
title_short	A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net
title_full	A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net
title_fullStr	A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net
title_full_unstemmed	A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net
title_sort	novel focal phi loss for power line segmentation with auxiliary classifier u-net
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2021-04-01
description	The segmentation of power lines (PLs) from aerial images is a crucial task for the safe navigation of unmanned aerial vehicles (UAVs) operating at low altitudes. Despite the advances in deep learning-based approaches for PL segmentation, these models are still vulnerable to the class imbalance present in the data. The PLs occupy only a minimal portion (1–5%) of the aerial images as compared to the background region (95–99%). Generally, this class imbalance problem is addressed via the use of PL-specific detectors in conjunction with the popular class balanced cross entropy (BBCE) loss function. However, these PL-specific detectors do not work outside their application areas and a BBCE loss requires hyperparameter tuning for class-wise weights, which is not trivial. Moreover, the BBCE loss results in low dice scores and precision values and thus, fails to achieve an optimal trade-off between dice scores, model accuracy, and precision–recall values. In this work, we propose a generalized focal loss function based on the Matthews correlation coefficient (MCC) or the Phi coefficient to address the class imbalance problem in PL segmentation while utilizing a generic deep segmentation architecture. We evaluate our loss function by improving the vanilla U-Net model with an additional convolutional auxiliary classifier head (ACU-Net) for better learning and faster model convergence. The evaluation of two PL datasets, namely the Mendeley Power Line Dataset and the Power Line Dataset of Urban Scenes (PLDU), where PLs occupy around 1% and 2% of the aerial images area, respectively, reveal that our proposed loss function outperforms the popular BBCE loss by 16% in PL dice scores on both the datasets, 19% in precision and false detection rate (FDR) values for the Mendeley PL dataset and 15% in precision and FDR values for the PLDU with a minor degradation in the accuracy and recall values. Moreover, our proposed ACU-Net outperforms the baseline vanilla U-Net for the characteristic evaluation parameters in the range of 1–10% for both the PL datasets. Thus, our proposed loss function with ACU-Net achieves an optimal trade-off for the characteristic evaluation parameters without any bells and whistles. Our code is available at Github.
topic	power lines semantic segmentation Matthews correlation coefficient loss function data imbalance
url	https://www.mdpi.com/1424-8220/21/8/2803
work_keys_str_mv	AT rabeeajaffari anovelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet AT manzoorahmedhashmani anovelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet AT constantinocarlosreyesaldasoro anovelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet AT rabeeajaffari novelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet AT manzoorahmedhashmani novelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet AT constantinocarlosreyesaldasoro novelfocalphilossforpowerlinesegmentationwithauxiliaryclassifierunet
_version_	1721524293812092928

A Novel Focal Phi Loss for Power Line Segmentation with Auxiliary Classifier U-Net

Similar Items