Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery

This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of cou...

Full description

Bibliographic Details
Main Authors:	Mélissande Machefer, François Lemarchand, Virginie Bonnefond, Alasdair Hitchins, Panagiotis Sidiropoulos
Format:	Article
Language:	English
Published:	MDPI AG 2020-09-01
Series:	Remote Sensing
Subjects:	UAV crop mapping image analysis precision agriculture deep learning individual plant segmentation
Online Access:	https://www.mdpi.com/2072-4292/12/18/3015

id	doaj-466e63d7f510409aa39201c24c2d0725
record_format	Article
spelling	doaj-466e63d7f510409aa39201c24c2d07252020-11-25T03:19:28ZengMDPI AGRemote Sensing2072-42922020-09-01123015301510.3390/rs12183015Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV ImageryMélissande Machefer0François Lemarchand1Virginie Bonnefond2Alasdair Hitchins3Panagiotis Sidiropoulos4Hummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKThis work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces.https://www.mdpi.com/2072-4292/12/18/3015UAVcrop mappingimage analysisprecision agriculturedeep learningindividual plant segmentation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos
spellingShingle	Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery Remote Sensing UAV crop mapping image analysis precision agriculture deep learning individual plant segmentation
author_facet	Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos
author_sort	Mélissande Machefer
title	Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_short	Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_full	Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_fullStr	Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_full_unstemmed	Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_sort	mask r-cnn refitting strategy for plant counting and sizing in uav imagery
publisher	MDPI AG
series	Remote Sensing
issn	2072-4292
publishDate	2020-09-01
description	This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces.
topic	UAV crop mapping image analysis precision agriculture deep learning individual plant segmentation
url	https://www.mdpi.com/2072-4292/12/18/3015
work_keys_str_mv	AT melissandemachefer maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT francoislemarchand maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT virginiebonnefond maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT alasdairhitchins maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT panagiotissidiropoulos maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
_version_	1724622210838036480

Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery

Similar Items