Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery

This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of cou...

Full description

Bibliographic Details
Main Authors: Mélissande Machefer, François Lemarchand, Virginie Bonnefond, Alasdair Hitchins, Panagiotis Sidiropoulos
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Remote Sensing
Subjects:
UAV
Online Access:https://www.mdpi.com/2072-4292/12/18/3015
id doaj-466e63d7f510409aa39201c24c2d0725
record_format Article
spelling doaj-466e63d7f510409aa39201c24c2d07252020-11-25T03:19:28ZengMDPI AGRemote Sensing2072-42922020-09-01123015301510.3390/rs12183015Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV ImageryMélissande Machefer0François Lemarchand1Virginie Bonnefond2Alasdair Hitchins3Panagiotis Sidiropoulos4Hummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKThis work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces.https://www.mdpi.com/2072-4292/12/18/3015UAVcrop mappingimage analysisprecision agriculturedeep learningindividual plant segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Mélissande Machefer
François Lemarchand
Virginie Bonnefond
Alasdair Hitchins
Panagiotis Sidiropoulos
spellingShingle Mélissande Machefer
François Lemarchand
Virginie Bonnefond
Alasdair Hitchins
Panagiotis Sidiropoulos
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
Remote Sensing
UAV
crop mapping
image analysis
precision agriculture
deep learning
individual plant segmentation
author_facet Mélissande Machefer
François Lemarchand
Virginie Bonnefond
Alasdair Hitchins
Panagiotis Sidiropoulos
author_sort Mélissande Machefer
title Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_short Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_full Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_fullStr Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_full_unstemmed Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
title_sort mask r-cnn refitting strategy for plant counting and sizing in uav imagery
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-09-01
description This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces.
topic UAV
crop mapping
image analysis
precision agriculture
deep learning
individual plant segmentation
url https://www.mdpi.com/2072-4292/12/18/3015
work_keys_str_mv AT melissandemachefer maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
AT francoislemarchand maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
AT virginiebonnefond maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
AT alasdairhitchins maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
AT panagiotissidiropoulos maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery
_version_ 1724622210838036480