Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery
This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of cou...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-09-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/12/18/3015 |
id |
doaj-466e63d7f510409aa39201c24c2d0725 |
---|---|
record_format |
Article |
spelling |
doaj-466e63d7f510409aa39201c24c2d07252020-11-25T03:19:28ZengMDPI AGRemote Sensing2072-42922020-09-01123015301510.3390/rs12183015Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV ImageryMélissande Machefer0François Lemarchand1Virginie Bonnefond2Alasdair Hitchins3Panagiotis Sidiropoulos4Hummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKHummingbird Technologies, Aviation House, 125 Kingsway, Holborn, London WC2B 6NH, UKThis work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces.https://www.mdpi.com/2072-4292/12/18/3015UAVcrop mappingimage analysisprecision agriculturedeep learningindividual plant segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos |
spellingShingle |
Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery Remote Sensing UAV crop mapping image analysis precision agriculture deep learning individual plant segmentation |
author_facet |
Mélissande Machefer François Lemarchand Virginie Bonnefond Alasdair Hitchins Panagiotis Sidiropoulos |
author_sort |
Mélissande Machefer |
title |
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
title_short |
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
title_full |
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
title_fullStr |
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
title_full_unstemmed |
Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery |
title_sort |
mask r-cnn refitting strategy for plant counting and sizing in uav imagery |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2020-09-01 |
description |
This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of <inline-formula><math display="inline"><semantics><mrow><mn>0.418</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.660</mn></mrow></semantics></math></inline-formula> for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of <inline-formula><math display="inline"><semantics><mrow><mn>0.781</mn></mrow></semantics></math></inline-formula> for potato plants and <inline-formula><math display="inline"><semantics><mrow><mn>0.918</mn></mrow></semantics></math></inline-formula> for lettuces. |
topic |
UAV crop mapping image analysis precision agriculture deep learning individual plant segmentation |
url |
https://www.mdpi.com/2072-4292/12/18/3015 |
work_keys_str_mv |
AT melissandemachefer maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT francoislemarchand maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT virginiebonnefond maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT alasdairhitchins maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery AT panagiotissidiropoulos maskrcnnrefittingstrategyforplantcountingandsizinginuavimagery |
_version_ |
1724622210838036480 |