TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks

Abstract Background Grain yield of wheat is greatly associated with the population of wheat spikes, i.e., $$spike~number~\text {m}^{-2}$$ spikenumberm-2 . To obtain this index in a reliable and efficient way, it is necessary to count wheat spikes accurately and automatically. Currently computer visi...

Full description

Bibliographic Details
Main Authors: Haipeng Xiong, Zhiguo Cao, Hao Lu, Simon Madec, Liang Liu, Chunhua Shen
Format: Article
Language:English
Published: BMC 2019-12-01
Series:Plant Methods
Subjects:
Online Access:https://doi.org/10.1186/s13007-019-0537-2
id doaj-bf846ae2fbeb46b7be3e7febbece132a
record_format Article
spelling doaj-bf846ae2fbeb46b7be3e7febbece132a2020-12-13T12:09:26ZengBMCPlant Methods1746-48112019-12-0115111410.1186/s13007-019-0537-2TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networksHaipeng Xiong0Zhiguo Cao1Hao Lu2Simon Madec3Liang Liu4Chunhua Shen5National Key Laboratory of Science and Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and TechnologyNational Key Laboratory of Science and Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and TechnologyNational Key Laboratory of Science and Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and TechnologyINRA-EMMAH-CAPTENational Key Laboratory of Science and Technology on Multi-Spectral Information Processing, School of Artificial Intelligence and Automation, Huazhong University of Science and TechnologySchool of Computer Science, The University of AdelaideAbstract Background Grain yield of wheat is greatly associated with the population of wheat spikes, i.e., $$spike~number~\text {m}^{-2}$$ spikenumberm-2 . To obtain this index in a reliable and efficient way, it is necessary to count wheat spikes accurately and automatically. Currently computer vision technologies have shown great potential to automate this task effectively in a low-end manner. In particular, counting wheat spikes is a typical visual counting problem, which is substantially studied under the name of object counting in Computer Vision. TasselNet, which represents one of the state-of-the-art counting approaches, is a convolutional neural network-based local regression model, and currently benchmarks the best record on counting maize tassels. However, when applying TasselNet to wheat spikes, it cannot predict accurate counts when spikes partially present. Results In this paper, we make an important observation that the counting performance of local regression networks can be significantly improved via adding visual context to the local patches. Meanwhile, such context can be treated as part of the receptive field without increasing the model capacity. We thus propose a simple yet effective contextual extension of TasselNet—TasselNetv2. If implementing TasselNetv2 in a fully convolutional form, both training and inference can be greatly sped up by reducing redundant computations. In particular, we collected and labeled a large-scale wheat spikes counting (WSC) dataset, with 1764 high-resolution images and 675,322 manually-annotated instances. Extensive experiments show that, TasselNetv2 not only achieves state-of-the-art performance on the WSC dataset ($$91.01\%$$ 91.01% counting accuracy) but also is more than an order of magnitude faster than TasselNet (13.82 fps on $$912\times 1216$$ 912×1216 images). The generality of TasselNetv2 is further demonstrated by advancing the state of the art on both the Maize Tassels Counting and ShanghaiTech Crowd Counting datasets. Conclusions This paper describes TasselNetv2 for counting wheat spikes, which simultaneously addresses two important use cases in plant counting: improving the counting accuracy without increasing model capacity, and improving efficiency without sacrificing accuracy. It is promising to be deployed in a real-time system with high-throughput demand. In particular, TasselNetv2 can achieve sufficiently accurate results when training from scratch with small networks, and adopting larger pre-trained networks can further boost accuracy. In practice, one can trade off the performance and efficiency according to certain application scenarios. Code and models are made available at: https://tinyurl.com/TasselNetv2.https://doi.org/10.1186/s13007-019-0537-2Wheat spikesObject countingConvolutional modelsLocal regression networksContext fusion
collection DOAJ
language English
format Article
sources DOAJ
author Haipeng Xiong
Zhiguo Cao
Hao Lu
Simon Madec
Liang Liu
Chunhua Shen
spellingShingle Haipeng Xiong
Zhiguo Cao
Hao Lu
Simon Madec
Liang Liu
Chunhua Shen
TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
Plant Methods
Wheat spikes
Object counting
Convolutional models
Local regression networks
Context fusion
author_facet Haipeng Xiong
Zhiguo Cao
Hao Lu
Simon Madec
Liang Liu
Chunhua Shen
author_sort Haipeng Xiong
title TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
title_short TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
title_full TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
title_fullStr TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
title_full_unstemmed TasselNetv2: in-field counting of wheat spikes with context-augmented local regression networks
title_sort tasselnetv2: in-field counting of wheat spikes with context-augmented local regression networks
publisher BMC
series Plant Methods
issn 1746-4811
publishDate 2019-12-01
description Abstract Background Grain yield of wheat is greatly associated with the population of wheat spikes, i.e., $$spike~number~\text {m}^{-2}$$ spikenumberm-2 . To obtain this index in a reliable and efficient way, it is necessary to count wheat spikes accurately and automatically. Currently computer vision technologies have shown great potential to automate this task effectively in a low-end manner. In particular, counting wheat spikes is a typical visual counting problem, which is substantially studied under the name of object counting in Computer Vision. TasselNet, which represents one of the state-of-the-art counting approaches, is a convolutional neural network-based local regression model, and currently benchmarks the best record on counting maize tassels. However, when applying TasselNet to wheat spikes, it cannot predict accurate counts when spikes partially present. Results In this paper, we make an important observation that the counting performance of local regression networks can be significantly improved via adding visual context to the local patches. Meanwhile, such context can be treated as part of the receptive field without increasing the model capacity. We thus propose a simple yet effective contextual extension of TasselNet—TasselNetv2. If implementing TasselNetv2 in a fully convolutional form, both training and inference can be greatly sped up by reducing redundant computations. In particular, we collected and labeled a large-scale wheat spikes counting (WSC) dataset, with 1764 high-resolution images and 675,322 manually-annotated instances. Extensive experiments show that, TasselNetv2 not only achieves state-of-the-art performance on the WSC dataset ($$91.01\%$$ 91.01% counting accuracy) but also is more than an order of magnitude faster than TasselNet (13.82 fps on $$912\times 1216$$ 912×1216 images). The generality of TasselNetv2 is further demonstrated by advancing the state of the art on both the Maize Tassels Counting and ShanghaiTech Crowd Counting datasets. Conclusions This paper describes TasselNetv2 for counting wheat spikes, which simultaneously addresses two important use cases in plant counting: improving the counting accuracy without increasing model capacity, and improving efficiency without sacrificing accuracy. It is promising to be deployed in a real-time system with high-throughput demand. In particular, TasselNetv2 can achieve sufficiently accurate results when training from scratch with small networks, and adopting larger pre-trained networks can further boost accuracy. In practice, one can trade off the performance and efficiency according to certain application scenarios. Code and models are made available at: https://tinyurl.com/TasselNetv2.
topic Wheat spikes
Object counting
Convolutional models
Local regression networks
Context fusion
url https://doi.org/10.1186/s13007-019-0537-2
work_keys_str_mv AT haipengxiong tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
AT zhiguocao tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
AT haolu tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
AT simonmadec tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
AT liangliu tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
AT chunhuashen tasselnetv2infieldcountingofwheatspikeswithcontextaugmentedlocalregressionnetworks
_version_ 1724385216664961024