Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot
The presented work is part of the H2020 project SWEEPER with the overall goal to develop a sweet pepper harvesting robot for use in greenhouses. As part of the solution, visual servoing is used to direct the manipulator towards the fruit. This requires accurate and stable fruit detection based on vi...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-02-01
|
Series: | Robotics |
Subjects: | |
Online Access: | http://www.mdpi.com/2218-6581/7/1/11 |
id |
doaj-6c2fd8e7f3b54fc0992cda0bdb1a8294 |
---|---|
record_format |
Article |
spelling |
doaj-6c2fd8e7f3b54fc0992cda0bdb1a82942020-11-25T00:02:14ZengMDPI AGRobotics2218-65812018-02-01711110.3390/robotics7010011robotics7010011Adaptive Image Thresholding of Yellow Peppers for a Harvesting RobotAhmad Ostovar0Ola Ringdahl1Thomas Hellström2Department of Computing Science, Umeå University, Umeå 901 87, SwedenDepartment of Computing Science, Umeå University, Umeå 901 87, SwedenDepartment of Computing Science, Umeå University, Umeå 901 87, SwedenThe presented work is part of the H2020 project SWEEPER with the overall goal to develop a sweet pepper harvesting robot for use in greenhouses. As part of the solution, visual servoing is used to direct the manipulator towards the fruit. This requires accurate and stable fruit detection based on video images. To segment an image into background and foreground, thresholding techniques are commonly used. The varying illumination conditions in the unstructured greenhouse environment often cause shadows and overexposure. Furthermore, the color of the fruits to be harvested varies over the season. All this makes it sub-optimal to use fixed pre-selected thresholds. In this paper we suggest an adaptive image-dependent thresholding method. A variant of reinforcement learning (RL) is used with a reward function that computes the similarity between the segmented image and the labeled image to give feedback for action selection. The RL-based approach requires less computational resources than exhaustive search, which is used as a benchmark, and results in higher performance compared to a Lipschitzian based optimization approach. The proposed method also requires fewer labeled images compared to other methods. Several exploration-exploitation strategies are compared, and the results indicate that the Decaying Epsilon-Greedy algorithm gives highest performance for this task. The highest performance with the Epsilon-Greedy algorithm ( ϵ = 0.7) reached 87% of the performance achieved by exhaustive search, with 50% fewer iterations than the benchmark. The performance increased to 91.5% using Decaying Epsilon-Greedy algorithm, with 73% less number of iterations than the benchmark.http://www.mdpi.com/2218-6581/7/1/11reinforcement learningQ-Learningimage thresholdingϵ-greedy strategies |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ahmad Ostovar Ola Ringdahl Thomas Hellström |
spellingShingle |
Ahmad Ostovar Ola Ringdahl Thomas Hellström Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot Robotics reinforcement learning Q-Learning image thresholding ϵ-greedy strategies |
author_facet |
Ahmad Ostovar Ola Ringdahl Thomas Hellström |
author_sort |
Ahmad Ostovar |
title |
Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot |
title_short |
Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot |
title_full |
Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot |
title_fullStr |
Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot |
title_full_unstemmed |
Adaptive Image Thresholding of Yellow Peppers for a Harvesting Robot |
title_sort |
adaptive image thresholding of yellow peppers for a harvesting robot |
publisher |
MDPI AG |
series |
Robotics |
issn |
2218-6581 |
publishDate |
2018-02-01 |
description |
The presented work is part of the H2020 project SWEEPER with the overall goal to develop a sweet pepper harvesting robot for use in greenhouses. As part of the solution, visual servoing is used to direct the manipulator towards the fruit. This requires accurate and stable fruit detection based on video images. To segment an image into background and foreground, thresholding techniques are commonly used. The varying illumination conditions in the unstructured greenhouse environment often cause shadows and overexposure. Furthermore, the color of the fruits to be harvested varies over the season. All this makes it sub-optimal to use fixed pre-selected thresholds. In this paper we suggest an adaptive image-dependent thresholding method. A variant of reinforcement learning (RL) is used with a reward function that computes the similarity between the segmented image and the labeled image to give feedback for action selection. The RL-based approach requires less computational resources than exhaustive search, which is used as a benchmark, and results in higher performance compared to a Lipschitzian based optimization approach. The proposed method also requires fewer labeled images compared to other methods. Several exploration-exploitation strategies are compared, and the results indicate that the Decaying Epsilon-Greedy algorithm gives highest performance for this task. The highest performance with the Epsilon-Greedy algorithm ( ϵ = 0.7) reached 87% of the performance achieved by exhaustive search, with 50% fewer iterations than the benchmark. The performance increased to 91.5% using Decaying Epsilon-Greedy algorithm, with 73% less number of iterations than the benchmark. |
topic |
reinforcement learning Q-Learning image thresholding ϵ-greedy strategies |
url |
http://www.mdpi.com/2218-6581/7/1/11 |
work_keys_str_mv |
AT ahmadostovar adaptiveimagethresholdingofyellowpeppersforaharvestingrobot AT olaringdahl adaptiveimagethresholdingofyellowpeppersforaharvestingrobot AT thomashellstrom adaptiveimagethresholdingofyellowpeppersforaharvestingrobot |
_version_ |
1725438753937293312 |