Design of an Active Vision System for High-Level Isolation Units through Q-Learning

The inspection of Personal Protective Equipment (PPE) is one of the most necessary measures when treating patients affected by infectious diseases, such as Ebola or COVID-19. Assuring the integrity of health personnel in contact with infected patients has become an important concern in developed cou...

Full description

Bibliographic Details
Main Authors: Andrea Gil Ruiz, Juan G. Victores, Bartek Łukawski, Carlos Balaguer
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/10/17/5927
id doaj-2a5c6332a7414117a9b5b19b9c1ec6de
record_format Article
spelling doaj-2a5c6332a7414117a9b5b19b9c1ec6de2020-11-25T03:52:04ZengMDPI AGApplied Sciences2076-34172020-08-01105927592710.3390/app10175927Design of an Active Vision System for High-Level Isolation Units through Q-LearningAndrea Gil Ruiz0Juan G. Victores1Bartek Łukawski2Carlos Balaguer3RoboticsLab Research Group, University Carlos III of Madrid, 28911 Leganés, SpainRoboticsLab Research Group, University Carlos III of Madrid, 28911 Leganés, SpainRoboticsLab Research Group, University Carlos III of Madrid, 28911 Leganés, SpainRoboticsLab Research Group, University Carlos III of Madrid, 28911 Leganés, SpainThe inspection of Personal Protective Equipment (PPE) is one of the most necessary measures when treating patients affected by infectious diseases, such as Ebola or COVID-19. Assuring the integrity of health personnel in contact with infected patients has become an important concern in developed countries. This work focuses on the study of Reinforcement Learning (RL) techniques for controlling a scanner prototype in the presence of blood traces on the PPE that could arise after contact with pathological patients. A preliminary study on the design of an agent-environment system able to simulate the required task is presented. The task has been adapted to an environment for the OpenAI Gym toolkit. The evaluation of the agent’s performance has considered the effects of different topological designs and tuning hyperparameters of the Q-Learning model-free algorithm. Results have been evaluated on the basis of average reward and timesteps per episode. The sample-average method applied to the learning rate parameter, as well as a specific epsilon decaying method worked best for the trained agents. The obtained results report promising outcomes of an inspection system able to center and magnify contaminants in the real scanner system.https://www.mdpi.com/2076-3417/10/17/5927reinforcement learningpersonal protective equipmentQ-Learningreward shapinggrid searchhealthcare
collection DOAJ
language English
format Article
sources DOAJ
author Andrea Gil Ruiz
Juan G. Victores
Bartek Łukawski
Carlos Balaguer
spellingShingle Andrea Gil Ruiz
Juan G. Victores
Bartek Łukawski
Carlos Balaguer
Design of an Active Vision System for High-Level Isolation Units through Q-Learning
Applied Sciences
reinforcement learning
personal protective equipment
Q-Learning
reward shaping
grid search
healthcare
author_facet Andrea Gil Ruiz
Juan G. Victores
Bartek Łukawski
Carlos Balaguer
author_sort Andrea Gil Ruiz
title Design of an Active Vision System for High-Level Isolation Units through Q-Learning
title_short Design of an Active Vision System for High-Level Isolation Units through Q-Learning
title_full Design of an Active Vision System for High-Level Isolation Units through Q-Learning
title_fullStr Design of an Active Vision System for High-Level Isolation Units through Q-Learning
title_full_unstemmed Design of an Active Vision System for High-Level Isolation Units through Q-Learning
title_sort design of an active vision system for high-level isolation units through q-learning
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2020-08-01
description The inspection of Personal Protective Equipment (PPE) is one of the most necessary measures when treating patients affected by infectious diseases, such as Ebola or COVID-19. Assuring the integrity of health personnel in contact with infected patients has become an important concern in developed countries. This work focuses on the study of Reinforcement Learning (RL) techniques for controlling a scanner prototype in the presence of blood traces on the PPE that could arise after contact with pathological patients. A preliminary study on the design of an agent-environment system able to simulate the required task is presented. The task has been adapted to an environment for the OpenAI Gym toolkit. The evaluation of the agent’s performance has considered the effects of different topological designs and tuning hyperparameters of the Q-Learning model-free algorithm. Results have been evaluated on the basis of average reward and timesteps per episode. The sample-average method applied to the learning rate parameter, as well as a specific epsilon decaying method worked best for the trained agents. The obtained results report promising outcomes of an inspection system able to center and magnify contaminants in the real scanner system.
topic reinforcement learning
personal protective equipment
Q-Learning
reward shaping
grid search
healthcare
url https://www.mdpi.com/2076-3417/10/17/5927
work_keys_str_mv AT andreagilruiz designofanactivevisionsystemforhighlevelisolationunitsthroughqlearning
AT juangvictores designofanactivevisionsystemforhighlevelisolationunitsthroughqlearning
AT bartekłukawski designofanactivevisionsystemforhighlevelisolationunitsthroughqlearning
AT carlosbalaguer designofanactivevisionsystemforhighlevelisolationunitsthroughqlearning
_version_ 1724484495249244160