A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/uj9h2d |
id |
ndltd-TW-107NSYS5442009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NSYS54420092019-05-16T01:40:48Z http://ndltd.ncl.edu.tw/handle/uj9h2d A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix 時變影像賈可賓矩陣的加強式學習方法 Chi-yuan Tai 戴啓原 碩士 國立中山大學 電機工程學系研究所 107 The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular point of image Jacobian matrix, causing the robot arm to lose control, or to leave the feature points out of FOV, resulting servoing failure. Therefore, this thesis propose a reinforcement learning approach to time varying image Jacobian matrix. It is implemented by Q-learning, because Q-learning is easy to realize and model-free. This thesis discretize image plane into state space according to the location of feature points, and the action space is composed of the linear combination of image Jacobian matrix. Then, according to current state, learning agent choose an action by ε-greedy policy. The agent finally get a reward by environment, and use it to update the policy. The agent will learn a policy approximate to the best solution through fully interaction with the environment. In order to verify the method proposed in this paper, a six-axis robot arm and a single-lens camera are used to form a visual servoing system. The method proposed in this thesis is verified by comparing the result with visual servo system using fixed image Jacobian matrix in the Webots simulation software and real world environment, respectively. Kao-Shing Hwang 黃國勝 2018 學位論文 ; thesis 54 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular point of image Jacobian matrix, causing the robot arm to lose control, or to leave the feature points out of FOV, resulting servoing failure. Therefore, this thesis propose a reinforcement learning approach to time varying image Jacobian matrix. It is implemented by Q-learning, because Q-learning is easy to realize and model-free. This thesis discretize image plane into state space according to the location of feature points, and the action space is composed of the linear combination of image Jacobian matrix. Then, according to current state, learning agent choose an action by ε-greedy policy. The agent finally get a reward by environment, and use it to update the policy. The agent will learn a policy approximate to the best solution through fully interaction with the environment. In order to verify the method proposed in this paper, a six-axis robot arm and a single-lens camera are used to form a visual servoing system. The method proposed in this thesis is verified by comparing the result with visual servo system using fixed image Jacobian matrix in the Webots simulation software and real world environment, respectively.
|
author2 |
Kao-Shing Hwang |
author_facet |
Kao-Shing Hwang Chi-yuan Tai 戴啓原 |
author |
Chi-yuan Tai 戴啓原 |
spellingShingle |
Chi-yuan Tai 戴啓原 A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
author_sort |
Chi-yuan Tai |
title |
A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
title_short |
A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
title_full |
A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
title_fullStr |
A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
title_full_unstemmed |
A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix |
title_sort |
reinforcement learning approach to timing varying image jacobian matrix |
publishDate |
2018 |
url |
http://ndltd.ncl.edu.tw/handle/uj9h2d |
work_keys_str_mv |
AT chiyuantai areinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT dàiqǐyuán areinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT chiyuantai shíbiànyǐngxiàngjiǎkěbīnjǔzhèndejiāqiángshìxuéxífāngfǎ AT dàiqǐyuán shíbiànyǐngxiàngjiǎkěbīnjǔzhèndejiāqiángshìxuéxífāngfǎ AT chiyuantai reinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT dàiqǐyuán reinforcementlearningapproachtotimingvaryingimagejacobianmatrix |
_version_ |
1719178965440004096 |