A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix

碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular...

Full description

Bibliographic Details
Main Authors:	Chi-yuan Tai, 戴啓原
Other Authors:	Kao-Shing Hwang
Format:	Others
Language:	zh-TW
Published:	2018
Online Access:	http://ndltd.ncl.edu.tw/handle/uj9h2d

id	ndltd-TW-107NSYS5442009
record_format	oai_dc
spelling	ndltd-TW-107NSYS54420092019-05-16T01:40:48Z http://ndltd.ncl.edu.tw/handle/uj9h2d A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix 時變影像賈可賓矩陣的加強式學習方法 Chi-yuan Tai 戴啓原碩士國立中山大學電機工程學系研究所 107 The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular point of image Jacobian matrix, causing the robot arm to lose control, or to leave the feature points out of FOV, resulting servoing failure. Therefore, this thesis propose a reinforcement learning approach to time varying image Jacobian matrix. It is implemented by Q-learning, because Q-learning is easy to realize and model-free. This thesis discretize image plane into state space according to the location of feature points, and the action space is composed of the linear combination of image Jacobian matrix. Then, according to current state, learning agent choose an action by ε-greedy policy. The agent finally get a reward by environment, and use it to update the policy. The agent will learn a policy approximate to the best solution through fully interaction with the environment. In order to verify the method proposed in this paper, a six-axis robot arm and a single-lens camera are used to form a visual servoing system. The method proposed in this thesis is verified by comparing the result with visual servo system using fixed image Jacobian matrix in the Webots simulation software and real world environment, respectively. Kao-Shing Hwang 黃國勝 2018 學位論文 ; thesis 54 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === The behavior of traditional image-based visual servoing method is unsatisfactory when current camera pose is very different from the desired, especially the rotational error along or around the optical axis. During the movement, it might encounter the singular point of image Jacobian matrix, causing the robot arm to lose control, or to leave the feature points out of FOV, resulting servoing failure. Therefore, this thesis propose a reinforcement learning approach to time varying image Jacobian matrix. It is implemented by Q-learning, because Q-learning is easy to realize and model-free. This thesis discretize image plane into state space according to the location of feature points, and the action space is composed of the linear combination of image Jacobian matrix. Then, according to current state, learning agent choose an action by ε-greedy policy. The agent finally get a reward by environment, and use it to update the policy. The agent will learn a policy approximate to the best solution through fully interaction with the environment. In order to verify the method proposed in this paper, a six-axis robot arm and a single-lens camera are used to form a visual servoing system. The method proposed in this thesis is verified by comparing the result with visual servo system using fixed image Jacobian matrix in the Webots simulation software and real world environment, respectively.
author2	Kao-Shing Hwang
author_facet	Kao-Shing Hwang Chi-yuan Tai 戴啓原
author	Chi-yuan Tai 戴啓原
spellingShingle	Chi-yuan Tai 戴啓原 A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
author_sort	Chi-yuan Tai
title	A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
title_short	A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
title_full	A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
title_fullStr	A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
title_full_unstemmed	A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix
title_sort	reinforcement learning approach to timing varying image jacobian matrix
publishDate	2018
url	http://ndltd.ncl.edu.tw/handle/uj9h2d
work_keys_str_mv	AT chiyuantai areinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT dàiqǐyuán areinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT chiyuantai shíbiànyǐngxiàngjiǎkěbīnjǔzhèndejiāqiángshìxuéxífāngfǎ AT dàiqǐyuán shíbiànyǐngxiàngjiǎkěbīnjǔzhèndejiāqiángshìxuéxífāngfǎ AT chiyuantai reinforcementlearningapproachtotimingvaryingimagejacobianmatrix AT dàiqǐyuán reinforcementlearningapproachtotimingvaryingimagejacobianmatrix
_version_	1719178965440004096

A Reinforcement Learning Approach to Timing Varying Image Jacobian Matrix

Similar Items