6-DOF Manipulator Path Planning Based on Reinforcement Learning
碩士 === 國立彰化師範大學 === 機電工程學系 === 107 === The purpose of this paper is to develop a robotic autonomous learning system based on virtual modeling and reinforcement learning. Its main function is given by the mechanical arm model, the environmental obstacle, the initial coordinate of the robot arm and...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/bx72t5 |
id |
ndltd-TW-107NCUE5489024 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NCUE54890242019-11-06T03:33:28Z http://ndltd.ncl.edu.tw/handle/bx72t5 6-DOF Manipulator Path Planning Based on Reinforcement Learning 強化學習應用於六軸機械手臂路徑規劃 Tsai,Han-Hsien 蔡瀚賢 碩士 國立彰化師範大學 機電工程學系 107 The purpose of this paper is to develop a robotic autonomous learning system based on virtual modeling and reinforcement learning. Its main function is given by the mechanical arm model, the environmental obstacle, the initial coordinate of the robot arm and the target point, and then the system will be automatically generated a set of motion corners to enable the robot to smoothly avoid the obstacle and reach the target. The program development in this article will be divided into two parts. The first part includes arm simulation and collision detection. Firstly, the VTK visualization tool library is utilized, and the six-axis robotic arm files and obstacles are imported, and then the motion of the robot arm and the surrounding environment are visualized. Finally, it is judged whether or not the collision is caused by the directional bounding box algorithm. The second part includes the machine learning for the path planning. The DDPG model is established through Tensorflow package. Next, the reinforcement learning is used to interact with the environmental variables. The different reward functions are designed to test and discuss. Finally, the feasibility is verified by the actual machine operation. This experiment finally proves the feasibility of reinforcement learning applied to the path planning of the robot arm. Through this method, the robot arm can be autonomously planned, and the target point is within 10 mm of the error. Chen,Ming-Fei 陳明飛 2019 學位論文 ; thesis 82 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立彰化師範大學 === 機電工程學系 === 107 === The purpose of this paper is to develop a robotic autonomous learning system based on virtual modeling and reinforcement learning. Its main function is given by the mechanical arm model, the environmental obstacle, the initial coordinate of the robot arm and the target point, and then the system will be automatically generated a set of motion corners to enable the robot to smoothly avoid the obstacle and reach the target.
The program development in this article will be divided into two parts. The first part includes arm simulation and collision detection. Firstly, the VTK visualization tool library is utilized, and the six-axis robotic arm files and obstacles are imported, and then the motion of the robot arm and the surrounding environment are visualized. Finally, it is judged whether or not the collision is caused by the directional bounding box algorithm. The second part includes the machine learning for the path planning. The DDPG model is established through Tensorflow package. Next, the reinforcement learning is used to interact with the environmental variables. The different reward functions are designed to test and discuss. Finally, the feasibility is verified by the actual machine operation. This experiment finally proves the feasibility of reinforcement learning applied to the path planning of the robot arm. Through this method, the robot arm can be autonomously planned, and the target point is within 10 mm of the error.
|
author2 |
Chen,Ming-Fei |
author_facet |
Chen,Ming-Fei Tsai,Han-Hsien 蔡瀚賢 |
author |
Tsai,Han-Hsien 蔡瀚賢 |
spellingShingle |
Tsai,Han-Hsien 蔡瀚賢 6-DOF Manipulator Path Planning Based on Reinforcement Learning |
author_sort |
Tsai,Han-Hsien |
title |
6-DOF Manipulator Path Planning Based on Reinforcement Learning |
title_short |
6-DOF Manipulator Path Planning Based on Reinforcement Learning |
title_full |
6-DOF Manipulator Path Planning Based on Reinforcement Learning |
title_fullStr |
6-DOF Manipulator Path Planning Based on Reinforcement Learning |
title_full_unstemmed |
6-DOF Manipulator Path Planning Based on Reinforcement Learning |
title_sort |
6-dof manipulator path planning based on reinforcement learning |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/bx72t5 |
work_keys_str_mv |
AT tsaihanhsien 6dofmanipulatorpathplanningbasedonreinforcementlearning AT càihànxián 6dofmanipulatorpathplanningbasedonreinforcementlearning AT tsaihanhsien qiánghuàxuéxíyīngyòngyúliùzhóujīxièshǒubìlùjìngguīhuà AT càihànxián qiánghuàxuéxíyīngyòngyúliùzhóujīxièshǒubìlùjìngguīhuà |
_version_ |
1719287444610744320 |