Path planning of wafer probe by reinforcement learning

碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinfo...

Full description

Bibliographic Details
Main Authors:	Yu-Syuan Weng, 翁雨玄
Other Authors:	Kao-Shing Hwang
Format:	Others
Language:	zh-TW
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/8g6s64

id	ndltd-TW-107NSYS5442031
record_format	oai_dc
spelling	ndltd-TW-107NSYS54420312019-05-16T01:40:51Z http://ndltd.ncl.edu.tw/handle/8g6s64 Path planning of wafer probe by reinforcement learning 以加強式學習實現晶圓測試之路徑規劃 Yu-Syuan Weng 翁雨玄碩士國立中山大學電機工程學系研究所 107 At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment. Kao-Shing Hwang 黃國勝 2019 學位論文 ; thesis 67 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment.
author2	Kao-Shing Hwang
author_facet	Kao-Shing Hwang Yu-Syuan Weng 翁雨玄
author	Yu-Syuan Weng 翁雨玄
spellingShingle	Yu-Syuan Weng 翁雨玄 Path planning of wafer probe by reinforcement learning
author_sort	Yu-Syuan Weng
title	Path planning of wafer probe by reinforcement learning
title_short	Path planning of wafer probe by reinforcement learning
title_full	Path planning of wafer probe by reinforcement learning
title_fullStr	Path planning of wafer probe by reinforcement learning
title_full_unstemmed	Path planning of wafer probe by reinforcement learning
title_sort	path planning of wafer probe by reinforcement learning
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/8g6s64
work_keys_str_mv	AT yusyuanweng pathplanningofwaferprobebyreinforcementlearning AT wēngyǔxuán pathplanningofwaferprobebyreinforcementlearning AT yusyuanweng yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà AT wēngyǔxuán yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà
_version_	1719178976134430720

Path planning of wafer probe by reinforcement learning

Similar Items