Path planning of wafer probe by reinforcement learning
碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinfo...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/8g6s64 |
id |
ndltd-TW-107NSYS5442031 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NSYS54420312019-05-16T01:40:51Z http://ndltd.ncl.edu.tw/handle/8g6s64 Path planning of wafer probe by reinforcement learning 以加強式學習實現晶圓測試之路徑規劃 Yu-Syuan Weng 翁雨玄 碩士 國立中山大學 電機工程學系研究所 107 At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment. Kao-Shing Hwang 黃國勝 2019 學位論文 ; thesis 67 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment.
|
author2 |
Kao-Shing Hwang |
author_facet |
Kao-Shing Hwang Yu-Syuan Weng 翁雨玄 |
author |
Yu-Syuan Weng 翁雨玄 |
spellingShingle |
Yu-Syuan Weng 翁雨玄 Path planning of wafer probe by reinforcement learning |
author_sort |
Yu-Syuan Weng |
title |
Path planning of wafer probe by reinforcement learning |
title_short |
Path planning of wafer probe by reinforcement learning |
title_full |
Path planning of wafer probe by reinforcement learning |
title_fullStr |
Path planning of wafer probe by reinforcement learning |
title_full_unstemmed |
Path planning of wafer probe by reinforcement learning |
title_sort |
path planning of wafer probe by reinforcement learning |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/8g6s64 |
work_keys_str_mv |
AT yusyuanweng pathplanningofwaferprobebyreinforcementlearning AT wēngyǔxuán pathplanningofwaferprobebyreinforcementlearning AT yusyuanweng yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà AT wēngyǔxuán yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà |
_version_ |
1719178976134430720 |