Path planning of wafer probe by reinforcement learning

碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinfo...

Full description

Bibliographic Details
Main Authors: Yu-Syuan Weng, 翁雨玄
Other Authors: Kao-Shing Hwang
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/8g6s64
id ndltd-TW-107NSYS5442031
record_format oai_dc
spelling ndltd-TW-107NSYS54420312019-05-16T01:40:51Z http://ndltd.ncl.edu.tw/handle/8g6s64 Path planning of wafer probe by reinforcement learning 以加強式學習實現晶圓測試之路徑規劃 Yu-Syuan Weng 翁雨玄 碩士 國立中山大學 電機工程學系研究所 107 At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment. Kao-Shing Hwang 黃國勝 2019 學位論文 ; thesis 67 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中山大學 === 電機工程學系研究所 === 107 === At present, industries generally conduct wafer probing with the use of automated test equipment, and switches to manual processing when testing on odd dice. This way of testing reduces the efficiency of the test equipment. This thesis uses the method of reinforcement learning that considers the wafer as the environment and moves the testing probe by an agent. By training the agent, it is capable of testing all the dice on the wafer with the least number of actions. When the number of dice increases, however, the agent will face the problem of the state space becoming massive. In order to mitigate this problem, this thesis proposes a method called “spotlight”. This method changes the state of the environment from the testing situation of all dice to only the ones surrounding the probe in order to reduce the impact caused by the increase of the number of dice. This thesis uses the convolutional neural network to deal with the state of the environment, and the Asynchronous advantage actor-critic algorithm to train the agent to move the probe and spotlight. The agent is first trained in a smaller environment. The neural network will load the network parameters previously trained when training in a bigger environment later on. This will provide it with the experience learned in a smaller environment. In the empty environment testing, it shows that the method of spotlight can be applied to several kinds of probes, and multiple sizes of the environment. In the odd environment testing, it also shows that the method of spotlight can be applied to the odd environment.
author2 Kao-Shing Hwang
author_facet Kao-Shing Hwang
Yu-Syuan Weng
翁雨玄
author Yu-Syuan Weng
翁雨玄
spellingShingle Yu-Syuan Weng
翁雨玄
Path planning of wafer probe by reinforcement learning
author_sort Yu-Syuan Weng
title Path planning of wafer probe by reinforcement learning
title_short Path planning of wafer probe by reinforcement learning
title_full Path planning of wafer probe by reinforcement learning
title_fullStr Path planning of wafer probe by reinforcement learning
title_full_unstemmed Path planning of wafer probe by reinforcement learning
title_sort path planning of wafer probe by reinforcement learning
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/8g6s64
work_keys_str_mv AT yusyuanweng pathplanningofwaferprobebyreinforcementlearning
AT wēngyǔxuán pathplanningofwaferprobebyreinforcementlearning
AT yusyuanweng yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà
AT wēngyǔxuán yǐjiāqiángshìxuéxíshíxiànjīngyuáncèshìzhīlùjìngguīhuà
_version_ 1719178976134430720