Safe Reinforcement Learning based Sequential Perturbation Learning Algorithm
碩士 === 國立交通大學 === 電機與控制工程系所 === 97 === This article is about sequential perturbation learning architecture through safe reinforcement learning (SRL-SP) which based on the concept of linear search to apply perturbations on each weight value of the neural network. The evaluation of value of function b...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/63234750154932788712 |