An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search
It is noticed that offline-training and online-implementation method is dominant in the data-driven control. However, the inconsistence existing in offline data and online data may degrade the control performance. To address the aforementioned issue, an online control strategy is developed so that t...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9181549/ |
id |
doaj-db2ae3af09384019bb4c50c2bcf11970 |
---|---|
record_format |
Article |
spelling |
doaj-db2ae3af09384019bb4c50c2bcf119702021-03-30T03:42:57ZengIEEEIEEE Access2169-35362020-01-01815866615867810.1109/ACCESS.2020.30205509181549An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo SearchDapeng Zhang0https://orcid.org/0000-0002-9657-0088Zhiwei Gao1https://orcid.org/0000-0001-5464-3288Zhiling Lin2School of Electrical and Information Engineering, Tianjin University, Tianjin, ChinaFaculty of Engineering and Environment, University of Northumbria, Newcastle upon Tyne, U.K.School of Electrical Engineering, Tianjin University of Technology, Tianjin, ChinaIt is noticed that offline-training and online-implementation method is dominant in the data-driven control. However, the inconsistence existing in offline data and online data may degrade the control performance. To address the aforementioned issue, an online control strategy is developed so that the control parameters can be updated online based on the real-time data measured to ensure satisfactory control performance in this study. Specifically, an online control algorithm is addressed to control the pressing-down speed of the forging machine based on the framework of the reinforcement learning that has a capability of building a complete mapping from state space to action space only according to the neighbour samples. Rather than using the way of trials and errors which is too slow to be online implementation, a taboo search is addressed to speed up the learning-working process by directly searching the control on the current states, followed by the stability conditions, derived from Lyapunov stability theory. A coarse model that is limited to get the cost information of the reinforcement learning is used to make the best of mechanism information, which prevents the occurrence of the invalid states that do not conform to system characteristics. The effectiveness of the algorithm is demonstrated by an ultra-low forging machine, which outperforms the conventional approaches such as PID and neural network control approaches. The proposed algorithm has advantages in parameter adjustments so that it is easier to implement in a practical system.https://ieeexplore.ieee.org/document/9181549/Online controlreinforcement learningtaboo searchforging machine |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Dapeng Zhang Zhiwei Gao Zhiling Lin |
spellingShingle |
Dapeng Zhang Zhiwei Gao Zhiling Lin An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search IEEE Access Online control reinforcement learning taboo search forging machine |
author_facet |
Dapeng Zhang Zhiwei Gao Zhiling Lin |
author_sort |
Dapeng Zhang |
title |
An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search |
title_short |
An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search |
title_full |
An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search |
title_fullStr |
An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search |
title_full_unstemmed |
An Online Control Approach for Forging Machine Using Reinforcement Learning and Taboo Search |
title_sort |
online control approach for forging machine using reinforcement learning and taboo search |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
It is noticed that offline-training and online-implementation method is dominant in the data-driven control. However, the inconsistence existing in offline data and online data may degrade the control performance. To address the aforementioned issue, an online control strategy is developed so that the control parameters can be updated online based on the real-time data measured to ensure satisfactory control performance in this study. Specifically, an online control algorithm is addressed to control the pressing-down speed of the forging machine based on the framework of the reinforcement learning that has a capability of building a complete mapping from state space to action space only according to the neighbour samples. Rather than using the way of trials and errors which is too slow to be online implementation, a taboo search is addressed to speed up the learning-working process by directly searching the control on the current states, followed by the stability conditions, derived from Lyapunov stability theory. A coarse model that is limited to get the cost information of the reinforcement learning is used to make the best of mechanism information, which prevents the occurrence of the invalid states that do not conform to system characteristics. The effectiveness of the algorithm is demonstrated by an ultra-low forging machine, which outperforms the conventional approaches such as PID and neural network control approaches. The proposed algorithm has advantages in parameter adjustments so that it is easier to implement in a practical system. |
topic |
Online control reinforcement learning taboo search forging machine |
url |
https://ieeexplore.ieee.org/document/9181549/ |
work_keys_str_mv |
AT dapengzhang anonlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch AT zhiweigao anonlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch AT zhilinglin anonlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch AT dapengzhang onlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch AT zhiweigao onlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch AT zhilinglin onlinecontrolapproachforforgingmachineusingreinforcementlearningandtaboosearch |
_version_ |
1724183007966789632 |