Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
An Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MA...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2010-01-01
|
Series: | EURASIP Journal on Wireless Communications and Networking |
Online Access: | http://dx.doi.org/10.1155/2010/876216 |
id |
doaj-bb427ed4ecfa4658a2962fdf48141481 |
---|---|
record_format |
Article |
spelling |
doaj-bb427ed4ecfa4658a2962fdf481414812020-11-24T21:15:20ZengSpringerOpenEURASIP Journal on Wireless Communications and Networking1687-14721687-14992010-01-01201010.1155/2010/876216Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio SystemsHusheng LiAn Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MARL) is applied for the secondary users to learn good strategies of channel selection. Specifically, the framework of Q-learning is extended from single user case to multiagent case by considering other secondary users as a part of the environment. The dynamics of the Q-learning are illustrated using a Metrick-Polak plot, which shows the traces of Q-values in the two-user case. For both complete and partial observation cases, rigorous proofs of the convergence of multiagent Q-learning without communications, under certain conditions, are provided using the Robins-Monro algorithm and contraction mapping, respectively. The learning performance (speed and gain in utility) is evaluated by numerical simulations. http://dx.doi.org/10.1155/2010/876216 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Husheng Li |
spellingShingle |
Husheng Li Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems EURASIP Journal on Wireless Communications and Networking |
author_facet |
Husheng Li |
author_sort |
Husheng Li |
title |
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems |
title_short |
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems |
title_full |
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems |
title_fullStr |
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems |
title_full_unstemmed |
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems |
title_sort |
multiagent q-learning for aloha-like spectrum access in cognitive radio systems |
publisher |
SpringerOpen |
series |
EURASIP Journal on Wireless Communications and Networking |
issn |
1687-1472 1687-1499 |
publishDate |
2010-01-01 |
description |
An Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MARL) is applied for the secondary users to learn good strategies of channel selection. Specifically, the framework of Q-learning is extended from single user case to multiagent case by considering other secondary users as a part of the environment. The dynamics of the Q-learning are illustrated using a Metrick-Polak plot, which shows the traces of Q-values in the two-user case. For both complete and partial observation cases, rigorous proofs of the convergence of multiagent Q-learning without communications, under certain conditions, are provided using the Robins-Monro algorithm and contraction mapping, respectively. The learning performance (speed and gain in utility) is evaluated by numerical simulations. |
url |
http://dx.doi.org/10.1155/2010/876216 |
work_keys_str_mv |
AT hushengli multiagentqlearningforalohalikespectrumaccessincognitiveradiosystems |
_version_ |
1716745693818781696 |