Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems

An Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MA...

Full description

Bibliographic Details
Main Author: Husheng Li
Format: Article
Language:English
Published: SpringerOpen 2010-01-01
Series:EURASIP Journal on Wireless Communications and Networking
Online Access:http://dx.doi.org/10.1155/2010/876216
id doaj-bb427ed4ecfa4658a2962fdf48141481
record_format Article
spelling doaj-bb427ed4ecfa4658a2962fdf481414812020-11-24T21:15:20ZengSpringerOpenEURASIP Journal on Wireless Communications and Networking1687-14721687-14992010-01-01201010.1155/2010/876216Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio SystemsHusheng LiAn Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MARL) is applied for the secondary users to learn good strategies of channel selection. Specifically, the framework of Q-learning is extended from single user case to multiagent case by considering other secondary users as a part of the environment. The dynamics of the Q-learning are illustrated using a Metrick-Polak plot, which shows the traces of Q-values in the two-user case. For both complete and partial observation cases, rigorous proofs of the convergence of multiagent Q-learning without communications, under certain conditions, are provided using the Robins-Monro algorithm and contraction mapping, respectively. The learning performance (speed and gain in utility) is evaluated by numerical simulations. http://dx.doi.org/10.1155/2010/876216
collection DOAJ
language English
format Article
sources DOAJ
author Husheng Li
spellingShingle Husheng Li
Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
EURASIP Journal on Wireless Communications and Networking
author_facet Husheng Li
author_sort Husheng Li
title Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
title_short Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
title_full Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
title_fullStr Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
title_full_unstemmed Multiagent Q-Learning for Aloha-Like Spectrum Access in Cognitive Radio Systems
title_sort multiagent q-learning for aloha-like spectrum access in cognitive radio systems
publisher SpringerOpen
series EURASIP Journal on Wireless Communications and Networking
issn 1687-1472
1687-1499
publishDate 2010-01-01
description An Aloha-like spectrum access scheme without negotiation is considered for multiuser and multichannel cognitive radio systems. To avoid collisions incurred by the lack of coordination, each secondary user learns how to select channels according to its experience. Multiagent reinforcement leaning (MARL) is applied for the secondary users to learn good strategies of channel selection. Specifically, the framework of Q-learning is extended from single user case to multiagent case by considering other secondary users as a part of the environment. The dynamics of the Q-learning are illustrated using a Metrick-Polak plot, which shows the traces of Q-values in the two-user case. For both complete and partial observation cases, rigorous proofs of the convergence of multiagent Q-learning without communications, under certain conditions, are provided using the Robins-Monro algorithm and contraction mapping, respectively. The learning performance (speed and gain in utility) is evaluated by numerical simulations.
url http://dx.doi.org/10.1155/2010/876216
work_keys_str_mv AT hushengli multiagentqlearningforalohalikespectrumaccessincognitiveradiosystems
_version_ 1716745693818781696