bullet Sensors & Transducers Journal

    (ISSN: 2306-8515, e-ISSN 1726-5479)


2008 e-Impact Factor

25 Top Downloaded Articles

Best Selling Articles 2012

Journal Subscription 2014

Editorial Calendar

Submit an Article

Editorial Board

Current Issue

S&T journal's cover

Sensors & Transducers Journal 2011

Sensors & Transducers Journal 2010

Sensors & Transducers Journal 2009

Sensors & Transducers Journal 2008

Sensors & Transducers Journal 2007

2000-2002 S&T e-Digest Contents

2003 S&T e-Digest Contents

2004 S&T e-Digest Contents

2005 S&T e-Digest Contents

2006 S&T e-Digest Contents


Best Articles 2011




Vol. 164, Issue 2, February 2014, pp. 170-175




Multiagent Reinforcement Learning Dynamic Spectrum Access in Cognitive Radios

1 Wu Chun, 2 Yin Mingyong, 2 Ma Shaoliang, 1 Jiang Hong

1 School of National Defense Technology, Southwest University of Science and Technology, Mianyang 621000, Sichuan, China
2 Institute of Computer Application, China Academy of Engineering Physics, Mianyang 621900, Sichuan, China
1 Tel.: 86-816089890, fax: 86-816089890

1 E-mail: soldier_wu@163.com


Received: 28 November 2013 /Accepted: 28 January 2014 /Published: 28 February 2014

Digital Sensors and Sensor Sysstems


Abstract: A multiuser independent Q-learning method which does not need information interaction is proposed for multiuser dynamic spectrum accessing in cognitive radios. The method adopts self-learning paradigm, in which each CR user performs reinforcement learning only through observing individual performance reward without spending communication resource on information interaction with others. The reward is defined suitably to present channel quality and channel conflict status. The learning strategy of sufficient exploration, preference for good channel, and punishment for channel conflict is designed to implement multiuser dynamic spectrum accessing. In two users two channels scenario, a fast learning algorithm is proposed and the convergence to maximal whole reward is proved. The simulation results show that, with the proposed method, the CR system can obtain convergence of Nash equilibrium with large probability and achieve great performance of whole reward.


Keywords: Cognitive radios, Multiagent reinforcement learning, Q-learning, Dynamic spectrum access.


Acrobat reader logo Click <here> or title of paper to download the full pages article in pdf format



Subscribe the full-page Sensors & Transducers journal in print (paper) or pdf formats

(shipping cost by standard mail for paper version is included)

(25 % discount for IFSA Members)




Alternatively we accept a money transfer to our bank account. Please contact for details: sales@sensorsportal.com



Download <here> the Library Journal Recommendation Form



Read more about Wireless Sensor Networks






1999 - 2014 Copyright , International Frequency Sensor Association (IFSA) Publishing, S.L. All Rights Reserved.

Home - News - Links - Archives - Tools - Voltage-to-Frequency Converters - Standardization - Patents - Marketplace - Projects - Wish List - e-Shop - Sensor Jobs - Membership - Videos - Publishing - Site Map - Subscribe - Search

 Members Area -Sensors Portal -Training Courses - S&T Digest - For advertisers - Bookstore - Forums - Polls - Submit Press Release - Submit White Paper - Testimonies - Twitter - Facebook - LinkedIn