Improving Reinforcement Learning Algorithms for Dynamic Spectrum Allocation in Cognitive Sensor Networks

Similar documents
Power Optimization in a Non-Coordinated Secondary Infrastructure in a Heterogeneous Cognitive Radio Network

Chapter 12. Cross-Layer Optimization for Multi- Hop Cognitive Radio Networks

MIMO-aware Cooperative Cognitive Radio Networks. Hang Liu

Cognitive Radio: Brain-Empowered Wireless Communcations

Dynamic Spectrum Access in Cognitive Radio Networks. Xiaoying Gan 09/17/2009

Beamforming and Binary Power Based Resource Allocation Strategies for Cognitive Radio Networks

Outline for this presentation. Introduction I -- background. Introduction I Background

Tips for making accurate rise / fall time measurements for radar signals

Distributed spectrum sensing in unlicensed bands using the VESNA platform. Student: Zoltan Padrah Mentor: doc. dr. Mihael Mohorčič

A Novel Cognitive Anti-jamming Stochastic Game

LTE in Unlicensed Spectrum

Efficient Method of Secondary Users Selection Using Dynamic Priority Scheduling

Journal of Asian Scientific Research DEVELOPMENT OF A COGNITIVE RADIO MODEL USING WAVELET PACKET TRANSFORM - BASED ENERGY DETECTION TECHNIQUE

Chapter 6. Agile Transmission Techniques

Optimal Power Control in Cognitive Radio Networks with Fuzzy Logic

Joint Spectrum and Power Allocation for Inter-Cell Spectrum Sharing in Cognitive Radio Networks

Optimum Power Allocation in Cooperative Networks

Harmonized Q-Learning for Radio Resource Management in LTE Based Networks

An Uplink Resource Allocation Algorithm For OFDM and FBMC Based Cognitive Radio Systems

Single channel noise reduction

Energy-Efficient Gaming on Mobile Devices using Dead Reckoning-based Power Management

Power Control Optimization of Code Division Multiple Access (CDMA) Systems Using the Knowledge of Battery Capacity Of the Mobile.

Dynamic Fair Channel Allocation for Wideband Systems

Distributed Game Theoretic Optimization Of Frequency Selective Interference Channels: A Cross Layer Approach

Overview. Cognitive Radio: Definitions. Cognitive Radio. Multidimensional Spectrum Awareness: Radio Space

Performance Analysis of Power Control and Cell Association in Heterogeneous Cellular Networks

Evaluation of spectrum opportunities in the GSM band

Achievable Transmission Capacity of Cognitive Radio Networks with Cooperative Relaying

Sensing-based Opportunistic Channel Access

Chapter 1 Introduction

Vietnam Spectrum Occupancy Measurements and Analysis for Cognitive Radio Applications

Dynamic Power Pricing using Distributed Resource Allocation for Large-Scale DSA Systems

COGNITIVE RADIO TECHNOLOGY. Chenyuan Wang Instructor: Dr. Lin Cai November 30, 2009

Adaptive Scheduling of Collaborative Sensing in Cognitive Radio Networks

A Novel SINR Estimation Scheme for WCDMA Receivers

Genetic Algorithm-Based Approach to Spectrum Allocation and Power Control with Constraints in Cognitive Radio Networks

Energy-aware Task Scheduling in Wireless Sensor Networks based on Cooperative Reinforcement Learning

DYNAMIC SPECTRUM ACCESS AND SHARING USING 5G IN COGNITIVE RADIO

Network Slicing with Mobile Edge Computing for Micro-Operator Networks in Beyond 5G

Common Control Channel Allocation in Cognitive Radio Networks through UWB Multi-hop Communications

SPECTRUM DECISION MODEL WITH PROPAGATION LOSSES

Partial overlapping channels are not damaging

Hedonic Coalition Formation for Distributed Task Allocation among Wireless Agents

PRIMARY USER BEHAVIOR ESTIMATION AND CHANNEL ASSIGNMENT FOR DYNAMIC SPECTRUM ACCESS IN ENERGY-CONSTRAINED COGNITIVE RADIO SENSOR NETWORKS

Chapter 10. User Cooperative Communications

Joint Subcarrier Pairing and Power Loading in Relay Aided Cognitive Radio Networks

Power Allocation Strategy for Cognitive Radio Terminals

Opportunistic Communications under Energy & Delay Constraints

Energy Efficient Spectrum Sensing and Accessing Scheme for Zigbee Cognitive Networks

Grey Wolf Optimized SVD based Spectrum Sensing

A Colored Petri Net Model of Simulation for Performance Evaluation for IEEE based Network

A Novel Utility Function of Power Control Game in Multi-Channel Cognitive Femtocell Network

User Guide for the Calculators Version 0.9

Candidate: Dragan Trajkov. Mentor: Dr. Jim Roberts

Adaptive Rate Transmission for Spectrum Sharing System with Quantized Channel State Information

Data-driven approach to wireless spectrum crunch

Efficient space time combination technique for unsynchronized cooperative MISO transmission

Cooperative Spectrum Sharing in Cognitive Radio Networks: A Game-Theoretic Approach

Efficient Data and Energy Transfer in IoT with a Mobile Cognitive Base Station

Spectrum Management of Cognitive Radio Using Multi-agent Reinforcement Learning

Full-Duplex Cognitive Radio: A New Design Paradigm for Enhancing Spectrum Usage

Geometric Programming and its Application in Network Resource Allocation. Presented by: Bin Wang

College of Engineering

Downlink Erlang Capacity of Cellular OFDMA

OFDM Pilot Optimization for the Communication and Localization Trade Off

Sound Parameter Estimation in a Security System

Speech Enhancement Based On Noise Reduction

Pareto Optimization for Uplink NOMA Power Control

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 15, NO. 5, MAY

Advanced Self-Interference Cancellation and Multiantenna Techniques for Full-Duplex Radios

A Weighted Least Squares Algorithm for Passive Localization in Multipath Scenarios

COGNITIVE Radio (CR) [1] has been widely studied. Tradeoff between Spoofing and Jamming a Cognitive Radio

Performance Analysis of Cognitive Radio based on Cooperative Spectrum Sensing

Ultra Dense Network: Techno- Economic Views. By Mostafa Darabi 5G Forum, ITRC July 2017

Optimal power allocation in cognitive radio based machine-to-machine network

On the Value of Coherent and Coordinated Multi-point Transmission

ENERGY EFFICIENT CHANNEL SELECTION FRAMEWORK FOR COGNITIVE RADIO WIRELESS SENSOR NETWORKS

Color of Interference and Joint Encoding and Medium Access in Large Wireless Networks

GSM FREQUENCY PLANNING

Research on Asymmetric Characteristics of Mobile Communications System Based on Electromagnetic Radiation

Resource Allocation Strategies Based on the Signal-to-Leakage-plus-Noise Ratio in LTE-A CoMP Systems

Aadptive Subcarrier Allocation for Multiple Cognitive Users over Fading Channels

Spectrum Requirements for 4G Wireless Systems

Backhaul Link Impact on the Admission Control in LTE-A Relay Deployment

Resource Allocation Challenges in Future Wireless Networks

Deep Learning for Launching and Mitigating Wireless Jamming Attacks

A CR-related concept: the Cognitive Pilot Channel (CPC)

Mobile Base Stations Placement and Energy Aware Routing in Wireless Sensor Networks

A New Analysis of the DS-CDMA Cellular Uplink Under Spatial Constraints

Cooperative Compressed Sensing for Decentralized Networks

Submission on Proposed Methodology for Engineering Licenses in Managed Spectrum Parks

Analysis of Distributed Dynamic Spectrum Access Scheme in Cognitive Radios

Energy-Aware Cognitive Radio Systems

Traffic Control for a Swarm of Robots: Avoiding Target Congestion

Wireless Network Security Spring 2012

Optimization of On-line Appointment Scheduling

Spectrum Sharing and Flexible Spectrum Use

IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 17, NO. 6, DECEMBER /$ IEEE

A Practical Resource Allocation Approach for Interference Management in LTE Uplink Transmission

Gaussian Random Field Approximation for Exclusion Zones in Cognitive Radio Networks

Transcription:

Improving Reinforcement Learning Algorithms for Dynamic Spectrum Allocation in Cognitive Sensor Networks Wireless Communications and Networking Conference Leonardo Faganello, Rafael Kunst, Cristiano Both, Lisandro Granville, Juergen Rochol `

Outline Introduction Motivation Background on Cognitive Radio System Model Spectrum Decision Algorithms Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+, Q-Noise, and Q-Noise+ Performance Evaluation Evaluation Methodology Results Conclusion 2

Introduction Cognitive Radio Networks Cognitive Radio Networks have been proposed to deal with the scarcity of frequency spectrum Possible applications include spectrum sharing among wireless sensors IEEE 802.15.4 Industrial Networks Health care sensors 3

Motivation Dynamic Spectrum Allocation Algorithms State-of-the-art for spectrum allocation mainly considers: Probabilistic resource allocation algorithms Genetic algorithms These algorithms have two frequent constraints: The model of users behavior is not properly defined Channel conditions are not considered in the allocation algorithms 4

Background on Cognitive Radio System Model: Industrial Scenario Spectrum sharing between sensors and wireless-enabled devices 5

Background on Cognitive Radio Spectrum Decision Algorithm: Q-Learning Reinforcement Learning algorithm Based on rewards Reward for each channel is based on successful transmissions Each channel has a Q-Value associated Rewards obtained with Q-Learning increase or decrease the channel s Q- Value 8

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 9

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 10

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 1 2... N-1 N Epoch 11

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 1 2... N-1 N Epoch 12

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 1 2... N-1 N Epoch Reward 13

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 1 2... N-1 N Epoch Reward r t = N D t D = 0, 667 14

Q-Learning Channel Q-Value 1 0.54 1 2 2 0.68...... N-1 0.32 Selects channel 2... N-1 N N 0.14 Epoch Reward Q t+1 a t = 1 α Q t + r t α New Q-Value r t = N D t D = 0, 667 15

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 1 2... N-1 N Epoch Reward Q t+1 a t = 1 α Q t + r t α New Q-Value Q t+1 a t = 1 0, 3 0, 68 + 0, 667. 0, 3 Q t+1 a t = 0, 6761 r t = N D t D = 0, 667 16

Q-Learning Channel Q-Value 1 0.54 2 0.68...... N-1 0.32 N 0.14 Selects channel 2 Update Q-Value For channel 2 1 2... N-1 N Epoch Reward Q t+1 a t = 1 α Q t + r t α New Q-Value Q t+1 a t = 1 0, 3 0, 68 + 0, 667. 0, 3 Q t+1 a t = 0, 6761 r t = N D t D = 0, 667 17

Q-Learning Channel Q-Value 1 0.54 2 0.6761...... N-1 0.32 N 0.14 Selects channel 2 Update Q-Value For channel 2 1 2... N-1 N Epoch Reward Q t+1 a t = 1 α Q t + r t α New Q-Value Q t+1 a t = 1 0, 3 0, 68 + 0, 667. 0, 3 Q t+1 a t = 0, 6761 r t = N D t D = 0, 667 18

Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+ Accurate historic behavior of the channel Lookback and Historic Weight Q-Noise Considers the channel s quality for transmission Noise weight Q-Noise+ Accurate historic behavior of the channel while considering the channel s quality for transmission Lookback, Historic Weight and Noise Weight 19

Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+ Accurate historic behavior of the channel Lookback and Historic Weight Q-Noise Considers the channel s quality for transmission Noise weight Lookback Q t+1 a t = (1 α) w t i r t i a t + αr t (a t ) i=1 Q-Noise+ Accurate historic behavior of the channel while considering the channel s quality for transmission Lookback, Historic Weight and Noise Weight 20

Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+ Accurate historic behavior of the channel Lookback and Historic Weight Q-Noise Considers the channel s quality for transmission Noise weight Q t+1 a t = 1 α Q t + αr t a t + (S w η) Q-Noise+ Accurate historic behavior of the channel while considering the channel s quality for transmission Lookback, Historic Weight and Noise Weight 21

Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+ SINR Value η Accurate historic behavior of the channel Lookback and Historic Weight Q-Noise Considers the channel s quality for transmission SINR < 15dB 0.00 15dB SINR < 17dB 0.25 17dB SINR< 20dB 0.50 20dB SINR < 25dB 0.75 SINR 25dB 1.00 Noise weight Q t+1 a t = 1 α Q t + αr t a t + (S w η) Q-Noise+ Accurate historic behavior of the channel while considering the channel s quality for transmission Lookback, Historic Weight and Noise Weight 22

Improving Reinforcement Learning Algorithms in DSA Context Q-Learning+ Accurate historic behavior of the channel Lookback and Historic Weight Q-Noise Considers the channel s quality for transmission Noise weight Q-Noise+ Lookback Q t+1 a t = 1 α w t i r t i a t + αr t (a t ) + (S w η) i=1 Accurate historic behavior of the channel while considering the channel s quality for transmission Lookback, Historic Weight and Noise Weight 23

Performance Evaluation Evaluation Methodology Parameter Default value Learning rate (α) 0.6 Number of channels 5 Transmission attempts 100 Exploration coefficient (ε) 0.25 Epoch duration 5 Threshold between Q-Values for spectrum hand-off (β) 0.1 Lookback 3 Historic weight [0.7 0.2 0.1] SINR weight 0.7 Confidence Interval 95% 24

Results 25

Results 26

Results 27

Results 28

Results 29

Results 30

Conclusions The Q-Learning+ and Q-Noise+ are able to improve the results obtained by reinforcement learning algorithm Better performance for a few number of transmissions For a great number of transmissions, the difference decreases, but always remains better The Q-Noise and Q-Noise+ are capable to transmit with more quality (better SINR) 31

Thank you! Questions? Leonardo Roveda Faganello www.inf.ufrgs.br/~lrfaganello lrfaganello@inf.ufrgs.br Cristiano Both www.inf.ufrgs.br/~cbboth cbboth@inf.ufrgs.br IEEE WCNC - Wireless Communications and Networking Conference Shangai, China, 7-10 April 2013 `