A Novel Cognitive Anti-jamming Stochastic Game

Similar documents
Multi-agent Reinforcement Learning Based Cognitive Anti-jamming

MACHINE LEARNING AIDED EFFICIENT AND ROBUST ALGORITHMS FOR SPECTRUM KNOWLEDGE ACQUISITION IN WIDEBAND AUTONOMOUS COGNITIVE RADIOS

Fast Online Learning of Antijamming and Jamming Strategies

Software Defined Radio: Enabling technologies and Applications

COGNITIVE Radio (CR) [1] has been widely studied. Tradeoff between Spoofing and Jamming a Cognitive Radio

Algorithm and Experimentation of Frequency Hopping, Band Hopping, and Transmission Band Selection Using a Cognitive Radio Test Bed

Cognitive Radio: Brain-Empowered Wireless Communcations

LOCALIZATION AND ROUTING AGAINST JAMMERS IN WIRELESS NETWORKS

Cooperative Wireless Networking Using Software Defined Radio

Jamming mitigation in cognitive radio networks using a modified Q-learning algorithm

ETSI Standards and the Measurement of RF Conducted Output Power of Wi-Fi ac Signals

A review paper on Software Defined Radio

Using the Time Dimension to Sense Signals with Partial Spectral Overlap. Mihir Laghate and Danijela Cabric 5 th December 2016

Analysis of cognitive radio networks with imperfect sensing

CHAPTER 10 CONCLUSIONS AND FUTURE WORK 10.1 Conclusions

/13/$ IEEE

GNSS Jamming: A Short Survival Guide

ABSTRACT 1. INTRODUCTION

Digital Modulation Recognition Based on Feature, Spectrum and Phase Analysis and its Testing with Disturbed Signals

COGNITIVE RADIO AND DYNAMIC SPECTRUM SHARING

Address: 9110 Judicial Dr., Apt. 8308, San Diego, CA Phone: (240) URL:

Overview: Trends and Implementation Challenges for Multi-Band/Wideband Communication

COGNITIVE RADIO TECHNOLOGY. Chenyuan Wang Instructor: Dr. Lin Cai November 30, 2009

Adaptive Multi-Coset Sampler

Spectrum accessing optimization in congestion times in radio cognitive networks based on chaotic neural networks

Multiband NFC for High-Throughput Wireless Computer Vision Sensor Network

COGNITIVE RADIO. Priyesh V.P.

Q. No. BT Level. Question. Domain

T software-defined radio (SDR) receivers

Environment-Adaptable Efficient Optimization for Programming of Reconfigurable Radio Frequency (RF) Receivers

Power Control Optimization of Code Division Multiple Access (CDMA) Systems Using the Knowledge of Battery Capacity Of the Mobile.

Overcoming Interference is Critical to Success in a Wireless IoT World

Cognitive Radio for Future Internet Survey on CR Testbed & Product

Cognitive Radio Systems: A Network Technology Assessment

Intelligent Adaptation And Cognitive Networking

Cognitive Radio: Fundamentals and Opportunities

Digitally Enhanced Inter-modulation Distortion Compensation in Wideband Spectrum Sensing. Han Yan and Prof. Danijela Cabric Nov.

PAR4CR: THE DEVELOPMENT OF A NEW SDR-BASED PLATFORM TOWARDS COGNITIVE RADIO

Testing and Measurement of Cognitive Radio and Software Defined Radio Systems

Cognitive multi-mode and multi-standard base stations: architecture and system analysis

BY INJECTING faked or replayed signals, a jammer aims

Learning via Delayed Knowledge A Case of Jamming. SaiDhiraj Amuru and R. Michael Buehrer

Improving Reinforcement Learning Algorithms for Dynamic Spectrum Allocation in Cognitive Sensor Networks

Sequential Multi-Channel Access Game in Distributed Cognitive Radio Networks

GAJET, a DRDC Evaluation Testbed for Navigation Electronic Warfare. Michel Clénet

OFDM and MC-CDMA A Primer

Wireless Systems Laboratory Stanford University Pontifical Catholic University Rio de Janiero Oct. 13, 2011

Cognitive Radio Techniques

Sang-Tae Kim, Seong-Yun Lee. Radio Technology Research Department

IMPROVED PROBABILITY OF DETECTION AT LOW SNR IN COGNITIVE RADIOS

The Framework of the Integrated Power Line and Visible Light Communication Systems

Power Optimization in a Non-Coordinated Secondary Infrastructure in a Heterogeneous Cognitive Radio Network

Cognitive Ultra Wideband Radio

Distributed Power Control in Cellular and Wireless Networks - A Comparative Study

Design & Implementation of an Adaptive Delta Sigma Modulator

Cell Selection Using Distributed Q-Learning in Heterogeneous Networks

Cognitive Radio: Smart Use of Radio Spectrum

A Novel Design In Digital Communication Using Software Defined Radio

Population Adaptation for Genetic Algorithm-based Cognitive Radios

Resource Allocation in a Cognitive Digital Home

UNEQUAL POWER ALLOCATION FOR JPEG TRANSMISSION OVER MIMO SYSTEMS. Muhammad F. Sabir, Robert W. Heath Jr. and Alan C. Bovik

Downlink Erlang Capacity of Cellular OFDMA

Wideband Autonomous Cognitive Radios: Spectrum Awareness and PHY/MAC Decision Making

VITA 49 VITA Radio Transport (VRT) A Spectrum Language for Software Defined Radios

Wideband Direct Digital Radio Modeling and Verification Rulon VanDyke 1, David Leiss 2

Spectrum Detector for Cognitive Radios. Andrew Tolboe

Adaptive Threshold for Energy Detector Based on Discrete Wavelet Packet Transform

Analysis of Interference from Secondary System in TV White Space

ni.com Redefining RF and Microwave Instruments

DESIGN AND IMPLEMENTATION OF AN ALGORITHM FOR MODULATION IDENTIFICATION OF ANALOG AND DIGITAL SIGNALS

Distributed Game Theoretic Optimization Of Frequency Selective Interference Channels: A Cross Layer Approach

3 USRP2 Hardware Implementation

UWB Radio: Issues and Opportunities Jim Lansford, Ph.D. CTO, Alereon

A Multicarrier CDMA Based Low Probability of Intercept Network

UNDERSTANDING AND MITIGATING

Review on Design and Implementation of DSSS-CDMA Transmitter using HDL with Raised Cosine Filter to Minimize ISI

ANTI-JAMMING PERFORMANCE OF COGNITIVE RADIO NETWORKS. Xiaohua Li and Wednel Cadeau

Researches in Broadband Single Carrier Multiple Access Techniques

Programmable Wireless Networking Overview

WAVELET AND S-TRANSFORM BASED SPECTRUM SENSING IN COGNITIVE RADIO

Ultra Wideband Signals and Systems in Communication Engineering

PhyCloak: Obfuscating Sensing from Communication Signals

Accident prevention and detection using internet of Things (IOT)

Outline for this presentation. Introduction I -- background. Introduction I Background

MASTER THESIS PROJECT PROPOSALS: SIGNAL PROCESSING FOR WIRELESS AND SATELLITE COMMUNICATIONS

Networking Devices over White Spaces

Adaptive Modulation with Customised Core Processor

INTRODUCTION TO SOFTWARE RADIO CONCEPTS

Cognitive Radios Games: Overview and Perspectives

Optimized BPSK and QAM Techniques for OFDM Systems

Wi-Fi Localization and its

Urban WiMAX response to Ofcom s Spectrum Commons Classes for licence exemption consultation

Acquisition and Tracking of IRNSS Receiver on MATLAB and Xilinx

MULTIBAND PATCH ANTENNA FOR WIRELESS COMMUNICATION SYSTEM

Control issues in cognitive networks. Marko Höyhtyä and Tao Chen CWC-VTT-Gigaseminar 4th December 2008

Addressing the Design-to-Test Challenges for SDR and Cognitive Radio

Avoid Impact of Jamming Using Multipath Routing Based on Wireless Mesh Networks

Jager UAVs to Locate GPS Interference

Demonstration of Real-time Spectrum Sensing for Cognitive Radio

A Novel SINR Estimation Scheme for WCDMA Receivers

Transcription:

A Novel Cognitive Anti-jamming Stochastic Game Mohamed Aref and Sudharman K. Jayaweera Communication and Information Sciences Laboratory (CISL) ECE, University of New Mexico, Albuquerque, NM and Bluecom Systems & Consulting LLC, Albuquerque, NM 1

Outline 1 Introduction 2 Problem formulation 3 System model 4 Q-learning-aided cognitive anti-jamming algorithm 5 Proposed anti-jamming stochastic game 6 Simulation results 2

Introduction Cognitive radio as an evolution of software-defined radio (SDR) Software controllable analog frontend ADC DAC Software based digital radio platform (SDR) Cognitive Engine A cognitive radio is a multiband, multimode, wideband software-defined radio (SDR) with autonomous decision-making and learning abilities that can optimally reconfigure its operation mode in response to its surrounding RF environment and user needs. 3

dbm Introduction Wideband Autonomous Cognitive Radios (WACR) Senses a wide frequency range. Comprehend its operating RF environment. Autonomous operation. Learn communication protocols and policies. WiFi GPS Mobile Wideband Spectrum Instantaneous BW Bluetooth Freq. 4

Dynamic spectrum sharing (DSS) power Frequency Introduction Vehicular networks Holes Space Time Cognitive Radio Applications Military Health care Smart grid Source: http://mil-embedded.com/articles/evolving- technology-sdr-cognitive-radio/ 5

Basic Cognitive Radio Functions RF Environment Spectrum Knowledge Acquisition Decisions Making Learning Source: S. K. Jayaweera, Signal Processing for Cognitive Radio, John Wiley & Sons, Hoboken, NJ, USA. 6

Wideband Spectrum Knowledge Acquisition Spectrum Knowledge Acquisition Wideband spectrum scanning Spectral activity detection Wideband Signal classification Instantaneous BW Hardware constraints limit the instantaneous sensing bandwidth of most state-of-theart software-defined radio (SDR) platforms to about 100MHz. There is a need to design an efficient scheme to achieve realtime sensing over a wide spectrum range. 7 Frequency

Power spectrum density Wideband Spectrum Knowledge Acquisition Spectrum Knowledge Acquisition Wideband spectrum scanning Spectral activity detection Signal classification Detection can be done by defining a threshold (simple). Any power spectral activity above this threshold is considered as an active signal. Threshold Frequency 8

Wideband Spectrum Knowledge Acquisition Spectrum Knowledge Acquisition Wideband spectrum scanning Spectral activity detection Signal classification Detected signals may belong to different radio systems. Classification Wi-Fi Bluetooth Mobile Others 9

Power spectrum Problem formulation Deliberate radio jammers and unintentional interference can disrupt communication systems. In both commercial and military systems WACR1 signal Jammer signal Frequency 10

Power spectrum Problem formulation In practice, this will result in a complicated multi-agent environment. WACR1 signal WACR2 signal WACR3 signal Jammer signal Frequency Goal: find optimal anti-jamming and interference avoidance policies for the WACRs that switches transmission before getting jammed. 11

Spectrum is divided in to N b sub-bands. Wideband spectrum of interest System model Sub-band Frequency Sub-band dynamics: Single sub-band has 2 Markov states: available/not-available. If the sub-band is jammed or faces interference, it is considered to be in state 0 (not-available). Otherwise, it is considered to be in state 1 (available). The set of sub-band states can be denoted by.. i P0,0 0 Not- Available i P0,1 i P1,0 1 Available i P1,1 12

System model Proposed cognitive framework Sensing Transmission Track the jammed subbands Determine when and where to switch the operating sub-band Each operation will have its own learning algorithm with different targets, but they both will experience the same RF environment. Essentially, if the sensing operation were to learn an optimal policy, the WACR would be able to accurately predict the jammed/interfered sub-bands. This will help the transmission operation as follows: if the current operating sub-band is predicted to be jammed during the next time instant by the sensing policy, the WACR will switch to another sub-band thereby avoiding the possibility of getting jammed. 13

System model For the game state, we choose a simple definition for both sensing and transmission operations, where and represent the index of selected sub-bands for sensing and transmission, respectively, at time n. Thus, the state space is given by. At any time instant, the state of operating sub-bands for both sensing and transmission (the value of for sub-band index ) has to be identified. During sensing operation: the WARC will perform spectral activity detection (spectrum sensing) to detect any active signals in the sensed sub-band and hence identify whether the sub-band is available or not. During transmission operation: the communications link quality will determine if transmission over the current operating sub-band is acceptable. After determining the states of both operating sub-bands, the WACR will select and execute actions for both operations. We define actions and as the indices of the selected new operating sub-bands for sensing and transmission, respectively, at time n. The action space can thus be defined as. 14

Q-learning-aided Cognitive Anti-jamming Learning parameters and Q-table initialization. 15

Q-learning-aided Cognitive Anti-jamming Algorithm Identify the state of the current operating subband. If the sub-band state is 1 (available), no further action is required. 16

Q-learning-aided Cognitive Anti-jamming Algorithm If the sub-band state is 0 (not-available), the WACR updates the Q- table based on a certain observed reward (r). 17

Q-learning-aided Cognitive Anti-jamming Algorithm Once the Q-table is updated, the WACR selects a new action a representing the new operating sub-band. 18

Proposed Anti-jamming Stochastic Game 19

Proposed Anti-jamming Stochastic Game 20

Performance metric: Normalized accumulated reward Simulation results Jammer model: Learning parameters: :immediate non-negative reward for transmission operation at time n N: number of iterations Sweeps the spectrum of interest from the lower to the higher frequency. ϒ=0.8 ϵ=0.9, α =0.4 ϵ=0.01, α =0.1 Before Q-table convergence After Q-table convergence 21

Simulation results Experiment 1: 1 WACR and 5 Sub-bands 22

Simulation results Experiment 2: 2 WACRs and 6 Sub-bands 23

Simulation results Experiment 3: 4 WACRs and 16 Sub-bands 24

Simulation results 25

Simulation results 26

Conclusions Proposed a novel cognitive anti-jamming stochastic game based on Q-learning for WACRs to avoid a dynamic jammer signal as well as unintentional interference from other WACRs. Developed new definitions for state, actions and rewards that enable the WACR to switch its operating sub-band before getting jammed, compared to previously proposed anti-jamming techniques in literature that switch the operating sub-band only after getting jammed. The cognitive framework is divided into two operations: sensing and transmission. Each is helped by its own learning algorithm based on Q-learning. The objective of the sensing operation is to track the jammed sub-bands. On the other hand, the transmission operation determines when and where to switch the operating sub-band. The key difference from the previous work is that the radio will switch the sub-band before getting jammed. This can be especially useful against a smart jammer since it will prevent the jammer from learning the radio s behavior. Simulation results showed that the proposed cognitive protocol has a very low probability of getting jammed and acceptable value for accumulated reward. 27

28

References 1. M. A. Aref, S. K. Jayaweera and S. Machuzak, Multi-agent Reinforcement Learning Based Cognitive Antijamming, IEEE Wireless Communications and Networking Conference (WCNC 17), San Francisco, CA, Mar. 2017. 2. H. M. Schwartz, Multi-Agent Machine Learning: A Reinforcement Approach, John Wiley & Sons, ISBN: 978-1-118-36208-2, 2014. 3. B. Wang, Y. Wu, K. Liu, and T. Clancy, An anti-jamming stochastic game for cognitive radio networks, IEEE Journal on Selected Areas in Communications, vol. 29, no. 4, Apr. 2011. 4. Y. Gwon, S. Dastangoo, C. Fossa, and H. T. Kung, Competing mobile network game: Embracing antijamming and jamming strategies with reinforcement learning, IEEE Conference in Communications and Network Security (CNS 13), National Harbor, MD, Oct. 2013. 5. M. Bowling and M. Veloso, Rational and Convergent Learning in Stochastic Games, 17th international joint conference on Artificial intelligence (IJCAI 01), Seattle, WA, Aug. 2001. 6. S. K. Jayaweera, Signal Processing for Cognitive Radio, John Wiley & Sons, ISBN: 978-1-118-82493-1, 2014. 7. R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, MIT Press, 1998. 29