Resource Management in QoS-Aware Wireless Cellular Networks

Similar documents
Downlink Scheduler Optimization in High-Speed Downlink Packet Access Networks

Q-Learning Algorithms for Constrained Markov Decision Processes with Randomized Monotone Policies: Application to MIMO Transmission Control

Optimal Utility-Based Resource Allocation for OFDM Networks with Multiple Types of Traffic

Resource Allocation in Energy-constrained Cooperative Wireless Networks

Dynamic Fair Channel Allocation for Wideband Systems

Framework for Performance Analysis of Channel-aware Wireless Schedulers

Delay-Aware Fair Scheduling in Relay-Assisted High-Speed Railway Networks

Opportunistic Scheduling: Generalizations to. Include Multiple Constraints, Multiple Interfaces,

Multiuser Scheduling and Power Sharing for CDMA Packet Data Systems

Optimal Rate Control in Wireless Networks with Fading Channels

Optimal Resource Allocation in Multihop Relay-enhanced WiMAX Networks

Finite Horizon Opportunistic Multicast Beamforming

Book Title: XXXXXXXXXXXXXXXXXXXXXXXXXX. Editors

Policy Teaching. Through Reward Function Learning. Haoqi Zhang, David Parkes, and Yiling Chen

Joint Optimization of Relay Strategies and Resource Allocations in Cooperative Cellular Networks

Opportunistic Communications under Energy & Delay Constraints

Performance Analysis of Multiuser MIMO Systems with Scheduling and Antenna Selection

Delay Constrained Point to Multi-Point Scheduling in Wireless Fading Channels

Multi-user Space Time Scheduling for Wireless Systems with Multiple Antenna

ADAPTIVE RESOURCE ALLOCATION FOR WIRELESS MULTICAST MIMO-OFDM SYSTEMS

An Efficient Fixed Rate Transmission Scheme over Delay-Constrained Wireless Fading Channels

An Effective Subcarrier Allocation Algorithm for Future Wireless Communication Systems

Opportunistic scheduling policies for improved throughput guarantees in wireless networks

How user throughput depends on the traffic demand in large cellular networks

Balance Queueing and Retransmission: Latency-Optimal Massive MIMO Design

NETWORK COOPERATION FOR ENERGY SAVING IN GREEN RADIO COMMUNICATIONS. Muhammad Ismail and Weihua Zhuang IEEE Wireless Communications Oct.

Dynamic Bandwidth Allocation for Low Power Devices With Random Connectivity

Traffic-Aware Transmission Mode Selection in D2D-enabled Cellular Networks with Token System

Online Algorithms for Delay Constrained Scheduling over a Fading Channel

Low Complexity Power and Scheduling Policies for Wireless Networks to Provide Quality of Service

Fairness and Delay in Heterogeneous Half- and Full-Duplex Wireless Networks

Optimal Coded Information Network Design and Management via Improved Characterizations of the Binary Entropy Function

Antennas and Propagation. Chapter 6b: Path Models Rayleigh, Rician Fading, MIMO

Power Controlled Random Access

A Backlog-Based CSMA Mechanism to Achieve Fairness and Throughput-Optimality in Multihop Wireless Networks

Power Control and Scheduling for Guaranteeing Quality of Service in Cellular Networks

Optimal Radio Access Technology Selection Algorithm for LTE-WiFi Network

Scheduling Algorithms For Policy Driven QoS Support in HSDPA Networks

Downlink Erlang Capacity of Cellular OFDMA

Learning via Delayed Knowledge A Case of Jamming. SaiDhiraj Amuru and R. Michael Buehrer

Multicast beamforming and admission control for UMTS-LTE and e

Diversity Techniques

IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. XX, NO. X, AUGUST 20XX 1

Distributed Approaches for Exploiting Multiuser Diversity in Wireless Networks

Optimal Foresighted Multi-User Wireless Video

IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 17, NO. 6, DECEMBER /$ IEEE

TRANSMISSION STRATEGIES FOR SINGLE-DESTINATION WIRELESS NETWORKS

Scheduling in WiMAX Networks

TRIAL-BASED HEURISTIC TREE SEARCH FOR FINITE HORIZON MDPS. Thomas Keller and Malte Helmert Presented by: Ryan Berryhill

Optimal Downlink Power Allocation in. Cellular Networks

Proportional Fair Scheduling for Wireless Communication with Multiple Transmit and Receive Antennas 1

Wireless communications: from simple stochastic geometry models to practice III Capacity

Service Differentiation in Multi-Rate Wireless Networks with Weighted Round-Robin Scheduling and ARQ-Based Error Control


CROSS-LAYER DESIGN OF ADMISSION CONTROL POLICIES IN CODE DIVISION MULTIPLE ACCESS COMMUNICATIONS SYSTEMS UTILIZING BEAMFORMING

A Time-scale Decomposition Approach to Optimize Wireless Packet Resource Allocation and Scheduling

ISSN: (Online) Volume 2, Issue 6, June 2014 International Journal of Advance Research in Computer Science and Management Studies

Beamforming and Binary Power Based Resource Allocation Strategies for Cognitive Radio Networks

Hype, Myths, Fundamental Limits and New Directions in Wireless Systems

Mobile Terminal Energy Management for Sustainable Multi-homing Video Transmission

Opportunistic Communication in Wireless Networks

Long Term Evolution (LTE) and 5th Generation Mobile Networks (5G) CS-539 Mobile Networks and Computing

Modeling the impact of buffering on

Cross-layer Scheduling and Resource Allocation in Wireless Communication Systems

QoS-based Dynamic Channel Allocation for GSM/GPRS Networks

A Dynamic Relay Selection Scheme for Mobile Users in Wireless Relay Networks

Capacity and Optimal Resource Allocation for Fading Broadcast Channels Part I: Ergodic Capacity

Low-Complexity OFDMA Channel Allocation With Nash Bargaining Solution Fairness

OPPORTUNISTIC SPECTRUM ACCESS IN MULTI-USER MULTI-CHANNEL COGNITIVE RADIO NETWORKS

Opportunistic Communication: From Theory to Practice

Planning and Optimization of Broadband Power Line Communications Access Networks: Analysis, Modeling and Solution

Power Minimization for Multi-Cell OFDM Networks Using Distributed Non-cooperative Game Approach

Design of a TDD Multisector TDM MAC for the WiFiRe Proposal for Rural Broadband Access

Optimal Scheduling and Power Allocation in Cooperate-to-Join Cognitive Radio Networks

A Reinforcement Learning Scheme for Adaptive Link Allocation in ATM Networks

Multiple Antennas in Wireless Communications

3644 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 57, NO. 6, JUNE 2011

Subcarrier Based Resource Allocation

New Cross-layer QoS-based Scheduling Algorithm in LTE System

Communication over a Time Correlated Channel with an Energy Harvesting Transmitter

Optimal Radio Access Technology Selection in an SDN based LTE-WiFi Network

LTE in Unlicensed Spectrum

Optimal Scheduling Policy Determination for High Speed Downlink Packet Access

Optimal Bandwidth Allocation with Dynamic Service Selection in Heterogeneous Wireless Networks

Combined Opportunistic Beamforming and Receive Antenna Selection

A Dynamic Resource Sharing Mechanism for Cloud Radio Access Networks

Transmission Scheduling for Efficient Wireless Utilization

Resource Allocation for HARQ based Mobile Ad hoc Networks

Cross-Layer Design and Analysis of Wireless Networks Using the Effective Bandwidth Function

IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 58, NO. 3, MARCH

Stability Analysis for Network Coded Multicast Cell with Opportunistic Relay

Throughput-optimal number of relays in delaybounded multi-hop ALOHA networks

Dynamic Resource Allocation for Efficient Wireless Packet Data Communcations

ABSTRACT ALGORITHMS IN WIRELESS NETWORKS WITH ANTENNA ARRAYS

IN recent years, multicasting data to mobile users (e.g.,

844 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 8, NO. 4, AUGUST 2006

Downlink Scheduling of Heterogeneous Traffic

Full-Duplex Machine-to-Machine Communication for Wireless-Powered Internet-of-Things

Spectrum Sharing with Adjacent Channel Constraints

Channel Probing in Communication Systems: Myopic Policies Are Not Always Optimal

Transcription:

Resource Management in QoS-Aware Wireless Cellular Networks Zhi Zhang Dept. of Electrical and Computer Engineering Colorado State University April 24, 2009 Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 1 / 38

Thesis Work A framework for opportunistic scheduling in multiuser OFDM systems Tradeoff between system performance and QoS/fairness for individual users in multiple-channel systems. A generalized framework for opportunistic scheduling Incorporate heterogeneous QoS support into opportunistic scheduling, provide an efficient tool (problem formulation and solution) to design and analyze this category of fair scheduling. Stochastic dynamic programming for opportunistic scheduling An approximate dynamic programming approach for opportunistic scheduling in a more generalized network model (finite queue backlogs and channel memory). Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 2 / 38

Publications Journal Z. Zhang, Y. He, and E. K. P. Chong, Opportunistic scheduling for OFDM systems with fairness constraints, EURASIP Journal on Wireless Communications and Networking, vol. 2008, Article ID 215939, 12 pages, 2008. Z. Zhang, S. Moola, and E. K. P. Chong, Opportunistic fair sheduling in wireless networks: an approximate dynamic programming approach, ACM Mobile Networks and Applications (MONET), in revision, 2008. Z. Zhang and E. K. P. Chong, Opportunistic scheduling with heterogeneous QoS support, in preparation. Z. Zhang, L. L. Scharf, and E. K. P. Chong, Algebraic equivalence of matrix conjugate direction and matrix multistage Wiener filters, in preparation. Conference Z. Zhang, S. Moola, and E. K. P. Chong, Approximate stochastic dynamic programming for opportunistic fair scheduling in wireless networks, 47th IEEE CDC, Cancun, Mexico, December 9 11, 2008, pp. 1404 1409. Z. Zhang, Y. He, and E. K. P. Chong, Opportunistic downlink scheduling for Multiuser OFDM systems, IEEE WCNC 05, New Orleans, LA, March 13 17, 2005,pp. 1206 1212 (Invited paper). L. L. Scharf, E. K. P. Chong, and Z. Zhang, Algebraic equivalence of matrix conjugate direction and matrix multistage filters for estimating random vectors, 43th IEEE CDC, Paradise Island, Bahamas, December 14 17, 2004, pp. 4175 4179. L. L. Scharf, E. K. P. Chong, L. T. McWhorter, and Z. Zhang, Algebraic equivalence of block conjugate direction and block multistage Wiener filters for estimating random vectors, Lake Louise Workshop on The Future of Signal Processing in the 21st Century, Lake Louise, Canada, October 5 10, 2003. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 3 / 38

Outline Outline 1 Background 2 A Generalized Framework for Opportunistic Scheduling 3 Stochastic Dynamic Programming for Opportunistic Scheduling 4 Summary Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 4 / 38

Background Outline 1 Background 2 A Generalized Framework for Opportunistic Scheduling 3 Stochastic Dynamic Programming for Opportunistic Scheduling 4 Summary Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 5 / 38

Background Challenges for Future Broadband Wireless Networks Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 6 / 38

Background Challenges for Future Broadband Wireless Networks High-speed data rate: 100Mbps to 1Gbps Heterogeneous Quality of Service (QoS) provisioning Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 6 / 38

Background Challenges for Future Broadband Wireless Networks High-speed data rate: 100Mbps to 1Gbps Heterogeneous Quality of Service (QoS) provisioning Flexible and efficient radio resource management: Scheduling, admission control, power control, etc. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 6 / 38

Background Characteristics of Wireless Channels Figure: Rayleigh fading with maximum 100Hz Doppler shift Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 7 / 38

Background Characteristics of Wireless Channels Figure: Rayleigh fading with maximum 100Hz Doppler shift Radio propagation: Path loss, shadowing, and multipath fading Time-varying and location-dependent channel conditions Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 7 / 38

Background Opportunistic Scheduling: An Illustration One base station and several active users in a cell. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 8 / 38

Background Opportunistic Scheduling: An Illustration One base station and several active users in a cell. At each time, scheduler picks one user to transmit. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 8 / 38

Background Opportunistic Scheduling: An Illustration One base station and several active users in a cell. At each time, scheduler picks one user to transmit. Channel conditions are time varying and location dependent. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 8 / 38

Background Opportunistic Scheduling: An Illustration One base station and several active users in a cell. At each time, scheduler picks one user to transmit. Channel conditions are time varying and location dependent. Scheduling decision is based on channel condition, input queues, QoS constraints, etc. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 8 / 38

A Generalized Framework for Opportunistic Scheduling Outline 1 Background 2 A Generalized Framework for Opportunistic Scheduling 3 Stochastic Dynamic Programming for Opportunistic Scheduling 4 Summary Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 9 / 38

A Generalized Framework for Opportunistic Scheduling Motivation Opportunistic scheduling exploits time-varying channel conditions to achieve multiuser diversity. Scheduler also should maintain specific QoS/fairness constraints for individual users. Previous work treats different QoS constraints individually as different problems. Future wireless multimedia networks require heterogeneous QoS support for individual users. A single user could require multiple different QoS/fairness constraints. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 10 / 38

A Generalized Framework for Opportunistic Scheduling Motivation for Minimal and Maximal Constraints Minimal constraints A minimal guarantee is the natural and simplest QoS guarantee. Bandwidth-sensitive applications need a minimal rate: VoIP and streaming video. Ensure premium customers receive better service than regular customers. Maximal constraints Give users an incentive to upgrade to expensive premium services. Decrease the subscribers QoS sensitivity to the number of subscribers in the network. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 11 / 38

A Generalized Framework for Opportunistic Scheduling Fairness and QoS Requirements Scheduler must allocate resources fairly among users under specific QoS/fairness constraints. Examples of (long-term) minimal constraints: Temporal fairness: User i is scheduled at least ri of the time. Utilitarian fairness: User i receives at least ai of the overall system utility. Minimum-performance guarantee: User i receives at least a utility of C i. Proportional fairness: Aggregate of proportional change in utility is non-positive. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 12 / 38

A Generalized Framework for Opportunistic Scheduling Scheduling with Generalized QoS constraints Ui t : channel utility for user i at time t. Example: instantaneous throughput. The better the channel condition, the larger the value of U t i. For simplicity, assume that channels are stationary and ergodic. Utility vector: U = (U 1,, U N ), N: number of users, U i : utility of user i at a generic time-slot. Scheduling policy π: a rule that specifies the action at each time. Policy π schedules user π( U) = i to transmit, receives reward U i. A feasible policy satisfies specific fairness/qos constraints. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 13 / 38

A Generalized Framework for Opportunistic Scheduling Problem Formulation max E N f i (U i )1 π {π( U)=i} (1) i=1 ) subject to E (h ji (U i)1 {π( U)=i} H j i 0, i = 1, 2,..., N, j = 1, 2,..., J, ( ) E gi k (U i )1 {π( U)=i} Gi k 0, i = 1, 2,..., N, k = 1, 2,..., K. f i : Functions of utility associated with user i. h j i, g i k : Constraint functions associated with user i. We assume that h j i and gi k are convex. H j i, G i k : Minimum and maximum predetermined constraint requirements associated with user i respectively. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 14 / 38

A Generalized Framework for Opportunistic Scheduling Optimal Scheduling Policy Define the policy π as follows: { } π ( U) J K = argmax f i (U i ) + λ j i hj i (U i) ρ k i gi k (U i ), (2) i where the control parameters λ j i and ρ k i are chosen such that: 1 λ j i 0, ρ k i 0, i, j, k; ( ) 2 E h j i (U i)1 {π ( U)=i} H j i 0, i, j; ( ) 3 If E h j i (U i)1 {π ( U)=i} H j i > 0, then λ j i = 0, i, j; ( ) 4 E gi k (U i )1 {π ( U)=i} Gi k 0, i, k; ( ) 5 If E gi k (U i )1 {π ( U)=i} Gi k < 0, then ρ k i = 0, i, k. Theorem The policy π defined in (2), if it exists, is an optimal solution to the problem defined in (1), i.e., it maximizes the system performance while satisfying the general fairness constraints for individual users. j=1 Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 15 / 38 k=1

A Generalized Framework for Opportunistic Scheduling An Example Scheduling with Minimal and Maximal Data Rates N ) max E (U i 1 π {π( U)=i} i=1 ) subject to E (U i 1 {π( U)=i} C i, i = 1, 2,..., N, ) E (U i 1 {π( U)=i} D i, i = 1, 2,..., N. C = (C 1, C 2,..., C N ): a feasible predetermined minimal data rate requirement vector. D = (D 1, D 2,..., D N ): a feasible predetermined maximal data rate requirement vector. D i C i 0, i. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 16 / 38

A Generalized Framework for Opportunistic Scheduling Optimal Scheduling Policy The following policy π is optimal: π ( U) = argmax {(θ i µ i )U i }, i where the control parameters θ i and µ i are chosen such that: 1 θ i 1, µ i 0, i; ( ) 2 E U i 1 {π ( U)=i} C i, i; ( ) 3 If E U i 1 {π ( U)=i} > C i, then θ i = 1, i; ( ) 4 E U i 1 {π ( U)=i} D i, i; ( ) 5 If E U i 1 {π ( U)=i} < D i, then µ i = 0, i. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 17 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Outline 1 Background 2 A Generalized Framework for Opportunistic Scheduling 3 Stochastic Dynamic Programming for Opportunistic Scheduling 4 Summary Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 18 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Overview Opportunistic scheduling: Multiuser diversity vs. QoS/fairness. Memoryless channels vs. channels with memory. Delay insensitive traffic vs. delay sensitive traffic. Markov decision processes (MDPs) and dynamic programming: Sequential decision making under uncertainty. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 19 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling A TDM Uplink Queueing Model Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 20 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Opportunistic Fair Scheduling Goal: Maximize system performance under certain QoS requirements of users. Throughput maximization. Delay minimization: Average system queue length. Want to determine optimal scheduler. Scheduling decision at time t depends on Instantaneous channel conditions H i (t). Packet queue lengths Qi (t). Exogenous packet arrivals Ai (t). Specific fairness/qos requirements. Fairness constraint considered here: Temporal fairness (long-term). Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 21 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP Opportunistic scheduling problem formulated as an MDP. Consider two creteria: Infinite horizon expected discounted reward and expected average reward. MDP is specified by State space Action space Transition function Reward function Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 22 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) State space: S R 2K. State of system at time t is X t = (Q 1 (t), Q 2 (t),..., Q N (t), H 1 (t), H 2 (t),..., H N (t)), Q i (t) and H i (t) are queue length and channel state of user i at time t. Generic notation of state: s, s S. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 23 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) State space: S R 2K. State of system at time t is X t = (Q 1 (t), Q 2 (t),..., Q N (t), H 1 (t), H 2 (t),..., H N (t)), Q i (t) and H i (t) are queue length and channel state of user i at time t. Generic notation of state: s, s S. Action space: A = {1, 2,..., N}. Action at time t is the selected user i = π t. Generic notation of action: a A. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 23 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) Transition function: Determined by Queue-length evolution: Q i (t + 1) = Q i (t) + A i (t) min (Q i (t), H i (t)) 1 {πt=i}. Dynamics of the channels (Markov chain). Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 24 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) Transition function: Determined by Queue-length evolution: Q i (t + 1) = Q i (t) + A i (t) min (Q i (t), H i (t)) 1 {πt=i}. Dynamics of the channels (Markov chain). Reward functions: Throughput maximization: N r(x t, π t ) = min (Q i (t), H i (t)) 1 {πt=i}. i=1 Delay minimization: N r(x t, π t ) = Q i (t). i=1 Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 24 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) Given a policy π, the expected average reward is [ ] T 1 1 J π(s) = lim T Eπ r(x t, π t) T X0 = s, s S. t=0 Want to maximize J π (s) with respect to π. Our MDP is nonstandard because of constraint on π corresponding to temporal fairness. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 25 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Problem Formulation as MDP (Cont d) Expected average temporal fairness constraint: [ ] T 1 1 lim T Eπ 1 {πt =a} T X0 = s C(a), a A, (3) t=0 C(a): minimum relative frequency at which action (user) a should be selected, where C(a) 0 and a A C(a) 1. Let Π be set of all policies satisfying expected average temporal fairness constraint. A policy π is average-reward-optimal if J π (s) = max Jπ(s), s S. (4) π Π Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 26 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Temporal Fair Scheduling Problem Goal Find an average-reward-optimal policy π subject to the expected average temporal fairness constraint. Opportunistic scheduling problem posed as an MDP with expected average temporal fairness constraint. Scheduler corresponds to optimal policy. How to compute scheduler? Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 27 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Optimal Scheduling Policy Suppose the system is unichain. Suppose we have a bounded function h : S R, a function u : A R, a constant g, and a stationary policy π such that for s S, 1 a A, u(a) 0; 2 a A, lim T E π 3 a A, if lim T E π 4 [ 1 T T 1 [ 1 T t=0 1 ] {πt X0 =a} = s C(a); T 1 t=0 1 ] {π t =a} X 0 = s > C(a), then u(a) = 0; g + h(s) = max {r(s, a) + u(a) + P(s s, a)h(s )}; (5) a A s S 5 π is a policy which, for each s, prescribes an action which maximizes the right-side of (5): π (s) = argmax{r(s, a) + u(a) + P(s s, a)h(s )}. a A s S Then π is an average-reward-optimal policy as defined by (4) subject to (3). The corresponding optimal average reward is J π (s) = g a A u(a)c(a), s S. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 28 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Remarks on Optimal Scheduling Policy 1 Main result: An explicit Bellman s equation for the general temporal fairness constrained MDP. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 29 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Remarks on Optimal Scheduling Policy 1 Main result: An explicit Bellman s equation for the general temporal fairness constrained MDP. 2 Proved sufficiency of Bellman s equation for optimality. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 29 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Remarks on Optimal Scheduling Policy 1 Main result: An explicit Bellman s equation for the general temporal fairness constrained MDP. 2 Proved sufficiency of Bellman s equation for optimality. 3 Obtained optimal scheduling policy based on Bellman s equation. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 29 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Remarks on Optimal Scheduling Policy 1 Main result: An explicit Bellman s equation for the general temporal fairness constrained MDP. 2 Proved sufficiency of Bellman s equation for optimality. 3 Obtained optimal scheduling policy based on Bellman s equation. 4 Developed a novel approximation method: Temporal fair rollout. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 29 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Temporal Fair Rollout An approximation method based on Monte Carlo sampling. Builds on [Bertsekas & Castañon 1999] by incorporating temporal fairness constraints. Optimal value function is approximated by some base policy: Typically heuristic and suboptimal. Rollout achieves better performance than base policy reward-improvement property. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 30 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Stochastic Approximation for Parameter Estimation Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 31 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Scheduling Schemes for Evaluation 1 Round-robin: Schedule users in predetermined order. 2 Opportunistic scheduling-1: Infinite-backlog temporal fairness policy [Liu, Chong, Shroff 2003]. 3 Opportunistic scheduling-2: Variation of opportunistic scheduling-1 with consideration of queue lengths. 4 Temporal fair rollout: Base policy is opportunistic scheduling-2. 5 Greedy: Schedule user with best channel condition ( Π). 6 Rollout (unconstrained): Base policy is greedy policy ( Π). Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 32 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Performance for Throughput Maximization Problem Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 33 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Fairness in Throughput Maximization Problem Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 34 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Performance for Delay Minimization Problem Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 35 / 38

Stochastic Dynamic Programming for Opportunistic Scheduling Fairness in Delay Minimization Problem Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 36 / 38

Summary Outline 1 Background 2 A Generalized Framework for Opportunistic Scheduling 3 Stochastic Dynamic Programming for Opportunistic Scheduling 4 Summary Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 37 / 38

Summary Summary Opportunistic scheduling exploits time-varying, location-dependent channel conditions to achieve mulituser diversity. Our framework for opportunistic scheduling in multiuser OFDM systems exploits both multiuser diversity and frequency diversity opportunisitically while maintaining certain QoS/fairness constraints for individual users. Our generalized framework for opportunistic scheduling encompasses most previous opportunistic fair scheduling formulation and also provide an efficient tool to design and analyze the scheduling problems with general heterogeneous QoS constraints. We reformulate the fair scheduling problem as a constrained MDP in a more general setting. The proposed approximate dynamic programming approach can easily be extended to fit different objective functions and other fairness measures. Zhi Zhang (ECE CSU) Resource Management in Wireless Networks April 24, 2009 38 / 38