GWiQ-P: : An Efficient, Decentralized Quota Enforcement Protocol

Similar documents
BASIC CONCEPTS OF HSPA

Wireless Internet Routing. IEEE s

Global State and Gossip

Author: Yih-Yih Lin. Correspondence: Yih-Yih Lin Hewlett-Packard Company MR Forest Street Marlboro, MA USA

Diffracting Trees and Layout

ETI2511-WIRELESS COMMUNICATION II HANDOUT I 1.0 PRINCIPLES OF CELLULAR COMMUNICATION

olsr.org 'Optimized Link State Routing' and beyond December 28th, 2005 Elektra

Introduction. Introduction ROBUST SENSOR POSITIONING IN WIRELESS AD HOC SENSOR NETWORKS. Smart Wireless Sensor Systems 1

Potential areas of industrial interest relevant for cross-cutting KETs in the Electronics and Communication Systems domain

Volume 2, Issue 9, September 2014 International Journal of Advance Research in Computer Science and Management Studies

DISTRIBUTED DYNAMIC CHANNEL ALLOCATION ALGORITHM FOR CELLULAR MOBILE NETWORK

g GETTING STARTED D PC System Requirements Computer: Pentium 90 MHz processor or equivalent.

Energy-Efficient Data Management for Sensor Networks

CHANNEL ASSIGNMENT IN MULTI HOPPING CELLULAR NETWORK

CSE6488: Mobile Computing Systems

AS-MAC: An Asynchronous Scheduled MAC Protocol for Wireless Sensor Networks

13. Before making a service call Trip information and remedies

I. General V

Multihop Routing in Ad Hoc Networks

NetApp Sizing Guidelines for MEDITECH Environments

Automatic power/channel management in Wi-Fi networks

Microwave Radio Rapid Ring Protection in Pubic Safety P-25 Land Mobile Radio Systems

Scaling Out your ProjectWise System

Gateways Placement in Backbone Wireless Mesh Networks

Aizaz U Chaudhry *, Nazia Ahmad and Roshdy HM Hafez. Abstract

AGENTLESS ARCHITECTURE

CSCI 445 Laurent Itti. Group Robotics. Introduction to Robotics L. Itti & M. J. Mataric 1

IMPORTANT NOTICE: PLEASE READ CAREFULLY BEFORE INSTALLING THE SOFTWARE: THIS LICENCE AGREEMENT (LICENCE) IS A LEGAL AGREEMENT BETWEEN

Fiber Bragg Grating Dispersion Compensation Enables Cost-Efficient Submarine Optical Transport

POWER GATING. Power-gating parameters

Power Theft Identification system using Power Line Carrier Communication (PLCC) technique in Distribution system based on Binary Search Algorithm

CS 457 Lecture 16 Routing Continued. Spring 2010

BridgeWave AdaptRate Solutions. White Paper

Scalable Routing Protocols for Mobile Ad Hoc Networks

Link-state protocols and Open Shortest Path First (OSPF)

Chapter 8 Traffic Channel Allocation

Time Distribution for Application Level 2 and 3 Linking Protection

TIME- OPTIMAL CONVERGECAST IN SENSOR NETWORKS WITH MULTIPLE CHANNELS

Optimized Periodic Broadcast of Non-linear Media

Energy-Efficient Gaming on Mobile Devices using Dead Reckoning-based Power Management

Composite Event Detection in Wireless Sensor Networks

Distributed Power Control in Cellular and Wireless Networks - A Comparative Study

PAPER AVIAT NETWORKS FOUR RECOMMENDATIONS FOR FIRSTNET BACKHAUL

Advanced Modeling and Simulation of Mobile Ad-Hoc Networks

CHANNEL ASSIGNMENT AND LOAD DISTRIBUTION IN A POWER- MANAGED WLAN

Analysis of Microprocessor Based Protective Relay s (MBPR) Differential Equation Algorithms

Autonomous Self-deployment of Wireless Access Networks in an Airport Environment *

Scalable and Lightweight CTF Infrastructures Using Application Containers

In the event of a failure, the inverter switches off and a fault code appears on the display.

Energy-Scalable Protocols for Battery-Operated MicroSensor Networks

Real-Time Spectrum Monitoring System Provides Superior Detection And Location Of Suspicious RF Traffic

Common Control Channel Allocation in Cognitive Radio Networks through UWB Multi-hop Communications

Solipsis: A Decentralized Architecture for Virtual Environments

Design of Parallel Algorithms. Communication Algorithms

A Location-Aware Routing Metric (ALARM) for Multi-Hop, Multi-Channel Wireless Mesh Networks

Localization in Wireless Sensor Networks

AERIAL PLATFORM PLACEMENT ALGORITHMS TO SATISFY CONNECTIVITY, CAPACITY AND SURVIVABILITY CONSTRAINTS IN WIRELESS AD-HOC NETWORKS

Mesh Networks with Two-Radio Access Points

Mobile Ad Hoc Networks Theory of Interferences, Trade-Offs between Energy, Congestion and Delay

The Message Passing Interface (MPI)

Clock Synchronization

Task Allocation: Motivation-Based. Dr. Daisy Tang

Foundations of Distributed Systems: Tree Algorithms

OSPF Fundamentals. Agenda. OSPF Principles. L41 - OSPF Fundamentals. Open Shortest Path First Routing Protocol Internet s Second IGP

OSPF - Open Shortest Path First. OSPF Fundamentals. Agenda. OSPF Topology Database

Increasing Broadcast Reliability for Vehicular Ad Hoc Networks. Nathan Balon and Jinhua Guo University of Michigan - Dearborn

Hitachi P1 Closed Loop Hoist Basic Instruc on Manual

Low-Latency Multi-Source Broadcast in Radio Networks

OSPF Nonstop Routing. Finding Feature Information. Prerequisites for OSPF NSR

Reliable Videos Broadcast with Network Coding and Coordinated Multiple Access Points

Static Power and the Importance of Realistic Junction Temperature Analysis

PIVX Zerocoin (zpiv) Technical Paper

XPT Digital Trunking Decentralized and Cost-Effective Digital Trunking Solution

ROUTING PROTOCOLS. Dr. Ahmed Khattab. EECE Department Cairo University Fall 2012 ELC 659/ELC724

An Adaptable Energy-Efficient Medium Access Control Protocol for Wireless Sensor Networks

Efficient Channel Allocation for Wireless Local-Area Networks

The Case for Optimum Detection Algorithms in MIMO Wireless Systems. Helmut Bölcskei

Link State Routing. Stefano Vissicchio UCL Computer Science CS 3035/GZ01

UMTS to WLAN Handover based on A Priori Knowledge of the Networks

Power Distribution Paths in 3-D ICs

IEEE Wireless Access Method and Physical Layer Specification. Proposal For the Use of Packet Detection in Clear Channel Assessment

An SDN QoE-Service for Dynamically Enhancing the Performance of OTT Applications

Investigation of Timescales for Channel, Rate, and Power Control in a Metropolitan Wireless Mesh Testbed1

IMPROVING SCALABILITY IN MMOGS - A NEW ARCHITECTURE -

Data acquisition and Trigger (with emphasis on LHC)

Ultra-Low Duty Cycle MAC with Scheduled Channel Polling

Location Discovery in Sensor Network

ON THE CONCEPT OF DISTRIBUTED DIGITAL SIGNAL PROCESSING IN WIRELESS SENSOR NETWORKS

On Regulation of Spectrum Sharing: An Analysis Supporting a Distributed Pilot Channel

Traffic Monitoring and Management for UCS

Using Network Traffic to Infer Power Levels in Wireless Sensor Nodes

Boundary Helps: Efficient Routing Protocol using Directional Antennas in Cognitive Radio Networks

AI Approaches to Ultimate Tic-Tac-Toe

Module 3 Greedy Strategy

Achievable Transmission Capacity of Cognitive Radio Networks with Cooperative Relaying

Joint Relaying and Network Coding in Wireless Networks

Monitoring power quality beyond EN and IEC

PoC #1 On-chip frequency generation

REIHE INFORMATIK TR Studying Vehicle Movements on Highways and their Impact on Ad-Hoc Connectivity

Closing the loop around Sensor Networks

Transcription:

GWiQ-P: : An Efficient, Decentralized Grid-Wide Quota Enforcement Protocol Kfir Karmon, Liran Liss and Assaf Schuster Technion Israel Institute of Technology SYSTOR 2007 IBM HRL, Haifa, Israel

Background Grid Resources Sum of all resources of the same type constitutes a: Grid Wide Resource Same resource type scattered over the grid

Background Grid Resources Grid Wide Resources CPU hours Disk space DB Connections Outbound traffic Concurrent number of CPUs Allocated RAM Floating Software Licenses Open sockets Etc

GWiQ Motivation A grid wide resource tends to be huge and can be exploited Grid Wide Quota Enforcement is vital: Security: Prohibit malicious use Fail Safe: Prevent resource leaks (bugs) Financial: Moderate use per paid share

Centralized GWiQ Enforcement Central server holds the GWiQ bounds for each (user, resource) tuple Per request, resource usage permits are leased until the GWiQ is exhausted. Congestion Scalability Latency

Objectives We strive for a Grid Wide Quota enforcement protocol that is: Decentralized: No hotspots, No single point of failure. Efficient: Overcome latency caused by grid s physical distribution. Scalable: Can handle Mega-Grids

GWiQ-P: Grid Wide Quota enforcement Protocol

GWiQ-P: Basic Concept GWiQ Enforcement At all times the sum of all local quotas < GWiQ Using sandboxes to enforce local quotas Given a attempt to access the resource: If (local-quota >= request) then Grant access local-quota = local-quota request Else Freeze job execution until local-quota reinforced

GWiQ-P: Resource Coins A resource coin denotes the smallest consumable portion of a grid resource. Each (user, resource) GWiQ is broken down to coins. A user s job may use the resource up to the amount that the coins are worth. i.e. Depositing four 1MB coins grants the job (another) 4MB to use Local Quota = Hosting SBox s resident coins GWiQ

GWiQ-P Spanning Forest Using a BF-based alg we build a spanning forest. A sandbox hosting a needy job will start forming a tree around itself. At all times, each neighbor will join the tree to which it is closest to its root. Member of one tree at a time. Surplus coins will be transferred to the root. Need 3 coins

GWiQ-P : In action 1/5 1 2 3 4 5 I need 6 7 1 coins 8 9 10 11 12 13 14 I need coins 15 16

GWiQ-P : In action 2/5 1 2 3 4 5 I need 6 7 1 coins 8 9 10 11 12 13 14 I need coins 15 16

GWiQ-P : In action 3/5 1 2 3 4 5 I need 6 7 1 coins 8 9 10 11 12 13 14 I need 3 coins 15 16

GWiQ-P : In action 4/5 1 2 3 4 5 6 7 8 9 10 11 12 13 14 I need coins 15 16

GWiQ-P : In action 5/5 1 2 3 4 5 6 7 8 9 10 11 12 13 14 I need coins 15 16

GWiQ-P: Fault Intolerance Links and nodes may crash Link crash Passing coins are lost Node crash Local coins are lost Lost coins Unfair GWiQ reduction Can t be sure how many coins are on the wire Did my neighbor send something? Have the coins I sent reached already? How do we restore unknown lost coins???

FT Solution1: Transactional Coin transfers are done in Transactions Pros Coins never lost on transfer Cons Slowdown Needs persistent storage Node failure (temporarily?) reduces GWiQ by local quota

FT Solution2: Light Weight 1/6 Machine down All of its links are down Each node holds a counter for every link. FTCounter = Sent coins Received coins FTCounter = Net number of transferred coins When a link goes down: If FTCounter < 0 Else Create a fictive demand for FTCounter coins FTCounter coins are added to the node s surplus

FT Solution2: Light Weight 2/6 Example 1: time FTCounter=0 FTCounter=0 I need 2 coins FTCounter=+2 FTCounter=0 I need 2 coins I need 2 coins

FT Solution2: Light Weight 3/6 Example 2: time FTCounter=+1 FTCounter=-1 I need 2 coins FTCounter=+3 FTCounter=-1 I need 2 coins Temporary breach Fictive Demand I need 1 coins I need 2 coins

FT Solution2: Light Weight 4/6 Example 3: time FTCounter=-1 FTCounter=+1 I need 2 coins FTCounter=+1 FTCounter=+1 I need 2 coins I need 2 coins

FT Solution2: Light Weight 5/6 Example 4: time FTCounter=0 FTCounter=0 FTCounter=0 FTCounter=0 FTCounter=+2 FTCounter=0 FTCounter=-2 FTCounter=0 FTCounter=+2 FTCounter=0 FTCounter=-2 FTCounter=0 Temporary breach Fictive Demand I need 2 coins

FT Solution2: Light Weight 6/6 Pro No latency No need for persistent storage GWiQ never unfairly reduced Con May introduce temporary GWiQ breaches

FT Solution3: Hybrid 1/2 Use the Light-Weight solution regularly Only con to deal with: GWiQ breaches Caused by loaded FTCounters Issue FTCounter balancing Transactions If FTCounter i,j > Threshold Disallow link usage. Start transaction. FTCounter i,j = FTCounter j,i = 0 End transaction. Resume link usage. Or, issue periodically.

FT Solution3: Hybrid 2/2 Due to Transactions Due to Light Weight Pro Reduce breaches size No slowdowns (mostly) No persistence storage (mostly) Node crash fully remedied immediately also Con See other side of the Pro -coin ;) Play with the tradeoff using parameterization

GWiQ-P: Properties Small trees form around demand Requests are remedied locally Coins are drawn towards hot areas Auto-Adaptable Fully distributed No hot-spots & single points of failure Low latency No Congestion Infinitely Scalable Fault Tolerant

Simulations (default) Properties Toplogy = BriteAS Fast LANs, slower intercon/ NetSize = 10K Q/D=1 Lan Lan Q for GWiQ; D for Overall demand Change Rate = 1%*D/E[EdgeDelay] Demanders = 1%*NetSize Fail Rate = 1%*Edges/E[EdgeDelay] Applicable for FT scenarios Lan Lan Lan Lan

Simulations 1/6 Topology: BriteAS Change rate: 1%*D/E[EdgeDelay] Q/D=1 Demanders: 1% GWiQ-P scalability due to locality

Simulations 2/6 Topology: 10K BriteAS Demanders: 1%

Simulations 3/6 Topology: 10K BriteAS E[EdgeDelay] ~= 3 ms Change rate: One Time 1%*D Demanders: 1% Return to 99% sat after =~ 30ms

Simulations 4/6 Excess coin exploitation Topology: 10K BriteAS Change rate: 1%*D/E[EdgeDelay] Fail rate: 1%*Edges/E[EdgeDelay] Q/D=1 Demanders: 1% Plane depicts sat in no-faults scenario Threshold=0 Transactions

Simulations 5/6 Topology: 10K BriteAS Change rate: 1%*D/E[EdgeDelay] Fail rate: 1%*Edges/E[EdgeDelay] Q/D=0.5 Demanders: 1%

Simulations 6/6 Topology: BriteAS Change rate: 1%*D/E[EdgeDelay] Fail rate: 1%*Edges/E[EdgeDelay] Q/D=1 Demanders: 1% Same distribution, growing network. Locality in FT

Conclusion We displayed GWiQ-P, a Grid Wide Quota enforcement Protocol GWiQ-P is infinitely scalable GWiQ-P is fully distributed GWiQ-P is local hence very efficient GWiQ-P is fault tolerant

Thank You! Contact: Kfir Karmon karmon@cs.technion.ac.il