Power Optimization Techniques Using Multiple VDD

Similar documents
Minimum Energy CMOS Design with Dual Subthreshold Supply and Multiple Logic-Level Gates

ISSN:

Optimal Module and Voltage Assignment for Low-Power

Design of Variable Input Delay Gates for Low Dynamic Power Circuits

Activity-Aware Registers Placement for Low Power Gated Clock Tree Construction

Delay of different load cap. v.s. different sizes of cells 1.6. Delay of different cells (ns)

Analysis and design of a low voltage low power lector inverter based double tail comparator

ELEC Digital Logic Circuits Fall 2015 Delay and Power

Low-Power Digital CMOS Design: A Survey

THERE is a growing need for high-performance and. Static Leakage Reduction Through Simultaneous V t /T ox and State Assignment

An Implementation of a 32-bit ARM Processor Using Dual Power Supplies and Dual Threshold Voltages

Blockage and Voltage Island-Aware Dual-VDD Buffered Tree Construction

Minimization of Dynamic and Static Power Through Joint Assignment of Threshold Voltages and Sizing Optimization

Low Power and High Speed Multi Threshold Voltage Interface Circuits Sherif A. Tawfik and Volkan Kursun, Member, IEEE

Low Power High Performance 10T Full Adder for Low Voltage CMOS Technology Using Dual Threshold Voltage

EECS 427 Lecture 22: Low and Multiple-Vdd Design

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

A Literature Survey on Low PDP Adder Circuits

[Deepika* et al., 5(7): July, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

Sleepy Keeper Approach for Power Performance Tuning in VLSI Design

Low Power and High Performance Level-up Shifters for Mobile Devices with Multi-V DD

Power Optimization of Delay Constrained Circuits

Voltage Island Aware Floorplanning for Power and Timing Optimization

Power-conscious High Level Synthesis Using Loop Folding

Design and Analysis of Low-Power 11- Transistor Full Adder

New Approaches to Total Power Reduction Including Runtime Leakage. Leakage

ALPS: An Automatic Layouter for Pass-Transistor Cell Synthesis

An Optimized Design System for Flip-Flop Grouping Using Low Power Clock Gating

Low Power Optimization Of Full Adder, 4-Bit Adder And 4-Bit BCD Adder

Dual-Threshold Voltage Assignment with Transistor Sizing for Low Power CMOS Circuits

A Low Complexity and Highly Robust Multiplier Design using Adaptive Hold Logic Vaishak Narayanan 1 Mr.G.RajeshBabu 2

DYNAMIC VOLTAGE FREQUENCY SCALING (DVFS) FOR MICROPROCESSORS POWER AND ENERGY REDUCTION

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

Low-power Full Adder array-based Multiplier with Domino Logic

Ultra-low voltage high-speed Schmitt trigger circuit in SOI MOSFET technology

DESIGN OF LOW POWER MULTIPLIER USING COMPOUND CONSTANT DELAY LOGIC STYLE

Low-Power Technology Mapping for FPGA Architectures with Dual Supply Voltages

MICROPROCESSORS LEAKAGE POWER REDUCTION USING DUAL SUPPLY VOLTAGE SCALING

Low-power Full Adder array-based Multiplier with Domino Logic

Optimizing addition for sub-threshold logic

An energy efficient full adder cell for low voltage

DESIGN AND ANALYSIS OF LOW POWER CHARGE PUMP CIRCUIT FOR PHASE-LOCKED LOOP

Short-Circuit Power Reduction by Using High-Threshold Transistors

MULTITHRESHOLD CMOS SLEEP STACK AND LOGIC STACK TECHNIQUE FOR DIGITAL CIRCUIT DESIGN

Pramoda N V Department of Electronics and Communication Engineering, MCE Hassan Karnataka India

Optimization of power in different circuits using MTCMOS Technique

Low Power Design for Systems on a Chip. Tutorial Outline

Design of an Energy Efficient, Low Power Dissipation Full Subtractor Using GDI Technique

A Unified Optimal Voltage Selection Methodology for Low-power Systems

Comparison of High Speed & Low Power Techniques GDI & McCMOS in Full Adder Design

Novel Buffer Design for Low Power and Less Delay in 45nm and 90nm Technology

MTCMOS Hierarchical Sizing Based on Mutual Exclusive Discharge Patterns

Dual Threshold Voltage Design for Low Power VLSI Circuits

DESIGN OF ADIABATIC LOGIC BASED COMPARATOR FOR LOW POWER AND HIGH SPEED APPLICATIONS

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

Low Power VLSI Circuit Synthesis: Introduction and Course Outline

SURVEY AND EVALUATION OF LOW-POWER FULL-ADDER CELLS

A Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem

Design of High Performance Arithmetic and Logic Circuits in DSM Technology

the cascading of two stages in CMOS domino logic[7,8]. The operating period of a cell when its input clock and output are low is called the precharge

ISSN Vol.04, Issue.05, May-2016, Pages:

Fast Statistical Timing Analysis By Probabilistic Event Propagation

Optimal Simultaneous Module and Multivoltage Assignment for Low Power

An Efficient Hybrid Voltage/Current mode Signaling Scheme for On-Chip Interconnects

Comparative Analysis of Array Multiplier Using Different Logic Styles

Near-threshold Computing of Single-rail MOS Current Mode Logic Circuits

IN digital circuits, reducing the supply voltage is one of

Glitch Power Minimization by Selective Gate Freezing

A High Performance IDDQ Testable Cache for Scaled CMOS Technologies

Reduce Power Consumption for Digital Cmos Circuits Using Dvts Algoritham

IJMIE Volume 2, Issue 3 ISSN:

Two New Low Power High Performance Full Adders with Minimum Gates

Two New Low Power High Performance Full Adders with Minimum Gates

Minimization of Area and Power in Digital System Design for Digital Combinational Circuits

An Overview of Static Power Dissipation

Low Power Realization of Subthreshold Digital Logic Circuits using Body Bias Technique

Designing Of A New Low Voltage CMOS Schmitt Trigger Circuit And Its Applications on Reduce Power Dissipation

VLSI Designed Low Power Based DPDT Switch

Hierarchical Power Optimization for System-on-a-Chip (SoC) through CMOS Technology Scaling

Data Word Length Reduction for Low-Power DSP Software

Design and Optimization of Half Subtractor Circuits for Low-Voltage Low-Power Applications

Leakage Power Reduction Through Hybrid Multi-Threshold CMOS Stack Technique In Power Gating Switch

Design and Analysis of Sram Cell for Reducing Leakage in Submicron Technologies Using Cadence Tool

Static Timing Analysis Taking Crosstalk into Account 1

Aarthi.P, Suresh Kumar.R, Muniraj N. J. R, International Journal of Advance Research, Ideas and Innovations in Technology.

Design of Adders with Less number of Transistor

DESIGN & IMPLEMENTATION OF SELF TIME DUMMY REPLICA TECHNIQUE IN 128X128 LOW VOLTAGE SRAM

EFFECTING POWER CONSUMPTION REDUCTION IN DIGITAL CMOS CIRCUITS BY A HYBRID LOGIC SYNTHESIS TECHNIQUE

Study of High Speed Buffer Amplifier using Microwind

LOW POWER VLSI TECHNIQUES FOR PORTABLE DEVICES Sandeep Singh 1, Neeraj Gupta 2, Rashmi Gupta 2

Design and Analysis of CMOS based Low Power Carry Select Full Adder

Abstract. 1 Introduction

Energy Reduction of Ultra-Low Voltage VLSI Circuits by Digit-Serial Architectures

A Dual-V DD Low Power FPGA Architecture

Gdi Technique Based Carry Look Ahead Adder Design

A Low Power Switching Power Supply for Self-Clocked Systems 1. Gu-Yeon Wei and Mark Horowitz

Transistor Network Restructuring Against NBTI Degradation. P. F. Butzen a, V. Dal Bem a, A. I. Reis b, R. P. Ribas b.

Leakage Power Reduction Using Power Gated Sleep Method

Circuit For Mems Application

High Performance and Low power VLSI CMOS Circuit Designs using ONOFIC Approach

Transcription:

Power Optimization Techniques Using Multiple VDD Presented by: Rajesh Panda LOW POWER VLSI DESIGN (EEL 6936-002) Dr. Sanjukta Bhanja

Literature Review 1) M. Donno, L. Macchiarulo, A. Macii, E. Macii and, M. Poncino, Enhanced Clustered Voltage Scaling for Low Power, GLSVLSI 02, 2002, New York, USA. 2) K. Usami and M. Horowitz, Clustered Voltage Scaling technique for low-power design, in proc. Proc. ISLPD, April 1995. 3) Y. Yeh, S. Kyo, and J. Jou, Converter-Free Multiple-Supply Voltage Scaling Techniques for Low-Power CMOS Digital Design, IEEE Trans., vol. 20, NO.1, 2001. 4) A. Chandrakasan, S. Sherng, and R. Broderson, Low-Power CMOS digital design, IEEE J.Solid State Circuits, vol. 27, April 1992. 5) J.M. Chang and M. Pedram, Energy minimization using multiple supply voltages, in proc.islped, 1996. 6) N.H.E. Weste and K. Eshraghian, Priciples of CMOS VLSI Design- A Systems Perspective, 2 nd ed. Reading, MA: Addison-Wesley, 1992. 7) S.Raje and M. Sarrafzadeh, Variable Voltage Scheduling, in proc. ISLPD, Apr. 1995. 8) C. Yeh, M. Chang Gate-Level Voltage Scaling for Low-Power Design Using Multiple Supply Voltages, IEE Proceedings, vol.146, No. 6, 1999. 9) V. Sunderarajan, K.K. Parhi Synthesis of Low Power CMOS VLSI Circuits using Dual Supply Voltages DAC-36. 10) J.M. Chang and M. Pedram, Energy minimization using multiple supply voltages, IEEE Transactions on VLSI Systems, vol. 5, 1997.

INTRODUCTION Power Optimization has always been a major goal in designing digital circuits. All of the circuit determines power dissipation but only a small fraction of the gates determine circuit performance. We should use high performance devices on critical path. Circuit Design Techniques: 1) Multiple Vdd. 2) Multiple Threshold voltages. 3) Gate Resizing.

Close up Look on Slack The average distribution of gates with different slack for 16 benchmark circuits. Reference : Chunhong Chen, Member, IEEE, Ankur Srivastava, Student Member, IEEE, and Majid Sarrafzadeh, Fellow, IEEE

Multiple Vdd Approach Idea : Determine what supply voltage level will allow the results to arrive just in time. Scale down Vdd Quadratic Reduction in Power: P = C L. Vdd 2.A.f Reduces Speed: t d = ½. C L. Vdd [ 1/ C 1 (Vdd - V tn ) 2 + 1/ C 2 (Vdd + V tp ) 2 ] Dual Vdd to maintain performance: Critical Path is assigned High Vdd and Gates on the noncritical paths are assigned Low Vdd.

Level Converter Low Vdd gates cannot drive High Vdd gates: PMOS does not turn off Results in flow of static current Insertion of Level converters required: Similar to amplifiers in memories

Problem with Level Converters Level converters introduce a new source of power dissipation. They take more silicon area. They add delay to the circuit. Approach: We need a strategy to limit the number of Level Converters!

Clustered Voltage Scaling Usami and Horowitz proposed Clustered Voltage Scaling Structure to limit the number of Level converters. CVS results in the clustering of gates in two sets: A set of gates at high Vdd and a set of gates at low Vdd. CVS structure: Primary I/p -> High Vdd cells -> Low Vdd cells -> Level Converters -> Primary O/p. CVS Algorithm is a search algorithm which tries to substitute as many cells as possible with low Vdd cells while maintaining the required performance.

CVS Structure Vdd H Vdd L Primary I/p VH Cluster VL Cluster LC Primary O/p

CVS Algorithm 1. Pick a new cell C connected to a primary output. 2. Substitute it with a VDD L analogous cell. 3. Perform a new static timing analysis. 4. If the new timing worsen the original one, go back to step 1. 5. Pick a cell feeding the last substituted. 6. Verify it s viability for substitution through a DFS. 7. If the new timing worsen the original one, go back to step 5. 8. If there are unanalyzed PO cells, go back to step 1. Reference: Monica Donno et al.

Application of Original CVS Algorithm This is the algorithm which was used in the CVS structure proposed by Usami and Horowitz. 2 7 10 3 5 1 6 9 4 8 Reference: Monica Donno et al.

Partial DFS Algorithm Forward DFS -> Checks whether substitution is feasible for all the transitive fanouts of a node or not -> Might take a long time! Donno et al. proposed alternative implementation to improve results and/or execution time without changing the basic CVS. They Proposed Partial DFS Algorithm. Partial DFS Algorithm -> Stops the search whenever a node is declared unfeasible -> Skips to the following PO -> Search space is reduced by cutting substitutions which are not likely to affect the results substantially -> Saves Computation time!

Application of Partial DFS Algorithm 2 7 10 3 5 1 6 9 4 8 Reference: Monica Donno et al.

Results for two Algorithms The following result for c6288 is the biggest benchmark circuit the authors have considered. (Monica et al.) Algorithm Circuit Power Red. CPU Time DFS C6288 0.35% 20 Min. Partial DFS C6288 0.35% 8 Min.

CFMV Scaling Y.J. Yeh, S.Y. Kuo and J.Y.Jou proposed converter free multiple voltage scaling technique. Approach: No level converters at all! How? -> Put constraints on the voltage differences between adjacent gates! Idea -> No static current if, Vdd R > Vdd l V tp І Vdd R : Reduced supply voltage V tp : Threshold voltage of PMOS

How to Determine Vdd R Subthreshold effect makes the prediction of Vdd R imprecise. Solution : Determine Vdd R by a circuit simulator, such as HSPICE, when the acceptable value of static current is given.

Arrangement of Supply Voltages Vdd n-1 Vdd 1 Vdd 0 Primary I/p C n-1 Cluster C 1 Cluster C 1 Cluster Primary O/p Vss n-1 Vss 1 Vss 0 Vdd 0 > Vdd 1 > Vdd n-1 and ( Vdd i Vdd i+1 ) > V st

CFMV Structure A combinational circuit can be represented as a directed acyclic graph G = ( V,E ). Proper Directed Cut: [ V 1, V 2 ] is a proper directed cut of G if V 2 contains all the sinks of G, all the boundary vertices of G and all the vertices in their reachable set. C1 is a proper directed cut but not C2

Algorithm for 2 supply voltages DFS (m) 1 For (each vertex v with voltage level m) Do 2 DFS-Visit (v,m); DFS-Visit (v,m) 1 If (v is marked) Then 2 return; 3 If (v is a sink or boundary vertex) Then 4 Mark v; 5 Else 6 For (each fanin vertex u of v) Do 7 DFS-Visit (v,m); 8 If (all the voltage levels of v s fanins are (m+1) ) Then 9 set v s voltage level to (m+1); 10 If (there exists negative slack) Then 11 set v s voltage level back to m; 12 Mark v; Reference: Yeh et al.

Results of CFMV Circuit CVS(5,3) CVS(5,3) CFMV(2 way) CFMV(2 way) Power Red. CPU time Power Red. CPU time C432 0.11% 0.01 4.18% 0.02 C880 17.08% 0.07 14.25% 0.10 C1908 6.53% 0.06 17.36% 0.41 C6288 1.69% 0.44 8.63% 1.97

Summary According to Yeh et al., on average, 9 18% power reduction can be obtained using the CFMV technique. We can observe that the CPU time in this case is more than CVS. I wonder, if we can we improve the CPU time by using partial DFS algorithm here too, without substantially affecting the results.??? This is indeed a very challenging research topic!