Glitch Power Reduction for Low Power IC Design

Similar documents
CHAPTER 5 DESIGN AND ANALYSIS OF COMPLEMENTARY PASS- TRANSISTOR WITH ASYNCHRONOUS ADIABATIC LOGIC CIRCUITS

THE INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLEDGE

Low Power Design for Systems on a Chip. Tutorial Outline

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

Delay-Insensitive Gate-Level Pipelining

Power-Area trade-off for Different CMOS Design Technologies

Implementation of Design For Test for Asynchronous NCL Designs

Mohit Arora. The Art of Hardware Architecture. Design Methods and Techniques. for Digital Circuits. Springer

Design of Low Power Vlsi Circuits Using Cascode Logic Style

COMPARATIVE ANALYSIS OF 32 BIT CARRY LOOK AHEAD ADDER USING HIGH SPEED CONSTANT DELAY LOGIC

Low-Power Digital CMOS Design: A Survey

Design and Characterization of Null Convention Self-Timed Multipliers

Design of New Full Swing Low-Power and High- Performance Full Adder for Low-Voltage Designs

LOW POWER NOVEL HYBRID ADDERS FOR DATAPATH CIRCUITS IN DSP PROCESSOR

Deep-Submicron CMOS Design Methodology for High-Performance Low- Power Analog-to-Digital Converters

Design & Analysis of Low Power Full Adder

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

Power Spring /7/05 L11 Power 1

Novel Buffer Design for Low Power and Less Delay in 45nm and 90nm Technology

On Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital VLSI

Investigation on Performance of high speed CMOS Full adder Circuits

Improved Linearity CMOS Multifunctional Structure for VLSI Applications

A new 6-T multiplexer based full-adder for low power and leakage current optimization

MTCMOS Hierarchical Sizing Based on Mutual Exclusive Discharge Patterns

Implementation of High Performance Carry Save Adder Using Domino Logic

METHODS FOR TRUE ENERGY- PERFORMANCE OPTIMIZATION. Naga Harika Chinta

Design Of Arthematic Logic Unit using GDI adder and multiplexer 1

A Level-Encoded Transition Signaling Protocol for High-Throughput Asynchronous Global Communication

Applying Analog Techniques in Digital CMOS Buffers to Improve Speed and Noise Immunity

Subthreshold Voltage High-k CMOS Devices Have Lowest Energy and High Process Tolerance

SURVEY AND EVALUATION OF LOW-POWER FULL-ADDER CELLS

International Journal of Scientific & Engineering Research, Volume 6, Issue 7, July ISSN

Designing of Low-Power VLSI Circuits using Non-Clocked Logic Style

International Journal of Advanced Research in Computer Science and Software Engineering

Clock-free nanowire crossbar architecture based on null convention logic (NCL)

Design of Robust and power Efficient 8-Bit Ripple Carry Adder using Different Logic Styles

A Novel Approach for High Speed and Low Power 4-Bit Multiplier

UNIT-II LOW POWER VLSI DESIGN APPROACHES

Optimization of power in different circuits using MTCMOS Technique

An Energy-Efficient Noise-Tolerant Dynamic Circuit Technique

Delay-insensitive ternary logic (DITL)

Sophisticated design of low power high speed full adder by using SR-CPL and Transmission Gate logic

PERFORMANCE ANALYSIS ON VARIOUS LOW POWER CMOS DIGITAL DESIGN TECHNIQUES

EE241 - Spring 2004 Advanced Digital Integrated Circuits. Announcements. Borivoje Nikolic. Lecture 15 Low-Power Design: Supply Voltage Scaling

Propagation Delay, Circuit Timing & Adder Design. ECE 152A Winter 2012

Propagation Delay, Circuit Timing & Adder Design

Implementation of 1-bit Full Adder using Gate Difuision Input (GDI) cell

Domino CMOS Implementation of Power Optimized and High Performance CLA adder

Design of Two New High-Performance Full Adders in Sub-threshold Region for Ultra-Low Power Applications

Design of Asynchronous Circuits for High Soft Error Tolerance in Deep Submicron CMOS Circuits

Pramoda N V Department of Electronics and Communication Engineering, MCE Hassan Karnataka India

Low Power Parallel Prefix Adder Design Using Two Phase Adiabatic Logic

A Digital Clock Multiplier for Globally Asynchronous Locally Synchronous Designs

Energy Aware IP Shifter for DSP Processors using MTD 3 L Asynchronous Approach

A Novel Low-Power Scan Design Technique Using Supply Gating

Design and Implementation of an Ultra-Low Power High Speed CMOS Logic using Cadence

By Dayadi Lakshmaiah, Dr. M. V. Subramanyam & Dr. K. Satya Prasad Jawaharlal Nehru Technological University, India

PERFORMANCE ANALYSIS OF LOW POWER FULL ADDER CELLS USING 45NM CMOS TECHNOLOGY

High Speed Low Power Noise Tolerant Multiple Bit Adder Circuit Design Using Domino Logic

Ultra-low voltage high-speed Schmitt trigger circuit in SOI MOSFET technology

Power Efficient D Flip Flop Circuit Using MTCMOS Technique in Deep Submicron Technology

Dr. Weidong Kuang. The University of Texas Rio Grande Valley Department of Electrical Engineering (956)

Design and Analysis of Improved Sparse Channel Adder with Optimization of Energy Delay

Design of Adders with Less number of Transistor

DESIGN OF HIGH FREQUENCY CMOS FRACTIONAL-N FREQUENCY DIVIDER

ASIC Design and Implementation of SPST in FIR Filter

Delay Insensitive Ternary Logic Utilizing CMOS and CNTFET

Multiplexer for Capacitive sensors

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

International Journal of Advance Engineering and Research Development

Chapter 1 Introduction

Dynamic Threshold for Advanced CMOS Logic

An Analysis for Power Minimization at Different Level of Abstraction to Optimize Digital Circuit

IJMIE Volume 2, Issue 3 ISSN:

BICMOS Technology and Fabrication

Low-Power CMOS VLSI Design

Ultra Low Power Consumption Military Communication Systems

Review and Analysis of Glitch Reduction for Low Power VLSI Circuits

DESIGN FOR LOW-POWER USING MULTI-PHASE AND MULTI- FREQUENCY CLOCKING

Speedup of Self-Timed Digital Systems Using Early Completion

Power Efficient adder Cell For Low Power Bio MedicalDevices

CML Current mode full adders for 2.5-V power supply

Domino Static Gates Final Design Report

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

Low Power Design in VLSI

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

A Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem

CHAPTER 6 GDI BASED LOW POWER FULL ADDER CELL FOR DSP DATA PATH BLOCKS

II. Previous Work. III. New 8T Adder Design

Low Power, Area Efficient FinFET Circuit Design

A Low Power Small Area Multi-bit Quantizer with A Capacitor String in Sigma-Delta Modulator

Application and Analysis of Output Prediction Logic to a 16-bit Carry Look Ahead Adder

Design of Low-Power High-Performance 2-4 and 4-16 Mixed-Logic Line Decoders

Designing and Simulation of Full Adder Cell using Self Reverse Biasing Technique

International Journal of Scientific & Engineering Research, Volume 4, Issue 8, August ISSN

Chapter 2 Combinational Circuits

LOW POWER CMOS CELL STRUCTURES BASED ON ADIABATIC SWITCHING

Low-power Full Adder array-based Multiplier with Domino Logic

Analysis and Design of High Speed Low Power Comparator in ADC

DESIGN AND ANALYSIS OF LOW POWER 10- TRANSISTOR FULL ADDERS USING NOVEL X-NOR GATES

Transcription:

This document is an author-formatted work. The definitive version for citation appears as: N. Weng, J. S. Yuan, R. F. DeMara, D. Ferguson, and M. Hagedorn, Glitch Power Reduction for Low Power IC Design, in Proceedings of the Ninth Annual NASA Symposium on VLSI Design, pp. 7.5.1 7.5.7, Albuquerque, New Mexico, U.S.A., November 8 9, 2000. Glitch Power Reduction for Low Power IC Design N. Weng 1, J.S. Yuan 1, Scott C. Smith 1, R. Demara 1, D. Ferguson 2, and M. Hagedorn 2 1 Chip Design and Reliability Laboratory, University of Central Florida, Orlando, FL 32816 tel: (407)823-5719, fax: (407)823-5835, email: yuanj@mail.ucf.edu 2 Theseus Logic, Inc., 3501 Quadrangle Blvd., Suite 100, Orlando, FL 32817 Abstract: Glitch power of a NCL multiplier is studied. The hysteresis threshold gates eliminate spurious power glitches significantly. The supply voltage signal bounce of the NCL circuit is also reduced drastically compared to its Boolean counterpart. 1 Introduction Because of the rapid growth of portable electronics, high density integrated circuits with low energy consumption and low electromagnetic interference (EMI) at high speeds are needed. It is well known that dynamic power dissipation is directly related to the number of the signal transitions in the circuit. Functional signal transitions are desirable, where spurious transitions (or glitches), caused by unequal propagation delays of input signals to the gate, are not desirable. Glitches multiply as they propagate through a combinational logic block and could occupy 20 to 70 percent of signal transitions [1]. In Boolean design, gate sizing and gate registration are used to reduce glitches. However, these techniques are delay sensitive and cannot truly eliminate glitches due to unequal propagation delays of input signals resulting from layout parasitics in deep submicron CMOS technology. In this paper, we propose the use of the NULL Convention Logic (NCL) threshold gates and NCL design paradigms to eliminate spurious signal transitions. A 4 bit by 4bit multiplier is designed for the evaluation of glitch power and signal bounce from supply voltage variation. The effect of voltage scaling on the clockless circuit is also evaluated. 2 NULL Convention Logic NCL, patented by Theseus Logic, Inc., integrates control within the data path [2]. A combination of NCL threshold gates and NCL design paradigms ensures delay insensitivity. The design paradigms include monotonic data transitions, mutual exclusive assertion groups (MEAGs), completion of DATA, and completion of NULL. NCL is inherently robust and reliable against temperature, process, and voltage variations. This enables NCL circuits run faster

when the conditions are favorable, while the circuits will still be functionally correct under adverse conditions. On the other hand, clocked designs must be developed for the worst case condition, and generally operate at the worst case rate condition. The fundamental building blocks of NCL are threshold M-of-N gates with hyteresis [3]. The operation of a threshold M-of-N gate is described as follows. For the positive threshold gate logic, if the initial output state is 0, the gate requires at least M inputs 1 before asserting 1 on its output. Once the output asserts 1, it will remain at the 1 state until all N inputs become 0 [4]. In NCL design, all glitches are eliminated because of monotonic data transitions and completion of DATA and NULL. In addition, NCL circuits operate with alternating wavefronts of DATA and NULL. The randomly distributed switching activities of clockless NCL circuits spread the power spectrum on the power bus, thus reducing noise spikes and EMI. 3 NCL Multiplier Design Figure 1 shows a 4x4 multiplier designed using NCL. In Fig. 1 FA represents a full adder, represents a half adder, and represents the completion circuitry. In addition, 8 bit, 12 bit, and 13 bit NCL registers are used in the pipelined multiplier to improve throughput.

Reset X 3 X 2 X 1 X 0 Y 3 Y 2 Y 1 Y 0 8 bit NCL Register FA FA FA 13 bit NCL Register FA 12 bit NCL Register 8 bit NCL Register S 7 S 6 S 5 S 4 S 3 S 2 S 1 S 0 Fig. 1. 4x4 NCL multiplier

4 Power Evaluation Technique To accurately determine the power dissipation of the NCL multiplier, the test bench circuit shown in Fig. 2 is adopted. An 8 bit timer (counter) is connected to the input of the multiplier. The output of the multiplier is connected to a detection circuit. The timer starts from 00000000, generating 256 possible DATA input combinations as well as the intermediate NULL inputs. Detecting circuit functions only when the output of the multiplier changes. It acknowledges Reset KO ki Timer8 x y ko ki MUL4x4 Detect S Fig. 2 Self-timed multiplier complete DATA and NULL outputs and requests the next NULL and DATA output, respectively. The schematics of the timer is shown in Fig. 3 below. Count (7:0) NCL Register NCL Register NCL Register X(7:0) Increment Circuitry (7:0) (7:0) (7:0) (7:0) Reset Reset to DATA 0 Reset to NULL Reset to NULL Fig. 3 NCL 8-bit timer using three stages register

5 Results and Discussions The NCL multiplier is simulated in Cadence SPICE using the 0.18 µm CMOS technology models from TSMC. The simulated power current wavforms are shown in Fig. 4. Fig. 4 Power current curves of the NCL multiplier In Fig. 4 the upper left plot is the entire time window of 256 data patterns, the upper right plot focuses on the window of one NULL-DATA-NULL cycle, and the bottom plot shows the switching activities. The current waveforms in Fig. 4 are obtained from the supply voltage in Cadence simulation. The switching current consists of the short circuit current when both p and n-channel transistors are on and the capacitive current resulting from the charging and discharging of capacitances. The power is evaluated by integrating current and voltage over the 256 input combinations and then averaging the result. The effect of signal bounce from V dd on NCL design is evaluated. The power bus line is modeled by a simple lumped RLC network to produce voltage variation. As shown in Fig. 5, the internal V dd to the multiplier dips in the initial transient and then goes back to normal. The response of the multiplier subject to V dd variation is recorded in Figs. 6 and 7 for Boolean and NCL multipliers, respectively. In the NCL multiplier, threshold gates and NCL registers are used. In the Boolean multiplier, standard Boolean logic gates and equivalent registers are used.

Fig. 5 V dd signal variation Fig. 6 Power current curves of Boolean circuit Fig. 7 Power current curves of NCL circuit It is clear from Figs. 6 and 7 that the signal bounce due to V dd variation is significant for the Boolean circuit, while the NCL circuit has the ability to stop the power bus signal bounce over a very short period of time.

The adaptability of NCL circuits subject to voltage scaling is also evaluated. The following Table shows the current, energy per input pattern (or instruction), and performance of the NCL multiplier at V dd = 1.5, 2, 2.5, 3, and 3.3 V. In the above Table, current is the average value obtained from the supply voltage in Cadence simulation. Energy is calculated using the integration of current and voltage over a period of time. Performance is measured by the number of instructions times 10E6 dividing the time needed for the total number of instructions (or multiplications). In the 4x4 multiplier, the total number of instructions is 256. Using more V dd data points, smoother curves of current, energy/instruction, and performance are displayed in Fig. 8 below: Peformance Adaptation vs. Vdd Energy per Instr [nj] Current[mA] Performance [MIPS] 1.4 12 1.2 10 Energy per Instr [nj] 1 0.8 0.6 0.4 8 6 4 Performance[MIPS] and Current[mA] 0.2 2 0 0 1.5 1.7 1.9 2.1 2.3 2.5 2.7 2.9 3.1 3.3 3.5 Supply voltage[v] Fig. 8 Current, energy, and performance versus supply voltage It is clear from Fig. 8 that the power current and energy per instruction increase quickly when the supply voltage increases. When the supply voltage decreases, the decrease of energy per instruction is much larger than the decrease of performance (MIPS). This implies a significant advantage for NCL design techniques for low voltage and low power applications.

6 Conclusion The power glitches, signal bounce, and supply voltage scaling effects on the NCL multiplier are evaluated. The SPICE simulation results show that hyteresis threshold gates of NCL circuits and NCL design eliminate glitches due to signal propagation from input skews. The NCL design shows robustness in low voltage and low power applications. Randomly distributed switching activities of clockless NCL circuits generally spread signal spectrum on the power bus. This reduces power bus spikes and EMI. Further study will be conducted to demonstrate low EMI emission of NCL circuits by experiments. References [1] A. Shen, A. Ghosh, S. Debadas, and K. Keutzer, Proc ICCAD, pp. 402-407, 1992 [2] Karl M. Fant and Scott A. Brandt. NULL Convention Logic: A Complete and Consistent Logic for Asynchronous Digital Circuit Synthesis, International Conference on Application Specific Systems, Architectures, and Processors, pp. 261-273, 1996 [3] C. Wang et al. Technology Independent Design Using NULL Convention Logic, www.theseus.com/techind/index.html, 1998 [4] Gerald E. Sobelman and Karl M. Fant, CMOS Circuit Design of Threshold Gates with Hysteresis. IEEE International Symposium on Circuits and Systems (II), pp. 61-65, 1998 [5] G. K. Yeap, Practical Low Power Digital VLSI Design, Kluwer Academic: Boston, 1998 [6] R. Jacob Baker, Harry W.Li, and David E. Boyce CMOS Circuit Design, Layout, and Simulation, IEEE Press: New Jersey, 1998