Available online at ScienceDirect. Procedia Computer Science 46 (2015 )

Similar documents
Unique Journal of Engineering and Advanced Sciences Available online: Research Article

Wide Fan-In Gates for Combinational Circuits Using CCD

Double Stage Domino Technique: Low- Power High-Speed Noise-tolerant Domino Circuit for Wide Fan-In Gates

[Sri*, 4.(12): December, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

High-Performance of Domino Logic Circuit for Wide Fan-In Gates Using Mentor Graphics Tools

Power Efficient and Noise Immune Domino Logic for Wide Fan in Gates

High Speed Low Power Noise Tolerant Multiple Bit Adder Circuit Design Using Domino Logic

Noise Tolerance Dynamic CMOS Logic Design with Current Mirror Circuit

EEC 118 Lecture #12: Dynamic Logic

[Vivekanand*, 4.(12): December, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

Dynamic-static hybrid near-threshold-voltage adder design for ultra-low power applications

VLSI Designed Low Power Based DPDT Switch

Comparative Study of Different Low Power Design Techniques for Reduction of Leakage Power in CMOS VLSI Circuits

A Low-Power 12 Transistor Full Adder Design using 3 Transistor XOR Gates

Low Power &High Speed Domino XOR Cell

UNIT-III GATE LEVEL DESIGN

ISSN: [Kumar* et al., 6(5): May, 2017] Impact Factor: 4.116

Low Power Design of Schmitt Trigger Based SRAM Cell Using NBTI Technique

CHAPTER 3 PERFORMANCE OF A TWO INPUT NAND GATE USING SUBTHRESHOLD LEAKAGE CONTROL TECHNIQUES

Performance Comparison of High-Speed Adders Using 180nm Technology

A HIGH SPEED DYNAMIC RIPPLE CARRY ADDER

Design of High Performance Arithmetic and Logic Circuits in DSM Technology

UNIT-II LOW POWER VLSI DESIGN APPROACHES

Design and Analysis of Sram Cell for Reducing Leakage in Submicron Technologies Using Cadence Tool

LOW POWER CMOS CELL STRUCTURES BASED ON ADIABATIC SWITCHING

the cascading of two stages in CMOS domino logic[7,8]. The operating period of a cell when its input clock and output are low is called the precharge

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

Design and Implementation of Digital CMOS VLSI Circuits Using Dual Sub-Threshold Supply Voltages

Designing of Low-Power VLSI Circuits using Non-Clocked Logic Style

Ultra Low Power Design and High Speed Design of Domino Logic Circuit

1. Short answer questions. (30) a. What impact does increasing the length of a transistor have on power and delay? Why? (6)

ESD-Transient Detection Circuit with Equivalent Capacitance-Coupling Detection Mechanism and High Efficiency of Layout Area in a 65nm CMOS Technology

Power-Area trade-off for Different CMOS Design Technologies

Available online at ScienceDirect. Procedia Computer Science 57 (2015 )

A Novel Approach for High Speed and Low Power 4-Bit Multiplier

Leakage Control Techniques for Designing Robust, Low Power Wide-OR Domino Logic for Sub-130nm CMOS Technologies

Low Power High Performance 10T Full Adder for Low Voltage CMOS Technology Using Dual Threshold Voltage

Design and Analysis of Energy Efficient MOS Digital Library Cell Based on Charge Recovery Logic

IJMIE Volume 2, Issue 3 ISSN:

Technology, Jabalpur, India 1 2

Design of Low Power Vlsi Circuits Using Cascode Logic Style

Read/Write Stability Improvement of 8T Sram Cell Using Schmitt Trigger

Ultra Low Power VLSI Design: A Review

Leakage Power Reduction by Using Sleep Methods

LEAKAGE POWER REDUCTION IN CMOS CIRCUITS USING LEAKAGE CONTROL TRANSISTOR TECHNIQUE IN NANOSCALE TECHNOLOGY

Comparison of High Speed & Low Power Techniques GDI & McCMOS in Full Adder Design

A NOVEL 4-Bit ARITHMETIC LOGIC UNIT DESIGN FOR POWER AND AREA OPTIMIZATION

Performance Analysis of Novel Domino XNOR Gate in Sub 45nm CMOS Technology

ESTIMATION OF LEAKAGE POWER IN CMOS DIGITAL CIRCUIT STACKS

Analysis of Different Topologies of Inverter in 0.18µm CMOS Technology and its Comparision

Design & Analysis of Low Power Full Adder

STATIC POWER OPTIMIZATION USING DUAL SUB-THRESHOLD SUPPLY VOLTAGES IN DIGITAL CMOS VLSI CIRCUITS

Domino CMOS Implementation of Power Optimized and High Performance CLA adder

International Journal of Engineering Trends and Technology (IJETT) Volume 45 Number 5 - March 2017

Ultra-low voltage high-speed Schmitt trigger circuit in SOI MOSFET technology

Design of a Low Voltage low Power Double tail comparator in 180nm cmos Technology

DESIGN OF PARALLEL MULTIPLIERS USING HIGH SPEED ADDER

Implementation of dual stack technique for reducing leakage and dynamic power

Analysis & Design of low Power Dynamic Latched Double-Tail Comparator

International Journal of Scientific & Engineering Research, Volume 6, Issue 7, July ISSN

Analysis of Low Power-High Speed Sense Amplifier in Submicron Technology

FTL Based Carry Look ahead Adder Design Using Floating Gates

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

Implementation of High Performance Carry Save Adder Using Domino Logic

2-Bit Magnitude Comparator Design Using Different Logic Styles

Sleepy Keeper Approach for Power Performance Tuning in VLSI Design

A Literature Review on Leakage and Power Reduction Techniques in CMOS VLSI Design

Low Power Adiabatic Logic Design

PROCESS and environment parameter variations in scaled

A HIGH SPEED & LOW POWER 16T 1-BIT FULL ADDER CIRCUIT DESIGN BY USING MTCMOS TECHNIQUE IN 45nm TECHNOLOGY

Investigation on Performance of high speed CMOS Full adder Circuits

Implementation of Carry Select Adder using CMOS Full Adder

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

RELIABILITY ANALYSIS OF DYNAMIC LOGIC CIRCUITS UNDER TRANSISTOR AGING EFFECTS IN NANOTECHNOLOGY

ISSN:

A Case Study of Nanoscale FPGA Programmable Switches with Low Power

A DUAL-EDGED TRIGGERED EXPLICIT-PULSED LEVEL CONVERTING FLIP-FLOP WITH A WIDE OPERATION RANGE

DESIGN & IMPLEMENTATION OF SELF TIME DUMMY REPLICA TECHNIQUE IN 128X128 LOW VOLTAGE SRAM

EFFICIENT VLSI IMPLEMENTATION OF A SEQUENTIAL FINITE FIELD MULTIPLIER USING REORDERED NORMAL BASIS IN DOMINO LOGIC

EE 330 Lecture 43. Digital Circuits. Other Logic Styles Dynamic Logic Circuits

A Literature Survey on Low PDP Adder Circuits

PERFORMANCE ANALYSIS ON VARIOUS LOW POWER CMOS DIGITAL DESIGN TECHNIQUES

Domino Static Gates Final Design Report

EE 330 Lecture 43. Digital Circuits. Other Logic Styles Dynamic Logic Circuits

Design of Robust and power Efficient 8-Bit Ripple Carry Adder using Different Logic Styles

EE 330 Lecture 42. Other Logic Styles Digital Building Blocks

An energy efficient full adder cell for low voltage

COMPREHENSIVE ANALYSIS OF ENHANCED CARRY-LOOK AHEAD ADDER USING DIFFERENT LOGIC STYLES

12-nm Novel Topologies of LPHP: Low-Power High- Performance 2 4 and 4 16 Mixed-Logic Line Decoders

A Survey of the Low Power Design Techniques at the Circuit Level

SCALING power supply has become popular in lowpower

Leakage Power Reduction in 5-Bit Full Adder using Keeper & Footer Transistor

Design and Implementation of ALU Chip using D3L Logic and Ancient Mathematics

Low Power Design for Systems on a Chip. Tutorial Outline

Figure.1. Schematic of 4-bit CLA JCHPS Special Issue 9: June Page 101

Power Efficiency of Half Adder Design using MTCMOS Technique in 35 Nanometre Regime

Design of an Energy Efficient, Low Power Dissipation Full Subtractor Using GDI Technique

Active Decap Design Considerations for Optimal Supply Noise Reduction

Module-3: Metal Oxide Semiconductor (MOS) & Emitter coupled logic (ECL) families

Power Optimization for Ripple Carry Adder with Reduced Transistor Count

Transcription:

Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 1261 1267 International Conference on Information and Communication Technologies (ICICT 2014) Low leakage and high performance tag comparator implemented in 180nm CMOS technology Lidiya Mariam Koshy a,*,jyothish Chandran G a a Department of Electronics and Communication Engineering, Saintgits College of Engineering, Kottayam-689648, India Abstract The parasitic capacitance in the dynamic node increases for wide tag comparator increasing the delay and power consumption. The output signal may also degrade with high performance computing that uses clock frequency over 1GHz. In this paper a circuit is proposed to reduce the delay of the evaluation phase of the 64 bit tag comparator by reducing the parasitic capacitance at the dynamic node. The circuit is applicable for wide fan-in gates. The performance has enhanced by 60-70% with reduced leakage providing reduced voltage degradation of the output signal when compared to the rest of the dynamic circuit under study. The Mentor Graphics tool kit is used to perform pre-layout simulations, circuit layout generation and physical verification of the layout. 2015 2014 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of organizing committee of the International Conference on Information and Communication Peer-review Technologies under (ICICT responsibility 2014). of organizing committee of the International Conference on Information and Communication Technologies (ICICT 2014) Keywords: Domino logic; tag comparator; evaluation network; wide fan-in gate; current mirror 1. Introduction The proportional scaling of all devices in a circuit definitely results in a reduction of the total silicon area occupied by the circuit, enhancing the overall packing density of the chip. In constant-field scaling the scaling affects the threshold voltage 6. * Corresponding author. Tel.: +914692672236. E-mail address: lidiyamkoshy@gmail.com 1877-0509 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of organizing committee of the International Conference on Information and Communication Technologies (ICICT 2014) doi:10.1016/j.procs.2015.01.046

1262 Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 As a result the subthreshold current may rise exponentially. This becomes critical where small amounts of current may significantly disturb the operation of the circuit degrading its noise immunity. Moreover using wide gates greatly degrades the noise margin and increases the leakage current 11. Device scaling scales down the gate oxide enhancing the short channel effects. Nevertheless, thinner gate oxide leads to exponentially higher gate leakage 9. Demands for high performance computing have boosted the clock frequency over 1 GHz and physical address space has extended up to 50 bit for 64 bit microprocessor. Dynamic logic design is broadly implemented in conventional tag comparator circuits. As the output signals from the SRAM are precharged signals, tag comparator is appropriate for a footless dynamic circuit style 1. Because each 2-input XOR consists of 2 legs, the 50-bit comparator is composed of hundred legs. The large number of legs causes the parasitic capacitance on the dynamic domino node to rise considerably. The dynamic node capacitance increases with the number of nmos transistors in the evaluation network resulting in a dramatic reduction of overall speed of the circuit. The drain capacitance of the multiple nmos contributes the dynamic node capacitance. Moreover, the multiple parallel leaky paths in wide fan-in gates decrease the noise immunity of the circuit. Excessive leakage of the dynamic node may degrade the output signal and makes the circuit unsuitable for high frequency operation. This motivates the need to develop circuit techniques to implement wide fan-in gate. In the worst case input pattern, only one out of eight nmos paths discharge the domino node. Increase in the number of legs in the evaluation path worsens the noise immunity intensifying the subthreshold conduction. Besides, unnecessary leakage may result in false evaluation. The wide fan-in gates are typically employed in the read path of register files, flash memories, tag comparators, programmable logic arrays, and wide multiplexer and De-multiplexer 2. 2. Existing architectures 2.1. Footed domino logic (FDL) Figure 1 shows a domino architecture of an OR gate using a keeper transistor. A clock signal is provided at the gate of the precharge transistor. The clock signal schedules the domino circuit in two phases: precharge phase and evaluation phase. When the clock signal is in a low state, the dynamic node is precharged to a high level. This phase represents precharge phase. During this phase, the signal at the output node does not vary with the input signal. When the clock signal makes a 0 to 1 transition, there are two possibilities: the dynamic node is either discharged to a low level through the pull-down leg or remains high. The potential problem due to charge sharing is that that the voltage at the dynamic node may reduce during evaluation phase. A pmos keeper restores the voltage at the dynamic node. Fig. 1. Footed Domino Logic implementing a wide fan-in OR gate

Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 1263 As the size of the keeper transistor increases, the noise immunity increases; however, the performance degrades, and the power consumption increases. The reduction in leakage immunity and lower performance makes keeper upsizing an incompatible solution for scaled domino circuit 5. It is clearly noted that the nmos circuitry of dynamic logic connected in parallel reduces the performance and increases the leakage. Hence, in order to achieve a tradeoff between performance and robustness, the number of nmos transistors in the pull-down leg must be limited 2. Besides, a large keeper is not applicable for high performance application. 2.2. High Speed Domino (HSD) The High Speed Domino 3 logic extends the operation of the domino logic utilizing a different configuration of control circuitry 11. The circuit schematic of the HSD circuit implementing tag comparator is shown in figure 2(a). The precharge phase begins with the rising transition of clock signal. The pull up transistor MP1 precharges the dynamic node when the clock signal goes low. At the inception of evaluation phase, the keeper is turned OFF thereby solving the problem due to contention. 2.3. Leakage current replica (LCR) keeper domino The leakage current replica keeper domino logic is similar to footless dynamic circuit and has a controlled keeper. The circuit schematic of the LCR circuit implementing tag comparator is shown in figure 2(b). Current mirror comprises of transistor Mp2 in diode configuration and transistor Mn1. The size of the transistor Mn1 depends on the size of n-transistors in the PDN structure. The worst case leakage current is replicated by implementing the current mirror. It tracks the leakage current and copies it into the domino logic through the keeper transistor Mk2 4. 3. Proposed Architecture Fig. 2. (a) HSD circuit implementing a tag comparator; (b) LCR circuit implementing a tag comparator. The schematic structure of the proposed circuit to realize wide gate is shown in fig. 3. The evaluation transistors in the pull-up network (PUN) implementing the logical function, is separated from the keeper transistor. The current mirror is connected to the gate of keeper transistor Mk1. One typical situation in a MOSFET is due to subthreshold leakage current that flows when V GS < V Tn. It occurs under the influence of increased drain voltage. Decreasing the channel length increases subthreshold conduction which may become

1264 Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 rather critical in short channel MOS transistors which has channel length L<1μm 10. The diode transistor (Md1) forms a serial connection with the evaluation network. During evaluation phase the voltage drop across the diode configuration turning the gate to source voltage of the OFF evaluation MOSFET negative 5. Consequently, the leakage current is reduced. Fig. 3. Proposed circuit implementing tag comparator The first stage comprises of the pull-up network consisting of evaluation transistors. The second stage comprises of the keeper network, which is like a single input footless domino. For the previous circuits, the parasitic capacitance at the node depends on the number of nmos transistors in the evaluation network. The overall performance can be poor owing to large parasitic capacitance at the dynamic node. But in case of the proposed circuit, only one transistor (Mp3) connects the dynamic node to the pull up logic. 4. Simulation and Analysis The dynamic performance of the simulated schematic file can be estimated by Eldo simulator at a nominal temperature of 27 C and 1.8V power supply implemented in Taiwan Semiconductor Manufacturing Company (TSMC) 180-nm process technology. The circuits were simulated using several clock frequency ranging from 0.1 to 1.1 GHz. IC workstation (layout editor tool) is used to generate circuit layout. In order to satisfy the layout design rules, many iterations are performed on the layout ensuring that its topology do not vary considerably. Subsequently, a circuit extraction procedure is accomplished so as to determine the actual transistor sizes and the parasitic capacitances at each node. 4.1. Waveform analysis EZ Wave Viewer displayed the pre-layout simulation waveforms. Figure 4 illustrates the simulated input and output voltage waveforms of FDL circuit realizing tag comparator. If the two inputs to the XOR-logic remains the same, the output remains the same state as that in precharged phase.

Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 1265 Fig. 4. Simulated voltage waveforms of FDL circuit realizing tag comparator. Using the proposed logic, when the input signals are applied, evaluation of the output signal takes place when clock is high. V(A0) and V(B0) corresponds to the first bit in the comparator. V(OP) represents the output signal. The circuits are simulated in Eldo platform in the 180-nm technology models at 27 C and 1.8 V supply. Among other domino circuits under study the proposed logic was the only circuit that exhibited true evaluation for all input pattern when simulated at 1GHz clock frequency. The other circuit simulation indicated false evaluation. 4.2. Sizing of keeper transistor The keeper must be strong enough to compensate for any leakage current drawn when the output is floating and the pull-down stack is OFF. Keepers may need to be stronger on wide NOR gates or multiplexers in particularly leaky processes. However, strong keepers also increase delay. For small dynamic gates the keeper must be weaker than a minimum sized transistor 8. Table 1 illustrates the upshot of testing FDL circuit and proposed circuit with various sizes of keeper. Simulation of 64 input FDL circuit resulted in false evaluation as keeper transistor width was quite low (< 2W). Table 1. Testing FDL and proposed logic by keeper upsizing Pull up transistor size (width) Footed Domino Proposed logic W Fail Pass 2W Pass Pass 3W Pass Pass 4.3. Performance and power utilization Using Eldo platform a comparative study relating to the delay, size and power utilization is carried out. The overall performance depends on the dynamic node capacitance that loads the dynamic gate. So as to avoid contention of the keeper transistor with evaluation network, the keeper transistor should be turned OFF quite sooner. The table 2 gives a comparative analysis of area, power delay estimates of the circuits under study. From the table it can be noted that the performance of the proposed circuit has been increased by 60-70% at the cost of higher power consumption. Voltage degradation of the output signal does not occur due to the minimum leakage current. Domino dissipates large power in driving the capacitance of the clock lines, which switch at full rate 7.

1266 Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 1 PDP 0.8 0.6 0.4 0.2 FDL HSD LCR Proposed circuit 0 Domino circuit Fig. 5. Power delay product of the domino architectures for 64 bit tag comparator Table 2. Comparison of delay and power for 64 bit tag comparator Footed Domino High Speed Domino Leakage current Replica Proposed circuit Transistor count 261 267 264 276 Clock to output delay (ps) 381 552 333 115 Normalized CQ delay 1 1.45 0.87 0.35 Power consumed(nw) 1.43 1.63 1.72 2.1 Another significant parameter is PDP or power delay product, which is equal to Pavg t DQ where Pavg is the circuit power consumption and t DQ represents the DQ circuit delay. The High Speed Domino logic shows a high value of PDP in the range of 10-18 Ws. Figure 5 illustrates the PDP of the domino circuits under study. As the proposed circuit has the least PDP, it can be used to implement low leakage and high performance gates. 4.4. Performance variation due to varied temperature The FDL and the proposed circuit were simulated under wide degree of temperature and noted the delay of the circuits. The temperature was varied between 10 to 115 C. Figure 6 illustrates the delay variation for the conventional FDL circuit and the proposed circuit logic for varying temperature range. There is a wide deviation in delay for the existing FDL circuit when compared to the proposed circuit for higher temperatures which restricts its application in high temperature applications. Delay (ps) 1000 500 0 10 46 82 118 FDL 393 487 653 773 Proposed circuit 81 91 103 114 Temperature ( C) FDL Proposed circuit Fig. 6. Delay variation under varying temperature range

Lidiya Mariam Koshy and G. Jyothish Chandran / Procedia Computer Science 46 ( 2015 ) 1261 1267 1267 A layout editor being an interactive graphic platform, allows the creation and deletion of layout elements. The layout of the simulated circuits can be created from the viewpoints of the circuits created by the Eldo simulator. The circuit passed the Design Rule Check. Design rules allow the scaling of geometrical features linearly and proportionally. The layout uses only two metal for metal to metal interconnections. Layout contains information on what patterns have to be made on the wafer. Wires cannot be drawn at arbitrary angles only horizontal and vertical wire segments are allowed 12. Masks are created using the layout information provided by the designer. The circuit layout was implemented in IC workstation in TSMC 180nm technology. 5. Conclusion The charge sharing problem degrades the output signal while implementing wide tag comparator. The 64 bit tag comparator was implemented using various domino approaches and simulated. The schematic entry of the 64 bit tag comparator was done using Mentor graphics Design Architect tool. Simulation was done with Mentor graphics Eldo using TSMC 180nm CMOS technology at 1.8V supply voltage. It is found that the proposed circuit has 70% lower C-Q delay than conventional FDL. The PDP is the lowest for the proposed logic and highest for HSD logic. The size of the pull up transistor of proposed circuit is relatively much smaller when compared to the conventional dynamic logic. Even during the variation of temperature there is least variation of performance. Hence this logic is better circuit for implementing wide fan-in gates. When certain input signal was applied to the existing domino logic a degraded output waveforms were obtained at a clock frequency of 1GHz. If these circuits are cascaded with any other circuit false evaluation may arise as a result of degraded waveforms of the domino logic. There is nonconformity in delay during temperature variation for the conventional dynamic circuit. The proposed circuit sustains a persistent delay upto 115 C temperature rise. References 1. Hiroaki Suzuki, Kim CH, Kaushik Roy. Fast tag comparator using diode partitioned domino for 64-bit microprocessors. IEEE Trans on Circuits Syst I Reg Papers 2007;54:322-8. 2. Ali Peiravi, Mohammad Asyaei. Current-comparison-based domino: new low-leakage high-speed domino circuit for wide fan-in gates. IEEE Trans on Very Large Scale (VLSI) Syst 2013;21:934-943. 3. Anis MH, Allam MW, Elmasry MI. Energy-efficient noise-tolerant dynamic styles for scaled-down CMOS and MTCMOS technologies. IEEE Trans Very Large Scale (VLSI) Syst 2002;10:71-8. 4. Lih Y, Tzartzanis N, Walker WW. A leakage current replica keeper for dynamic circuits. IEEE J Solid-State Circuits 2007;42:48-55. 5. Hamid Mahmoodi, Kaushik Roy. Diode-footed domino: A leakage-tolerant high fan-in dynamic circuit design style. IEEE Trans Circuits Syst I Reg Papers 2004;51:495-503. 6. Sung Mo Kang, Yusuf Leblebici. CMOS digital integrated circuits analysis and design. 3rd ed. New York: Mc Graw Hill; 2003. 7. Anis MH, Allam MW, Elmasry MI. Impact of technology scaling on CMOS logic styles. IEEE Trans on Circuits Syst II Analog and Digital Signal Processing 2002;49:577-588 8. Neil HE Weste, David Harris, Ayan Banerjee. CMOS VLSI design- a circuits and systems perspective. 3rd ed. Pearson publication. 9. Moradi F et al. Domino logic designs for high-performance and leakage-tolerant applications. The VLSI Journal 2012. 10. John P Uyemura. CMOS logic circuit design. Springer US; 2001. 11. Lidiya Mariam Koshy, Jyothish CG. Sub- 0.18μm low leakage and high performance dynamic logic wide fan-in gates. Proc AICERA 2014. 12. Wayne Wolf. Modern VLSI design- systems on silicon. 2nd ed. Prentice Hall; 1998.