A gate sizing and transistor fingering strategy for

Similar documents
NOVEL OSCILLATORS IN SUBTHRESHOLD REGIME

SUBTHRESHOLD CIRCUIT DESIGN FOR HIGH PERFORMANCE

Low Power Realization of Subthreshold Digital Logic Circuits using Body Bias Technique

1. Short answer questions. (30) a. What impact does increasing the length of a transistor have on power and delay? Why? (6)

Highly linear common-gate mixer employing intrinsic second and third order distortion cancellation

CHAPTER 3 PERFORMANCE OF A TWO INPUT NAND GATE USING SUBTHRESHOLD LEAKAGE CONTROL TECHNIQUES

ECEN 474/704 Lab 7: Operational Transconductance Amplifiers

A NEW APPROACH FOR DELAY AND LEAKAGE POWER REDUCTION IN CMOS VLSI CIRCUITS

Energy Efficiency of Power-Gating in Low-Power Clocked Storage Elements

An Efficient Design of CMOS based Differential LC and VCO for ISM and WI-FI Band of Applications

Circuit level, 32 nm, 1-bit MOSSI-ULP adder: power, PDP and area efficient base cell for unsigned multiplier

Microelectronics Circuit Analysis and Design. MOS Capacitor Under Bias: Electric Field and Charge. Basic Structure of MOS Capacitor 9/25/2013

problem grade total

A new 6-T multiplexer based full-adder for low power and leakage current optimization

Lecture 4. The CMOS Inverter. DC Transfer Curve: Load line. DC Operation: Voltage Transfer Characteristic. Noise in Digital Integrated Circuits

Microelectronics Circuit Analysis and Design

Conduction Characteristics of MOS Transistors (for fixed Vds)! Topic 2. Basic MOS theory & SPICE simulation. MOS Transistor

Topic 2. Basic MOS theory & SPICE simulation

Conduction Characteristics of MOS Transistors (for fixed Vds) Topic 2. Basic MOS theory & SPICE simulation. MOS Transistor

5. CMOS Gates: DC and Transient Behavior

Near-threshold Computing of Single-rail MOS Current Mode Logic Circuits

Design of an Efficient Phase Frequency Detector for a Digital Phase Locked Loop

SCALING power supply has become popular in lowpower

Performance Analysis of Energy Efficient and Charge Recovery Adiabatic Techniques for Low Power Design

Novel Buffer Design for Low Power and Less Delay in 45nm and 90nm Technology

Leakage Current Analysis

Impact of Logic and Circuit Implementation on Full Adder Performance in 50-NM Technologies

Design of Multiplier using Low Power CMOS Technology

Written Examination on. Wednesday October 17, 2007,

Electronics Basic CMOS digital circuits

Design cycle for MEMS

Comparison of High Speed & Low Power Techniques GDI & McCMOS in Full Adder Design

Session 10: Solid State Physics MOSFET

Sub-threshold Logic Circuit Design using Feedback Equalization

A CMOS CURRENT CONTROLLED RING OSCILLATOR WITH WIDE AND LINEAR TUNING RANGE

DIGITAL VLSI LAB ASSIGNMENT 1

Aarthi.P, Suresh Kumar.R, Muniraj N. J. R, International Journal of Advance Research, Ideas and Innovations in Technology.

A Study on the Characteristics of a Temperature Sensor with an Improved Ring Oscillator

Ultra-low voltage high-speed Schmitt trigger circuit in SOI MOSFET technology

An Analytical Model for Current, Delay, and Power Analysis of Submicron CMOS Logic Circuits

PERFORMANCE ANALYSIS ON VARIOUS LOW POWER CMOS DIGITAL DESIGN TECHNIQUES

DESIGN AND ANALYSIS OF LOW POWER CHARGE PUMP CIRCUIT FOR PHASE-LOCKED LOOP

DESIGN AND ANALYSIS OF SUB 1-V BANDGAP REFERENCE (BGR) VOLTAGE GENERATORS FOR PICOWATT LSI s.

Leakage Power Reduction for Logic Circuits Using Variable Body Biasing Technique

Sub-threshold Leakage Current Reduction Using Variable Gate Oxide Thickness (VGOT) MOSFET

Jan Rabaey, «Low Powere Design Essentials," Springer tml

LOW POWER VLSI TECHNIQUES FOR PORTABLE DEVICES Sandeep Singh 1, Neeraj Gupta 2, Rashmi Gupta 2

EEC 118 Spring 2010 Lab #1: NMOS and PMOS Transistor Parameters

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

ECE520 VLSI Design. Lecture 5: Basic CMOS Inverter. Payman Zarkesh-Ha

Energy Reduction of Ultra-Low Voltage VLSI Circuits by Digit-Serial Architectures

Design of Multiplier Using CMOS Technology

Chapter 2 CMOS at Millimeter Wave Frequencies

Design of a Folded Cascode Operational Amplifier in a 1.2 Micron Silicon-Carbide CMOS Process

Robust Ultra-Low Power Sub-threshold DTMOS Logic Λ

[Singh*, 5(3): March, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Transistor Network Restructuring Against NBTI Degradation. P. F. Butzen a, V. Dal Bem a, A. I. Reis b, R. P. Ribas b.

Minimizing the Sub Threshold Leakage for High Performance CMOS Circuits Using Stacked Sleep Technique

Design of Two High Performance 1-Bit CMOS Full Adder Cells

INTERNATIONAL JOURNAL OF APPLIED ENGINEERING RESEARCH, DINDIGUL Volume 1, No 3, 2010

Sub-threshold Design using SCL for Low Power Applications

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

Process-sensitive Monitor Circuits for Estimation of Die-to-Die Process Variability

IMPLEMENTATION OF POWER GATING TECHNIQUE IN CMOS FULL ADDER CELL TO REDUCE LEAKAGE POWER AND GROUND BOUNCE NOISE FOR MOBILE APPLICATION

ECE 3110: Engineering Electronics II Fall Final Exam. Dec. 16, 8:00-10:00am. Name: (78 points total)

Analysis and Design of Low Power Ring Oscillators with Frequency ~ khz

Total reduction of leakage power through combined effect of Sleep stack and variable body biasing technique

Stacked-FET linear SOI CMOS SPDT antenna switch with input P1dB greater than

Chapter 3 DESIGN OF ADIABATIC CIRCUIT. 3.1 Introduction

Leakage Power Reduction in CMOS VLSI Circuits

Low Power Design of Schmitt Trigger Based SRAM Cell Using NBTI Technique

An Analysis of Novel CMOS Ring Oscillator Using LECTOR Technique with Minimum Leakage

Digital Electronics Part II - Circuits

CPE/EE 427, CPE 527 VLSI Design I CMOS Inverter. CMOS Inverter: A First Look

PROCESS and environment parameter variations in scaled

Week 9a OUTLINE. MOSFET I D vs. V GS characteristic Circuit models for the MOSFET. Reading. resistive switch model small-signal model

2-Bit Magnitude Comparator Design Using Different Logic Styles

Low Power, Area Efficient FinFET Circuit Design

UNIT 3: FIELD EFFECT TRANSISTORS

Design of Single Phase Continuous Clock Signal Set D-FF for Ultra Low Power VLSI Applications

Temperature-adaptive voltage tuning for enhanced energy efficiency in ultra-low-voltage circuits

Chapter 6 DIFFERENT TYPES OF LOGIC GATES

Power Spring /7/05 L11 Power 1

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

Design Analysis of 1-bit Comparator using 45nm Technology

Designing of Low-Power VLSI Circuits using Non-Clocked Logic Style

A low-variation on-resistance CMOS sampling switch for high-speed high-performance applications

Design of Low Power Vlsi Circuits Using Cascode Logic Style

Low Voltage SC Circuit Design with Low - V t MOSFETs

An energy efficient full adder cell for low voltage

CHAPTER 6 PHASE LOCKED LOOP ARCHITECTURE FOR ADC

Lecture 13 CMOS Power Dissipation

Robust Subthreshold Circuit Designing Using Sub-threshold Source Coupled Logic (STSCL)

Design of High Performance Arithmetic and Logic Circuits in DSM Technology

3.CMOS Inverter-homework

ISSN Vol.04, Issue.05, May-2016, Pages:

Comparative Study of Different Low Power Design Techniques for Reduction of Leakage Power in CMOS VLSI Circuits

Study and Analysis of CMOS Carry Look Ahead Adder with Leakage Power Reduction Approaches

EE 42/100 Lecture 23: CMOS Transistors and Logic Gates. Rev A 4/15/2012 (10:39 AM) Prof. Ali M. Niknejad

Jack Keil Wolf Lecture. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. Lecture Outline. MOSFET N-Type, P-Type.

Transcription:

LETTER IEICE Electronics Express, Vol.9, No.19, 1550 1555 A gate sizing and transistor fingering strategy for subthreshold CMOS circuits Morteza Nabavi a) and Maitham Shams b) Department of Electronics, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1S 5B6, Canada a) nabavi@doe.carleton.ca b) shams@doe.carleton.ca Abstract: Parallel Transistor Stacks (PTS) has been shown to be an effective technique for improving the speed of digital circuits operating in the subthreshold region which comes at the cost of power consumption and area. However, our experience shows that using PTS is not beneficial in all cases. In this paper, we present a methodology to identify whether using PTS is beneficial (or not) in a particular CMOS technology and what transistor sizing can be employed to maximize the circuit speed. Our technique is based on analyzing the Current-Over- Capacitance (COC) ratio of PMOS and NMOS transistors. The results of incorporating the proposed methodology in a 4-bit comparator and a 19-stage inverter ring oscillator, using 90 nm CMOS technology, illustrate 26% and 40% extra improvement compared to the blind use of PTS, respectively. Keywords: VLSI, CMOS, logic design, subthreshold circuits Classification: Integrated circuits References [1] B. H. Calhoun, A. Wang, and A. P. Chandrakasan, Modeling and Sizing for Minimum Energy Operation in Sub-threshold Circuits, IEEE J. Solid- State Circuits, vol. 40, no. 9, pp. 1778 1786, Sept. 2005. [2] M. Muker and M. Shams, Designing digital subthreshold CMOS circuits using parallel transistor stacks, IET Electronics Letter, vol. 47, no. 6, pp. 372 374, March 2011. Also see the featured interviews on page 354 in the same issue. [3] J. M. Rabaey, A. Chandrakasn, and B. Nikolic, Digital Integrated Crcuits: A Design Perspective, 2nd ed., Pearson Education, Toronto, 2003. 1 Introduction Digital circuits operating in the subthreshold region benefit from very low power consumption at the cost of speed [1]. In order to improve the speed of subthreshold circuits, Muker and Shams [2] introduced the Parallel Transistor Stacks (PTS) structure that, for example, enhances the speed of a 9-stage 1550

ring oscillator by 2.63 times compared to the conventional transistor sizing optimization method at a supply voltage of 0.3 V in a 65 nm LP CMOS technology. However, there are CMOS technologies in which using PTS blindly may not improve the speed of a subthreshold circuit. In this paper, we describe a methodology to identify the case when using PTS is beneficial for improving the speed of a digital circuit operating in the subthreshold region. We also elaborate on obtaining the appropriate transistor sizing where applying PTS is advised. 2 Propagation delay The propagation delay (D) for charging or discharging a node is given by [3]: D = CV DD, (1) 2I where C is the load capacitance, I denotes the average current, and V DD is the supply voltage. As (1) indicates, higher speeds (i.e., lower propagation delays) can be achieved by increasing the current and lowering the capacitance. Therefore, the highest operating frequency is attained when the Current- Over-Capacitance (COC) ratio is maximized. This is often exactly or close to the transistor s minimum width in modern nanometer CMOS technologies, as a result of the so-called Inverse-Narrow-Width-Effect (INWE) [2]. Fig. 1 shows the COC ratio versus the width of both PMOS and NMOS transistors in a 130 nm CMOS technology. The transistors are biased for maximum current drive in the subthreshold region (i.e., V DS = V GS ). The capacitance includes the total gate and junction capacitance of the transistor. Based on this figure, in order to achieve a higher COC ratio we either use the minimum transistor width (point A) or use PTS locked to the minimum width. Nevertheless, the maximum COC ratio is not always associated with the minimum transistor width, as shown in Fig. 1 (b). The COC ratio for the PMOS transistor shown in this figure consists of two regions. In Region 1 Fig. 1. Current over capacitance ratio as a function of width for transistors in a 130 nm CMOS technology: a) NMOS, b) PMOS. 1551

the COC ratio corresponding to the widths larger than the minimum width have a lower value compared to the COC ratio at the minimum width. On the other hand, the widths located in Region 2 have a COC ratio larger than the minimum width. Based on this figure, we use PTS for transistor widths in Region 1 and avoid PTS for transistor widths in Region 2. Applying PTS on the transistors in Region 2 will result in a lower circuit speed. 3 Methodology The following steps explain our methodology on how to identify the cases when to use and when not to use PTS in designing a logic circuit. 1- Plot the COC ratio versus width for each transistor type (PMOS and NMOS) for the given technology kit (e.g., Fig. 1). 2- Based on Step 1, identify the transistor width (W INWE ) associated with the local maximum COC ratio, which typically appears near the minimum width due to INWE [2]. As shown in Fig. 1, point A denotes W INWE for NMOS transistor and point C denotes W INWE for PMOS transistor. 3- Apply a transistor sizing optimization method such as Logical Effort [3] to optimize the circuit performance. 4- Find the optimum width (W opt ) for each transistor in the circuit based on Step 3. 5- Compare COC opt, the COC ratio corresponding to W opt (Step 4), with COC INWE, the COC ratio corresponding to W INWE (Step 2). a. If COC opt < COC INWE, use PTS, and split the transistor of width W opt into multiple (K) transistors of width W INWE, where K = W opt /W INWE rounded to the nearest digit. For example, if Logical Effort finds that an NMOS transistor should have a width of 1.6 µm (point B in Fig. 1 (a)) we split it into K = W opt /W INWE = 10 transistors at the width of 160 nm (point A in Fig. 1 (a)). b. Else If COC opt COC INWE, using PTS results in a slower circuit. In this case we keep the transistor width at W opt. For example, if Logical Effort finds a transistor of width 8 µm (point E in Fig. 1 (b)), we don t use PTS, as the COC ratio at point E has already a higher value compared to the COC ratio at 160 nm (point C in Fig. 1 (b)). Note that Logical Effort (LE) is a much simpler optimization method compared to exhaustive blind simulations. Although LE may not result in the exact optimum solution, especially after extracting layout parasitic, it gives a very good initial point to start with. Iterative applications of LE may be 1552

needed for fine tuning. However, LE assumes that the current of a transistor is linearly proportional to its width. This assumption is not valid for MOSFETs operating in the subthreshold region in most modern CMOS technologies due to the INWE [2]. By using PTS, a linear-relationship between the current of the transistor and its width is established, enabling one to use LE for transistor sizing of subthreshold circuits [2]. 4 Results A 19-stage inverter ring oscillator running at 0.2 V has been examined to verify the correctness of the proposed methodology in 130 nm and 90 nm CMOS technologies. We explored the effect of incorporating PTS on each inverter as shown in Fig. 2 (b-d). The results are shown in Table I. Considering a standard inverter shown in Fig. 2 (a), the simulation results for the 130 nm technology shown in Table I reveals that applying PTS on only PMOS transistors (column P-PTS) of each inverter in the ring oscillator reduces the frequency of oscillation by 3%. On the other hand, applying PTS on only NMOS transistors (column N-PTS) improves the frequency by 120%, while applying PTS on both NMOS and PMOS transistors increases the frequency only by 100%. This result is expected as the COC ratio corresponding to the PMOS transistor of 8 µm width (point E in Fig. 1 (b)) is more than the COC ratio compared to 160 nm (point C in Fig. 1 (b)), and the NMOS transistor at the width of 4 µm (point D in Fig. 1 (a)) has much lower COC ratio compared to point A. Therefore, in this case, the optimum sizing Fig. 2. Inverter configurations for a 19-inverter ring oscillator: (a) Standard, (b) PMOS PTS, (c) NMOS PTS, (d) PMOS and NMOS PTS. 1553

Table I. Frequency, Delay, and Power of 19-stage ring oscillator and a 4-bit comparator in 130 and 90 nm technology with the standard, only PMOS PTS (P-PTS), only NMOS PTS (N-PTS), both NMOS and PMOS PTS (PN-PTS), and PTS based on the proposed methodology (OPT-PTS). (column OPT-PTS) is achieved by applying PTS on NMOS transistors, and avoiding applying PTS on PMOS transistors. Note that by applying PTS the power consumed by the ring oscillator increases by 127%. In the 90 nm technology, the COC ratio for both transistor types follow the same behavior as that of the 130 nm technology. Simulation illustrates that the corresponding COC ratio of the NMOS transistor versus width has a sharp decay, whereas the COC ratio corresponding to the PMOS transistor increases after a certain width (1 µm). The simulation results on a 19-stage ring oscillator shown in Table I for the 90 nm confirm that applying PTS on all PMOS transistors of 8 µm width results in 16% lower frequency compared to that of the standard sizing. On the other hand, applying PTS on only NMOS transistors improves the speed almost by 81%. Applying PTS on both PMOS and NMOS transistors increases the speed only by 41% compared to the standard case. Again here, for achieving the maximum speed we advise applying PTS just on NMOS transistors and not PMOS transistors. As shown in Table I, the power consumed by the ring oscillator increases compared to the standard case from 165.9 nw to 327.6 nw. We also applied our methodology on implementing a 4-bit comparator. The simulation results after Logical Effort optimization are shown in Table I (column Standard). 1554

By running Logical Effort on this complex circuit, it suggests PMOS transistors with widths in both regions (Fig. 1 (b)). According to our methodology, for NMOS transistors, as all suggested widths have lower COC ratio than the minimum width, we apply PTS on all NMOS transistors in both CMOS technologies. Applying PTS on PMOS transistors depends on whether the Logical Effort locates the width sizes in Region 1 or Region 2. Regardless the fact that PTS should be applied on certain transistors, in Table I we list the simulation results of applying PTS only on all PMOS and only on all NMOS transistors in the P-PTS, and N-PTS columns, respectively. Column PN- PTS lists the simulation results for applying PTS on both NMOS and PMOS transistors. The last column, OPT-PTS, illustrates the results of optimum sizing (maximum speed) based on our methodology. This column shows 41% increase of speed compared to the standard sizing, which is 3% more than applying PTS on all transistors blindly. The reason that our methodology in this case doesn t show considerable improvement compared to the blind PTS is that most of the PMOS transistors are in Region 1. The result of applying the proposed methodology, i.e. selective application of PTS, on the 4-bit comparator in the 90 nm technology is also shown in OPT-PTS column of Table I. The speed is increased by around 39% compared to standard sizing, while applying blind PTS only improves the speed by 13%. Note that applying PTS only on all PMOS transistors (P-PTS) degrades the speed by 33%, while applying PTS only on all NMOS transistors (N-PTS) improves the speed by 35% compared to the standard sizing. The power consumed by the 4-bit comparator in both 130 nm and 90 nm technologies shows 80% and 72% increase when PTS is applied compared to the standard case. 5 Conclusion Although using parallel transistor stacks (PTS) gives rise to considerable speed benefit in the subthreshold circuits, there are cases where PTS should be avoided. In this paper we proposed a methodology to advise selective application of PTS on some transistors in a circuit. Applying the proposed methodology on different circuits illustrates up to 40% increase in speed compared to when PTS is applied on all transistors blindly. Incorporating our methodology shows 38% to 121% speed improvement compared to the conventional transistor sizing optimization method. Besides, it is important to note that using PTS increases the circuit area up to 50% and according to our simulations the power consumption is linearly proportional to the speed. 1555