Incorporating Variability into Design

Similar documents
EECS 427 Lecture 13: Leakage Power Reduction Readings: 6.4.2, CBF Ch.3. EECS 427 F09 Lecture Reminders

Lecture 9: Clocking for High Performance Processors

EECS 427 Lecture 22: Low and Multiple-Vdd Design

EE241 - Spring 2004 Advanced Digital Integrated Circuits. Announcements. Borivoje Nikolic. Lecture 15 Low-Power Design: Supply Voltage Scaling

Interconnect-Power Dissipation in a Microprocessor

Lecture 11: Clocking

Lecture 10. Circuit Pitfalls

HIGH-SPEED LOW-POWER ON-CHIP GLOBAL SIGNALING DESIGN OVERVIEW. Xi Chen, John Wilson, John Poulton, Rizwan Bashirullah, Tom Gray

Deep Trench Capacitors for Switched Capacitor Voltage Converters

Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University

EE241 - Spring 2013 Advanced Digital Integrated Circuits. Announcements. Lecture 16: Power and Performance

Announcements. Advanced Digital Integrated Circuits. Midterm feedback mailed back Homework #3 posted over the break due April 8

Probabilistic and Variation- Tolerant Design: Key to Continued Moore's Law. Tanay Karnik, Shekhar Borkar, Vivek De Circuit Research, Intel Labs

EE-382M-8 VLSI II. Early Design Planning: Back End. Mark McDermott. The University of Texas at Austin. EE 382M-8 VLSI-2 Page Foil # 1 1

Power Spring /7/05 L11 Power 1

Lecture 10 Circuit Design Rules of Thumb. Overview

Ruixing Yang

DESIGNING powerful and versatile computing systems is

Leakage Power Minimization in Deep-Submicron CMOS circuits

Low Power System-On-Chip-Design Chapter 12: Physical Libraries

Lecture 19: Design for Skew

DESIGN & IMPLEMENTATION OF SELF TIME DUMMY REPLICA TECHNIQUE IN 128X128 LOW VOLTAGE SRAM

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

A Novel Low-Power Scan Design Technique Using Supply Gating

Low Power Design of Successive Approximation Registers

EE290C - Spring 2004 Advanced Topics in Circuit Design High-Speed Electrical Interfaces. Announcements

Introduction to CMOS VLSI Design (E158) Lecture 9: Cell Design

Lecture 9: Cell Design Issues

Reliability Enhancement of Low-Power Sequential Circuits Using Reconfigurable Pulsed Latches

CMOS Process Variations: A Critical Operation Point Hypothesis

Low Power Design in VLSI

High Temperature Mixed Signal Capabilities

Disseny físic. Disseny en Standard Cells. Enric Pastor Rosa M. Badia Ramon Canal DM Tardor DM, Tardor

EECS 427 Lecture 21: Design for Test (DFT) Reminders

Lecture 23: PLLs. Office hour on Monday moved to 1-2pm and 3:30-4pm Final exam next Wednesday, in class

RECENT technology trends have lead to an increase in

04/29/03 EE371 Power Delivery D. Ayers 1. VLSI Power Delivery. David Ayers

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

Timing analysis can be done right after synthesis. But it can only be accurately done when layout is available

Active Decap Design Considerations for Optimal Supply Noise Reduction

DESIGN FOR LOW-POWER USING MULTI-PHASE AND MULTI- FREQUENCY CLOCKING

Leakage Control Techniques for Designing Robust, Low Power Wide-OR Domino Logic for Sub-130nm CMOS Technologies

Trends and Challenges in VLSI Technology Scaling Towards 100nm

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

Announcements. Advanced Digital Integrated Circuits. Quiz #3 today Homework #4 posted This lecture until 4pm

EEC 216 Lecture #10: Ultra Low Voltage and Subthreshold Circuit Design. Rajeevan Amirtharajah University of California, Davis

Low-Power CMOS VLSI Design

Low-Power Digital CMOS Design: A Survey

UNIT-II LOW POWER VLSI DESIGN APPROACHES

Fully Integrated Switched-Capacitor DC-DC Conversion

Energy Efficiency of Power-Gating in Low-Power Clocked Storage Elements

Optimization of Overdrive Signoff

High Speed Communication Circuits and Systems Lecture 14 High Speed Frequency Dividers

ECEN 720 High-Speed Links Circuits and Systems

To appear in IEEE International Solid-State Circuits Conference (ISSCC) Digest of Technical Papers, San Francisco, February 2002.

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

Geared Oscillator Project Final Design Review. Nick Edwards Richard Wright

Homework 10 posted just for practice. Office hours next week, schedule TBD. HKN review today. Your feedback is important!

Jack Keil Wolf Lecture. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. Lecture Outline. MOSFET N-Type, P-Type.

! Is it feasible? ! How do we decompose the problem? ! Vdd. ! Topology. " Gate choice, logical optimization. " Fanin, fanout, Serial vs.

EDA Challenges for Low Power Design. Anand Iyer, Cadence Design Systems

ICS2510C. 3.3V Phase-Lock Loop Clock Driver. Integrated Circuit Systems, Inc. General Description. Pin Configuration.

ECEN 720 High-Speed Links: Circuits and Systems

COMPREHENSIVE ANALYSIS OF ENHANCED CARRY-LOOK AHEAD ADDER USING DIFFERENT LOGIC STYLES

Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis

Comparative Study of Different Low Power Design Techniques for Reduction of Leakage Power in CMOS VLSI Circuits

Nanowire-Based Programmable Architectures

EECS150 - Digital Design Lecture 19 CMOS Implementation Technologies. Recap and Outline

Impact of Leakage on IC Testing?

Mitigating Parameter Variation with Dynamic Fine-Grain Body Biasing

Opportunities and Challenges in Ultra Low Voltage CMOS. Rajeevan Amirtharajah University of California, Davis

Characterization and Variation Modeling for 22FDX. Ning Jin Digital Design Methodology Team

A Low Power Single Phase Clock Distribution Multiband Network

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

Lecture 04 CSE 40547/60547 Computing at the Nanoscale Interconnect

A Survey of the Low Power Design Techniques at the Circuit Level

Overview ECE 553: TESTING AND TESTABLE DESIGN OF DIGITAL SYSTES. Motivation. Modeling Levels. Hierarchical Model: A Full-Adder 9/6/2002

An Optimal Design of Ring Oscillator and Differential LC using 45 nm CMOS Technology

Development of a 20 GS/s Sampling Chip in 130nm CMOS Technology

FinFET-based Design for Robust Nanoscale SRAM

DESIGNING OF SRAM USING LECTOR TECHNIQUE TO REDUCE LEAKAGE POWER

I DDQ Current Testing

Chapter 6 Combinational CMOS Circuit and Logic Design. Jin-Fu Li Department of Electrical Engineering National Central University Jungli, Taiwan

CS250 VLSI Systems Design. Lecture 3: Physical Realities: Beneath the Digital Abstraction, Part 1: Timing

Preface to Third Edition Deep Submicron Digital IC Design p. 1 Introduction p. 1 Brief History of IC Industry p. 3 Review of Digital Logic Gate

Bridging the Gap between Dreams and Nano-Scale Reality

High-speed Serial Interface

Single-Ended to Differential Converter for Multiple-Stage Single-Ended Ring Oscillators

ECE 471/571 The CMOS Inverter Lecture-6. Gurjeet Singh

Microprocessor Design in the Nanoscale Era

CMPEN 411 VLSI Digital Circuits Spring Lecture 24: Peripheral Memory Circuits

Lecture 8: Memory Peripherals

Low Transistor Variability The Key to Energy Efficient ICs

2009 Spring CS211 Digital Systems & Lab 1 CHAPTER 3: TECHNOLOGY (PART 2)

DATE 2016 Early Reliability Modeling for Aging and Variability in Silicon System (ERMAVSS Workshop)

A Static Power Model for Architects

1. Short answer questions. (30) a. What impact does increasing the length of a transistor have on power and delay? Why? (6)

An Interconnect-Centric Approach to Cyclic Shifter Design

Low Power VLSI Circuit Synthesis: Introduction and Course Outline

Power Considerations in the Design of the Alpha Microprocessor

Transcription:

Incorporating Variability into Design Jim Farrell, AMD Designing Robust Digital Circuits Workshop UC Berkeley 28 July 2006

Outline Motivation Hierarchy of Design tradeoffs Design Infrastructure for variability Circuits techniques for variability Architectural techniques for variability Conclusion 2

Motivation Beyond basic yield, Variation costs frequency, which reduces revenue. Example of how we might partition a lot into frequency bins: Top Bin Fmax Mid Bin Bottom Bin Sidd 3

How Variability affects Circuit Delay Which components are more affected by variations, LARGE gates or SMALL gates? Example: Inverter Chain with different inverter sizes Delay Variations (%) 40 35 30 25 20 15 Large Gates vs. Small Gates 0 0.5 1 1.5 2 2.5 3 3.5 Gate Width (um) Gate Width (um) 0.24 0.5 1.0 1.5 2.0 3.0 Variation (%) 35 29 26 20 20 20 4

1.4 1.2 1 0.8 0.6 0.4 0.2 0 Impact of Variation on Leakage Power -3 sigma Nom Le +3 sigma leakage device distribution leakage distribution Relative Speed Part Frequency P C TOT α F Vdd 2 + N TOT α F Vdd I CO + N ON I LEAK Vdd Selective use of High Vt and longer channel devices shifts leakage profile to lower values But, poly bias is limited in CPP From Naffziger, 2006 VLSI Ckt Symp Switching Power Crossover Power Leakage Power 5

Limits to poly biasing for leakage variation reduction Contacted poly pitch includes: PC width CA width Spacer and overlay tolerances CA Competing requirements for: CA resistance CA-PC capacitance CA-PC breakdown voltage And, maximum poly bias for leakage PC RX 6

Infrastructure to Reduce Variability This is the Boring stuff Classic engineering isolation of problem areas Is this a problem that should addressed? Simulation, data and judgment Critical work at the beginning of the project Examples: Contact vs Salicide resistance Feedback sizing in M/S flops 7

Infrastructure: Transistor R,C model Question do I care about the variation in contact or salicide resistance? Rc Rc Rc Rdiff Rdiff Rdiff Rdiff Rc Rc Rc 8

Comparing Source vs Contact effect on transistor delay. Delay Diff Normalized (%) 11 10 9 8 7 6 5 4 3 2 1 0 N Width Contact resistance dominates over source resistance Conclusion: Concentrate on minimizing variation impact of contacts Rc(min)-Rc(0) Rc(nom)-Rc(0) Rc(max)-Rc(0) Ra(min)-Ra(0) Ra(nom)-Ra(0) Ra(max)-Ra(0) 9

Choices for M/S Flops: clocked feedback safer for variation Din Qb CLK Ability to use ratioed feedback severely compromised by writeability vs noise margin 10

Consider variation in design of Flops Din Qb CLK Noise pulse on clock requires analysis of sizing of keeper devices vs data path ->Reducing keeper stack sizing by half for clock power reduces noise margin by from 14 to 9% at 6sigma due to Vt and Leff variation. 11

Circuit Techniques for Variability Based on triage from infrastructure work Focus on the goals of the design: Power Timing Voltage range Look for opportunities to reduce constraints imposed by standard circuits 12

Latching In 45/32nm with a 16 FO4 T CYCLE and the standard MSFF design, the time available for logic is 16-1.5-1.5-1.5 = 11.5 usable FO4 delays. T SU +T CQ = 3 FO4s, T JITTER 1.5 FO4s (~25ps typical) Jitter is likely to be larger than this given process variability trends The jitter budget must comprehend path variability due to supply droop and device/wire variations A flop that tolerates variations in clock and data arrival time without hurting frequency is highly desirable Should also provide low flow-through time Should provide the ability to incorporate logic for reduced delay Multiple flavors of flops is a usage and support cost (i.e. SA flop, fqbx, msfqbx etc.) If a single flop can provide good performance and robustness, that s better AMD s eight generation processors have ~140,000 flops. At 10fC switching cap on average (10% data activity) and assuming 25% avg. clock gating, 15% of our C AC budget is spent on flops T JIT T CQ T LOGIC T LOGIC = T CYCLE T CQ T SU - T JIT T SU TCQ From Naffziger, 2006 VLSI Ckt Symp 13

Some Ways to Shoulder the Variation Burden: Adaptive clocking Programmable Delay Buffers Empirically set the clock edge to optimize frequency Higher granularity more variation tolerance From Naffziger, 2006 VLSI Ckt Symp L BIST and GA search algorithms show promise for per-part optimization 14

Logic vs Metal Delay variations Logic deltal dominated Interconnect linewidth and material dominated 65nm timing tools dominated by worst case assumptions Next generation needs to consider the statistical variation of the logic and the interconnect. Biasing metal linewidth may provide margin for critical interconnect much like biasing Lpoly for transistors. Ideally consider layout of interconnect to incorporate dependencies. Metal fill improves variation. 15

Metal fill tradeoffs Opportunity to improve variation from processing Designer needs to be cognizant of o additional capacitance (and coupling if not tied fill) o cad tool burden datasize, fill generation o data prep time Before fill After fill 16

Clkin from PLL Adaptive Circuits Example: duty cycle correction for process variation through training cycle (Agarwal, 2006 VLSI) Delay Line c d Clkout: Corrected duty cycle from variation a b Ctrl bits From detector/cntr 17

Architectural Techniques to reduce variability Goal change the problem to one that can be solved. Example: Rather than make every cell perfect, redundancy reduces effect of weak bitcells Consider the larger issue of identifying weak cells Anticipate problems during the full life of the product 18

Architectural Options separate voltage planes VDD1 L2$ L3 Cache VDD2 VDD3 L2$ L2$ VDD4 VDD0 L2$ Solving problem with how to increase headroom for small memory cells Common Technique but difficulty in supplying multiple supplies and architectural latency crossing Boundaries (Kanno, 2006 ISSCC) Core VDD5 Core Core Core VDD6 VDD7 VDD8 Architectural choices in how many supplies can be supported or tied together. 19

Adaptive Supply Voltage Energy / Operation Per-part and dynamic voltage management are key More range flexibility and finer grain response will provide differentiation Short Nom High Channel Length Long Low Vdd From Naffziger, 2006 VLSI Ckt Symp 20

Options for weak memory elements Cache Datapath1 RegFile Datapath2 Ctrl Identify weakness at test Redundancy in cache Fusing off bad entries in Register file (virtual vs Physical registers Datapath/ctrl can use redundant storage elements at significant area cost (similar to ultradrowsy from L. Clark, et.al, JSSC Feb 2005) Difficult problem for test: BIST for weakness Ideally probe for weakness over life of product and softfuse off any failing cells 21

Conclusions Variation requires a new body of infrastructure that must be analyzed for critical requirements Minimizing the impact through circuit techniques is the first answer Consider the overall problem: architectural tradeoffs can also minimize the effect of variation 2006, Advanced Micro Devices. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. 22