Contents CONTRIBUTING FACTORS. Preface. List of trademarks 1. WHY ARE CUSTOM CIRCUITS SO MUCH FASTER?

Similar documents
Closing the Gap Between ASIC & Custom. Tools and Techniques for High-Performance ASIC Design

Closing the Power Gap between ASIC and Custom: An ASIC Perspective

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

Leakage Power Minimization in Deep-Submicron CMOS circuits

LOW POWER DESIGN METHODOLOGIES

Technology Timeline. Transistors ICs (General) SRAMs & DRAMs Microprocessors SPLDs CPLDs ASICs. FPGAs. The Design Warrior s Guide to.

Lecture 3, Handouts Page 1. Introduction. EECE 353: Digital Systems Design Lecture 3: Digital Design Flows, Simulation Techniques.

EECS150 - Digital Design Lecture 28 Course Wrap Up. Recap 1

Lecture 9: Clocking for High Performance Processors

FPGA Based System Design

Overview. 1 Trends in Microprocessor Architecture. Computer architecture. Computer architecture

Wave Pipelined Circuit with Self Tuning for Clock Skew and Clock Period Using BIST Approach

Low-Power Digital CMOS Design: A Survey

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

Computer Aided Design of Electronics

MS Project :Trading Accuracy for Power with an Under-designed Multiplier Architecture Parag Kulkarni Adviser : Prof. Puneet Gupta Electrical Eng.

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

Design of an Efficient Phase Frequency Detector for a Digital Phase Locked Loop

PE713 FPGA Based System Design

CMOS Test and Evaluation

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Course Outcome of M.Tech (VLSI Design)

Low Power System-On-Chip-Design Chapter 12: Physical Libraries

The challenges of low power design Karen Yorav

Low Power Design Methods: Design Flows and Kits

Video Enhancement Algorithms on System on Chip

Design of High Speed Power Efficient Combinational and Sequential Circuits Using Reversible Logic

A Case Study of Nanoscale FPGA Programmable Switches with Low Power

On Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital VLSI

Datorstödd Elektronikkonstruktion

A Survey of the Low Power Design Techniques at the Circuit Level

EE241 - Spring 2004 Advanced Digital Integrated Circuits. Announcements. Borivoje Nikolic. Lecture 15 Low-Power Design: Supply Voltage Scaling

Probabilistic and Variation- Tolerant Design: Key to Continued Moore's Law. Tanay Karnik, Shekhar Borkar, Vivek De Circuit Research, Intel Labs

Power Spring /7/05 L11 Power 1

STM RH-ASIC capability

EECS 427 Lecture 22: Low and Multiple-Vdd Design

Low Power Design Part I Introduction and VHDL design. Ricardo Santos LSCAD/FACOM/UFMS

Digital Systems Design

POWER GATING. Power-gating parameters

ASICs Concept to Product

EDA for IC System Design, Verification, and Testing

In 1951 William Shockley developed the world first junction transistor. One year later Geoffrey W. A. Dummer published the concept of the integrated

Preface to Third Edition Deep Submicron Digital IC Design p. 1 Introduction p. 1 Brief History of IC Industry p. 3 Review of Digital Logic Gate

Sensing Voltage Transients Using Built-in Voltage Sensor

CIRCUITS. Raj Nair Donald Bennett PRENTICE HALL

EECS 270 Schedule and Syllabus for Fall 2011 Designed by Prof. Pinaki Mazumder

Intel's 65 nm Logic Technology Demonstrated on 0.57 µm 2 SRAM Cells

The backend duplication method

Mohit Arora. The Art of Hardware Architecture. Design Methods and Techniques. for Digital Circuits. Springer

Low Power Techniques for SoC Design: basic concepts and techniques

CS 6135 VLSI Physical Design Automation Fall 2003

Ramon Canal NCD Master MIRI. NCD Master MIRI 1

EE-382M-8 VLSI II. Early Design Planning: Back End. Mark McDermott. The University of Texas at Austin. EE 382M-8 VLSI-2 Page Foil # 1 1

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

EDA Challenges for Low Power Design. Anand Iyer, Cadence Design Systems

1 FUNDAMENTAL CONCEPTS What is Noise Coupling 1

Practical Information

Lecture 1: Digital Systems and VLSI

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

CS250 VLSI Systems Design. Lecture 3: Physical Realities: Beneath the Digital Abstraction, Part 1: Timing

SRAM SYSTEM DESIGN FOR MEMORY BASED COMPUTING

HIGH-PERFORMANCE HYBRID WAVE-PIPELINE SCHEME AS IT APPLIES TO ADDER MICRO-ARCHITECTURES

ECE 5745 Complex Digital ASIC Design Topic 2: CMOS Devices

Low Power VLSI Circuit Synthesis: Introduction and Course Outline

L15: VLSI Integration and Performance Transformations

Design of Adjustable Reconfigurable Wireless Single Core

Mixed Signal Virtual Components COLINE, a case study

Introduction to co-simulation. What is HW-SW co-simulation?

A Flying-Adder Architecture of Frequency and Phase Synthesis With Scalability

A HIGH PERFORMANCE LOW POWER MESOCHRONOUS PIPELINE ARCHITECTURE FOR COMPUTER SYSTEMS

Digital Integrated Circuits 1: Fundamentals

Energy Efficiency of Power-Gating in Low-Power Clocked Storage Elements

EECS150 - Digital Design Lecture 15 - CMOS Implementation Technologies. Overview of Physical Implementations

EECS150 - Digital Design Lecture 9 - CMOS Implementation Technologies

CSE241 VLSI Digital Circuits Winter Lecture 06: Timing

Enhancing Power, Performance, and Energy Efficiency in Chip Multiprocessors Exploiting Inverse Thermal Dependence

Lecture 19: Design for Skew

LEAKAGE IN NANOMETER CMOS TECHNOLOGIES

Design of Mixed-Signal Microsystems in Nanometer CMOS

CONTENTS PREFACE. Part One THE DESIGN PROCESS: PROPERTIES, PARADIGMS AND THE EVOLUTIONARY STRUCTURE

L15: VLSI Integration and Performance Transformations

An Optimized Design System for Flip-Flop Grouping Using Low Power Clock Gating

EE434 ASIC & Digital Systems. Partha Pande School of EECS Washington State University

EECS 427 Lecture 13: Leakage Power Reduction Readings: 6.4.2, CBF Ch.3. EECS 427 F09 Lecture Reminders

Overview of Design Methodology. A Few Points Before We Start 11/4/2012. All About Handling The Complexity. Lecture 1. Put things into perspective

An Energy Scalable Computational Array for Energy Harvesting Sensor Signal Processing. Rajeevan Amirtharajah University of California, Davis

EDA for IC Implementation, Circuit Design, and Process Technology

Architecture Design and Validation Methods

On the Rules of Low-Power Design

Low Power Radiation Tolerant CMOS Design using Commercial Fabrication Processes

Fundamentals of Industrial Control

nmos, pmos - Enhancement and depletion MOSFET, threshold voltage, body effect

Lecture #2 Solving the Interconnect Problems in VLSI

DFT for Testing High-Performance Pipelined Circuits with Slow-Speed Testers

Timing analysis can be done right after synthesis. But it can only be accurately done when layout is available

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

What this paper is about:

A Level-Encoded Transition Signaling Protocol for High-Throughput Asynchronous Global Communication

Multiprocessor System-on-Chip

CMOS Process Variations: A Critical Operation Point Hypothesis

Transcription:

Contents Preface List of trademarks xi xv Introduction and Overview of the Book WHY ARE CUSTOM CIRCUITS SO MUCH FASTER? WHO SHOULD CARE? DEFINITIONS: ASIC, CUSTOM, ETC. THE 35,000 FOOT VIEW: WHY IS CUSTOM FASTER? MICROARCHITECTURE TIMING OVERHEAD: CLOCK TREE DESIGN AND REGISTERS LOGIC STYLE 8. LOGIC DESIGN 9. CELL DESIGN AND WIRE SIZING 0. LAYOUT: FLOORPLANNING AND PLACEMENT TO MANAGE WIRES PROCESS VARIATION AND IMPROVEMENT SUMMARY AND WHAT S NOT IN THE BOOK ORGANIZATION OF THE REST OF THE BOOK 3 4 9 2 5 7 8 20 22 26 28 28 CONTRIBUTING FACTORS Improving Performance through Microarchitecture EXAMPLES OF MICRO ARCHITECTURAL TECHNIQUES TO INCREASE SPEED MEMORY ACCESS TIME AND THE CLOCK PERIOD SPEEDUP FROM PIPELINING Reducing the Timing Overhead CHARACTERISTICS OF SYNCHRONOUS SEQUENTIAL LOGIC 33 34 44 45 57 58

vi Contents EXAMPLE WHERE LATCHES ARE FASTER OPTIMAL LATCH POSITIONS WITH TWO CLOCK PHASES EXAMPLE WHERE LATCHES ARE SLOWER PIPELINE DELAY WITH LATCHES VS. PIPELINE DELAY WITH FLIP-FLOPS CUSTOM VERSUS ASIC TIMING OVERHEAD 77 8 83 87 90 High-Speed Logic, Circuits, Libraries and Layout Andrew Chang, William J. Dally Stanford University David Chinnery, Kurt Keutzer, Radu Zlatanovici UC Berkeley TECHNOLOGY INDEPENDENT METRICS PERFORMANCE PENALTIES IN ASIC DESIGNS FROM LOGIC STYLE, LOGIC DESIGN, CELL DESIGN, AND LAYOUT COMPARISON OF ASIC AND CUSTOM CELL AREAS ENERGY TRADEOFFS BETWEEN ASIC CELLS AND CUSTOM CELLS FUTURE TRENDS SUMMARY Finding Peak Performance in a Process PROCESS AND OPERATING CONDITIONS CHIP SPEED VARIATION DUE TO STATISTICAL PROCESS VARIATION CONTINUOUS PROCESS IMPROVEMENT SPEED DIFFERENCES DUE TO ALTERNATIVE PROCESS IMPLEMENTATIONS PROCESS TECHNOLOGY FOR ASICS POTENTIAL IMPROVEMENTS FOR ASICS 0 0 02 08 29 34 38 39 45 46 55 57 59 6 64 DESIGN TECHNIQUES Physical Prototyping Plans for High Performance Michel Courtoy, Pinhong Chen, Xiaoping Tang, Chin-Chi Teng, Yuji Kukimoto Silicon Perspective, a Cadence Company FLOORPLANNING PHYSICAL PROTOTYPING 69 69 70 72

TECHNIQUES IN PHYSICAL PROTOTYPING 80 85 vii 8. 9. Automatic Replacement of Flip-Flops by Latches in ASICs Jagesh Sanghavi, Earl Killian, Kaushik Sheth Tensilica THEORY ALGORITHM RESULTS CONCLUSION Useful-Skew Clock Synthesis Boosts ASIC Performance Wayne Dai UC Santa Cruz David Staepelaere Celestry Design Technologies 8. 9. IS CLOCK SKEW REALLY GLOBAL? PERMISSIBLE RANGE SKEW CONSTRAINTS WHY CLOCK SKEW MAY BE USEFUL USEFUL SKEW DESIGN METHODOLOGY USEFUL SKEW CASE STUDY CLOCK AND LOGIC CO-DESIGN SIMULTANEOUS CLOCK SKEW OPTIMIZATION AND GATE SIZING CONCLUSION Faster and Lower Power Cell-Based Designs with Transistor-Level Cell Sizing Michel Côté, Philippe Hurat Cadabra, a Numerical Technologies Company OPTIMIZED CELLS FOR BETTER POWER AND PERFORMANCE PPO FLOW PPO EXAMPLES FLOW CHALLENGES AND ADOPTION 87 87 9 99 203 207 209 209 20 2 23 26 28 220 220 22 225 225 226 228 235 238 239 0. Design Optimization with Automated Flex-Cell Creation Debashis Bhattacharya, Vamsi Boppana Zenasis Technologies FLEX-CELL BASED OPTIMIZATION OVERVIEW MINIMIZING THE NUMBER OF NEW FLEX-CELLS CREATED 24 244 249

viii CELL LAYOUT SYNTHESIS IN FLEX-CELL BASED OPTIMIZATION GREATER PERFORMANCE THROUGH BETTER CHARACTERIZATION PHYSICAL DESIGN AND FLEX-CELL BASED OPTIMIZATION CASE STUDIES WITH RESULTS Contents 254 255 259 26 266 Exploiting Structure and Managing Wires to Increase Density and Performance Andrew Chang, William J. Dally Stanford University INHERENT DESIGN STRUCTURE SUCCESSIVE CUSTOM TECHNIQUES FOR EXPLOITING STRUCTURE FUTURE DIRECTIONS SUMMARY Semi-Custom Methods in a High-Performance Microprocessor Design Gregory A. Northrop IBM CUSTOM PROCESSOR DESIGN SEMI-CUSTOM DEISGN FLOW DESIGN EXAMPLE 24 BIT ADDER OVERALL IMPACT ON CHIP DESIGN Controlling Uncertainty in High Frequency Designs Stephen E. Rich, Matthew J. Parker, Jim Schwartz Intel FREQUENCY TERMINOLOGY UNCERTAINTY DEFINED WHY UNCERTAINTY REDUCES THE MAXIMUM POSSIBLE FREQUENCY PRACTICAL EXAMPLE OF TOOL UNCERTAINTY FOCUSED METHODOLOGY DEVELOPMENT METHODS FOR REMOVING PATHS FROM THE UNCERTAINTY WINDOW 8. THE UNCERTAINTY LIFECYCLE 9. CONCLUSION 269 269 27 285 285 289 289 290 29 297 300 306 309 32 34 35 37 32

ix Increasing Circuit Performance through Statistical Design Techniques Michael Orshansky UC Berkeley PROCESS VARIABILITY AND ITS IMPACT ON TIMING INCREASING PERFORMANCE THROUGH PROBABILISTIC TIMING MODELING INCREASING PERFORMANCE THROUGH DESIGN FOR MANUFACTURABILITY TECHNIQUES ACCOUNTING FOR IMPACT OF GATE LENGTH VARIATION ON CIRCUIT PERFORMANCE: A CASE STUDY CONCLUSION 323 324 329 334 338 342 DESIGN EXAMPLES Achieving 550MHz in a Standard Cell ASIC Methodology David Chinnery, Kurt Keutzer UC Berkeley A DESIGN BRIDGING THE SPEED GAP BETWEEN ASIC AND CUSTOM MICROARCHITECTURE: PIPELINING AND LOGIC DESIGN REGISTER DESIGN CLOCK TREE INSERTION AND CLOCK DISTRIBUTION CUSTOM LOGIC VERSUS SYNTHESIS REDUCING UNCERTAINTY 8. SUMMARY AND The icore 520MHz Synthesizable CPU Core Nick Richardson, Lun Bin Huang, Razak Hossain, Julian Lewis, Tommy Zounes, Naresh Soni STMicroelectronics OPTIMIZING THE MICROARCHITECTURE OPTIMIZING THE IMPLEMENTATION PHYSICAL DESIGN STRATEGY RESULTS 345 345 346 348 353 356 357 358 358 36 36 363 375 378 379 380