Design Methodologies. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

Similar documents
Design Methodologies. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic.

Lecture Perspectives. Administrivia

Lecture 30. Perspectives. Digital Integrated Circuits Perspectives

Digital Integrated Circuits Perspectives. Administrivia

18nm FinFET. Lecture 30. Perspectives. Administrivia. Power Density. Power will be a problem. Transistor Count

EECS 427 Lecture 21: Design for Test (DFT) Reminders

Lecture 3, Handouts Page 1. Introduction. EECE 353: Digital Systems Design Lecture 3: Digital Design Flows, Simulation Techniques.

PROGRAMMABLE ASICs. Antifuse SRAM EPROM

2009 Spring CS211 Digital Systems & Lab 1 CHAPTER 3: TECHNOLOGY (PART 2)

Homework 10 posted just for practice. Office hours next week, schedule TBD. HKN review today. Your feedback is important!

Disseny físic. Disseny en Standard Cells. Enric Pastor Rosa M. Badia Ramon Canal DM Tardor DM, Tardor

Technology Timeline. Transistors ICs (General) SRAMs & DRAMs Microprocessors SPLDs CPLDs ASICs. FPGAs. The Design Warrior s Guide to.

FPGA Based System Design

Lecture 12 Memory Circuits. Memory Architecture: Decoders. Semiconductor Memory Classification. Array-Structured Memory Architecture RWM NVRWM ROM

Very Large Scale Integration (VLSI)

PE713 FPGA Based System Design

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Introduction. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. July 30, 2002

ECE260B CSE241A Winter Design Styles Multi-Vdd/ Vth Designs. Website: / vlsicad.ucsd.edu/ courses/ ece260bw05

Reference. Wayne Wolf, FPGA-Based System Design Pearson Education, N Krishna Prakash,, Amrita School of Engineering

Design Methodologies. Design Trade-offs. System Design to Hardware. Design Gap. Speed (throughput and clock frequency) Area and

Engr354: Digital Logic Circuits

DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N

PROGRAMMABLE ASIC INTERCONNECT

EC 1354-Principles of VLSI Design

CMPEN 411 VLSI Digital Circuits Spring Lecture 24: Peripheral Memory Circuits

ECE380 Digital Logic

Lecture 9: Cell Design Issues

Leakage Power Minimization in Deep-Submicron CMOS circuits

Low-Power Digital CMOS Design: A Survey

L15: VLSI Integration and Performance Transformations

EE 434 ASIC and Digital Systems. Prof. Dae Hyun Kim School of Electrical Engineering and Computer Science Washington State University.

Introduction to CMOS VLSI Design (E158) Lecture 9: Cell Design

1. Introduction. Institute of Microelectronic Systems. Status of Microelectronics Technology. (nm) Core voltage (V) Gate oxide thickness t OX

Memory Basics. historically defined as memory array with individual bit access refers to memory with both Read and Write capabilities

EEC 118 Lecture #12: Dynamic Logic

Topics. Memory Reliability and Yield Control Logic. John A. Chandy Dept. of Electrical and Computer Engineering University of Connecticut

CS 6135 VLSI Physical Design Automation Fall 2003

Datorstödd Elektronikkonstruktion

EMT 251 Introduction to IC Design

PROGRAMMABLE ASIC INTERCONNECT

Computer Aided Design of Electronics

+1 (479)

Lecture 1. Tinoosh Mohsenin

Lecture 4&5 CMOS Circuits

Preface to Third Edition Deep Submicron Digital IC Design p. 1 Introduction p. 1 Brief History of IC Industry p. 3 Review of Digital Logic Gate

Digital Design and System Implementation. Overview of Physical Implementations

L15: VLSI Integration and Performance Transformations

FPGA Circuits. na A simple FPGA model. nfull-adder realization

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

Low Power VLSI Circuit Synthesis: Introduction and Course Outline

Jan Rabaey, «Low Powere Design Essentials," Springer tml

Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. July 30, Digital EE141 Integrated Circuits 2nd Introduction

Policy-Based RTL Design

Learning Outcomes. Spiral 2 8. Digital Design Overview LAYOUT

Memory, Latches, & Registers

Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis

Jack Keil Wolf Lecture. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. Lecture Outline. MOSFET N-Type, P-Type.

CMOS VLSI IC Design. A decent understanding of all tasks required to design and fabricate a chip takes years of experience

WHAT ARE FIELD PROGRAMMABLE. Audible plays called at the line of scrimmage? Signaling for a squeeze bunt in the ninth inning?

Propagation Delay, Circuit Timing & Adder Design. ECE 152A Winter 2012

Propagation Delay, Circuit Timing & Adder Design

Programmable Interconnect. CPE/EE 428, CPE 528: Session #13. Actel Programmable Interconnect. Actel Programmable Interconnect

Introduction to Electronic Design Automation

CMOS Digital Logic Design with Verilog. Chapter1 Digital IC Design &Technology

VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur

A 0.9 V Low-power 16-bit DSP Based on a Top-down Design Methodology

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

VALLIAMMAI ENGINEERING COLLEGE

Chapter 3 Digital Logic Structures

CHAPTER III THE FPGA IMPLEMENTATION OF PULSE WIDTH MODULATION

DESIGN & IMPLEMENTATION OF SELF TIME DUMMY REPLICA TECHNIQUE IN 128X128 LOW VOLTAGE SRAM

Advanced FPGA Design. Tinoosh Mohsenin CMPE 491/691 Spring 2012

EECS150 - Digital Design Lecture 15 - CMOS Implementation Technologies. Overview of Physical Implementations

EECS150 - Digital Design Lecture 9 - CMOS Implementation Technologies

! Review: MOS IV Curves and Switch Model. ! MOS Device Layout. ! Inverter Layout. ! Gate Layout and Stick Diagrams. ! Design Rules. !

Embedded System Hardware - Reconfigurable Hardware -

Introduction to CMOS VLSI Design (E158) Lecture 5: Logic

Multi-Channel FIR Filters

EECS150 - Digital Design Lecture 19 CMOS Implementation Technologies. Recap and Outline

ALPS: An Automatic Layouter for Pass-Transistor Cell Synthesis

Sticks Diagram & Layout. Part II

EE141-Spring 2007 Digital Integrated Circuits

Lecture 0: Introduction

Electronic Design Automation at Transistor Level by Ricardo Reis. Preamble

EE 330 Lecture 44. Digital Circuits. Dynamic Logic Circuits. Course Evaluation Reminder - All Electronic

PHYSICAL STRUCTURE OF CMOS INTEGRATED CIRCUITS. Dr. Mohammed M. Farag

Digital Systems Design

! Review: MOS IV Curves and Switch Model. ! MOS Device Layout. ! Inverter Layout. ! Gate Layout and Stick Diagrams. ! Design Rules. !

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

2 MARK QUESTIONS & ANSWERS UNIT1-MOS TRANSISTOR PRINCIPLE

CS302 - Digital Logic Design Glossary By

EE 434 ASIC & Digital Systems

CPE/EE 427, CPE 527 VLSI Design I: Homeworks 3 & 4

Microelectronics, BSc course

I/O Design EE141. Announcements. EE141-Fall 2006 Digital Integrated Circuits. Class Material. Pads + ESD Protection.

Digital Systems Laboratory

LOW-POWER SOFTWARE-DEFINED RADIO DESIGN USING FPGAS

IES Digital Mock Test

EE 330 Lecture 44. Digital Circuits. Ring Oscillators Sequential Logic Array Logic Memory Arrays. Final: Tuesday May 2 7:30-9:30

Transcription:

Digital Integrated Circuits A Design Perspective Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic Design Methodologies December 10, 2002

L o g i c T r a n s i s t o r s p e r C h i p ( K ) 1 9 8 1 1 9 8 3 1 9 8 5 1 9 8 7 1 9 8 9 1 9 9 1 1 9 9 3 1 9 9 5 1 9 9 7 1 9 9 9 2 0 0 1 2 0 0 3 2 0 0 5 2 0 0 7 2 0 0 9 P r o d u c t i v i t y ( T r a n s. / S t a f f - M o n t h ) The Design Productivity Challenge Logic Transistors per Chip (K) 10,000,000.10m 1,000,000.35m 2.5m 100,000 10,000 1,000 100 10 1 Logic Transistors/Chip Transistor/Staff Month 58%/Yr. compound Complexity growth rate X x X X X X X 21%/Yr. compound Productivity growth rate 100,000,000 10,000,000 1,000,000 100,000 10,000 1,000 100 10 1981 Productivity (Trans./Staff-Month) 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 A growing gap between design complexity and design productivity Source: sematech97

I N P U T - O U T P U T A Simple Processor MEMORY INPUT/OUTPUT CONTROL DATAPATH

A System-on on-a-chip: Example Courtesy: Philips

Impact of Implementation Choices 100-1000 10-100 1-10 0.1-1 Energy Efficiency (in MOPS/mW) Hardwired custom Configurable/Parameterizable Domain-specific processor (e.g. DSP) Embedded microprocessor None Somewhat Fully flexible flexible Flexibility (or application scope)

Design Methodology Design process traverses iteratively between three abstractions: behavior, structure, and geometry More and more automation for each of these steps

Implementation Choices Digital Circuit Implementation Approaches Custom Semicustom Cell-based Array-based Standard Cells Compiled Cells Macro Cells Pre-diffused (Gate Arrays) Pre-wired (FPGA's)

The Custom Approach Intel 4004

Transition to Automation and Regular Structures Intel 4004 ( 71) Intel 8080 Intel 8286 Digital Integrated Circuits Intel 8085 Intel 8486 Courtesy Intel

Cell-based Design (or standard cells) Feedthrough cell Logic cell Rows of cells Functional module (RAM, multiplier, ) Routing channel Routing channel requirements are reduced by presence of more interconnect layers

Standard Cell Example [Brodersen92]

Standard Cell The New Generation Cell-structure hidden under interconnect layers

Standard Cell - Example 3-input NAND cell (from ST Microelectronics): C = Load capacitance T = input rise/fall time

Automatic Cell Generation Initial transistor geometries Placed transistors Routed cell Compacted cell Finished cell

A Historical Perspective: the PLA x 0 x 1 Product terms AND plane x 2 OR plane f 0 f 1 x 0 x 1 x 2

Two-Level Logic Every logic function can be expressed in sum-of-products format (AND-OR) minterm Inverting format (NOR- NOR) more effective

PLA Layout Exploiting Regularity V DD And-Plane Or-Plane φ GND x 0 x 0 x 1 x 1 x 2 x 2 Pull-up devices f 0 f 1 Pull-up devices

Breathing Some New Life in PLAs River PLAs A cascade of multiple-output PLAs. Adjacent PLAs are connected via river routing. PRE-CHARGE BUFFER PRE-CHARGE BUFFER PRE-CHARGE PRE- CHARGE BUFFER BUFFER BUFFER PRE-CHARGE BUFFER PRE-CHARGE PRE- CHARGE BUFFER BUFFER PRE-CHARGE No placement and routing needed. Output buffers and the input buffers of the next stage are shared.

Experimental Results Area: RPLAs (2 layers) 1.23 SCs (3 layers) - 1.00, NPLAs (4 layers) 1.31 Delay RPLAs 1.04 SCs 1.00 NPLAs 1.09 Synthesis time: for RPLA, synthesis time equals design time; SCs and NPLAs still need P&R. Also: RPLAs are regular and predictable Layout of C2670 delay 1.4 1 0.6 0.2 0 2 4 6 area SC NPLA RPLA Standard cell, 2 layers channel routing Standard cell, 3 layers OTC Network of PLAs, 4 layers OTC River PLA, 2 layers no additional routing

MacroModules 256 32 (or 8192 bit) SRAM Generated by hard-macro module generator

Soft MacroModules

Intellectual Property A Protocol Processor for Wireless

Semicustom Design Flow Design Capture Behavioral Design Iteration Pre-Layout Simulation Post-Layout Simulation HDL HDL Logic Logic Synthesis Floorplanning Placement Structural Physical Circuit Circuit Extraction Routing Tape-out

The Design Closure Problem Iterative Removal of Timing Violations (white lines)

Integrating Synthesis with Physical Design RTL (Timing) Constraints Physical Synthesis Macromodules Fixed netlists Netlist with Place-and-Route Info Place-and-Route Optimization Artwork

Late-Binding Implementation Array-based Pre-diffused (Gate Arrays) Pre-wired (FPGA's)

Gate Array Sea-of of-gates polysilicon V DD rows of uncommitted cells GND metal possible contact Uncommited Cell In1 In2 In3 In4 routing channel Committed Cell (4-input NOR Out

Sea-of of-gate Primitive Cells Oxide-isolation PMOS PMOS NMOS NMOS NMOS Using oxide-isolation Using gate-isolation

Example: Base Cell of Gate-Isolated GA continuous p-diff strip continuous n-diff strip contact for isolator VDD GND 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 n-well p-well n-diff p-diff poly m1 m2 contact

Example: Flip-Flop Flop in Gate-Isolated GA VDD CLR Q CLK Q D GND

Sea-of-gates Random Logic Memory Subsystem LSI Logic LEA300K (0.6 µm CMOS) Digital Integrated Circuits Courtesy LSI Logic

The return of gate arrays? Via programmable gate array (VPGA) Via-programmable cross-point metal-5 metal-6 programmable via Exploits regularity of interconnect [Pileggi02]

Prewired Arrays Classification of prewired arrays (or fieldprogrammable devices): Based on Programming Technique Fuse-based (program-once) Non-volatile EPROM based RAM based Programmable Logic Style Array-Based Look-up Table Programmable Interconnect Style Channel-routing Mesh networks

Fuse-Based FPGA antifuse polysilicon ONO dielectric n + antifuse diffusion 2 l Open by default, closed by applying current pulse

Array-Based Programmable Logic I 5 I 4 I 3 I 2 I 1 I 0 Programmable OR array I 3 I 2 I 1 I 0 Programmable OR array I 5 I 4 I 3 I 2 I 1 I 0 Fixed OR array Programmable AND array Fixed AND array Programmable AND array O 3 O 2 O 1 O 0 O 3 O 2 O 1 O 0 O 3 O 2 O 1 O 0 PLA PROM PAL Indicates programmable connection Indicates fixed connection

Programming a PROM 1 X 2 X 1 X 0 : programmed node NA NA f 1 f 0

More Complex PAL programmable AND array (2 i 3 jk) k macrocells 1 product terms j -wide OR array j D Q OUT j CLK macrocell A B C i i inputs i inputs, j minterms/macrocell, k macrocells

2-input mux as programmable logic block Configuration A B S F= A 0 B 1 S F 0 0 0 0 0 X 1 X 0 Y 1 Y 0 Y X XY X 0 Y XY Y 0 X XY Y 1 X X 1 Y 1 0 X X 1 0 Y Y 1 1 1 1

Logic Cell of Actel Fuse-Based FPGA A B 1 SA C 1 Y D 1 SB S0 S1

Look-up Table Based Logic Cell Memory Out In Out 00 00 01 1 10 1 11 0 ln1 ln2

LUT-Based Logic Cell C 1...C 4 4 Figure must be updated xx xxxx xxxx xxxx D 4 D 3 D 2 Logic function of xxx xx xx xx xx Bits control x xx x xxxx xx D 1 F 4 F 3 F 2 Logic function of xxx Logic function x of xxx x xx xx xx xx x Bits control xx x xx xx x xxxx x xx F 1 x xxxxx Xilinx 4000 Series H P x xx xx Multiplexer Controlled by Configuration Program x

Array-Based Programmable Wiring M Interconnect Point Programmed interconnection Input/output pin Cell Horizontal tracks Vertical tracks

Mesh-based Interconnect Network Switch Box Connect Box Interconnect Point

Transistor Implementation of Mesh

Hierarchical Mesh Network Use overlayed mesh to support longer connections Reduced fanout and reduced resistance

EPLD Block Diagram Primary inputs Macrocell

Altera MAX

Altera MAX Interconnect Architecture column channel row channel t PIA LAB1 LAB2 LAB PIA t PIA LAB6 Array-based (MAX 3000-7000) Mesh-based (MAX 9000)

Field-Programmable Gate Arrays Fuse-based I/O Buffers Program/ Test/Diagnostics Vertical routes Standard-cell like floorplan I/O Buffers I/O Buffers Rows of logic modules Routing channels I/O Buffers

Xilinx 4000 Interconnect Architecture 12 Quad 8 Single 4 Double 3 Long CLB 2 Direct Connect 12 4 4 8 4 8 4 2 3 Long Quad Long Global Clock Long Double Single Global Clock Carry Chain Direct Connect

RAM-based FPGA Xilinx XC4000ex

A Low-Energy FPGA (UC Berkeley) Array Size: 8x8 (2 x 4 LUT) Power Supply: 1.5V & 0.8V Configuration: Mapped as RAM Toggle Frequency: 125MHz Area: 3mm x 3mm

Larger Granularity FPGAs PADDI-2 (UC Berkeley) 1-mm 2-metal CMOS tech 1.2 x 1.2 mm 2 600k transistors 208-pin PGA fclock = 50 MHz P av = 3.6 W @ 5V Basic Module: Datapath

Design at a crossroad System-on on-a-chip Multi- Spectral Imager 500 k Gates FPGA RAM + 1 Gbit DRAM Preprocessing 64 SIMD Processor Array + SRAM Image Conditioning 100 GOPS Analog µc system +2 Gbit DRAM Recognition Embedded applications where cost, performance, and energy are the real issues! DSP and control intensive Mixed-mode Combines programmable and application-specific modules Software plays crucial role

Addressing the Design Complexity Issue Architecture Reuse Reuse comes in generations Generation Reuse element Status 1 st Standard cells We ll e s tablis he d 2 nd IP blo c ks Being introduced 3 rd Architecture Eme rging 4 th IC Early re s e arc h Source: Theo Claasen (Philips) DAC 00

Architecture ReUse Silicon System Platform Flexible architecture for hardware and software Specific (programmable) components Network architecture Software modules Rules and guidelines for design of HW and SW Has been successful in PC s Dominance of a few players who specify and control architecture Application-domain specific (difference in constraints) Speed (compute power) Dissipation Costs Real / non-real time data

Platform-Based Design Only the consumer gets freedom of choice; designers need freedom from choice (Orfali,, et al, 1996, p.522) A platform is a restriction on the space of possible implementation choices, providing a well-defined abstraction of the underlying technology for the application developer New platforms will be defined at the architecture-micro-architecture boundary They will be component-based, and will provide a range of choices from structured-custom to fully programmable implementations Key to such approaches is the representation of communication in the platform model Source:R.Newton

Berkeley Pleiades Processor Interface FPGA Reconfigurable Data-path ARM8 Core 0.25um 6-level metal CMOS 5.2mm x 6.7mm 1.2 Million transistors 40 MHz at 1V 2 extra supplies: 0.4V, 1.5V 1.5~2 mw power dissipation

Heterogeneous Programmable Platforms FPGA Fabric Embedded PowerPc Embedded memories Hardwired multipliers High-speed I/O Xilinx Vertex-II Pro

Summary Digital CMOS Design is kicking and healthy Some major challenges down the road caused by Deep Sub-micron Super GHz design Power consumption!!!! Reliability making it work Some new circuit solutions are bound to emerge Who can afford design in the years to come? Some major design methodology change in the making!