LSI Design Flow Development for Advanced Technology

Similar documents
Statistical Static Timing Analysis Technology

High-Speed Interconnect Technology for Servers

A 0.9 V Low-power 16-bit DSP Based on a Top-down Design Methodology

ECE 546 Introduction

Ultra-high-speed Interconnect Technology for Processor Communication

Semiconductor Technology Academic Research Center An RTL-to-GDS2 Design Methodology for Advanced System LSI

Variation-Aware Design for Nanometer Generation LSI

LSI and Circuit Technologies of the SX-9

Summary of Fujitsu SoC Technology and Related Business

LSI and Circuit Technologies for the SX-8 Supercomputer

PROCESS-VOLTAGE-TEMPERATURE (PVT) VARIATIONS AND STATIC TIMING ANALYSIS

Lecture #2 Solving the Interconnect Problems in VLSI

EE 434 ASIC and Digital Systems. Prof. Dae Hyun Kim School of Electrical Engineering and Computer Science Washington State University.

UNIT-II LOW POWER VLSI DESIGN APPROACHES

UNIT-III POWER ESTIMATION AND ANALYSIS

BASICS: TECHNOLOGIES. EEC 116, B. Baas

Policy-Based RTL Design

White Paper Stratix III Programmable Power

NanoFabrics: : Spatial Computing Using Molecular Electronics

POWER GATING. Power-gating parameters

Amber Path FX SPICE Accurate Statistical Timing for 40nm and Below Traditional Sign-Off Wastes 20% of the Timing Margin at 40nm

INF3430 Clock and Synchronization

Design and Implement of Low Power Consumption SRAM Based on Single Port Sense Amplifier in 65 nm

TECHNOLOGY scaling, aided by innovative circuit techniques,

Low Power System-On-Chip-Design Chapter 12: Physical Libraries

Tiago Reimann Cliff Sze Ricardo Reis. Gate Sizing and Threshold Voltage Assignment for High Performance Microprocessor Designs

Timing analysis can be done right after synthesis. But it can only be accurately done when layout is available

Microcontroller Systems. ELET 3232 Topic 13: Load Analysis

Low Power Design Methods: Design Flows and Kits

Lecture 9: Clocking for High Performance Processors

Research in Support of the Die / Package Interface

CMOS VLSI IC Design. A decent understanding of all tasks required to design and fabricate a chip takes years of experience

Dr. Ralf Sommer. Munich, March 8th, 2006 COM BTS DAT DF AMF. Presenter Dept Titel presentation Date Page 1

Disseny físic. Disseny en Standard Cells. Enric Pastor Rosa M. Badia Ramon Canal DM Tardor DM, Tardor

Lecture 1. Tinoosh Mohsenin

Lecture 11: Clocking

MTCMOS Post-Mask Performance Enhancement

Microcircuit Electrical Issues

Reducing Transistor Variability For High Performance Low Power Chips

Fixing Antenna Problem by Dynamic Diode Dropping and Jumper Insertion

A Survey of the Low Power Design Techniques at the Circuit Level

Low Transistor Variability The Key to Energy Efficient ICs

The Need for Gate-Level CDC

2009 Spring CS211 Digital Systems & Lab 1 CHAPTER 3: TECHNOLOGY (PART 2)

Ruixing Yang

Low-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering

LOW-POWER SOFTWARE-DEFINED RADIO DESIGN USING FPGAS

CHAPTER 3 NEW SLEEPY- PASS GATE

EDA Challenges for Low Power Design. Anand Iyer, Cadence Design Systems

Leakage Power Minimization in Deep-Submicron CMOS circuits

All Digital on Chip Process Sensor Using Ratioed Inverter Based Ring Oscillator

28nm and below: New Frontiers and Innovations in Design for Manufacturing. Vito Dai, Ph.D. Sr. Member of Technical Staff, DFM

Electronic Design Automation at Transistor Level by Ricardo Reis. Preamble

Circuit Design of 2-Input Reconfigurable Dynamic. Logic Based on Double Gate MOSFETs. with Whole Set of 16 Functions

Managing Cross-talk Noise

! Review: MOS IV Curves and Switch Model. ! MOS Device Layout. ! Inverter Layout. ! Gate Layout and Stick Diagrams. ! Design Rules. !

DATASHEET CADENCE QRC EXTRACTION

EE 434 ASIC & Digital Systems

CS 6135 VLSI Physical Design Automation Fall 2003

VLSI Physical Design Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

Chapter 1 Introduction

ECEN689: Special Topics in High-Speed Links Circuits and Systems Spring 2012

On Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital VLSI

Jack Keil Wolf Lecture. ESE 570: Digital Integrated Circuits and VLSI Fundamentals. Lecture Outline. MOSFET N-Type, P-Type.

Trends and Challenges in VLSI Technology Scaling Towards 100nm

VLSI Design Verification and Test Delay Faults II CMPE 646

Downsizing Technology for General-Purpose Inverters

! Review: MOS IV Curves and Switch Model. ! MOS Device Layout. ! Inverter Layout. ! Gate Layout and Stick Diagrams. ! Design Rules. !

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

Chapter 3 Chip Planning

BiCMOS Circuit Design

BICMOS Technology and Fabrication

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

Timing Issues in FPGA Synchronous Circuit Design

ESE 570: Digital Integrated Circuits and VLSI Fundamentals

SURVEY AND EVALUATION OF LOW-POWER FULL-ADDER CELLS

unit: mm 3159-QFP64E unit: mm 3190-SQFP64

A Novel Low-Power Scan Design Technique Using Supply Gating

Low-Power Digital CMOS Design: A Survey

Lineup for Compact Cameras from

Guaranteeing Silicon Performance with FPGA Timing Models

Learning Outcomes. Spiral 2 8. Digital Design Overview LAYOUT

Nanowire-Based Programmable Architectures

Interconnect-Power Dissipation in a Microprocessor

Low Power Design Part I Introduction and VHDL design. Ricardo Santos LSCAD/FACOM/UFMS

Introduction to VLSI ASIC Design and Technology

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

Signal Integrity Management in an SoC Physical Design Flow

Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis

1 Digital EE141 Integrated Circuits 2nd Introduction

Process and Environmental Variation Impacts on ASIC Timing

Noise Constraint Driven Placement for Mixed Signal Designs. William Kao and Wenkung Chu October 20, 2003 CAS IEEE SCV Meeting

Introduction. Digital Integrated Circuits A Design Perspective. Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic. July 30, 2002

3084 IEEE TRANSACTIONS ON NUCLEAR SCIENCE, VOL. 60, NO. 4, AUGUST 2013

LC7574NE, 7574NW. 1/2 Duty VFD Driver for Frequency Display

BCT4222B BCT4222B. High-Speed DPDT Analog Switch. General Description. Features. Connection Diagram. Applications

Design Technology Challenges in the Sub-100 Nanometer Era

Static Power and the Importance of Realistic Junction Temperature Analysis

Advanced Digital Design

A Review of Clock Gating Techniques in Low Power Applications

Transcription:

LSI Design Flow Development for Advanced Technology Atsushi Tsuchiya LSIs that adopt advanced technologies, as represented by imaging LSIs, now contain 30 million or more logic gates and the scale is beginning to approach the level of 100 million gates. As compared with the 90 nm process generation, this is three to ten times the number of gates. With advanced technologies such as 40 nm and 28 nm, the process characteristics become remarkably complex and wiring resistance rapidly increases, which unavoidably means that more sign-off corners and more accurate sign-off and layout tools are required. In addition, mask design rules, which are process requirements, are constantly increasing in complexity and the 28 nm process requires twice as many mask design rules as the 90 nm one. To address these changing needs in the areas of design and technology, Fujitsu Semiconductor is developing new features for the LSI design environment called Reference Design Flow for introduction into LSI design. This has reduced the turnaround time (TAT) of LSI development adopting advanced technologies. This paper describes the characteristics and effects of Reference Design Flow newly enhanced for advanced technologies. 1. Introduction Imaging LSIs are increasingly designed into system-on-a-chip systems and the increasing scale of LSIs and accompanying shift to advanced technologies have become remarkable. Meanwhile, time-to-market of LSIs is in a trend toward further reduction so as to beat the global competition. In order to satisfy conflicting requirements of scale increase of LSIs and reduction of LSI design periods at the same time, Fujitsu Semiconductor has developed an LSI design environment called Reference Design Flow 1) and applied it to the design of microcontrollers, application-specific integrated circuits (ASICs), and application-specific standard products (ASSPs). Of the various parts of the Reference Design Flow, this paper puts special focus on layout design and describes issues including measures for handling LSIs, measures for addressing increase of sign-off corners, improvement of layout tool accuracy, and measures for accommodating new mask design rules. These are essential for developing the LSI design environment for advanced processes such as 40 nm and 28 nm. This paper also presents solutions to those issues and their effects. 2. Measures for handling LSIs LSIs that adopt advanced technologies now contain 30 million or more logic gates and the scale is beginning to approach the level of 100 million gates. This means that, as compared with the 90 nm process generation, the number of gates has increased by three to ten times. The International Technology Roadmap for Semiconductors (ITRS) has shown a roadmap indicating that the trend toward LSIs is expected to continue in the future 2) as shown in Figure 1. For that reason, measures for handling LSIs are the highest priority issue in developing the LSI design environment. There are several techniques for designing LSIs in a short turnaround time (TAT), and a representative one is hierarchical design. Application of hierarchical design allows the scale of the circuit as the layout design object to be arbitrarily controlled and layout design can be completed within a TAT that meets the schedule and with the memory included in the computer used. However, there are some layout processes to which hierarchical design cannot be applied. Design planning is one such process. Figure 2 shows an outline of the processing flow of design planning. Design 110 FUJITSU Sci. Tech. J., Vol. 49, No. 1, pp. 110 116 (January 2013)

Rate of increase of LSI circuit scale normalized to 2009 (times) 12 11 10 9 8 7 6 5 4 3 2 1 0 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 Year Copyright 2009 International Technology Roadmap for Semiconductors Figure 1 Trend of increasing LSI scale. planning is a process carried out in the first half of layout design and includes the following steps. IO placement block placement Macro placement and cell placement Power routing block pin placement Signal routing partitioning As shown in the figure, hierarchical partitioning is performed in the last step of design planning, which means that design planning itself must be processed by flat design. For that reason, completing design planning in a short TAT while maintaining accuracy is an issue in the development of the LSI design environment. Reference Design Flow resolves this issue by introducing three new features, which are described below. 1) Introduction of netlist reduction feature As the first measure, the netlist reduction feature has been introduced. The following outlines the feature. The layout tool itself deletes unnecessary circuit information from the memory according to the content of the process of design planning The relevant design planning is carried out After the process is completed, the deleted circuit information is automatically restored so that the next design planning step can be started This feature reduces the scale of the circuit handled by the layout tool itself, thus allowing the processing time and amount of memory used to be reduced. 2) Introduction of automatic macro placement feature With LSIs, the number of macros including memory has reached a few thousand, crossing the limit of what is possible with manual macro placement. To address this issue, an automatic macro placement feature has been introduced as the second measure. With this feature, results very close to manual macro placement with minimum dead space can be generated in short periods of time. As an example, a macro placement that took two days when done manually has been completed in six hours. 3) Introduction of high-speed processing engine for cell placement, signal routing and timing optimization In design planning, a lot of time is spent for LSI feasibility verification including: FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013) 111

Flat design Design planning IO placement block placement Macro placement & cell placement Power routing block pin placement block 1 block 2 Flat design Layout design Signal routing partitioning partitioning Placement & routing design Model 1 Model 2 block 1 = + + block 2 design Figure 2 Outline of processing flow of design planning. Routability closure at the target chip size and routing layer Timing closure under the target sign-off condition In this verification process, the steps of cell placement, signal routing, and timing optimization are repeated. For that reason, as the third measure, a new high-speed processing engine has been introduced for these three steps, thereby reducing the LSI feasibility verification test time. By applying design planning including the features described above, design planning of LSIs integrating 110 million logic gates and 10 000 macros can be processed in 48 hours with 77 GB of memory. In designing LSIs, TAT reduction by ensuring support for multi-cpu processing is also very effective. Reference Design Flow provides positive support for multi-cpu processing. Concerning layout design, almost all processes support multi-cpu processing and multi-cpu processing under a job scheduler such as Load Sharing Facility (LSF) is also supported. Figure 3 shows the TAT reduction effects by using multi-cpu processing. It shows results of a layout with one CPU and four CPUs of a block on a scale of four million gates as the layout object. While the reduction effects vary depending on the process step, TAT reductions of 18.8% to 67.4% can be achieved. 3. Measures for addressing increase of sign-off corners An LSI sign-off condition is composed of a combination of operation guaranteed ranges of process, temperature and voltage. A sign-off corner refers to an arbitrary corner point of the combination. For example, a sign-off condition of process slow, high temperature and lower limit voltage is defined as the worst corner and a sign-off condition of process fast, low temperature and upper limit voltage as the best corner. With advanced technologies, process characteristics have increased in complexity and a characteristic 112 FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013)

(after signal routing) 52.5% reduction Signal routing 67.4% reduction (after clock generation) 18.8% reduction Clock generation 61.2% reduction (before clock generation) 19.5% reduction Initial placement 35.1% reduction Figure 3 TAT reduction effects of multi-cpu processing. 0 100 200 300 400 500 600 700 800 TAT (min) may emerge in which the worst delay is generated in a low temperature condition something that was unthinkable with the conventional processes. Similarly, a characteristic exists in which leakage power is inverted depending on the combination of V th (transistor threshold voltage) and temperature. To cover all of these advanced process characteristics, the number of sign-off corners unavoidably increases in advanced technologies as compared with the conventional technologies. Naturally, increased sign-off corners means there are more corners to be processed with a layout. To focus on optimizing setup timing, multiple corners must be considered. In addition, the power worst corner separate from the corner for setup timing optimization must be considered as well for leakage power optimization. Dealing with multiple corners, which may differ depending on the layout purpose, with high accuracy and a short TAT is an issue in the development of the LSI design environment. Reference Design Flow resolves this issue by introducing two new features, which are described below. 1) Introduction of Multi-Corner Multi-Mode (MCMM) feature As the first measure, the MCMM feature has now been introduced to layout design in addition to timing ECO design 1), which has been previously supported. The following outlines the MCMM feature. Multiple corners (multiple modes can be specified as required) are specified for the layout tool Layout design is implemented while giving consideration to multiple corners specified at the same time Traditionally, there was no way to handle this other than serially processing as many setup timing optimization steps as the number of corners, which had a risk of causing a local solution. By introducing this new feature, multiple corners can be collectively processed and the process can be completed with a short TAT without rework. Furthermore, there was a limit of only one corner considered for clock generation. For this reason, a clock skew occurred at an unconsidered corner and hindered convergence of layout design in some cases. After introducing this feature, it has become possible to give consideration to multiple corners at the same time and the problem of clock skew at an unconsidered corner has been eliminated. Introducing this feature has also allowed FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013) 113

simultaneous consideration of multiple corners required for setup timing calculation and leakage power calculation. This has significantly improved the accuracy of leakage power optimization, and thus helped disseminate this optimization method. The benefits of introducing the MCMM feature are as described above, but there are also drawbacks. Generally, the TAT increases in proportion to the number of corners considered by the layout tool and measures to address this issue are also required in the development of the LSI design environment. 2) Introduction of auto scenario note1) reduction feature As the second measure, the auto scenario reduction feature has been introduced. An outline is as follows. The layout tool itself calculates the timing of the specified scenario immediately before the start of layout design Unnecessary scenarios are automatically deleted from the memory according to the content of the process such as setup optimization, clock generation and leakage power optimization The processing time is reduced by keeping on the memory only the scenarios necessary for the note1) A combination of an arbitrary corner and arbitrary mode is called a scenario. A mode corresponds to the timing constraints describing the operation of an LSI. relevant process After the process is completed, the deleted scenarios are automatically restored and the next layout step is started. By introducing this feature, layout design can be completed within a TAT that meets the schedule even if multiple corners required by advanced technologies are specified for the layout tool. Figure 4 shows the TAT reduction effects of the auto scenario reduction feature. It shows the results of a layout of a block on a scale of four million gates as the layout object before and after this feature has been introduced. While the reduction effects vary depending on the process step, TAT reductions of 21.4% to 28.3% can be achieved. 4. Improvement of layout tool accuracy Along with the advancement of technologies, the impact of the following factors on delay analysis, which were conventionally negligible, has increased and new verification technologies and sign-off conditions are now required. 1) Increase of crosstalk delay 2) Increase of on-chip variation 3) Increase of variation of wiring resistance and wiring capacitance due to wiring density 4) Increase of wiring resistivity 1200 1000 28.3% reduction 800 TAT (min) 600 24.1% reduction Before introduction 400 21.4% reduction After introduction 200 0 Before introduction After introduction (before clock generation) Before After introduction introduction (after clock generation) (after signal routing) Figure 4 TAT reduction effects of auto scenario reduction feature. 114 FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013)

For Reference Design Flow, we have quickly launched verification technologies and put the sign-off conditions in place. Measures for the sign-off tool have been taken. For the layout tool, however, one problem was that there was no other choice but to address the error factors in delay analysis mentioned above by taking measures in the form of design margins for reasons including: Inadequate functionality of the layout tool Inability to be used in practical applications in terms of TAT even if adequate functionality is provided Nevertheless, with advanced technologies such as 40 nm and 28 nm, the increased design complexity and process variation made convergence of calculation in layout design impossible with design margin, and there was an urgent need to improve the accuracy of the layout tool. Accordingly, for Reference Design Flow, we have launched new features, accelerated enhancement and positively introduced a new version of the layout tool so as to accommodate advanced technologies, and thereby improved the accuracy of the layout tool. Specifically, the following new features and libraries have been introduced into the layout design environment. 1) Introduction of a delay calculation engine that takes crosstalk into consideration 2) Introduction of an on-chip variation (OCV) calculation engine 3) Introduction of an engine for wiring resistance and wiring capacitance extraction that takes wiring density into consideration 4) Introduction of Composite Current Source (CCS) libraries note2) Introducing the new features and libraries mentioned above has improved the accuracy of layout tools and improved the correlation between the sign-off and layout tools. Further, the design margins, which were expected in the conventional LSI design environment, have been significantly reduced. note2) Non-linear delay model (NLDM) libraries were common in the past but they have a limitation in that they cannot handle the impact of wiring resistance with high accuracy. For that reason, CCS libraries, which are current source model-based, are the mainstream for advanced technologies. Figure 5 shows delay calculation errors between sign-off and layout tools in 28 nm technology. In 28 nm technology, in which correlation is difficult, delay calculation errors are within the range of ±10 ps with most of the timing paths. This points to the high accuracy of the layout tool that almost eliminates the need for design margins. 5. Measures for accommodating new mask design rules Due to the miniaturization of processes, measures for new mask design rules, which did not exist in the conventional technologies, are essential to advanced technologies. For example, the 28 nm process requires twice as many mask design rules as the 90 nm one. Meanwhile, when advanced technologies are launched, specifications of technology files note3) and features of layout tools are often unable to meet the requirements of new mask design rules. The LSI design environment must be developed so that the LSI design schedule can be met even in such a situation. With Reference Design Flow, we have resolved this issue by independently developing routing technology files tuned to satisfy new mask design rules within the scope of the existing technology file specifications and layout tool features. In the 40 nm and 28 nm process generations, there are new mask design rules relating to routing as well as placement but we have successfully satisfied the new rules by making use of workarounds. Even for a fabless company, techniques for tuning the LSI design environment including technology files are essential as differentiating techniques. 6. Conclusion This paper has described issues such as measures for handling LSIs, measures for addressing increase of sign-off corners, improvement of layout tool accuracy, and measures for accommodating new mask design rules. These are essential to the development of the LSI design environment for advanced technologies. To resolve these issues, we have developed and introduced note3) A technology file is a file representing mask design rules for each layout or sign-off tool. Multiple technology files exist such as those for placement and routing and for wiring resistance and wiring capacitance extraction. FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013) 115

120 000 ±10 ps 100 000 Number of timing paths 80 000 60 000 40 000 20 000 0 100 90 80 70 60 50 40 30 20 10 0 10 20 30 40 50 60 70 80 90 100 Delay value error (ps) Figure 5 Delay calculation errors between sign-off and layout tools. new features including the netlist reduction feature, automatic macro placement feature, high-speed processing engine for cell placement, signal routing and timing optimization, MCMM feature, auto scenario reduction feature, delay calculation engine that takes crosstalk into consideration, OCV calculation engine, an engine for wiring resistance and wiring capacitance extraction that takes wiring density into consideration, CCS libraries, and independent development of routing technology files. We have thereby managed to reduce TAT of LSI development adopting advanced technologies such as 40 nm and 28 nm processes. For this LSI design environment, we also plan to conduct development in future so as to accommodate the 20 nm process. References 1) H. Ikeda: Reference Design Flow for 90 nm CMOS ASICs. (in Japanese), FUJITSU, Vol. 55, No. 3, pp. 215 220 (2004). 2) ITRS: International Technology Roadmap for Semiconductors 2009 Update, System Drivers. ITRS, pp. 6 7 (2009). Atsushi Tsuchiya Fujitsu Semiconductor Ltd. Mr. Tsuchiya is currently engaged in development and support of physical design environment. 116 FUJITSU Sci. Tech. J., Vol. 49, No. 1 (January 2013)