DDR4 memory interface: Solving PCB design challenges

Similar documents
Microcircuit Electrical Issues

Touchstone v2.0 SI/PI S- Parameter Models for Simultaneous Switching Noise (SSN) Analysis of DDR4 Memory Interface Applications.

High Speed Digital Systems Require Advanced Probing Techniques for Logic Analyzer Debug

DL-150 The Ten Habits of Highly Successful Designers. or Design for Speed: A Designer s Survival Guide to Signal Integrity

Modeling System Signal Integrity Uncertainty Considerations

TITLE. Image. Topic: Topic: Hee-Soo o LEE, Keysight Technologies Cindy Cui, Keysight Technologies

ULTRASCALE DDR4 DE-EMPHASIS AND CTLE FEATURE OPTIMIZATION WITH STATISTICAL ENGINE FOR BER SPECIFICATION

Effective Routing of Multiple Loads

Signal Technologies 1

PCB Routing Guidelines for Signal Integrity and Power Integrity

8. QDR II SRAM Board Design Guidelines

Asian IBIS Summit, Tokyo, Japan

MICTOR. High-Speed Stacking Connector

Logic Analyzer Probing Techniques for High-Speed Digital Systems

Z-Dok High-Performance Docking Connector

High Speed Characterization Report

H19- Reliable Serial Backplane Data Transmission at 10 Gb/s. January 30, 2002 Slide 1 of 24

Demystifying Vias in High-Speed PCB Design

DP Array DPAM/DPAF Final Inch Designs in Serial ATA Generation 1 Applications 10mm Stack Height. REVISION DATE: January 11, 2005

Low Jitter, Low Emission Timing Solutions For High Speed Digital Systems. A Design Methodology

High Speed Characterization Report

Where Did My Signal Go?

Signal/Power Integrity Analysis of High-Speed Memory Module with Meshed Reference Plane 1

How to anticipate Signal Integrity Issues: Improve my Channel Simulation by using Electromagnetic based model

Relationship Between Signal Integrity and EMC

Intro. to PDN Planning PCB Stackup Technology Series

High Speed Characterization Report

A Co-design Methodology of Signal Integrity and Power Integrity

Keysight Technologies Signal Integrity Tips and Techniques Using TDR, VNA and Modeling

High Speed Characterization Report

Impedance Matching: Terminations

High-speed Serial Interface

High Speed Characterization Report

Effect of Power Noise on Multi-Gigabit Serial Links

Minimizing Coupling of Power Supply Noise Between Digital and RF Circuit Blocks in Mixed Signal Systems

LVDS Owner s Manual. A General Design Guide for National s Low Voltage Differential Signaling (LVDS) Products. Moving Info with LVDS

Validation Report Comparison of Eye Patterns Generated By Synopsys HSPICE and the Agilent PLTS

High Speed Characterization Report

High Speed Characterization Report

To learn statistical bit-error-rate (BER) simulation, BER link noise budgeting and using ADS to model high speed I/O link circuits

DL-150 The Ten Habits of Highly Successful Designers. or Design for Speed: A Designer s Survival Guide to Signal Integrity

Taking the Mystery out of Signal Integrity

EMC problems from Common Mode Noise on High Speed Differential Signals

High Speed Characterization Report

RiseUp RU8-DP-DV Series 19mm Stack Height Final Inch Designs in Serial ATA Generation 1 Applications. Revision Date: March 18, 2005

CROSSTALK DUE TO PERIODIC PLANE CUTOUTS. Jason R. Miller, Gustavo Blando, Istvan Novak Sun Microsystems

Lambert Simonovich 5/28/2012

EQCD High Speed Characterization Summary

Q2 QMS-DP/QFS-DP Series 11 mm Stack Height Final Inch Designs in Serial ATA Generation 1 Applications. Revision Date: February 22, 2005

ECE 497 JS Lecture - 22 Timing & Signaling

LVDS Flow Through Evaluation Boards. LVDS47/48EVK Revision 1.0

Using IBIS Models for Timing Analysis

A 5-Gb/s 156-mW Transceiver with FFE/Analog Equalizer in 90-nm CMOS Technology Wang Xinghua a, Wang Zhengchen b, Gui Xiaoyan c,

Differential Signaling is the Opiate of the Masses

100 Gb/s: The High Speed Connectivity Race is On

HOW SMALL PCB DESIGN TEAMS CAN SOLVE HIGH-SPEED DESIGN CHALLENGES WITH DESIGN RULE CHECKING MENTOR GRAPHICS

Power integrity is more than decoupling capacitors The Power Integrity Ecosystem. Keysight HSD Seminar Mastering SI & PI Design

Engineering the Power Delivery Network

Overcoming Obstacles to Closing Timing for DDR and Beyond. John Ellis Sr. Staff R&D Engineer Synopsys, Inc.

3D IC-Package-Board Co-analysis using 3D EM Simulation for Mobile Applications

ECEN720: High-Speed Links Circuits and Systems Spring 2017

AN ABSTRACT OF THE THESIS OF

High Speed Characterization Report

Matched Length Matched Delay

Effect of slots in reference planes on signal propagation in single and differential t-lines

QPairs QTE-DP/QSE-DP Final Inch Designs in Serial ATA Generation 1 Applications 5mm Stack Height. REVISION DATE: January 12, 2005

The Facts about the Input Impedance of Power and Ground Planes

ABSTRACT. As data frequency increases beyond several Gbps range, low power chip to chip

Advanced Transmission Lines. Transmission Line 1

Signal Integrity Design of TSV-Based 3D IC

Guide to CMP-28/32 Simbeor Kit

Texas Instruments DisplayPort Design Guide

Cyclone III Simultaneous Switching Noise (SSN) Design Guidelines

High Speed Digital Systems Require Advanced Probing Techniques for Logic Analyzer Debug

DesignCon 2003 High-Performance System Design Conference (HP3-5)

EE290C Spring Lecture 2: High-Speed Link Overview and Environment. Elad Alon Dept. of EECS

Data Mining 12-Port S- Parameters

5Gbps Serial Link Transmitter with Pre-emphasis

AN-1106 Custom Instrumentation Amplifier Design Author: Craig Cary Date: January 16, 2017

A Bottom-Up Approach to on-chip Signal Integrity

Adjusting Signal Timing (Part 1)

10 GIGABIT ETHERNET CONSORTIUM

Section VI. PCB Layout Guidelines

CAPLESS REGULATORS DEALING WITH LOAD TRANSIENT

Differential Pair Routing

Interfacing Single-Ended PECL to Differential PECL and Differential PECL to Single-Ended PECL

Interfacing Virtex-6 FPGAs with 3.3V I/O Standards Author: Austin Tavares

How Long is Too Long? A Via Stub Electrical Performance Study

How Return Loss Gets its Ripples

NOT RECOMMENDED FOR NEW DESIGNS. Features. Applications. Markets

Lecture #2 Solving the Interconnect Problems in VLSI

EUP V/12V Synchronous Buck PWM Controller DESCRIPTION FEATURES APPLICATIONS. Typical Application Circuit. 1

Bill Ham Martin Ogbuokiri. This clause specifies the electrical performance requirements for shielded and unshielded cables.

DesignCon Design of Gb/s Interconnect for High-bandwidth FPGAs. Sherri Azgomi, Altera Corporation

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 1

IEEE Std 802.3ap (Amendment to IEEE Std )

TITLE. Capturing (LP)DDR4 Interface PSIJ and RJ Performance. Image. Topic: Topic: John Ellis, Synopsys, Inc. Topic: malesuada blandit euismod.

Note Using the PXIe-5785 in a manner not described in this document might impair the protection the PXIe-5785 provides.

Eye Diagrams. EE290C Spring Most Basic Link BER. What About That Wire. Why Wouldn t You Get What You Sent?

To learn S-parameters, eye diagram, ISI, modulation techniques and their simulations in MATLAB and Cadence.

Transcription:

DDR4 memory interface: Solving PCB design challenges Chang Fei Yee - July 23, 2014 Introduction DDR SDRAM technology has reached its 4th generation. The DDR4 SDRAM interface achieves a maximum data rate of 3.6Gbps per bit (i.e., clock rate of 1.8GHz). There are four key challenges in designing the placement and routing of DDR4 SDRAM interface with multi-gigabit transmission. The major challenges include the routing topology and termination scheme for nets with multiple receivers, routing technique to minimize crosstalk, method to mitigate the impedance discontinuity due to imperfect vias, and the method to maximize the timing margin of the data transmission. These four challenges and the ways to overcome them are discussed in the subsequent section of this paper. The study is done by analyzing signal integrity using Mentor Graphics Hyperlynx with the import of IBIS models of Kintex UltraScale FPGA from Xilinx and DDR4 SDRAM from Micron. Routing Technique for Multiple Receivers With the growing size of data storage, more and more memory ICs are required to be accessed by the processor or memory controller. In the worst case condition, there could be as many as eight memory ICs (i.e., fan out) connected to the processor. The conventional way of routing (i.e., tree topology) creates the issue of trace stub which degrades the signal integrity of the transmission channel. The effect of trace stub is explained in (1), where resonant frequency or the bandwidth of the transmission line is inversely proportional to the stub length. In conventional tree topology, the trace stub is lengthened with the increase in number of receivers. f o = resonant frequency (Hz) c = speed of light (1.18x10 10 inches/sec) stub_length in inches D k = dielectric constant

The eye diagram analysis is conducted on an address signal net in tree topology with two SDRAMs driven by an FPGA at 2.4Gbps, shown in Figure 1. A 50 ohm resistor that serves as a parallel termination, is pulled up to 0.6V (i.e., half of the power rail) to amplify the dynamic range of the signal. The total trace length from FPGA to each SDRAM is 2 inch, where a stub length of 1 inch is seen by each SDRAM. Referring to the specification of JEDEC and Micron, with bit rate of 2.4Gbps or unit interval (UI) of 416ps, setup time and hold time with respect to clock edge must be at least 62ps and 87ps respectively. The eye diagram at address pin of the memory is shown in Figure 2. The setup time is computed between the orange circle (i.e., when left half of eye opening crosses 700mV or logic 1 threshold) and the red vertical line (i.e., clock edge at the center of the eye opening). On the other hand, the hold time is computed between red vertical line and white circle (i.e., when right half of eye opening crosses 700mV or logic 1 threshold). This eye diagram indicates the setup time of 50ps, which violates the minimum timing requirement of the memory. This timing violation causes the issue of metastability where the receiver IC misinterprets the signal and results miscommunication between driver and receiver. Fig1. Simulation model of eye diagram for tree topology with 2 receivers Fig2. Eye diagram at memory with tree topology

The lower channel bandwidth due to trace stub attenuates the high frequency components that compose the rising and falling edges of the signal, and shrinks the eye opening at the memory. In order to overcome this problem, the conventional tree topology shall be replaced by the fly-by routing topology, shown in Figure 3. The first memory (i.e., U5) is connected to the driver (i.e., U4) with a 2 inch trace, while the second memory (i.e., U6) is cascaded to the first memory with a 1 inch trace. A parallel termination 50 ohm resistor is added at the end of the trace. Fig3. Simulation model of eye diagram for fly-by topology with 2 receivers The eye diagrams at memory U5 and U6 are illustrated in Figure 4A and Figure 4B respectively. The setup time and hold time in both figures are ~150ps respectively. The memory U6 does not experience stub effect at all with fly-by topology. A large eye opening is observed at memory U5 due to the fact that the long stub experienced by it, is terminated with a 50 ohm resistor. A properly terminated stub minimizes the reflection and attenuation of the signal.

Fig4A. Eye diagram at memory U5 with fly-by topology Fig4B. Eye diagram at memory U6 with fly-by topology The case study is further conducted with more receivers cascaded (i.e., 4 memory ICs in total) to the net, shown in Figure 5. The longest stub (i.e., 3 inch) is seen by the first memory, while shorter stub is experienced by the subsequent cascaded memory ICs. A parallel termination 50 ohm resistor is appended at the end of the channel (i.e., the last memory IC). The setup time and hold time of minimum 140ps at each receiver summarized in Table I, meet the timing requirement of JEDEC and Micron. By applying fly-by routing technique and properly terminating an opened long stub, attenuation of signal at the receiving end is alleviated.

Fig5. Simulation model of eye diagram for fly-by topology with 4 receivers Table I. Summary of setup and hold time with fly-by topology Memory IC Setup time Hold time (ps) (ps) 1st ~150 ~150 2nd ~150 ~150 3rd ~145 ~145 4th ~140 ~140

Routing method to alleviate crosstalk Routing Method to Alleviate Crosstalk Crosstalk effect due to capacitive and inductive coupling from a signal to another becomes more severe at higher frequency and edge rate. At 2.4Gbps for DDR4 technology, the edge rate could be as high as 10V/ns. In order to minimize the coupling effect from the aggressor to the victim at high edge rate, the spacing between two adjacent signal traces shall be at least three times the trace width. However, the large trace spacing is tough to be implemented on the PCB with small dimension used in mobile computing device. SDRAM applies source synchronous clocking scheme (i.e., non-interleaved routing), where all the signals in each single signaling group (e.g. address, data, control and command respectively) propagate in one direction simultaneously. In order to alleviate the crosstalk for traces with spacing of less than two times the copper width, the traces shall be laid out on inner PCB layer (i.e., stripline) instead of outer layer (i.e., microstrip). In order to avoid misinformation at the receiver in the source synchronous clocking scheme, far end crosstalk (FEXT) must be minimal. In this section, the FEXT is studied by analyzing s-parameter and transient response at the receiver on stripline and microstrip with different trace length. The simulation model of FEXT with coupled traces separated by one and a half times copper width is illustrated in Figure 6. Port 1 and 2 are the transmitting and receiving port respectively of the aggressor line. Meanwhile, Port 3 and 4 are the transmitting and receiving port respectively of the victim line. The plot of S-parameter (i.e., magnitude of S41 in db versus frequency) that represents FEXT is shown in Figure 7. The S41 refers to the ratio of induced noise at the victim's receiver to the injected signal at the aggressor's transmitter. A more severe FEXT is indicated by the waveform with magnitude closer to 0dB. With reference to Figure 7, FEXT is minimal (i.e., less than -12dB) for coupled pairs on stripline compared to microstrip. The FEXT becomes more significant when the coupled length increases. Fig6. Simulation model of s-parameter for FEXT

Fig7. S-parameter plot for FEXT The FEXT is further studied by analyzing the transient response at the victim's receiver. The simulation model and result of transient response are shown in Figure 8 and 9 respectively. Referring to Figure 9, for 1 inch coupled length, the injected signal at aggressor (i.e., red curve) swings between 200mV and 1000mV. Meanwhile, the noise spurs with amplitude ~80mVpp (i.e., blue curve) and ~150mVpp (i.e., green curve) are induced at victim's receiver on stripline and microstrip respectively. The magnitude of noise induction with different coupled length on stripline and microstrip is summarized in Table II. The amplitude of noise induction increases with longer coupled pairs, and FEXT is much more severe on microstrip compared to stripline. Fig8. Simulation model of transient response for FEXT

Fig9. Transient response plot for FEXT Table II. Summary of noise induction with different coupled length Coupled length (inch) on Stripline (mvpp) on Microstrip (mvpp) 1 ~80 ~150 2 ~110 ~200 3 ~120 ~220 4 ~190 ~400 Technique to Minimize Via Stub Effect The larger memory size of DDR4 SDRAM requires more bit width for data and address groups and increases the quantity of net to be laid out on PCB. Due to the limited space on PCB, different groups of signals are routed on different PCB layers. The transition of signal from one layer to another experiences a via stub effect. The bandwidth of the channel gets lower when the via stub becomes longer. The effect is explained in (2). f o = resonant frequency (Hz) c = speed of light (1.18x1010 inches/sec) via_stub_length in inches

D keff = effective dielectric constant The effect of via stub is analyzed with the simulation of insertion loss. With reference to the plot of insertion loss shown in Figure 10, the bandwidth decreases with the longer via stub. With via stub length of 120mil, 140mil and 160mil, the attenuation of signal is as much as -65dB minimum at 4.8GHz, 5.4GHz and 6.2GHz respectively. With much shorter via stub (i.e., less than 60mil), the insertion loss is lower than -5dB for frequency up to 9GHz. In order to keep the via stub short, if the driver and receiver ICs are placed on top layer of PCB, the traces shall be routed on layers closest to the bottom of PCB. Alternatively, micro via technique can be considered. Fig10. Insertion loss plot for varying via stub length Maximum Skew of Trace Length to Meet Timing Margin In order to be compliant with the specification of JEDEC, the maximum skew among all the signals shall be less than +/-2.5% of the clock period driven by the memory controller. All the signals of SDRAM are directly or indirectly referenced to the clock. For example, if the normal FR4 material with dielectric constant of 4 is used on PCB, at differential clock rate of 1.2GHz (i.e., 833ps clock period), the maximum skew shall be +/-125mil among all the signals. This number is computed with the equation of the propagation speed of signal on PCB, shown in (3).

S = propagation speed of signal on PCB c = speed of light (1.18x1010 inches/sec) D k = dielectric constant Conclusion DDR4 SDRAM operates at high data rate (i.e., from 1.6Gbps to 3.2Gbps) and the memory interface must be designed in a stringent way to comply with the specification set by JEDEC. All the major challenges discussed in this paper have to be overcome for excellent signal integrity to guarantee minimum bit error rate in the multi-gigabit transmission. REFERENCES [1] Romi Mayder, Raymond Anderson, Nilesh Kamdar, "Touchstone v2.0 SI/PI S-Parameter Models for Simultaneous Switching Noise (SSN) Analysis of DDR4 Memory Interface Applications", DesignCon 2014 [2] DRAM Memory In High-Speed Digital Designs by Micron Technology [3] June Feng, Bipin Dhavale, Janani Chandrasekhar, Yuri Tretiakov and Dan Oh, "System Level Signal and Power Integrity Analysis for 3200Mbps DDR4 Interface", Electronic Components & Technology Conference 2013, pp1081-1086 [4] Jun-Hee Shin, Seok -Il Kim, Young-Man Ahn, You-Keun Han, Seung- Jin Seo, "Methodology on power estimation of memory modules with Pseudo-open drain and Center-tab termination type termination schemes", 2011 [5] DDR4: Designing for Power and Performance, Altera, 2013 [6] POD12-1.2 V PSEUDO OPEN DRAIN INTERFACE, JESD8-24, 2011 [7] DDR4 SDRAM, JESD79-4, 2012 [8] LogiCORE IP UltraScale Architecture-Based FPGAs Memory Interface Solutions v4.2, Xilinx, 2013 [9] Using DDR4 in Networking Subsystems, Micron, 2014 [10] Why migrate to DDR4, David Wang, 2013 [11] Eric Bogatin, Signal and Power Integrity Simplified 2nd Edition, Prentice Hall, 2009 [12] Douglas Brooks, Signal Integrity Issues and Printed Circuit Board Design, Prentice Hall,

2008 Also see: Addressing the challenges of transition to DDR4 How to verify and debug next-generation memory