Clock Skew Verification in the Presence of IR-Drop in the Power Distribution Network
|
|
- Maximillian Knight
- 6 years ago
- Views:
Transcription
1 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE Clock Skew Verification in the Presence of IR-Drop in the Power Distribution Network Resve Saleh, Member, IEEE, Syed Zakir Hussain, Member, IEEE, Steffen Rochel, Member, IEEE, and David Overhauser, Member, IEEE Abstract Clocks are perhaps the most important circuits in high-speed digital systems. The design of clock circuitry and the quality of clock signals directly impact the performance of a very large scale integrated chip. Clock skew verification requires high accuracy and is typically performed using circuit simulators. However, in high-performance deep-submicrometer digital circuits, clocks are running at higher frequencies and are driving more gates than ever, thus presenting a higher current load on the power distribution network with the potential for substantial power grid voltage (IR)-drop. This IR-drop affects the clock timing and must be taken into account in the verification process. Since IR-drop is a full-chip phenomenon, the use of standard circuit simulation on both the clock circuitry and the power-grid is not practical. In this paper, we present a new methodology for the verification of clock delay and skew. An iterative technique is presented for clock simulation in the presence of full-chip dynamic IR-drop. The effect of IR-drop on the timing of clock signals is quantified on a small example, and demonstrated on a large chip. Index Terms Circuit simulation, CMOS integrated circuits, integrated circuit interconnections, matrix decomposition, power distribution, relaxation methods, very large scale integration. Fig. 1. Simple Buffered Clock Tree. I. INTRODUCTION THE CLOCK is the fastest and most active signal in a digital integrated circuit. It spans the entire chip and is the most critical control signal in the design [1]. In the early days of very large scale integration technology, clock circuitry was comprised of a few buffers in a simple tree structure, as shown in Fig. 1. A single clock tree drove the entire chip and was relatively easy to design. The clock skew, defined as the maximum difference between the arrival times of the clock signal to the latches, was easily controlled and it was possible to obtain nearly zero-skew clocks through an automatic clock synthesis and routing process [2]. Minimization of the clock skew is one of the key performance goals of clock design [3]. Today, the clock circuit is expected to drive thousands of latches and the design goal is to ensure that the clock signals all reach the latch inputs within a specified time bound. In order to reduce clock delay and skew, the clock Manuscript received January 31, This paper was recommended by Associate Editor E. Charbon. R. Saleh is with Simplex Solutions, Inc., Sunnyvale, CA USA ( res@simplex.com). S. Z. Hussain is with Simplex Solutions, Inc., Sunnyvale, CA USA ( szh@simplex.com). S. Rochel is with Simplex Solutions, Inc., Sunnyvale, CA USA ( steffen@simplex.com). D. Overhauser is with Simplex Solutions, Inc., Sunnyvale, CA USA ( ovee@simplex.com). Publisher Item Identifier S (00) Fig. 2. Interconnect delay dominates gate delay in DSM. is composed of large buffers driving long lines over the entire chip. As a result, the clock consumes large amounts of power, often exceeding 25% of the total chip power [4]. With the advent of deep-submicrometer (DSM) technology, designing high-speed clock networks is becoming increasingly complex. Typically, DSM refers to line widths of 0.35 m and below. Mainstream industry has reached and surpassed this feature size, with 0.25 and 0.18 m already in production. As the line width decreases, the number of devices continues to grow according to Moore s Law. However, it is not the increase in device count alone that is making chip design difficult. Rather, it is the fact that interconnect effects are now dominating the performance of the chip. Parasitic effects such as interconnect resistance and three-dimensional (3-D) capacitance have greatly increased the design complexity, and made clock design a considerable challenge. Fig. 2 illustrates the delay effects observed in DSM: gate delay is decreasing while interconnect delay is increasing with a crossover at roughly m [5]. This type of graph is /00$ IEEE
2 636 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE 2000 Fig. 3. The cause of IR-drop effects. cited in virtually every paper on timing in DSM. In practice, the curves apply to long nets in the design, and clock nets tend to be in this category. The main reasons for the shift from gate dominance to interconnect dominance is the increase in resistance and coupling capacitances. The resistance of the lines continues to increase as the line width decreases and this introduces a significant resistance-capacitance (RC) delay component. Coupling capacitances arise due to 3-D effects of tall, thin metal lines that are closely spaced together. Unfortunately, this makes it difficult to estimate the interconnect loads in advance. Most clock synthesis tools rely on accurate loading information and assume that gate delay is dominant, as shown in Fig. 2. Therefore, it is no longer possible to use these clock synthesis tools to generate zero-skew circuits. Another key issue adding to the complexity of clock design is the IR drop in the power supply [6], [7]. The causes of IR drop are the resistance in the power distribution system, as shown in Fig. 3, and increases in the current flowing through the power grid. In the past, resistance in the power grid could be safely ignored. But with the narrow metal lines used in DSM, resistance is a major factor in the power distribution system. Often, wider lines are used in power busses to reduce resistance, but the speed of today s clock designs require very large buffers which draw large currents. When these currents flow from the supply to the drivers, any resistance encountered in the power bus causes the voltage to drop. Therefore, the far-end inverter in Fig. 3 experiences a lower supply voltage than the first inverter. Ironically, the clock circuit itself is one of the major causes of IR drop. Of course, large bus drivers, memory decoder drivers and input output buffers can also lead to significant IR drop during the operation of the chip. In this paper, we focus on the effect of IR drop on clock timing. The IR drop varies dynamically at each buffer as it is related to the switching activity of the clock signal. It is clear that the current delivered by the buffers will vary depending on the level of the IR drop. This fluctuation of the supply voltage causes variations in the delay, skew, and slew rates. These effects can only be captured using detailed transient analysis. For simulation purposes, the clock circuit under analysis must incorporate all the effects shown in Fig. 4. The nonlinear transistors, RC interconnect, coupling capacitances and variations must be represented to obtain correct timing information. The coupling capacitances can be grounded for the clock simulation since most of the adjacent nets are stable during the rapid clock transition. The circuit model of Fig. 4 is used in our clock verification methodology. Note that each buffer has a different value of that varies as a function of time. II. EFFECT OF IR-DROP ON TIMING Before describing the verification techniques, it is important to understand and quantify the effects of IR drop on timing using a representative design. Since IR drop effectively reduces the supply voltage, the buffers connected to the power grid become weaker and the propagation delay through the clock increases. In this section, we present a quantitative view of exactly how much variation can be expected. For this purpose, two small clock circuits, presented in Fig. 5, were simulated to observe the impact of dynamic IR-drop. The circuits were deliberately kept small enough so that SPICE simulations could be performed to generate the results. The circuit of Fig. 5(a) contains a five-stage clock configured as a tree and connected to the power grid. The clock tree consists of 23 inverters connected together with distributed RC interconnect. The circuit of Fig. 5(b) is an H-tree [9] configuration of the clock which attempts to evenly distribute the clock to all parts of a chip. The power grids, shown as dotted lines in Fig. 5, each consist of a mesh of nodes, resistors and capacitors lumped to ground. Two 1.8-V sources are connected to the power grid at nodes and. This asymmetric connection is used to create a variable dynamic IR drop in the power grid. In Fig. 6, simulation results of the circuit in Fig. 5(a) using a commercial SPICE program are presented. A clock input ramp with a 0.1-ns rise time is applied at node CLK. The waveforms at the output of all five stages are shown along with an IR-drop waveform from the power grid. The simulation results show the transient behavior of the power grid voltage with a maximum drop associated with the switching of stages four and five. The highest IR-drop value of 45 mv or 2.5%, is observed at the power grid tap point of inverter G14. Similar results were obtained for Fig. 5(b) but are not shown here. An instantaneous IR-drop plot for the whole power grid is shown in Fig. 7 at the timepoint ps where the most significant voltage drop occurs, based on the results of Fig. 6. The voltage drop is shaded from very light to very dark as the voltage drop transitions from a lower value to a higher value. The contour plot shows a transition from a very small IR drop in the top corners to a noticeable drop at the positions of buffers G12, G13, G14, and G15. This is expected since the voltage supplies are connected in the top corners and the largest effective resistance occurs from the supplies to these four buffers. In order to assess the effect of IR drop on timing, the power grid was removed and an ideal voltage source was attached to all 23 buffers. The voltage source waveform was a time-varying function that was similar to the IR-drop waveform of Fig. 6, except that the level of IR drop was adjusted to be 0%, 5%, 10%, 15%, and 20% of in five successive runs. A scatter plot of the percentage increase in delay for each case is shown in Fig. 8(a). The H-tree of Fig. 5(b) was simulated using the same experiments to obtain Fig. 8(b). The results show that a given percentage of variation, or IR drop, translates directly to the same percentage of delay variation (note that this is a change in the delay as opposed to a change in skew). For example, a 10% voltage drop causes at most a 10% increase in delay. More accurately, a 10% IR drop causes an increase in delay of about 5% to 10%. In fact, our results show that an % IR drop causes
3 SALEH et al.: CLOCK SKEW VERIFICATION IN THE PRESENCE OF IR-DROP 637 Fig. 4. Buffers and RC interconnect model for clocks. Fig. 5. (a) Clock circuits connected to the power-grid. (a) Simple Clock Tree. (b) Simple H-tree. (b) a delay change of to % and this serves as a useful rule-ofthumb for quantifying the impact of IR drop on timing. To support our claim, we examined the normalized sensitivity of delay to changes in. In general, the normalized sensitivity of a variable to a variable is given by the standard equation In our case, is the switching delay,, and is the power supply voltage,. The percentage change in the delay is simply the normalized sensitivity multiplied by the percentage change in (1) (2) Now, consider the delay associated with the inverter in Stage 1 of Fig. 4. If the input switches low, then the output of the inverter will be charged to a high value. A simplified expression for the delay of the gate switching from low-to-high is given by where is the total capacitance at the output, is the -channel charging current and the resistance is set to zero. Assuming short channel devices, we expect that the transistor would be velocity saturated for most of the transition. A simple model for the transistor in the saturation region is given in [8] where is the channel width, is the carrier saturation velocity, is the oxide capacitance, is the gate-to-source (3) (4)
4 638 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE 2000 Fig. 6. Circuit simulation results with dynamic IR-drop. Equation (6) is plotted as a function of in Fig. 9 with V and V. The sensitivity values lie in the range of approximately 0.9 to 0.5 for ranging from 2.5 V to 4 V. This is expected since there is an inverse relationship between and delay. It is also consistent with the experimental results of Fig. 8. To see this, consider the nominal value of V in our examples. The sensitivity at this value is approximately This implies that a 10% decrease in would produce a 7.5% increase in delay. As the voltage drops, the delay becomes more sensitive to changes in, which further supports the expermental results that a 10% decrease in produces a 5% 10% increase in the delay. Based on this analysis, we believe that our rule-of-thumb can be used for quantifying the impact of IR drop on timing. Note that the sensitivity of delay to IR drop gets even worse as the power supply is reduced further. Fig. 7. IR drop at t =200ps for power grid of Fig. 5(a). voltage, is the drain-to-source voltage and is the device threshold voltage. We can compute the current in saturation by setting to obtain [8] where is the critical electric field and is the channel length. After setting, if we apply (5) to (3), we can compute the normalized sensitivity from (1) to obtain (5) (6) III. CLOCK VERIFICATION METHODOLOGY Because of the importance of clock circuits, designers have been verifying clocks in detail for years. However, a manual approach was used until recently. As DSM technology became prevalent, the interconnect issues and IR-drop problems made manual clock verification obsolete. Designers began to pursue new clock topologies and power routing schemes to avoid these interconnect issues. For example, an H-tree [9] has a central buffer that drives the entire clock tree. It is usually implemented as a custom buffered tree (i.e., multiple buffers distributed throughout the tree) to distribute the clock more evenly throughout the chip. However, skew is more difficult to manage. An alternative to this is the clock grid [10], which reduces local clock skew, but is not area efficient. A third alternative is a hybrid of the H-tree and the clock grid [11], where the advantages of the two basic approaches are exploited, while minimizing the disadvantages. To mitigate potential IR-drop problems, designers use wide metal lines in the topmost metal layer to reduce the resistance. They will often add or remove straps (metal wires) to re-route current and minimize IR drop [12]. Other methods include the use of decoupling capacitors and ball-grid arrays [12]. It is apparent that the complexity of the verification increases significantly when using any of these advanced design techniques.
5 SALEH et al.: CLOCK SKEW VERIFICATION IN THE PRESENCE OF IR-DROP 639 Fig. 8. (a) (b) Scatter plot of %Delay Change versus%ir drop Change for two clock structures (a) Clock tree of Fig. 5(a) (b) Simple H-tree from Fig. 5(b). Fig. 9. Sensitivity of Delay to V Variations. The manual process is both time consuming and error prone on simple clock trees, but completely impractical on the advanced structures. In fact, the volume of data is too large, and the clock tracing is too complex for a manual approach. Furthermore, the results cannot be patched together since the IR drop must be analyzed across the whole chip. A more reliable approach is to take the final layout and extract the transistor netlist, interconnect and power grid. The next step is to automatically trace the clock from its source node to the latches in the design. Once extracted, the clock circuit can be simulated at the transistor level with its interconnect represented as reduced-order models [13]. If the supply voltage were held constant, a best-case clock skew can be determined for each receiver. During the simulation, if the tap current information of the buffers is captured for a power grid IR-drop analysis, the IR-drop analysis can be folded back into the clock simulation to obtain a worst-case clock skew. The two clock simulations place a bound on the clock skew in the presence of IR drop. This is the basic strategy used for clock verification described in the rest of this section. A. Parasitic Extraction The first problem in clock verification is to perform the parasitic extraction. While this is not the subject of this paper, it is important to describe the extraction process and its potential issues. Typically, only resistances and capacitances are extracted for interconnects. In the immediate future, inductance will have to be considered in the clock signals [14], [15] as the speeds continue to increase and as the interconnect technology moves from aluminum to copper [5]. In this paper, we consider only RC effects, although the analysis can be equally applied to interconnect with inductive effects. The emphasis in signal parasitic extraction is on the 3-D capacitance [16]. The interconnect between clock buffers is extracted by taking into account the surrounding interconnect. Both the self-capacitance and the coupling capacitances must be extracted using methods that account for 3-D effects of neighboring lines. During the clock analysis, the coupled capacitances would be grounded since the switching of the clock occurs so rapidly that other signals look like ac ground. In addition to capacitance, the resistances of each metal segment must be extracted using one dimensional and two-dimensional techniques. It is important to include via resistance in the interconnect extraction. Via capacitance is typically small and therefore negligible (although it is necessary to include their effects if there are a large number of vias along a line). The extraction of the power grid is focused more on resistance extraction since it is the primary source of voltage drop along the power grid. Resistance extraction is performed in the same manner as described above for signals. Extraction of the individual grounded capacitances associated with the power grid can be done in a specialized manner, or in the same way as the interconnect extraction. Either way, the entire full-chip power grid must be extracted for IR-drop analysis. The power grid for both and should be extracted for a complete analysis. Typically, the IR drop in a supply grid is worse than the IR-drop in the ground grid due to the presence of substrate capacitance. In this paper, only the power grid is analyzed. However, the
6 640 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE 2000 Fig. 10. Simulation of nonlinear devices with constant tap voltages. techniques presented are equally applicable to IR drop in both supply and ground grids. Finally, the actual devices, specifically the MOS transistors, must be extracted. A clock tracing algorithm is required to extract the clock circuit from the transistor netlist to improve the efficiency of the simulation. The tracing algorithm starts at the root node of the clock and searches until latches or flip-flop circuits are encountered. The transistors and interconnect along these paths are tagged as being part of the clock circuit. The loading effects of the final gates driven by the clock buffers can be modeled as constant capacitances. This assumption is made because the size of clock buffers is considerably larger than the devices in the driven gates and latches. Moreover, the linear capacitance in the wiring interconnect shields the nonlinear loading capacitances during the period of fastest signal transitions. The main issue in this extraction step is the size of the resulting data. Storage of the data must be done efficiently, typically in a binary format, and in a data file organization that allows rapid access by downstream tools. A typical power grid and clock for a million-transistor circuit consists of several million metal segments, and thousands of transistors [10]. We have performed the extraction of many designs containing over 5 million transistors in a five-metal layer process and produced approximately million resistors and 100 K 200 K transistors. Conventional circuit or timing simulation techniques are unable to handle the volume of data required for dynamic IR-drop simulation. Therefore, alternative methods must be pursued to verify clock circuits in the presence of IR drop. B. Iterative IR-drop Simulation Relaxation methods have been used in the past to decompose large problems into smaller ones before applying an iterative method to obtain a solution [17], [18]. Linear, nonlinear and waveform relaxation methods have all been successfully applied to MOS circuits. We also make use of a relaxation-based approach, except that the partitioning and iteration counts are different from the previous approaches, and specialized methods are used to solve each partition. In previous relaxation-based circuit simulators, the power supply variations were ignored and the circuit was partitioned into subcircuits composed of source-drain connected MOS transistors. These subcircuits were all nonlinear and could easily be solved using standard direct methods. The characteristics of our problem are different and so a different circuit partitioning is required. Specifically, the circuit is partitioned into a very large linear portion and a relatively small nonlinear portion. Given this partitioning, a simulator must be designed with a focus on solving an extremely large sparse linear problem that is generated by the power-grid. A second simulator is required for the nonlinear circuit to provide tap currents for nonlinear devices that are connected to the power grid. In the relaxation process, the power grid and clock are solved separately and their results are synchronized at regular intervals during dynamic simulation. First, the partitioning approach is illustrated in more detail. In Fig. 10, a two-stage inverter circuit that represents the clock is connected to a series of resistors that represent the power grid. For this example, the results of partitioning the problem into the nonlinear and linear portions are shown in Figs. 11 and 12, respectively. While solving the nonlinear circuit of Fig. 11, the tap voltages are applied at the connections of the -channel transistors. The resulting tap currents of these devices, along with their capacitive loadings, are used in the solution of the linear circuit in Fig. 12. Given this partitioning, the success of this approach relies on two factors: the ability to build specialized simulators for each partition and the ability to converge to the correct solution in a few iterations. 1) Convergence: The next issue to consider is the convergence property. Linear relaxation is the easiest to understand. To simplify the problem for illustrative purposes, one can treat the nonlinear devices of Fig. 11 as linear devices by linearizing the transistors. It is easy to prove that linear relaxation would converge to the correct solution if all nodes had grounded capacitors and the timestep was sufficiently small [18]. The speed of convergence would depend on the tightness of the coupling between the two blocks of Figs. 11 and 12. Since the source nodes of the p-channel transistors are connected to the resistive power grid, the coupling tends to be fairly tight, which leads to a large number of iterations to reach convergence. In the case of nonlinear relaxation, sometimes referred to as timepoint iteration, each iteration of the nonlinear circuit would involve linearization using Newton s method. The requirements for convergence are nearly identical to linear relaxation, with the additional requirement that the initial guess must be close enough to the final solution [18]. Since the final solution to these types of problems are typically around,itis easy to satisfy this condition: the initial guess is always. Of
7 SALEH et al.: CLOCK SKEW VERIFICATION IN THE PRESENCE OF IR-DROP 641 Fig. 11. Inverter chain with a distributed power grid. Fig. 12. Power-grid with piecewise-constant tap currents and device capacitances. course, if this guess is quite far from the final solution, convergence may be difficult to achieve. But then, the circuit is unlikely to work anyway because the power supply is not close to. So timepoint or nonlinear relaxation is a viable approach in this respect, but also suffers from the issue of tight coupling between the two partitions. Finally, if a waveform relaxation (WR) method is used to compute the solution, then, in addition to the requirements for linear relaxation, the window sizes of the iteration must be small enough to achieve convergence [18]. If the window sizes were equal to the timepoints used in nonlinear relaxation, the two algorithms would be identical. However, the use of a larger window size equal to the clock period is more advantageous. Consider the results of two successive WR iterations. In the first iteration, the clock is simulated with zero IR-drop on the power grid. This configuration is depicted in Fig. 11, where both and are set to voltage. The resulting solution of the power grid of Fig. 12 generates the worst-case voltage drops. On the second iteration, all devices in the clock of Fig. 11 encounter large IR-drop voltages at their tap points and therefore draw less current. The next iteration of the power grid of Fig. 12 generates best-case voltage drops. Additional iterations will improve the accuracy, but if the worst-case results are within the specifications of the design, further iterations will be unnecessary. In fact, rather than iterating to convergence, a weighted average of the first two iterations produces a relatively accurate result. From the standpoint of verification, the two-step WR-based method is the most appropriate approach, and avoids any convergence issues. 2) Solution to Linear and Nonlinear Partitions: The partitioning and analysis approach described above has several advantages. By keeping the linear and nonlinear solvers separate, one can design two separate tools and optimize their individual performances. The speed of simulation is greatly increased, as the synchronization between the tools takes place at the end of each simulation period. The worst-case IR-drop obtained after the first iteration provides a good measure of the quality of the power grid and the clock design. The only issue that remains is the solution of each partition. For the nonlinear problem represented by the clock circuit, there are a variety of published approaches that could be used such as EMU [19], SPECS2 [20], isplice3 [21], etc. These methods all use a form of timing simulation and are therefore times faster than SPICE. Furthermore, they can all handle the capacity of a typical clock circuit. For interconnect associated with clock signals, a reduction method [13] is needed to generate a lower order model for simulation. If a clock grid structure is used a reduction method and a direct solver must be embedded in the timing simulator because the clock grids tend to generate very large and sparse matrices. One important requirement for the simulation is that it must be accurate to within a few percent of SPICE to produce meaningful results for clock skew. The linear problem generated by the power grid can be solved in a number of ways. Typically, the matrix is symmetric and positive definite [22], and is extremely large and sparse. Therefore, the choices for the solution can be linear relaxation, a pre-conditioned conjugate gradient, or a direct method based on a Cholesky decomposition [22]. While any one of these methods is applicable, large problems dictate the use of a conjugate gradient method with incomplete Cholesky preconditioning, while smaller problems with low fillins can be solved by an efficient implementation of a Cholesky decomposition [7]. These methods are typically times faster than SPICE for large problems. In fact, SPICE cannot even read in the data associated with the power grids of chips containing a million transistors, let alone solve the problem. IV. RESULTS We now present results using the two-step iterative technique described in the previous section. The first example circuit was shown in Fig. 5(a), a simple clock tree connected to a power grid. In Fig. 13, the IR-drop computed using waveform iterations is compared with the exact solution using SPICE simulation. The first iteration results in an overestimate of the IR drop, whereas the second iteration underestimates the value. Further iterations converge toward the exact solution which is indicated by the
8 642 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE 2000 Fig. 13. Convergence of IR-drop waveform iterations compared to SPICE simulation. solid line. In practice, the iterations converge if the IR-drop is within few percent of the supply voltage. However, this technique can become unstable if the IR-drop is large. This is most likely due to the tight coupling and the fact that the window size is relatively large (we set the window size equal to one clock cycle). Note that the first two iterations bound the exact solution. Instead of continuing with iterations, the final value of the IR-drop can be estimated by taking a weighted average of the first two iterations. This approach produces surprisingly accurate results, as shown in Fig. 13, and once the IR drop is accurate, the skew can be computed accurately. As described earlier, for verification purposes, the best approach is to use a two-step iteration followed by a weighted average of the results. Typical weighting factors are 10% of the first iteration and 90% of the second iteration, but these are adjustable parameters. The second example circuit is a Sparc Core containing 100 K transistors and a powergrid with 190 K resistors. This circuit is used to illustrate the convergence behavior of clock-powergrid iterations. Table I contains the results of the iterations at the output of the fifth stage of the clock tree. It provides the maximum delay and skew values along with the differences between two successive iterations. In this case, convergence occurs when the delta delay and delta skew values are less than a picosecond. The results demonstrate that the first two iterations tend to bound the correct solution. The subsequent iterations converge quickly to the final solution. In this case, we see that the actual solution is closer to the second iteration but it lies between the two iterations. A third example circuit that contained transistors and RC components in a clock grid was simulated and compared to SPICE. The power grid contained resisive elements. The run time for the clock simulation was 1600 s in our nonlinear simulation tool and over 100 h with a commercial SPICE tool. The power grid simulation required less than one 1 h in our linear simulation tool but SPICE could not read the TABLE I RESULTS FOR STAGE 5RISING CLOCK EDGE entire power grid into memory so a runtime comparison could not be performed. The fourth and final example shown in Fig. 14 illustrates the waveform results for a large chip using the two-step simulation technique. The clock network of this industrial example consists of transistors. The power grid itself is modeled by a network of resistors and capacitors. SPICE was unable to complete the simulation of the power grid or the clock circuit. Our approach required approximately 55 min for the two-step simulations. There are three sets of waveforms in this plot of voltage versus time. Each set shows the actual waveforms from different settings of the IR drop. The skew for each set of waveforms is defined as the width of the waveform envelope measured at the 50% transition point, as illustrated for the waveforms labeled (b) in Fig. 14. The delay here is defined as the time required to reach the midpoint of the skew value, as illustrated in Fig. 14(b). The fastest waveforms, labeled (a), are the results of the simulation without IR drop, which is the same as the first iteration of our method. The results of a second iteration followed by averaging is labeled (b). Note that both the delay and skew of (b) increase relative to (a) because the proper IR drop has been taken into account. In this case, there is a 30% change in the
9 SALEH et al.: CLOCK SKEW VERIFICATION IN THE PRESENCE OF IR-DROP 643 Fig. 14. Simulation results for large chip (a) without IR-drop, (b) dynamic IR-drop, (c) fixed IR-drop. skew value and a 0.2% change in the delay value. Clearly, an error of 30% in the skew calculation cannot be tolerated by the designer. The curves denoted by 14(c) deserve a separate discussion. Often, a designer uses a single worst-case IR drop for all buffers during a SPICE simulation to try to bound the total delay and skew. Our contention is that the results of this approach can be very misleading. For illustrative purposes, we used a single worst case IR-drop value on all buffers and re-ran the transient simulation as shown in Fig. 14(c). The skew value is almost the same as the wrong result of 14(a) and the delay value increased by about 0.5%. While the approach does provide an upper bound on the delay, it masks the 30% change in the skew value which is the more critical metric. Therefore, using a single worst-case static IR drop is misleading and should be avoided. V. CONCLUSION Verification of clocks in the presence of full-chip IR drop is a difficult problem due to conflicting requirements of speed, accuracy and problem size. High-performance clock signals require the highest accuracy because their skews are in the range of picoseconds. However, circuit simulators do not have the speed and memory capacity to handle full-chip power-grid simulation. In this paper, it is demonstrated that the effect of IR drop cannot be ignored in deep submicrometer design. For example, a 10% drop in the supply voltage can lead to a 5% 10% change in the clock timing, and it is only getting worse as the supply voltage is scaled. It has a more pronounced effect on the clock skew, as illustrated by an industrial example which exhibited a 30% change in the skew due to IR drop. A new two-step waveform-based algorithm with averaging was shown to be an effective technique for delay and skew verification of a clock design in the presence of IR drop. Furthermore, the use of a single worst case IR drop for clock skew should be avoided at all costs as it can produce misleading skew information. ACKNOWLEDGMENT The authors would like to thank D. Divekar for assistance with the simulation results in this paper. They would also like to thank the reviewers for providing excellent feedback to improve the overall quality of the paper. REFERENCES [1] E. G. Friedman, Ed., Clock Distribution Networks in VLSI Circuits and Systems. Piscataway, NJ: IEEE Press, [2] A. Kahng and C. W. Tsao, Planar-DME: A single-layer zero-skew clock tree router, IEEE Trans. Computer-Aided Design, vol. 15, Jan [3] G. Tellez and M. Sarrafzadeh, Minimal buffer insertion in clock trees with skew and slew rate contraints, IEEE Trans. Computer-Aided Design, vol. 16, Apr [4] M. K. Gowan, L. L. Biro, and D. B. Jackson, Power considerations in the design of the alpha microprocessor, in Proc. Design Automation Conf., June 1998, pp [5] Technology Roadmap for Semiconductors, Semiconductor Industry Association, San Jose, CA, [6] G. Steele, D. Overhauser, S. Rochel, and S. Z. Hussain, Full-chip verification methods for DSM power distribution systems, in Proc. Design Automation Conf., June 1998, pp [7] A. Dharchoudhury, R. Panda, D. Blaauw, and R. Vaidyanathan, Design and analysis of power distribution networks in PowerPC microprocessors, in Proc. Design Automation Conf., June 1998, pp [8] K. Toh, P. Ko, and R. Meyer, An empirical model for short-channel MOS devices, IEEE J. Solid-State Circuits, vol. 23, pp , Aug [9] A. Vittal and M. Marek-Sadowska, Low-power buffered clocktree design, IEEE Trans. Computer-Aided Design, vol. 16, pp , Sept [10] M. P. Desai, R. Cvijetic, and J. Jensen, Sizing of clock distribution networks for high performance CPU chips, in Proc. 33rd ACM/IEEE Design Automation Conf., June 1996, pp [11] K. M. Carrig, N. T. Gargiulo, R. P. Gregor, D. R. Menard, and H. E. Reindel, A new direction in ASIC high-performance clock methodology, in Proc. Custom Integrated Circuits Conf., May 1998, pp [12] R. Saleh, M. Benoit, and P. McCrorie, Power distribution planning, in Proc. Design, Automation and Test in Europe Conf., Paris, France, Feb. 1998, pp [13] A. Odabasioglu, M. Celik, and L. T. Pileggi, PRIMA: Passive reducedorder interconnect macromodeling algorithm, IEEE Trans. Computer- Aided Design, vol. 17, pp , Aug [14] P. Restle, A. Ruehli, and S. Walker, Dealing with Inductance in highspeed chip design, in Proc. Design Automation Conf., June 1999, pp
10 644 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 19, NO. 6, JUNE 2000 [15] Y. Massoud, S. Majors, T. Bustami, and J. White, Layout techniques for minimizing on-chip interconnect self-inductance, in Proc. Design Automation Conf., June 1998, pp [16] N. Arora, K. Raol, R. Schumann, and L. Richardson, Modeling and extraction of interconnect capacitances for multilayer VLSI circuits, IEEE Trans. Computer-Aided Design, vol. 15, pp , Jan [17] R. Saleh and A. R. Newton, The exploitation of latency and multirate behavior using nonlinear relaxation for circuit simulation, IEEE Trans. Computer-Aided Design, Dec [18] J. White and A. Sangiovanni-Vincentelli, Relaxation-Based Techniques for the Simulation of MOS VLSI Circuits. Norwell, MA: Kluwer Academic, [19] B. Ackland and N. Weste, Functional verification in an interactive symbolic IC design environment, in Proc. 2nd Caltech Conf. VLSI, Jan. 1981, pp [20] C. Visveswariah and R. Rohrer, SPECS2: An integrated circuit timing simulatorw, in Proc. ICCAD, Nov. 1987, pp [21] R. A. Saleh, S.-J. Jou, and A. R. Newton, Mixed-Mode Simulation and Analog Multilevel Simulation. Norwell, MA: Kluwer Academic, [22] G. Golub and C. Van Loan, Matrix Computations. Baltimore, MD: Johns Hopkins Univ. Press, Resve Saleh (S 78 M 79) received the B.S. degree in electrical engineering from Carleton University, Ottawa, ON, Canada, and the M.S. and Ph.D. degrees in electrical engineering and computer science from the University of California, Berkeley. In 1995, he co-founded Simplex Solutions, Inc., Sunnyvale, CA, to address deep submicrometer-integrated circuit verification and currently serves as Chairman. Prior to starting Simplex, he spent ten years as a Professor in the Department of Electrical and Computer Engineering and the Department of Computer Science at the University of Illinois, Urbana. He has also worked for Toshiba Corporation in Japan, Tektronix in Beaverton, OR, and Mitel Corporation in Ottawa, Canada. He began his career as a Software Engineer with Bell-Northern Research in Ottawa, Canada. He was granted a patent for nonlinear frequency domain analysis in He has written two books on analog and mixed-mode simulation and published over 50 conference papers and journal articles. Dr. Saleh has served as technical program chair, conference chair, and general chair for the Custom Integrated Circuits Conference in 1993, 1994, and 1995, respectively. He has served on the technical program committees of the Design Automation Conference, International Conference on Computer Design, and, most recently, the International Symposium on Quality in Electronic Design. From , he held the position of chairman of the IEEE Standards Coordinating Committee 30 on Analog Hardware Description Languages (AHDL).He also received an inventor recognition award from the Semiconductor Research Corporation in He is currently an Associate Editor for the IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS. He received a Presidential Young Investigator Award in 1990 from the National Science Foundation. Syed Zakir Hussain (M 97) received the B.Sc. degree with honors in mechanical engineering from North Western Frontier Province (NWFP) University of Engineering and Technology, Peshawar, Pakistan, in He received the M.S. degree in mechanical engineering in 1992 and the Ph.D. degree in electrical engineering in 1995, from Duke University, Durham, NC. From 1987 to 1988, he worked as a design engineer at National Engineering Services of Pakistan (NESPAK). He is currently a Staff Engineer with Simplex Solutions, Inc., San Jose, CA. His research interests include design and development of simulation tools in the area of VLSI design verification. Steffen Rochel (M 95) received the Ph.D. degree in electrical engineering from the University of Technology at Ilmenau, Ilmenau, Germany, in In 1992, he joined Anacad in Germany, where he was engaged in research and development of analog and mixed-signal behavioral modeling and simulation techniques. Since 1996, he has been with Simplex Solutions, Inc., San Jose, CA, working on the interconnect verification and analysis products. His research interests include circuit simulation, reliability assessment, and timing and power analysis. He has served on the Custom Integrated Circuits Committee since David Overhauser (S 86 M 89) received the B.S. degree in math and computer science from Purdue University, West Lafayette, IN, in 1983 and the M.S. and Ph.D. degrees in electrical engineering from the University of Illinois, Urbana, in 1985 and 1989, respectively. From 1989 to 1995, he was an Assistant Professor in Electrical and Computer Engineering at Duke University, Durham, NC, where he founded the Design Automation Technology Center. He has published two books and over 20 technical papers in the area of timing simulation, macromodeling, mixed-signal simulation, and design verification. He is a founder and is currently Vice President at Simplex Solutions, Inc., San Jose, CA. Dr. Overhauser has served on the technical program committees of the Custom Integrated Circuits Conference and the International Symposium for Quality Electronic Design.
Analysis and Reduction of On-Chip Inductance Effects in Power Supply Grids
Analysis and Reduction of On-Chip Inductance Effects in Power Supply Grids Woo Hyung Lee Sanjay Pant David Blaauw Department of Electrical Engineering and Computer Science {leewh, spant, blaauw}@umich.edu
More informationModeling the Effect of Wire Resistance in Deep Submicron Coupled Interconnects for Accurate Crosstalk Based Net Sorting
Modeling the Effect of Wire Resistance in Deep Submicron Coupled Interconnects for Accurate Crosstalk Based Net Sorting C. Guardiani, C. Forzan, B. Franzini, D. Pandini Adanced Research, Central R&D, DAIS,
More informationFast Placement Optimization of Power Supply Pads
Fast Placement Optimization of Power Supply Pads Yu Zhong Martin D. F. Wong Dept. of Electrical and Computer Engineering Dept. of Electrical and Computer Engineering Univ. of Illinois at Urbana-Champaign
More informationImpact of Low-Impedance Substrate on Power Supply Integrity
Impact of Low-Impedance Substrate on Power Supply Integrity Rajendran Panda and Savithri Sundareswaran Motorola, Austin David Blaauw University of Michigan, Ann Arbor Editor s note: Although it is tempting
More informationAS very large-scale integration (VLSI) circuits continue to
IEEE TRANSACTIONS ON ELECTRON DEVICES, VOL. 49, NO. 11, NOVEMBER 2002 2001 A Power-Optimal Repeater Insertion Methodology for Global Interconnects in Nanometer Designs Kaustav Banerjee, Member, IEEE, Amit
More informationInductance 101: Analysis and Design Issues
Inductance 101: Analysis and Design Issues Kaushik Gala, David Blaauw, Junfeng Wang, Vladimir Zolotov, Min Zhao Motorola Inc., Austin TX 78729 kaushik.gala@motorola.com Abstract With operating frequencies
More informationAppendix. RF Transient Simulator. Page 1
Appendix RF Transient Simulator Page 1 RF Transient/Convolution Simulation This simulator can be used to solve problems associated with circuit simulation, when the signal and waveforms involved are modulated
More informationA SIGNAL DRIVEN LARGE MOS-CAPACITOR CIRCUIT SIMULATOR
A SIGNAL DRIVEN LARGE MOS-CAPACITOR CIRCUIT SIMULATOR Janusz A. Starzyk and Ying-Wei Jan Electrical Engineering and Computer Science, Ohio University, Athens Ohio, 45701 A designated contact person Prof.
More informationLecture 11: Clocking
High Speed CMOS VLSI Design Lecture 11: Clocking (c) 1997 David Harris 1.0 Introduction We have seen that generating and distributing clocks with little skew is essential to high speed circuit design.
More informationPower Distribution Paths in 3-D ICs
Power Distribution Paths in 3-D ICs Vasilis F. Pavlidis Giovanni De Micheli LSI-EPFL 1015-Lausanne, Switzerland {vasileios.pavlidis, giovanni.demicheli}@epfl.ch ABSTRACT Distributing power and ground to
More informationEffects of Inductance on the Propagation Delay and Repeater Insertion in VLSI Circuits
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 8, NO. 2, APRIL 2000 195 Effects of Inductance on the Propagation Delay Repeater Insertion in VLSI Circuits Yehea I. Ismail Eby G.
More informationRECENT technology trends have lead to an increase in
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 39, NO. 9, SEPTEMBER 2004 1581 Noise Analysis Methodology for Partially Depleted SOI Circuits Mini Nanua and David Blaauw Abstract In partially depleted silicon-on-insulator
More informationFDTD SPICE Analysis of High-Speed Cells in Silicon Integrated Circuits
FDTD Analysis of High-Speed Cells in Silicon Integrated Circuits Neven Orhanovic and Norio Matsui Applied Simulation Technology Gateway Place, Suite 8 San Jose, CA 9 {neven, matsui}@apsimtech.com Abstract
More informationDesign and Analysis of Power Distribution Networks in PowerPC Microprocessors
Design and Analysis of Power Distribution Networks in PowerPC Microprocessors Abhijit Dharchoudhury, Rajendran Panda, David Blaauw, Ravi Vaidyanathan Advanced Tools Group, Advanced System Technologies
More informationCROSS-COUPLING capacitance and inductance have. Performance Optimization of Critical Nets Through Active Shielding
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 51, NO. 12, DECEMBER 2004 2417 Performance Optimization of Critical Nets Through Active Shielding Himanshu Kaul, Student Member, IEEE,
More informationIT has been extensively pointed out that with shrinking
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 18, NO. 5, MAY 1999 557 A Modeling Technique for CMOS Gates Alexander Chatzigeorgiou, Student Member, IEEE, Spiridon
More informationAccurate In Situ Measurement of Peak Noise and Delay Change Induced by Interconnect Coupling
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 36, NO. 10, OCTOBER 2001 1587 Accurate In Situ Measurement of Peak Noise and Delay Change Induced by Interconnect Coupling Takashi Sato, Member, IEEE, Dennis
More informationUNIT-III POWER ESTIMATION AND ANALYSIS
UNIT-III POWER ESTIMATION AND ANALYSIS In VLSI design implementation simulation software operating at various levels of design abstraction. In general simulation at a lower-level design abstraction offers
More informationDAT175: Topics in Electronic System Design
DAT175: Topics in Electronic System Design Analog Readout Circuitry for Hearing Aid in STM90nm 21 February 2010 Remzi Yagiz Mungan v1.10 1. Introduction In this project, the aim is to design an adjustable
More informationWorst Case RLC Noise with Timing Window Constraints
Worst Case RLC Noise with Timing Window Constraints Jun Chen Electrical Engineering Department University of California, Los Angeles jchen@ee.ucla.edu Lei He Electrical Engineering Department University
More informationStatic Timing Analysis Taking Crosstalk into Account 1
Static Timing Analysis Taking Crosstalk into Account 1 Matthias Ringe IBM Deutschland Entwicklung GmbH, Schönaicher Str. 220 71032 Böblingen; Germany ringe@de.ibm.com Thomas Lindenkreuz Robert Bosch GmbH,
More informationLecture #2 Solving the Interconnect Problems in VLSI
Lecture #2 Solving the Interconnect Problems in VLSI C.P. Ravikumar IIT Madras - C.P. Ravikumar 1 Interconnect Problems Interconnect delay has become more important than gate delays after 130nm technology
More informationFull-Chip Verification of UDSM Designs
R. Saleh Simplex Solutions 521 Almanor Avenue Sunnyvale, CA 94086 (408) 617-6100 res@simplex.com Full-Chip Verification of UDSM Designs D. Overhauser Simplex Solutions 521 Almanor Avenue Sunnyvale, CA
More informationChapter 4. Problems. 1 Chapter 4 Problem Set
1 Chapter 4 Problem Set Chapter 4 Problems 1. [M, None, 4.x] Figure 0.1 shows a clock-distribution network. Each segment of the clock network (between the nodes) is 5 mm long, 3 µm wide, and is implemented
More informationOn Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital VLSI
ELEN 689 606 Techniques for Layout Synthesis and Simulation in EDA Project Report On Chip Active Decoupling Capacitors for Supply Noise Reduction for Power Gating and Dynamic Dual Vdd Circuits in Digital
More informationClocktree RLC Extraction with Efficient Inductance Modeling
Clocktree RLC Extraction with Efficient Inductance Modeling Norman Chang, Shen Lin, Lei He*, O. Sam Nakagawa, and Weize Xie Hewlett-Packard Laboratories, Palo Alto, CA, USA *University of Wisconsin, Madison,
More informationOn-Chip Inductance Modeling
On-Chip Inductance Modeling David Blaauw Kaushik Gala ladimir Zolotov Rajendran Panda Junfeng Wang Motorola Inc., Austin TX 78729 ABSTRACT With operating frequencies approaching the gigahertz range, inductance
More informationMicrocircuit Electrical Issues
Microcircuit Electrical Issues Distortion The frequency at which transmitted power has dropped to 50 percent of the injected power is called the "3 db" point and is used to define the bandwidth of the
More informationActive Decap Design Considerations for Optimal Supply Noise Reduction
Active Decap Design Considerations for Optimal Supply Noise Reduction Xiongfei Meng and Resve Saleh Dept. of ECE, University of British Columbia, 356 Main Mall, Vancouver, BC, V6T Z4, Canada E-mail: {xmeng,
More informationFigure 1. Inductance
Tools for On-Chip Interconnect Inductance Extraction Jerry Tallinger OEA International Inc. 155 East Main Ave., Ste. 110 Morgan Hill, CA 95037 jerry@oea.com Haris Basit OEA International Inc. 155 East
More informationInstantaneous Loop. Ideal Phase Locked Loop. Gain ICs
Instantaneous Loop Ideal Phase Locked Loop Gain ICs PHASE COORDINATING An exciting breakthrough in phase tracking, phase coordinating, has been developed by Instantaneous Technologies. Instantaneous Technologies
More information43.2. Figure 1. Interconnect analysis using linear simulation and superposition
43.2 Driver Modeling and Alignment for Worst-Case Delay Noise Supamas Sirichotiyakul, David Blaauw, Chanhee Oh, Rafi Levy*, Vladimir Zolotov, Jingyan Zuo Motorola Inc. Austin, TX, *Motorola Semiconductor
More informationWITH increasing clock frequency and decreasing supply
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 16, NO. 11, NOVEMBER 2008 1581 Layout of Decoupling Capacitors in IP Blocks for 90-nm CMOS Xiongfei Meng, Student Member, IEEE, Resve
More informationCS 6135 VLSI Physical Design Automation Fall 2003
CS 6135 VLSI Physical Design Automation Fall 2003 1 Course Information Class time: R789 Location: EECS 224 Instructor: Ting-Chi Wang ( ) EECS 643, (03) 5742963 tcwang@cs.nthu.edu.tw Office hours: M56R5
More informationFEASIBILITY OF OPTICAL CLOCK DISTRIBUTION FOR FUTURE CMOS TECHNOLOGY NODES
6 Vol.11(1) March 1 FEASIBILITY OF OPTICAL CLOCK DISTRIBUTION FOR FUTURE CMOS TECHNOLOGY NODES P.J. Venter 1 and M. du Plessis 1 and Carl and Emily Fuchs Institute for Microelectronics, Dept. of Electrical,
More informationPreface to Third Edition Deep Submicron Digital IC Design p. 1 Introduction p. 1 Brief History of IC Industry p. 3 Review of Digital Logic Gate
Preface to Third Edition p. xiii Deep Submicron Digital IC Design p. 1 Introduction p. 1 Brief History of IC Industry p. 3 Review of Digital Logic Gate Design p. 6 Basic Logic Functions p. 6 Implementation
More informationIT HAS become well accepted that interconnect delay
442 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 7, NO. 4, DECEMBER 1999 Figures of Merit to Characterize the Importance of On-Chip Inductance Yehea I. Ismail, Eby G. Friedman,
More informationStability and Dynamic Performance of Current-Sharing Control for Paralleled Voltage Regulator Modules
172 IEEE TRANSACTIONS ON POWER ELECTRONICS, VOL. 17, NO. 2, MARCH 2002 Stability Dynamic Performance of Current-Sharing Control for Paralleled Voltage Regulator Modules Yuri Panov Milan M. Jovanović, Fellow,
More informationOn the Interaction of Power Distribution Network with Substrate
On the Interaction of Power Distribution Network with Rajendran Panda, Savithri Sundareswaran, David Blaauw Rajendran.Panda@motorola.com, Savithri_Sundareswaran-A12801@email.mot.com, David.Blaauw@motorola.com
More informationChapter 3 Novel Digital-to-Analog Converter with Gamma Correction for On-Panel Data Driver
Chapter 3 Novel Digital-to-Analog Converter with Gamma Correction for On-Panel Data Driver 3.1 INTRODUCTION As last chapter description, we know that there is a nonlinearity relationship between luminance
More informationDesign and implementation of LDPC decoder using time domain-ams processing
2015; 1(7): 271-276 ISSN Print: 2394-7500 ISSN Online: 2394-5869 Impact Factor: 5.2 IJAR 2015; 1(7): 271-276 www.allresearchjournal.com Received: 31-04-2015 Accepted: 01-06-2015 Shirisha S M Tech VLSI
More informationCHAPTER 6 DIGITAL CIRCUIT DESIGN USING SINGLE ELECTRON TRANSISTOR LOGIC
94 CHAPTER 6 DIGITAL CIRCUIT DESIGN USING SINGLE ELECTRON TRANSISTOR LOGIC 6.1 INTRODUCTION The semiconductor digital circuits began with the Resistor Diode Logic (RDL) which was smaller in size, faster
More informationOn-Chip Inductance Modeling and Analysis
On-Chip Inductance Modeling and Analysis Kaushik Gala, ladimir Zolotov, Rajendran Panda, Brian Young, Junfeng Wang, David Blaauw Motorola Inc., Austin TX 78729 kaushik.gala@motorola.com Abstract With operating
More informationAccurate and Efficient Macromodel of Submicron Digital Standard Cells
Accurate and Efficient Macromodel of Submicron Digital Standard Cells Cristiano Forzan, Bruno Franzini and Carlo Guardiani SGS-THOMSON Microelectronics, via C. Olivetti, 2, 241 Agrate Brianza (MI), ITALY
More informationPROCESS-VOLTAGE-TEMPERATURE (PVT) VARIATIONS AND STATIC TIMING ANALYSIS
PROCESS-VOLTAGE-TEMPERATURE (PVT) VARIATIONS AND STATIC TIMING ANALYSIS The major design challenges of ASIC design consist of microscopic issues and macroscopic issues [1]. The microscopic issues are ultra-high
More informationA Comparative Study of Π and Split R-Π Model for the CMOS Driver Receiver Pair for Low Energy On-Chip Interconnects
International Journal of Scientific and Research Publications, Volume 3, Issue 9, September 2013 1 A Comparative Study of Π and Split R-Π Model for the CMOS Driver Receiver Pair for Low Energy On-Chip
More informationTHE TREND toward implementing systems with low
724 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 30, NO. 7, JULY 1995 Design of a 100-MHz 10-mW 3-V Sample-and-Hold Amplifier in Digital Bipolar Technology Behzad Razavi, Member, IEEE Abstract This paper
More informationOptimization of Power Dissipation and Skew Sensitivity in Clock Buffer Synthesis
Optimization of Power Dissipation and Skew Sensitivity in Clock Buffer Synthesis Jae W. Chung, De-Yu Kao, Chung-Kuan Cheng, and Ting-Ting Lin Department of Computer Science and Engineering Mail Code 0114
More informationLogic Families. Describes Process used to implement devices Input and output structure of the device. Four general categories.
Logic Families Characterizing Digital ICs Digital ICs characterized several ways Circuit Complexity Gives measure of number of transistors or gates Within single package Four general categories SSI - Small
More informationTiming Analysis of Discontinuous RC Interconnect Lines
8 TAEHOON KIM et al : TIMING ANALYSIS OF DISCONTINUOUS RC INTERCONNECT LINES Timing Analysis of Discontinuous RC Interconnect Lines Taehoon Kim, Youngdoo Song, and Yungseon Eo Abstract In this paper, discontinuous
More informationA Survey of the Low Power Design Techniques at the Circuit Level
A Survey of the Low Power Design Techniques at the Circuit Level Hari Krishna B Assistant Professor, Department of Electronics and Communication Engineering, Vagdevi Engineering College, Warangal, India
More informationTHE GROWTH of the portable electronics industry has
IEEE POWER ELECTRONICS LETTERS 1 A Constant-Frequency Method for Improving Light-Load Efficiency in Synchronous Buck Converters Michael D. Mulligan, Bill Broach, and Thomas H. Lee Abstract The low-voltage
More informationOutput Waveform Evaluation of Basic Pass Transistor Structure*
Output Waveform Evaluation of Basic Pass Transistor Structure* S. Nikolaidis, H. Pournara, and A. Chatzigeorgiou Department of Physics, Aristotle University of Thessaloniki Department of Applied Informatics,
More informationTECHNOLOGY scaling, aided by innovative circuit techniques,
122 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 14, NO. 2, FEBRUARY 2006 Energy Optimization of Pipelined Digital Systems Using Circuit Sizing and Supply Scaling Hoang Q. Dao,
More informationSubstrate Coupling in RF Analog/Mixed Signal IC Design: A Review
Substrate Coupling in RF Analog/Mixed Signal IC Design: A Review Ashish C Vora, Graduate Student, Rochester Institute of Technology, Rochester, NY, USA. Abstract : Digital switching noise coupled into
More informationVLSI Timing Simulation with Selective Dynamic Regionization
VLSI Timing Simulation with Selective Dynamic ization Meng-Lin Yu Bryan D. Ackland AT&T Bell Laboratories Holmdel, NJ 07733 Abstract Accurate timing simulations are crucial to the design of MOS VLSI circuits,
More informationDriver Modeling and Alignment for Worst-Case Delay Noise
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 11, NO. 2, APRIL 2003 157 Driver Modeling and Alignment for Worst-Case Delay Noise David Blaauw, Member, IEEE, Supamas Sirichotiyakul,
More informationA Novel Low-Power Scan Design Technique Using Supply Gating
A Novel Low-Power Scan Design Technique Using Supply Gating S. Bhunia, H. Mahmoodi, S. Mukhopadhyay, D. Ghosh, and K. Roy School of Electrical and Computer Engineering, Purdue University, West Lafayette,
More informationA New Model for Thermal Channel Noise of Deep-Submicron MOSFETS and its Application in RF-CMOS Design
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 36, NO. 5, MAY 2001 831 A New Model for Thermal Channel Noise of Deep-Submicron MOSFETS and its Application in RF-CMOS Design Gerhard Knoblinger, Member, IEEE,
More informationTFA: A Threshold-Based Filtering Algorithm for Propagation Delay and Output Slew Calculation of High-Speed VLSI Interconnects
TFA: A Threshold-Based Filtering Algorithm for Propagation Delay and Output Slew Calculation of High-Speed VLSI Interconnects S. Abbaspour, A.H. Ajami *, M. Pedram, and E. Tuncer * Dept. of EE Systems,
More informationNoise Aware Decoupling Capacitors for Multi-Voltage Power Distribution Systems
Noise Aware Decoupling Capacitors for Multi-Voltage Power Distribution Systems Mikhail Popovich and Eby G. Friedman Department of Electrical and Computer Engineering University of Rochester, Rochester,
More informationNovel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis
Novel Low-Overhead Operand Isolation Techniques for Low-Power Datapath Synthesis N. Banerjee, A. Raychowdhury, S. Bhunia, H. Mahmoodi, and K. Roy School of Electrical and Computer Engineering, Purdue University,
More informationDesign Considerations for CMOS Digital Circuits with Improved Hot-Carrier Reliability
1014 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 31, NO. 7, JULY 1996 Design Considerations for CMOS Digital Circuits with Improved Hot-Carrier Reliability Yusuf Leblebici, Member, IEEE Abstract The hot-carrier
More informationWITH the rapid evolution of liquid crystal display (LCD)
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 43, NO. 2, FEBRUARY 2008 371 A 10-Bit LCD Column Driver With Piecewise Linear Digital-to-Analog Converters Chih-Wen Lu, Member, IEEE, and Lung-Chien Huang Abstract
More informationAS THE semiconductor process is scaled down, the thickness
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 52, NO. 7, JULY 2005 361 A New Schmitt Trigger Circuit in a 0.13-m 1/2.5-V CMOS Process to Receive 3.3-V Input Signals Shih-Lun Chen,
More informationEE434 ASIC & Digital Systems. Partha Pande School of EECS Washington State University
EE434 ASIC & Digital Systems Partha Pande School of EECS Washington State University pande@eecs.wsu.edu Lecture 11 Physical Design Issues Interconnect Scaling Effects Dense multilayer metal increases coupling
More informationTesting Power Sources for Stability
Keywords Venable, frequency response analyzer, oscillator, power source, stability testing, feedback loop, error amplifier compensation, impedance, output voltage, transfer function, gain crossover, bode
More informationSeparation and Extraction of Short-Circuit Power Consumption in Digital CMOS VLSI Circuits
Separation and Extraction of Short-Circuit Power Consumption in Digital CMOS VLSI Circuits Atila Alvandpour, Per Larsson-Edefors, and Christer Svensson Div of Electronic Devices, Dept of Physics, Linköping
More informationTHE increased complexity of analog and mixed-signal IC s
134 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 34, NO. 2, FEBRUARY 1999 An Integrated Low-Voltage Class AB CMOS OTA Ramesh Harjani, Member, IEEE, Randy Heineke, Member, IEEE, and Feng Wang, Member, IEEE
More informationEngineering the Power Delivery Network
C HAPTER 1 Engineering the Power Delivery Network 1.1 What Is the Power Delivery Network (PDN) and Why Should I Care? The power delivery network consists of all the interconnects in the power supply path
More informationICCAD 2014 Contest Incremental Timing-driven Placement: Timing Modeling and File Formats v1.1 April 14 th, 2014
ICCAD 2014 Contest Incremental Timing-driven Placement: Timing Modeling and File Formats v1.1 April 14 th, 2014 http://cad contest.ee.ncu.edu.tw/cad-contest-at-iccad2014/problem b/ 1 Introduction This
More informationPROCESS and environment parameter variations in scaled
1078 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 53, NO. 10, OCTOBER 2006 Reversed Temperature-Dependent Propagation Delay Characteristics in Nanometer CMOS Circuits Ranjith Kumar
More informationLayout-Aware Pattern Generation for Maximizing Supply Noise Effects on Critical Paths
Layout-Aware Pattern Generation for Maximizing Supply Noise Effects on Critical Paths Junxia Ma, Jeremy Lee and Mohammad Tehranipoor ECE Department, University of Connecticut, CT, 06269 {junxia, jslee,
More informationA Multiplexer-Based Digital Passive Linear Counter (PLINCO)
A Multiplexer-Based Digital Passive Linear Counter (PLINCO) Skyler Weaver, Benjamin Hershberg, Pavan Kumar Hanumolu, and Un-Ku Moon School of EECS, Oregon State University, 48 Kelley Engineering Center,
More informationEECS150 - Digital Design Lecture 19 CMOS Implementation Technologies. Recap and Outline
EECS150 - Digital Design Lecture 19 CMOS Implementation Technologies Oct. 31, 2013 Prof. Ronald Fearing Electrical Engineering and Computer Sciences University of California, Berkeley (slides courtesy
More informationECEN 720 High-Speed Links: Circuits and Systems
1 ECEN 720 High-Speed Links: Circuits and Systems Lab4 Receiver Circuits Objective To learn fundamentals of receiver circuits. Introduction Receivers are used to recover the data stream transmitted by
More informationUNIT-II LOW POWER VLSI DESIGN APPROACHES
UNIT-II LOW POWER VLSI DESIGN APPROACHES Low power Design through Voltage Scaling: The switching power dissipation in CMOS digital integrated circuits is a strong function of the power supply voltage.
More informationLow-Power VLSI. Seong-Ook Jung VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering
Low-Power VLSI Seong-Ook Jung 2013. 5. 27. sjung@yonsei.ac.kr VLSI SYSTEM LAB, YONSEI University School of Electrical & Electronic Engineering Contents 1. Introduction 2. Power classification & Power performance
More informationDIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N
DIGITAL INTEGRATED CIRCUITS A DESIGN PERSPECTIVE 2 N D E D I T I O N Jan M. Rabaey, Anantha Chandrakasan, and Borivoje Nikolic CONTENTS PART I: THE FABRICS Chapter 1: Introduction (32 pages) 1.1 A Historical
More informationCS250 VLSI Systems Design. Lecture 3: Physical Realities: Beneath the Digital Abstraction, Part 1: Timing
CS250 VLSI Systems Design Lecture 3: Physical Realities: Beneath the Digital Abstraction, Part 1: Timing Fall 2010 Krste Asanovic, John Wawrzynek with John Lazzaro and Yunsup Lee (TA) What do Computer
More informationArea and Energy-Efficient Crosstalk Avoidance Codes for On-Chip Buses
Area and Energy-Efficient Crosstalk Avoidance Codes for On-Chip Buses Srinivasa R. Sridhara, Arshad Ahmed, and Naresh R. Shanbhag Coordinated Science Laboratory/ECE Department University of Illinois at
More informationLSI Design Flow Development for Advanced Technology
LSI Design Flow Development for Advanced Technology Atsushi Tsuchiya LSIs that adopt advanced technologies, as represented by imaging LSIs, now contain 30 million or more logic gates and the scale is beginning
More informationLow-Power Digital CMOS Design: A Survey
Low-Power Digital CMOS Design: A Survey Krister Landernäs June 4, 2005 Department of Computer Science and Electronics, Mälardalen University Abstract The aim of this document is to provide the reader with
More informationIntroduction. Timing Verification
Timing Verification Sungho Kang Yonsei University YONSEI UNIVERSITY Outline Introduction Timing Simulation Static Timing Verification PITA Conclusion 2 1 Introduction Introduction Variations in component
More informationDesignCon Full Chip Signal and Power Integrity with Silicon Substrate Effect. Norio Matsui Dileep Divekar Neven Orhanovic
DesignCon 2004 Chip-Level Physical Design Full Chip Signal and Power Integrity with Silicon Substrate Effect Norio Matsui Dileep Divekar Neven Orhanovic Applied Simulation Technology, Inc. 408-436-9070
More informationAn Active Decoupling Capacitance Circuit for Inductive Noise Suppression in Power Supply Networks
An Active Decoupling Capacitance Circuit for Inductive Noise Suppression in Power Supply Networks Sanjay Pant, David Blaauw University of Michigan, Ann Arbor, MI Abstract The placement of on-die decoupling
More informationALTHOUGH zero-if and low-if architectures have been
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 40, NO. 6, JUNE 2005 1249 A 110-MHz 84-dB CMOS Programmable Gain Amplifier With Integrated RSSI Function Chun-Pang Wu and Hen-Wai Tsao Abstract This paper describes
More informationAn Efficient Model for Frequency-Dependent On-Chip Inductance
An Efficient Model for Frequency-Dependent On-Chip Inductance Min Xu ECE Department University of Wisconsin-Madison Madison, WI 53706 mxu@cae.wisc.edu Lei He ECE Department University of Wisconsin-Madison
More informationVariable-Segment & Variable-Driver Parallel Regeneration Techniques for RLC VLSI Interconnects
Variable-Segment & Variable-Driver Parallel Regeneration Techniques for RLC VLSI Interconnects Falah R. Awwad Concordia University ECE Dept., Montreal, Quebec, H3H 1M8 Canada phone: (514) 802-6305 Email:
More informationNOWADAYS, multistage amplifiers are growing in demand
1690 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, VOL. 51, NO. 9, SEPTEMBER 2004 Advances in Active-Feedback Frequency Compensation With Power Optimization and Transient Improvement Hoi
More informationGate Delay Estimation in STA under Dynamic Power Supply Noise
Gate Delay Estimation in STA under Dynamic Power Supply Noise Takaaki Okumura *, Fumihiro Minami *, Kenji Shimazaki *, Kimihiko Kuwada *, Masanori Hashimoto ** * Development Depatment-, Semiconductor Technology
More informationUnscrambling the power losses in switching boost converters
Page 1 of 7 August 18, 2006 Unscrambling the power losses in switching boost converters learn how to effectively balance your use of buck and boost converters and improve the efficiency of your power
More informationECEN 720 High-Speed Links Circuits and Systems
1 ECEN 720 High-Speed Links Circuits and Systems Lab4 Receiver Circuits Objective To learn fundamentals of receiver circuits. Introduction Receivers are used to recover the data stream transmitted by transmitters.
More informationThe Evolution of Waveform Relaxation for Circuit and Electromagnetic Solvers
The Evolution of Waveform Relaxation for Circuit and Electromagnetic Solvers Albert Ruehli, Missouri S&T EMC Laboratory, University of Science & Technology, Rolla, MO with contributions by Giulio Antonini,
More informationLecture 4&5 CMOS Circuits
Lecture 4&5 CMOS Circuits Xuan Silvia Zhang Washington University in St. Louis http://classes.engineering.wustl.edu/ese566/ Worst-Case V OL 2 3 Outline Combinational Logic (Delay Analysis) Sequential Circuits
More informationECE 484 VLSI Digital Circuits Fall Lecture 02: Design Metrics
ECE 484 VLSI Digital Circuits Fall 2016 Lecture 02: Design Metrics Dr. George L. Engel Adapted from slides provided by Mary Jane Irwin (PSU) [Adapted from Rabaey s Digital Integrated Circuits, 2002, J.
More informationPulse propagation for the detection of small delay defects
Pulse propagation for the detection of small delay defects M. Favalli DI - Univ. of Ferrara C. Metra DEIS - Univ. of Bologna Abstract This paper addresses the problems related to resistive opens and bridging
More informationEFFECTING POWER CONSUMPTION REDUCTION IN DIGITAL CMOS CIRCUITS BY A HYBRID LOGIC SYNTHESIS TECHNIQUE
EFFECTING POWER CONSUMPTION REDUCTION IN DIGITAL CMOS CIRCUITS BY A HYBRID LOGIC SYNTHESIS TECHNIQUE PBALASUBRAMANIAN Dr RCHINNADURAI MRLAKSHMI NARAYANA Department of Electronics and Communication Engineering
More informationLayout-Oriented Synthesis of High Performance Analog Circuits
-Oriented Synthesis of High Performance Analog Circuits Mohamed Dessouky, Marie-Minerve Louërat Université Paris VI (55/65) Laboratoire LIP6-ASIM 4 Place Jussieu. 75252 Paris Cedex 05. France Mohamed.Dessouky@lip6.fr
More informationA design of 16-bit adiabatic Microprocessor core
194 A design of 16-bit adiabatic Microprocessor core Youngjoon Shin, Hanseung Lee, Yong Moon, and Chanho Lee Abstract A 16-bit adiabatic low-power Microprocessor core is designed. The processor consists
More information