Design and Analysis of a Portable High-Speed Clock Generator

Similar documents
PHASE-LOCKED loops (PLLs) are widely used in many

ALTHOUGH zero-if and low-if architectures have been

MULTIPHASE clocks are useful in many applications.

Fast-lock all-digital DLL and digitally-controlled phase shifter for DDR controller applications

IN RECENT years, the phase-locked loop (PLL) has been a

CHAPTER 2 LITERATURE SURVEY

/$ IEEE

AS THE semiconductor process is scaled down, the thickness

A PROCESS AND TEMPERATURE COMPENSATED RING OSCILLATOR

A Low-Power and Portable Spread Spectrum Clock Generator for SoC Applications

A Novel Low Power Digitally Controlled Oscillator with Improved linear Operating Range

Delay-Locked Loop Using 4 Cell Delay Line with Extended Inverters

Design of Low Noise 16-bit CMOS Digitally Controlled Oscillator

THIS paper deals with the generation of multi-phase clocks,

A Monotonic and Low-Power Digitally Controlled Oscillator Using Standard Cells for SoC Applications

DESIGN OF MULTIPLYING DELAY LOCKED LOOP FOR DIFFERENT MULTIPLYING FACTORS

Acounter-basedall-digital spread-spectrum clock generatorwithhighemi reductionin65nmcmos

A Robust Oscillator for Embedded System without External Crystal

AS THE DATA rate demanded by multimedia system

ECE1352. Term Paper Low Voltage Phase-Locked Loop Design Technique

A Wide-Range Delay-Locked Loop With a Fixed Latency of One Clock Cycle

Low Power CMOS Digitally Controlled Oscillator Manoj Kumar #1, Sandeep K. Arya #2, Sujata Pandey* 3 and Timsi #4

Design of Phase Locked Loop as a Frequency Synthesizer Muttappa 1 Akalpita L Kulkarni 2

DESIGN AND VERIFICATION OF ANALOG PHASE LOCKED LOOP CIRCUIT

A fast lock-in all-digital phase-locked loop in 40-nm CMOS technology

A10-Gb/slow-power adaptive continuous-time linear equalizer using asynchronous under-sampling histogram

A Digital Clock Multiplier for Globally Asynchronous Locally Synchronous Designs

DESIGN FOR LOW-POWER USING MULTI-PHASE AND MULTI- FREQUENCY CLOCKING

Design of Sub-10-Picoseconds On-Chip Time Measurement Circuit

REFERENCES. [1] P. J. van Wijnen, H. R. Claessen, and E. A. Wolsheimer, A new straightforward

CLOCKING is an essential part in a digital system. The

A Symbol-Rate Timing Synchronization Method for Low Power Wireless OFDM Systems Jui-Yuan Yu, Ching-Che Chung, and Chen-Yi Lee

Taheri: A 4-4.8GHz Adaptive Bandwidth, Adaptive Jitter Phase Locked Loop

Case5:08-cv PSG Document Filed09/17/13 Page1 of 11 EXHIBIT

Fractional- N PLL with 90 Phase Shift Lock and Active Switched- Capacitor Loop Filter

Analysis and Design of a 1GHz PLL for Fast Phase and Frequency Acquisition

Research Article A Low-Power Digitally Controlled Oscillator for All Digital Phase-Locked Loops

A CMOS Clock and Data Recovery Circuit with a Half-Rate Three-State Phase Detector

RECENT advances in integrated circuit (IC) technology

Biju Viswanath Rajagopal P C Ramya Nair S R Jobin Cyriac. QuEST Global

A Frequency Synthesis of All Digital Phase Locked Loop

Multiple Reference Clock Generator

A design of 16-bit adiabatic Microprocessor core

Low-Power CMOS VLSI Design

Lecture 11: Clocking

WITH the rapid evolution of liquid crystal display (LCD)

IN MANY industrial applications, ac machines are preferable

DOUBLE DATA RATE (DDR) technology is one solution

An Ultra-Low-Power 15-bit Digitally Controlled Oscillator with High Resolution

SERIALIZED data transmission systems are usually

A Random and Systematic Jitter Suppressed DLL-Based Clock Generator with Effective Negative Feedback Loop

A wide-range all-digital duty-cycle corrector with output clock phase alignment in 65 nm CMOS technology

INF3430 Clock and Synchronization

An Ultra-Low-Power 15-bit Digitally Controlled Oscillator with High Resolution

Highly Reliable Frequency Multiplier with DLL-Based Clock Generator for System-On-Chip

Optimization of Digitally Controlled Oscillator with Low Power

A 2.2GHZ-2.9V CHARGE PUMP PHASE LOCKED LOOP DESIGN AND ANALYSIS

HIGH PERFORMANCE VOLTAGE CONTROLLED OSCILLATOR (VCO) USING 65NM VLSI TECHNOLOGY

NOVEL OSCILLATORS IN SUBTHRESHOLD REGIME

Implementation of High Performance Carry Save Adder Using Domino Logic

ESD-Transient Detection Circuit with Equivalent Capacitance-Coupling Detection Mechanism and High Efficiency of Layout Area in a 65nm CMOS Technology

NEW WIRELESS applications are emerging where

THE reference spur for a phase-locked loop (PLL) is generated

Lecture 1. Tinoosh Mohsenin

A High-Throughput Memory-Based VLC Decoder with Codeword Boundary Prediction

A 2.4 GHz to 3.86 GHz digitally controlled oscillator with 18.5 khz frequency resolution using single PMOS varactor

Design of a 3.3-V 1-GHz CMOS Phase Locked Loop with a Two-Stage Self-Feedback Ring Oscillator

A Low-Power 12 Transistor Full Adder Design using 3 Transistor XOR Gates

THE phase-locked loop (PLL) is a very popular circuit component

A 10-Gb/s Multiphase Clock and Data Recovery Circuit with a Rotational Bang-Bang Phase Detector

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

FFT Analysis, Simulation of Computational Model and Netlist Model of Digital Phase Locked Loop

Dedication. To Mum and Dad

Low Power Glitch Free Delay Lines

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Design and Performance of a Phase Angle Control Method Based on Digital Phase-locked Loop

Analysis of phase Locked Loop using Ring Voltage Controlled Oscillator

Digital Pulse-Frequency/Pulse-Amplitude Modulator for Improving Efficiency of SMPS Operating Under Light Loads

Mixed-Signal Simulation of Digitally Controlled Switching Converters

DFT for Testing High-Performance Pipelined Circuits with Slow-Speed Testers

An All-digital Delay-locked Loop using a Lock-in Pre-search Algorithm for High-speed DRAMs

Accomplishment and Timing Presentation: Clock Generation of CMOS in VLSI

All Digital on Chip Process Sensor Using Ratioed Inverter Based Ring Oscillator

GENERALLY speaking, to decrease the size and weight of

MODELING THE PHASE STEP RESPONSE OF BANG-BANG DIGITAL PLLS

Phase Locked Loop Design for Fast Phase and Frequency Acquisition

A Clock and Data Recovery Circuit With Programmable Multi-Level Phase Detector Characteristics and a Built-in Jitter Monitor

Lecture 160 Examples of CDR Circuits in CMOS (09/04/03) Page 160-1

CHAPTER 6 PHASE LOCKED LOOP ARCHITECTURE FOR ADC

RESISTOR-STRING digital-to analog converters (DACs)

Experimental Results for Low-Jitter Wide-Band Dual Cascaded Phase Locked Loop System

Power Efficient Digital LDO Regulator with Transient Response Boost Technique K.K.Sree Janani 1, M.Balasubramani 2

Design of an Efficient Phase Frequency Detector for a Digital Phase Locked Loop

ISSCC 2004 / SESSION 15 / WIRELESS CONSUMER ICs / 15.7

I. INTRODUCTION. Architecture of PLL-based integer-n frequency synthesizer. TABLE I DIVISION RATIO AND FREQUENCY OF ALL CHANNELS, N =16, P =16

IEEE TRANSACTIONS ON POWER ELECTRONICS, VOL. 14, NO. 3, MAY A Sliding Mode Current Control Scheme for PWM Brushless DC Motor Drives

WITH the aid of wave-length division multiplexing technique,

A Phase-Locked Loop with Embedded Analog-to-Digital Converter for Digital Control

Designing of Charge Pump for Fast-Locking and Low-Power PLL

Noise Analysis of Phase Locked Loops

Transcription:

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 4, APRIL 2001 367 Design and Analysis of a Portable High-Speed Clock Generator Terng-Yin Hsu, Chung-Cheng Wang, and Chen-Yi Lee Abstract A new portable clock generator with full pull-in range and fast acquisition is presented in this paper, where it can be developed at hardware description language (HDL) to reduce design cycle as well as improve system-level integration simulation. In the proposed design, frequency tracking is performed by the Prune-and-Search algorithm, and the digital-controlled ring oscillator is constructed by CMOS standard cells. In order to reduce propagation delay of the loop divider, a novel structure is developed to provide a constant delay at any divider setting. In addition, input jitter can be isolated to avoid coupling by digital processing. Hence, the generated clock output becomes more clean and robust. Based on the proposed methodology, a test chip has been designed and verified on 0.6- m CMOS process with frequency range of (360 800) MHz at 3.3 V and peak-to-peak jitter of less than 60 ps at 800 MHz/3.3 V. Index Terms ADPLL, clock generator, frequency synthesizer, full pull-in range, HDL, low jitter. I. INTRODUCTION WITH THE increasing performance and decreasing cost of digital VLSI chips, many more requirements and constraints, such as I/O characteristics, off-chip signaling, PCB complexity, and process migration, must be considered in system-level designs. For advances and improvements of digital VLSI process and design technology, all-digital approaches have the capability to meet the above requirements of computer and communication applications in recent years. In modern digital ASICs, an on-chip high-speed clock synthesizer is important and its design issues can be identified as: 1) wide operating frequency and voltage ranges; 2) low jitter characteristics; and 3) short system turnaround time. Hence, the characteristics of a clock generator must not be limited for a target process technology and should be flexible enough to meet different system specs, such as various operating frequencies, process changes, and integrated simulations, etc. In this paper, a portable solution for a frequency synthesizer-based clock generator with both large pull-in range and fast acquisition is presented. The portable design is an important issue in current digital VLSI because of foundry independence and reusability. For synchronous VLSI designs, very often a phase-locked loop (PLL) circuit is exploited to provide an on-chip high-speed clock. Generally speaking, the key characteristics of these PLL-based frequency synthesizers are: 1) pull-in range and 2) Manuscript received January 2000; revised March 2001. This work was supported by the National Science Council of Taiwan, R.O.C., under Grant NSC89-2215-E-009-053. This paper was recommended by Associate Editor S. Sriram. The authors are with the Department of Electronics Engineering, National Chiao Tung University, Hsinchu 300, Taiwan, R.O.C. (e-mail: cylee@cc.nctu.edu.tw). Publisher Item Identifier S 1057-7130(01)05230-2. output jitter. Many PLL/DPLL-based designs were developed in the past [1] [4] which were mainly based on analog LF and current-controlled/csa ring oscillators to improve the performance of the PLL. Intensive circuit simulation and layout design to overcome process, voltage, and temperature (PVT) variations should be taken into account before an acceptable performance is reached. This results in a long design cycle as the design moves to different foundry processes or design specifications are changed, even though the remaining digital parts are easily portable. Recently, an all-digital PLL (ADPLL) has been proposed [5], where the oscillator is controlled by digital commands. The rest of the controller parts can be designed at the HDL level. However, intensive simulation still has to be conducted to ensure that target frequency band remains achievable under PVT variations. Then, based on circuit simulation results, the corresponding oscillator s behavioral model can be described at the HDL level for ongoing design procedure. Specific transistor sizing and layout design of the oscillator are required as the design specification or process changes. Thus, efforts at physical design level remain unsolved. In order to cope with such an issue, a novel approach is developed to make use of the advantages of digital VLSI for a true portable and cost-effective ADPLL-based clock generator solution in this paper. It is different from conventional approaches in the tracking and locking mechanism. In order to operate at a higher speed, a matrix structure for a digital-controlled ring oscillator based on standard CMOS inverters is proposed to ensure that the clock generator can be up to several hundred megahertz. This approach also belongs to IP-based designs and has been verified on silicon using an in-house 0.6- m SPTM CMOS cell library through synthesis tools. According to measured results, it can generate any number of frequencies with the output range from 360 MHz to 800 MHz under 1-ps resolution with an output jitter of ps and ps @ 800 MHz/3.3 V. The supplied-voltage range is V. The rest of this paper is organized as follows. The algorithm for the clock generator is first addressed in Section II. The proposed architecture for portable design is presented in Section III. Circuit designs and analysis of the cell-based ring oscillator are given in Section IV. Implementation and measurement results of the proposed solution are then described and discussed in Section V. II. THE PROPOSED ALGORITHM The objective of an all-digital frequency synthesizer-based clock generator is to improve turnaround time as well as integration simulation for system-level IC designs. In this way, a 1057 7130/01$10.00 2001 IEEE

368 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 4, APRIL 2001 Fig. 1. A target frequency and possible sites of a locking window versus input jitter. portable (IP-based) clock generator can be obtained and simply realized by standard cells. A. Frequency-Search Algorithm In all digital designs, the frequency-search process is like a searching problem in a database. It is assumed that all available frequencies are a set of and a target frequency is within that set. In order to improve searching efficiency, it only uses the Prune-and-Search algorithm [6] to find a possible solution of a target frequency. In order to trade off both the miss probability and the efficiency of the searching process, a searching window for frequency error must be created to dynamically control available searching range in the Prune-and-Search algorithm. Hence, the searching range is not only dependent of the algorithm, but is also determined by frequency error. In other words, if the frequency error is greater than the searching window, the searching range must be increased to make the searching process more efficient and to ensure that frequency acquisition can be completed. When frequency error is small or the output frequency is approaching a target frequency, the searching range must be limited to make sure that the searching process does not miss its target. Note that a large searching range implies higher searching efficiency, but the miss rate of the frequency search may increase. It is assumed that a default clock is generated by the clock generator and is equal to. The target clock is denoted by within and the reference clock is where and are programmable constants. If the difference between and is detected, then the status, fast/slow, of a default clock is known. In the next searching cycle, the new frequency of the default clock is changed to and the same searching procedure will be performed again. So the target frequency is equal to Finally, frequency-search schemes are classified into two parts: a coarse search to reduce searching complexity and a fine search to improve searching stability. Both of them belong (1) to the Prune-and-Search algorithm. The worst-case time complexity of a frequency-search algorithm is equal to where is the total number of steps in a coarse search, is the complexity of the last step, and and is the complexity of Prune-and-Search algorithm for fine and coarse searches, respectively. The overall time complexity for the condition of within is also equal to (2). So the lock-in time is equal to, where is a constant for frequency-search loss. B. Pull-In Algorithm Different from conventional approaches, both the phase detector and loop filter are not included in our proposal. To ensure that a target frequency becomes more accurate and robust in applications, a novel scheme is developed to improve both stability and accuracy of the output frequency, where it induces a locking window to meet design requirements. If the difference between and is smaller than the locking window, a target frequency is found. In noisy environments, such as high input jitter, the site of a locking window will be dynamically shifted, which will influence the searching process to determine correct status, as shown in Fig. 1. In order to cope with this imperfection, it is necessary to ensure that the size of a locking window can tolerate reference variations. Generally speaking, the size of the locking window is proportional to the jitter range. It implies that we need a large locking window in a high-jitter environment to maintain stability; however, accuracy of the target frequency will become degraded. A possible solution to trading off jitter and accuracy is to control the searching procedure to ensure that the target frequency is located at the center of the locking window. (2)

HSU et al.: DESIGN AND ANALYSIS OF A PORTABLE HIGH-SPEED CLOCK GENERATOR 369 Fig. 2. Block diagram of the clock generator based on double-ring oscillators. C. Propagation-Delay Issue If the target frequency is much higher than the reference clock, the propagation delay of loop divider must be considered carefully to minimize frequency errors. The default clock is rewritten as where,, and are constants, and is a frequency error. If, is equal to. From (3), it can be found that a long wordlength counter is now replaced by three short wordlengths (, and ) counters. As a result, the propagation delay can be reduced to an acceptable range through decomposition. (3) III. THE PROPOSED ARCHITECTURE A block diagram of a double-ring oscillator-based clock generator is shown in Fig. 2. Its structure is similar to a traditional frequency synthesizer, where an extra path is added to generate the target clock. However, a completely different searching mechanism is developed here. Four major functional modules are allocated in the design: (1) clock pair for frequency and output generation; (2) a frequency-search unit used to search a target frequency and determine estimation status; (3) a clock controller used to adjust control timing; (4) an offset estimation unit used to minimize mismatch in the clock pair. The divider N is used not only to slow down the reference frequency but also to generate control signals for search processes as shown in Fig. 2. Overall operations are classified into two processes: 1) an active process for cyclically searching a target frequency and 2) preset process for restoring commands. Due to physical constraints of setup time and hold time, the decisions of the controlled status must be made carefully. Hence, a divider M is used to slow down the target frequency as well as to enlarge the frequency error. Then all of them can fit physical constraints, and the threshold (the size of a locking window) can be accurately determined without timing violation too. In order to compensate for temperature and supply voltage variations, the searching procedure is activated continuously (recycled) whether a target frequency is found or not. So, in Fig. 2, ring oscillator #2 is cyclically restarted to search locked commands of a target frequency, and ring oscillator #1 is continuous to the output of a target clock based on searching results Fig. 3. Controller design of the proposed portable clock generator. (a) The controller state diagram. (b) Structure of the frequency search. (c) Structure of the offset estimation and clock controller.

370 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 4, APRIL 2001 Fig. 4. Frequency allocation of the digital-controlled ring oscillator. from ring oscillator #2. On the other hand, ring oscillator #2, which is operated at a burst mode to keep the searching procedure accurate and to find searching commands, cannot generate a continuous output for target clock. As a result, the output of clock generators can become more stable during the frequency search. The state transition diagram of the frequency search is shown in Fig. 3(a) and its structure is shown in Fig. 3(b). The ring oscillator #1 will only change its frequency when the setting of a target frequency is determined and found by ring oscillator #2. Because there are two equivalent ring oscillators in the proposed architecture, an offset estimation is essential to measure mismatch between two oscillators, caused by foundry fabrication. In other words, to overcome the mismatch problem from foundry fabrication, an offset calibration is performed to reduce frequency errors between two equivalent oscillators before beginning the search for a target frequency. Then the locked commands are equal to summations of an offset and searching commands (ideally, searching commands are equal to locked commands) as shown in Fig. 3(c). If the temperature offset is taken into account, the offset calibration must be conducted to ensure successful locked status. Other issues, such as compensating for process variations and preventing oscillator frequency discontinuity, are handled during frequency band allocation. As shown in Fig. 4, all subbands of ring oscillators are overlapped. The degree of the overlapping, which can be determined at the HDL level, is dependent on process parameters. For example, if the process variation is about 10%, the degree of the extended region is set to twice this variation (20%). Although this frequency-allocation scheme will increase missing probability (lock-in time), it can overcome process variations. Hence, an overall bandwidth of ring oscillators is always continuous and a wide frequency band can be achieved also. IV. CIRCUIT DESIGNS A. Digital-Controlled Ring Oscillator A cell-based digital-controlled oscillator is shown in Fig. 5. It is a matrix structure based on CMOS inverters (inverter matrix), whose characteristics are determined by both the combination and number of inverters. Note that the dynamic control range is Fig. 5. Structure of the cell-based digital-controlled ring oscillator. determined by the combination of inverter matrix, and the frequency resolution is dependent of scales of the parallel-inverter structure (delay bank). Both the inverter matrix and delay bank are allocated to perform coarse and fine searches, and the resolution of the clock generator is determined by the minimum scale of the delay bank. By means of operating different numbers of inverters in parallel-inverter structures (delay banks), the proposed ring oscillator can generate different frequencies. The delay bank in Fig. 5 is analyzed by an RC model [7], whose delay scale for the mth delay path can be expressed by where is the output resistance, is the gate capacitance, and are the MOSFET width and channel length, respectively, is one of delay path, and is the total number of paths in the delay bank. Hence, the delay matrix owns paths to pass signals with different delays. By properly assigning and in (4), a linear delay model can be obtained as shown in Fig. 6. To ensure the proposed architecture can generate any target frequency, both temperature and voltage variations have to be taken into account. Equation (4) also implies that DCO resolution is proportional to a factor of inverter size.by SPICE circuit simulation, the proposed DCO with delay bank,, can provide 1-ps resolution for an in-house 0.6- m (4)

HSU et al.: DESIGN AND ANALYSIS OF A PORTABLE HIGH-SPEED CLOCK GENERATOR 371 Fig. 6. Equivalent structure of the proposed digital-controlled ring oscillator. Fig. 7. Structure of the loop divider (Jk + C) and its timing. SPTM CMOS cell library. From [7], it is shown that the output frequency of inverter chains is dependent of and of inverters. However, inverter delays are functions of temperature, supply voltage, loading capacitance, etc. For simplification without losing generality, the proposed ring oscillator is equivalent to the inverter-chain structure [8] as shown in Fig. 6, and it is assumed that and for. In addition, a certain frequency error,, is allowed in the design specifications. Hence, the output frequency is allocated within, and the size of a locked window must be less than frequency error to produce target frequency. The output frequency of the cell-based digital controlled oscillator is then modeled as and is the total number of cascaded inverters, is the model of inverter bank and represents the delay of the th path in delay paths in (4). Based on (4) and (6), we can build a corresponding HDL model of the DCO to (5) (6)

372 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 4, APRIL 2001 Fig. 8. Measured results with L =4, M =34, N =8, external clock at 36 MHz 6 163 ps, and test output 153 MHz 6 38 ps (DCO output at 612 MHz). perform system-level integration simulation. Note that (6) must satisfy the following inequality of frequency errors: In (7), for any given and, can always be found to satisfy the above constraints. So, the bandwidth of digital-controlled ring oscillators is proportional to, and its resolution is determined by. Both the and parameters are controlled by frequency-tracking module to search proper assignments (solutions) based on coarse and fine searches. B. Loop Divider The propagation delay of the loop divider must be reduced to enhance frequency search accuracy and to achieve higher clock rates. This is because the multiplication factor is proportional to the total length of the loop divider. Here this delay-reduction process is done in the time domain. Based on (3), a large (7) is now replaced by, three short wordlength counters instead of a long wordlength counter. For implementation, the combination of (3) is easy to be realized and its structure is shown in Fig. 7. A counter is activated after counting which are not autorestarted after counting C. Hence, the overall propagation delay is equal to count, instead of. For examples, if is 8000, can be and will be 20. Generally, the range of is to prevent timing violation of Masker in Fig. 7. In addition, this scheme does not need any special circuit to achieve small propagation delay. By properly selecting, and, the propagation delay can always be reduced to an acceptable level. V. IMPLEMENTATION AND DISCUSSION A prototyping chip for a cell-based clock generator has been designed and fabricated in 0.6- m CMOS SPTM process. First, the controller is described by Verilog-HDL and digital-controlled ring oscillators are designed at gate-level with timing information from an in-house standard cell library. A behavioral model of the oscillator is then generated to complete the whole simulation with other blocks. Then source codes are

HSU et al.: DESIGN AND ANALYSIS OF A PORTABLE HIGH-SPEED CLOCK GENERATOR 373 Fig. 9. (b) Harmonic characteristic. Frequency characteristics with L =4, M =34, N =8, external clock at 26 MHz, and test output 110.5 MHz (DCO output at 442 MHz). (a) Spectrum. synthesized to generate gate-level netlists and schematics for further simulations and verifications. These gate-level netlists are verified with original codes to check their behaviors and timing. In addition, area and timing of whole design must be carefully considered to meet requirements at this step. After timing and functions have been verified correctly, automatic placement and routing (APR) tool is exploited to complete physical layout. Measured results show that the operating frequency of this test chip ranges from 360 to 800 MHz at 3.3 V. The working supply voltage range is V. Fig. 8 shows measured results with, 1,, external clock at 36 MHz 163 ps, and test output at 153 MHz 38 ps (after output divider; DCO at 612 MHz). The spectrum and harmonic 1 Because the speed limitation of PADs in 0.6-m CMOS SPTM process is 200 MHz, the output frequency of clock generator must be lowered for testing. characteristic at 110 MHz for test output (DCO at 440 MHz) are shown in Fig. 9(a) and (b). Because the proposed DCO and PADs are operated at full swing, its spectrum contains some high-frequency components. The histogram of maximum output frequency at 800 MHz/3.3 V(, 200 MHz output test) is shown in Fig. 10, whose jitter is 60 ps. A chip microphoto of the proposed design is shown in Fig. 11 and its summary is listed in Table I. Note that power dissipation is mainly from the DCO, which consumes more than 85% of the total power consumption. Simulation shows that more power dissipation can be saved if minimum-size inverters (which are not available in our current cell library version) are exploited for the DCO. From this research, it is found that the portable clock generator can be divided into two parts, namely the controller and the DCO. A frequency detection and acquisition scheme is performed in the controller, which is synchronized to the input ref-

374 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: ANALOG AND DIGITAL SIGNAL PROCESSING, VOL. 48, NO. 4, APRIL 2001 Fig. 10. Measured results of the clock generator with maximum output frequency at 800 MHz (200 MHz for output test). rate is higher than the DCO maximum frequency, only the DCO cell is needed to redesign as in [5]. Then an HDL behavioral description model can be set up and combined with a controller IP for HDL-level simulation. Thus, the design cycle for an on-chip high-speed clock generator, which is needed in many ASIC designs, can be reduced quite a bit. Moreover, an HDL simulation model of the portable clock generator can be combined with other system modules for system-level integrated simulation, making it very suitable for system-on-a-chip design. VI. CONCLUSION Fig. 11. Chip microphoto of the 360 800-MHz clock generator. erence clock. As a result, this controller can be described by a synthesizable HDL code and becomes a soft-ip. The oscillation frequency is mainly achieved by the DCO whose maximum frequency is determined by a target cell library. If the target clock To exploit advances of digital VLSI, a portable high-speed on-chip clock generator with large pull-in range and low output jitter has been presented in this paper. Based on an in-house 0.6 m CMOS SPTM standard cell library, a test chip has been designed, fabricated, and measured. Results show that the output frequency of the proposed solution can be up to 800 MHz, which is sufficient for many ASIC designs fabricated by 0.6- m CMOS process. In addition, the design cycle can be reduced significantly and system-turnaround time can be improved during technology migration. As a result, the completely portable, IP-based, and fully integrated clock generator solution becomes available for digital VLSI system designs.

HSU et al.: DESIGN AND ANALYSIS OF A PORTABLE HIGH-SPEED CLOCK GENERATOR 375 TABLE I CHIP SUMMARY OF THE ON-CHIP CLOCK GENERATOR BASED ON THE DIGITAL FREQUENCY SYNTHESIZER [5] J. Dunning, G. Garcia, J. Lundberg, and E. Nuckolls, An all-digital phase-locked loop with 50-cycle lock time suitable for high-performance microprocessors, IEEE J. Solid-State Circuits, vol. 30, pp. 412 422, Apr. 1995. [6] G. Brassard and P. Bratley, Algorithmics: Theory and Practice. Englewood Cliffs, NJ: Prentice-Hall, 1988. [7] S.-M. Kang and Y. Leblebici, CMOS Digital Integrated Circuits: Analysis & Design. New York: McGraw-Hill, 1996. [8] T. Y. Hsu, B. J. Shieh, and C. Y. Lee, An all digital phase-locked loop (ADPLL) based clock recovery circuit, IEEE J. Solid-State Circuits, vol. 34, pp. 1063 1073, Aug 1999. Terng-Yin Hsu received the B.S. and M.S. degrees from Feng Chia University, Taichung, Taiwan, in 1993 and 1995, respectively, and the Ph.D. degree from the National Chiao Tung University, Hsinchu, Taiwan, in 1999, all in electronic engineering. In 2000, he joined the Department of Electronics Engineering, Ming-Hsing Institute of Technology, where he is currently Assistant Professor. His research interests mainly include VLSI architectures, CDMA power control, wireless LAN communications, high-speed networking, analog-like digital circuits, system-on-chip design technology, and related ASIC designs. ACKNOWLEDGMENT The authors would like to thank members of the SI2 Lab of the Electrical Engineering Department of the National Chiao Tung University for many fruitful suggestions on physical implementations. The MPC support from NSC/CIC is acknowledged as well. Chung-Cheng Wang was born in Taipei City, Taiwan, R.O.C., on January 6, 1976. He received the B.S. and M.S. degrees from National Chiao Tung University, Hsinchu, Taiwan, in 1998 and 2000, respectively. His research interests include IC design flow, cellbased and fully custom VLSI design, phase-locked loops, and clock generators. REFERENCES [1] I. Novof, J. Austin, R. Chmela, T. Frank, R. Kelkar, K. Short, D. Strayer, M. Styduhar, and S. Wyatt, Fully-integrated CMOS phased-locked loop with 15 to 240 MHz locking range and 650 ps jitter, in IEEE Int. Solid- State Circuits Conf., Session 6, Feb 1995, pp. 112 113. [2] J. G. Maneatis, Low-jitter and process-independent DLL and PLL based on self-biased techniques, in IEEE Int. Solid-State Circuits Conf. Session 8, Feb 1996, pp. 130 131. [3] Z. X. Zhang, H. Du, and M. S. Lee, A 360 MHz 3 V CMOS PLL with 1 V peak-to-peak power supply noise tolerance, in IEEE Int. Solid-State Circuits Conf. Session 8, Feb 1996, pp. 134 135. [4] V. R. von Kaenel, A high-speed, low-power clock generator for a microprocessor, IEEE J. Solid-State Circuits, vol. 33, pp. 1634 1639, Nov. 1998. Chen-Yi Lee received the B.S. degree from National Chiao Tung University, Hsinchu, Taiwan, in 1982, and the M.S. and Ph.D. degrees from Katholieke University Leuven (KUL), Belgium, in 1986 and 1990, respectively, all in electrical engineering. From 1986 to 1990, he was with IMEC/VSDM working in the area of architecture synthesis for DSP. In February 1991, he joined the faculty of the Electronics Engineering Department, National Chiao Tung University, where he is currently a Professor. His research interests mainly include VLSI algorithms and architectures for high-throughput DSP applications. He is also active in various aspects of high-speed networking, system-on-chip design technology, very low bit rate coding, and multimedia signal processing.