A HIGH SPEED DYNAMIC RIPPLE CARRY ADDER

Similar documents
Design of Robust and power Efficient 8-Bit Ripple Carry Adder using Different Logic Styles

COMPARATIVE ANALYSIS OF 32 BIT CARRY LOOK AHEAD ADDER USING HIGH SPEED CONSTANT DELAY LOGIC

Domino CMOS Implementation of Power Optimized and High Performance CLA adder

High-Performance of Domino Logic Circuit for Wide Fan-In Gates Using Mentor Graphics Tools

4-BIT RCA FOR LOW POWER APPLICATIONS

Performance Evaluation of Adders using LP-HS Logic in CMOS Technologies

Implementation of High Performance Carry Save Adder Using Domino Logic

Design of Multipliers Using Low Power High Speed Logic in CMOS Technologies

Low Power 32-bit Improved Carry Select Adder based on MTCMOS Technique

Low Power and High Performance ALU using Dual Mode Transmission Gate Diffusion Input (DMTGDI)

A Low-Power 12 Transistor Full Adder Design using 3 Transistor XOR Gates

Design of 32-bit Carry Select Adder with Reduced Area

Design of Low Power High Speed Hybrid Full Adder

A Low-Power High-speed Pipelined Accumulator Design Using CMOS Logic for DSP Applications

Study and Analysis of Full Adder in Different Sub-Micron Technologies with an Area Efficient Layout of 4-Bit Ripple Carry Adder

DESIGN OF LOW POWER MULTIPLIER USING COMPOUND CONSTANT DELAY LOGIC STYLE

COMPREHENSIVE ANALYSIS OF ENHANCED CARRY-LOOK AHEAD ADDER USING DIFFERENT LOGIC STYLES

Leakage Power Reduction in 5-Bit Full Adder using Keeper & Footer Transistor

International Journal of Advance Engineering and Research Development

Wide Fan-In Gates for Combinational Circuits Using CCD

A Novel Approach for High Speed and Low Power 4-Bit Multiplier

Performance Comparison of High-Speed Adders Using 180nm Technology

FPGA Implementation of Area Efficient and Delay Optimized 32-Bit SQRT CSLA with First Addition Logic

POWER DISSAPATION CHARACTERISTICS IN VARIOUS ADDERS

ISSN:

Investigation on Performance of high speed CMOS Full adder Circuits

An Efficient SQRT Architecture of Carry Select Adder Design by HA and Common Boolean Logic PinnikaVenkateswarlu 1, Ragutla Kalpana 2

DESIGN OF PARALLEL MULTIPLIERS USING HIGH SPEED ADDER

A NOVEL 4-Bit ARITHMETIC LOGIC UNIT DESIGN FOR POWER AND AREA OPTIMIZATION

Design & Simulation of Half Adder Circuit Using AVL technique based on CMOS Technology

Performance Analysis of High Speed Low Power Carry Look-Ahead Adder Using Different Logic Styles

Design and Implementation of Complex Multiplier Using Compressors

Dynamic Logic. Domino logic P-E logic NORA logic 2-phase logic Multiple O/P domino logic Cascode logic 11/28/2012 1

Power-Area trade-off for Different CMOS Design Technologies

A Low Power High Speed Adders using MTCMOS Technique

Implementation of Carry Select Adder using CMOS Full Adder

ISSN: [Kumar* et al., 6(5): May, 2017] Impact Factor: 4.116

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

Design of 16-bit Heterogeneous Adder Architectures Using Different Homogeneous Adders

SINGLE CYCLE TREE 64 BIT BINARY COMPARATOR WITH CONSTANT DELAY LOGIC

DESIGN OF CARRY SELECT ADDER WITH REDUCED AREA AND POWER

[Vivekanand*, 4.(12): December, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

Estimating the Maximum Propagation Delay of 4-bit Ripple Carry Adder Using Reduced Input Transitions

An Optimized Design of High-Speed and Energy- Efficient Carry Skip Adder with Variable Latency Extension

Unique Journal of Engineering and Advanced Sciences Available online: Research Article

A Literature Survey on Low PDP Adder Circuits

A NEW APPROACH TO DESIGN LOW POWER CMOS FLASH A/D CONVERTER

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

Design and Analysis of CMOS based Low Power Carry Select Full Adder

Design and Analysis of CMOS Based DADDA Multiplier

Double Stage Domino Technique: Low- Power High-Speed Noise-tolerant Domino Circuit for Wide Fan-In Gates

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, MAY-2013 ISSN

Design and Implementation of High Speed Carry Select Adder

Low Power &High Speed Domino XOR Cell

Design and Simulation of Novel Full Adder Cells using Modified GDI Cell

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

the cascading of two stages in CMOS domino logic[7,8]. The operating period of a cell when its input clock and output are low is called the precharge

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

A Novel Low Power, High Speed 14 Transistor CMOS Full Adder Cell with 50% Improvement in Threshold Loss Problem

International Journal of Advance Research in Computer Science and Management Studies

Index terms: Analog to digital converter, Flash ADC, Pseudo NMOS logic, Pseudo Dynamic CMOS logic multi threshold voltage CMOS inverters.

High Speed NP-CMOS and Multi-Output Dynamic Full Adder Cells

A Novel Low power and Area Efficient Carry- Lookahead Adder Using MOD-GDI Technique

Figure.1. Schematic of 4-bit CLA JCHPS Special Issue 9: June Page 101

Gdi Technique Based Carry Look Ahead Adder Design

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Pardeep Kumar, Susmita Mishra, Amrita Singh

Error Tolerant Adder

International Journal of Advanced Research in Biology Engineering Science and Technology (IJARBEST)

Design and Performance Analysis of High Speed Low Power 1 bit Full Adder

Design of High Performance Arithmetic and Logic Circuits in DSM Technology

DESIGN OF EXTENDED 4-BIT FULL ADDER CIRCUIT USING HYBRID-CMOS LOGIC

Low power high speed hybrid CMOS Full Adder By using sub-micron technology

DESIGN AND ANALYSIS OF LOW POWER ADDERS USING SUBTHRESHOLD ADIABATIC LOGIC S.Soundarya 1, MS.S.Anusooya 2, V.Jean Shilpa 3 1

International Journal of Scientific & Engineering Research, Volume 4, Issue 8, August ISSN

Noise Tolerance Dynamic CMOS Logic Design with Current Mirror Circuit

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

A Novel Designing Approach for Low Power Carry Select Adder M. Vidhya 1, R. Muthammal 2 1 PG Student, 2 Associate Professor,

Anitha R 1, Alekhya Nelapati 2, Lincy Jesima W 3, V. Bagyaveereswaran 4, IEEE member, VIT University, Vellore

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

Design of a Power Optimal Reversible FIR Filter for Speech Signal Processing

International Journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Novel Buffer Design for Low Power and Less Delay in 45nm and 90nm Technology

A Highly Efficient Carry Select Adder

Power Efficient and Noise Immune Domino Logic for Wide Fan in Gates

LOW POWER NOVEL HYBRID ADDERS FOR DATAPATH CIRCUITS IN DSP PROCESSOR

II. Previous Work. III. New 8T Adder Design

Design of a Low Voltage low Power Double tail comparator in 180nm cmos Technology

Implementation of Efficient 5:3 & 7:3 Compressors for High Speed and Low-Power Operations

IJMIE Volume 2, Issue 3 ISSN:

International Journal of Advance Engineering and Research Development

Design of Low Power Carry Look-Ahead Adder Using Single Phase Clocked Quasi-Static Adiabatic Logic

Improved Performance and Simplistic Design of CSLA with Optimised Blocks

Transcription:

A HIGH SPEED DYNAMIC RIPPLE CARRY ADDER Y. Anil Kumar 1, M. Satyanarayana 2 1 Student, Department of ECE, MVGR College of Engineering, India. 2 Associate Professor, Department of ECE, MVGR College of Engineering, India. Abstract Adder, which is one of the basic building blocks of a processor affect the performance of the processor. There are many adder architectures each of them have their own advantage. Ripple Carry Adder (RCA) architecture occupies the minimum area among the other architectures with lesser power dissipation. RCA experiences more delay due to its carry propagation in critical path; apart from the delay it also experiences glitches. Constant delay (CD) logic solves both the delay problems and glitch related problems., due to its pre-evaluated characteristics delivers high speed but due its bulkier nature it is used only in the critical path. In this paper two new techniques are presented which modifies the conventional timing block (requires ten transistors) in and two new timing blocks one with eight transistors and other with nine transistors are developed. The with the two new timing block is used in critical path of RCA to achieve higher speed performance with lesser area compared to conventional. The with 9-transistor timing block achieves 70% and 39% delay reduction compared to Static and Domino logics. It also achieves 21% and 5% reduction in power dissipation and delay. The 8-transistor version also achieves reduction of delay by 65% and 29% compared to Static and dynamic logic. The two versions of timing blocks have their own advantages where 9-transistor version provides high speed and 8- transistor version provides lesser power dissipation. Simulations are carried out in 130 nm at 1V power supply using mentor graphics tools. Key Words: Critical Path, Feed Through Logic, Constant Delay logic, Pre-evaluated logic, and Timing block. --------------------------------------------------------------------***---------------------------------------------------------------------- 1. INTRODUCTION Addition is very crucial to perform fundamental arithmetic operations. It is used extensively in many VLSI designs and is by so far the most frequently used operation in generalpurpose system and in application-specific processors. Also, because the operations of subtraction, multiplication, division and address calculation usually rely on the operation of addition, addition is often seen as an indispensable part of the arithmetic unit. It is dubbed the heart of any microprocessor, DSP architecture, and data processing system. The carry propagation from each bit to its higher position results in a substantial delay. So the adder which lies in the critical delay path effectively determines the system s overall speed. An efficient adder builds an efficient system. This leads to increasing popularity of smaller and more durable mobile computing and communication systems. There are many adder architectures namely the Ripple Carry Adder (RCA), the Carry Look-Ahead Adder (CLA), the Carry Skip Adder (CSK), the Carry Select Adder (CSL), the Carry Save Adder (CSA) and the Conditional Sum Adder (COS). Each architecture has its own advantages. Among all the adder architectures, the RCA occupies the smallest area and offers good performance for random data input. But the delay depends on length of carry propagation path. As the number of inputs increases delay increases linearly. For an n-bit RCA the delay is nt, where T is the delay of a full adder block. The overall performance of an RCA depends on design on Full adder block. 2. FULL ADDER BLOCK There are many full adder (FA) architectures, where the conventional CMOS adder uses 32 transistors, the highest among the adders and least number of transistors required to design a full adder are six. But the CMOS logic and dynamic logic provides less power dissipation. But the dynamic logic suffers from cascading problem. Domino logic overcomes the cascading problem with an extra inverter [3]. The cascading problem in dynamic logic and an extra inverter overhead is compensated by NORA and ZIPPER logic, but NORA logic suffers from charge leakage and ZIPPER logic needs non overlapping clocks that creates area overhead[1-2]. Some other versions of full adders include Complementary pass transistor logic full adder, Transmission gate full adder, 17-transistor full adder, 14- transistor full adder, 10- transistor full adder [5] etc., In this paper a high speed dynamic logic is proposed which is derived from (Constant Delay) [8]. Before discussing about, Feed Through Logic (FTL) [4] should be understood. FTL is shown in fig. 1 overcomes the area over head in domino logic and cascading problem in dynamic logic. By removing the footer transistor and placing a pre-discharge transistor parallel to output node, the cascading problem is solved without an extra inverter. But drawback of FTL is higher power dissipation than dynamic and domino logic. This is due to the short circuit path from VDD to Ground when M1 and NMOS pull down network conducts simultaneously. overcomes the short circuit problem in FTL with the help of an additional timing block. This timing block prevents the pull up transistors and Volume: 09 Issue: 08 August-2015, Available @ http://www.ijret.org 438

NMOS pull down logic to conduct simultaneously. The inverter at the output is to eliminate the noise. CD LOGIC can implement both inverting logic and non inverting logic. Fig. 2 shows buffer implemented using constant delay logic. In this paper six types of full adder configurations are compared with each other, where A, B and CIN are the inputs and COUT and SUM are the outputs. Fig. 3(a) and 3(b) shows the sum generation units, where 3(b) consumes less power but provides more delay compared to CMOS sum generation. Figure 4(a) (d) shows the carry generation units of Conventional CMOS logic, Domino logic, 10- Transistor Full adder and respectively. Out of the 4 types of carry generation units the 10-Transistor FA requires least number of transistors to generate carry. It generates the inverted version of carry, so to use it in RCA the COUT signal should be given to an inverter input. The noise in COUT signal is removed by the inverter, to remove the noise in SUM signal two inverters or a buffer should be added to the SUM signal output. But this addition of inverters makes it increase its size and power dissipation. When 10-T FA is used in RCA, by connecting the sum output of the last stage to a buffer, the noise at the output is reduced. CMOS logic and Domino logic carry generation unit consumes the lesser power. provides lesser delay above all the carry generation units because of its pre-evaluation concept. In this paper technique with optimized timing block is utilized to design an 8-bit RCA. Importance of CD logic and methods to overcome the drawbacks of are briefly explained in the next section. Fig -3 Sum generation units Fig -4 Carry generation units Fig -1 feed through logic (FTL) Fig -2 Constant delay logic buffer 3. PROPOSED TIMING BLOCKS OF CD LOGIC. in D-Q mode shows high speed performance due to its pre-evaluation nature. But when compared to domino logic(without keeper), requires extra 11 transistors and extra 13 compared to dynamic logic. In the extra number of transistors is mainly due to timing block (TB), so optimizing the timing block reduces the area overhead and power consumption. The timing block should be optimized in such a way that the delay should not be increased. Two optimized designs of timing blocks are proposed in this paper which performs the same logical function of the original timing block. 8-T Timing Block In first design the number of transistors in the timing block is reduced by two. Fig. 5 shows the original timing block from where the transistors M1 and M3 are removed to get the same operation performed. Fig. 6 shows the first modification in timing block i.e., 8-T timing block. The delay gets increased if this timing block is used because Volume: 09 Issue: 08 August-2015, Available @ http://www.ijret.org 439

transistor M3 has clock as an input i.e., if CLK= 1 then TOUT should be pulled to zero. Fig -5 Timing Block of As M3 is absent in this circuit TOUT is made zero through M2 and M1 using delay inverted clock signal. So TOUT is inverted version of clock, i.e., when CLK= 1 TOUT= 0, but with some delay. This delay makes the pre-charging slower, which increases the delay. To reduce the delay, the W/L ratio of pre-charge transistors should be increased. To avoid this 9-T timing block is developed. Fig -6 8-T timing block 9-T Timing block In the second design one transistor is reduced as shown in fig. 7, but the 8-T timing block has some limitations for low leakage paths, and 9-T timing works well with the low leakage paths but dissipates more power at high leakage paths. The drawback in 8-T timing block is overcome by adding transistor M3. So now, charging the TOUT node is faster compared to that in 8-T timing block. But here the disadvantage is more power dissipation due to extra transistor. Fig -7 9-T timing block 4. IMPLEMENTATION OF 8-BIT RIPPLE CARRY ADDER In RCA the carry chain constitute the critical path, so speeding up the carry helps in improving the speed of the RCA. Ripple Carry adder is subjected to a glitching problem. Due to delay in previous stage carry signals the glitches occur. In delay from the previous stage is to be considered seriously as the evaluation time for this logic type is very less. Evaluation time is said to be the window width. The window width is equal to the three inverter delay. Generally in dynamic logic the evaluation time is the whole evaluation period, but in it is just a part of the evaluation period, so the delay should be less than the window width to prevent false logic evaluation. If the glitches from the previous state last for the window width period then false evaluation takes place. To eliminate this problem the clock signal should be delayed such that the window width period can be delayed and the glitches can be avoided. Figure 8 shows the arrangements of inverters to generate appropriate clock signals for each stage. CLK1, CLK2 and CLK3are delayed clock signals for different stages of RCA. This arrangement is only for dynamic logic, as the static logic doesn t operate on clock signals. In this paper an 8-bit RCA is simulated using six different full adder blocks. The sum generation unit same for all adder blocks. The simulated results in Table I are of the RCA circuit that used CMOS sum generation unit. Even though 12-T adder provides lesser transistors and noise less output, it creates more delay which is due to the inverters INV1 and INV2 in fig 3(b). Due to the extra delay the clock arrangement also should be changed. It requires three extra delay clock signals as shown in fig 9, CLK for stage1, CLK1 for stage2, CLK2 for stage3, CLK3 for stage4, CLK4 for stage5, CLK5 for stage6, CLK6 for stage6 and stage7. Other full adder (FA) with 28 transistors with sizing strongly in favour of Cout computation [6] can also be used. A more energy-efficient pass-transistor FA design [7] can also be used in the full adder design Volume: 09 Issue: 08 August-2015, Available @ http://www.ijret.org 440

Among the compared RCA architectures even though the 10-Transistor FA RCA requires the least number of transistors the delay is almost equal to the CMOS 32- Transistor FA RCA and power dissipation is very high. Considering low leakage conditions Domino logic carry generation circuit is simulated without a keeper transistor and that reduced 8 transistors. If the keeper transistor is included the power dissipation would be equal to that of CMOS 32-T FA RCA. occupies more area than CMOS logic but it has the better power delay product. 8-T TB even though provides more delay than CD logic it has second least power delay product. 9-TB CD logic is the fastest among the compared logics and having the third least power delay product. Another logic, the Current Comparison Domino (CCD) Logic [9] consumes very less power which is not discussed in this paper but not faster than. Ripple carry adder occupies the least area among all the other adder architectures, but has more delay when number of inputs increases. But with modified in critical path of RCA makes it suitable for 8-bit applications. Instead of going to CLA it is better to opt RCA which occupies less area with increasing speed. Fig -8 8-bit Ripple carry adder Figure 9 Clock signals arrangement for RCA with 12-T sum unit Table -1 Comparison Of Rca Architectures With Different Logics Logic type S7 Delay COUT DELAY Number of Power Dissipation Power Delay transistors product (10-18 ) Conventional 32-T FA 2.113 ns 2.07 ns 276 20.5962 n Watts 43.519 Domino Logic 1.04 ns 587.718 ps 160 14.238 n Watts 14.807 CD Logic 677.780 ps 550.134 ps 300 51.191 n Watts 34.656 10-T FA 2.512 n s 2.527 ns 96 59.889 u Watts 150441 8-T TB 734.65 ps 612.456 ns 284 32.8427 n Watts 24.131 9-T TB 643.247 ps 575.767 ps 292 42.3167 n Watts 27.220 Volume: 09 Issue: 08 August-2015, Available @ http://www.ijret.org 441

Normalized values IJRET: International Journal of Research in Engineering and Technology eissn: 2319-1163 pissn: 2321-7308 4 3 2 1 0 CMOS logic Fig -10 Power and Delay characteristics of different types of 8-bit RCA architectures 5. CONCLUSION Power and Delay characteristics Domino logic Logic types 8-T TB 9-T TB Power Delay product Delay In this paper different RCA architectures are analyzed. Even though the 10-transistor FA occupies less area it failed to reduce the power and delay. CMOS logic despite of its bulky nature provides lesser power dissipation, but the delay increased by 200% when compared to. Two new techniques are employed in to reduce the power dissipation and delay. The two techniques 8T- Timing block and 9-T timing block used succeeded in reducing the power dissipation compared to its predecessor but the 8- T timing failed to reduce the delay. But the 8-T TB has lesser delay compared to CMOS logic, Domino Logic and 10-T FA RCA. Fig. 10 shows the Power- Delay product and Delay characteristics graph. 8-T TB CD logic has the better Power-Delay product among the other versions. The 8-transistor version also achieves reduction of delay by 65% and 29% compared to Static and dynamic logic. But the 8-T TB version has 14% more delay compared to 9-T TB version. Even though 8-T TB version dissipates less power than 9-T TB version, as the main aim is to reduce the critical path 9-T TB version is considered. Domino logic has the lesser power delay product but the delay is 53.4% more compared to and 61.7% more than 9-TB. So the domino logic dissipated less power and provides more speed. As the VLSI domain mainly opts for high speed and low power designs, 9-TB is suitable in replacing critical path circuits and the remaining circuit can be designed with domino logic or some other low power logic. [4]. V. Navarro-Botello, J. A. Montiel-Nelson, and S. Nooshabadi, Analysis of high-performance fast feed through logic families in CMOS, IEEE Trans. Circuits Syst. II, xp. Briefs, vol. 54, no. 6, pp. 489 493, Jun. 2007. [5]. Kiat-Seng Yeo and Kaushik Roy Low-Voltage, Low- Power VLSI Subsystems, Tata McGraw-Hill Edition 2009. [6]. N. Weste and D. Harris, CMOS VLSI Design: A Circuits and Systems Perspective, 4th ed. Reading, MA: Addison Wesley, Mar. 2010. [7]. M. Aguirre-Hernandez and M. Linares-Aranda, CMOS full-adders for energy-efficient arithmetic applications, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 19, no. 99, pp. 1 5, Apr. 2010. [8]. Pierce Chuang, David Li and Manoj Sachdev Constant Delay Logic Style IEEE Transactions On VLSI Systems, Vol. 21, No. 3, pp. 554-565, March 2013. [9]. Ali Peiravi, Mohammad Asyaei, Current- Comparison-Based Domino: New Low-Leakage High- Speed Domino Circuit for Wide Fan-In Gates, IEEE Transactions VLSI Systems, Vol. 21, No. 5,pp. 934-943, May 2013. BIOGRAPHIES Yerninti Anil Kumar received B.tech degree in Electronics and Communication Engineering from LENDI College of Engg. and Tech. Pursuing M.tech (VLSI) in MVGR college of Engineering. Dr. Moturi Satyanarayana received B.tech in Electronics and Communication Engineering from Nagarjuna University. M.tech in Radar and Microwave Engineering from Andhra University, and PhD in Antenna Arrays from Andhra University. Member of IETE, SEMCE, ISTE and ISOI. REFERNCES [1]. N. Goncalves and H. De Man, NORA: A racefree dynamic CMOS technique for pipelined logic structures, IEEE J. Solid-State Circuits, vol. 18, no. 3, pp. 261 266, Jun. 1983. [2]. C. Lee and E. Szeto, Zipper CMOS, IEEE Circuits Syst. Mag., vol. 2,no. 3, pp. 10 16, May 1986. [3]. Sung-mo (Steve) Kang and Yusuf Leblebici. CMOS Digital Integrated Circuits Analysis And Design, 3 rd edition, WCB McGraw Hill, 2003. Volume: 09 Issue: 08 August-2015, Available @ http://www.ijret.org 442