Design of 32-bit Carry Select Adder with Reduced Area

Similar documents
FPGA Implementation of Area Efficient and Delay Optimized 32-Bit SQRT CSLA with First Addition Logic

Efficient Carry Select Adder Using VLSI Techniques With Advantages of Area, Delay And Power

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

Design and Implementation of High Speed Carry Select Adder

Index Terms: Low Power, CSLA, Area Efficient, BEC.

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Low Power and Area EfficientALU Design

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

International Journal of Modern Trends in Engineering and Research

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

LowPowerConditionalSumAdderusingModifiedRippleCarryAdder

Implementation of Cmos Adder for Area & Energy Efficient Arithmetic Applications

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

II. LITERATURE REVIEW

Design of High Speed Hybrid Sqrt Carry Select Adder

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

128 BIT MODIFIED SQUARE ROOT CARRY SELECT ADDER

Design of 16-bit Heterogeneous Adder Architectures Using Different Homogeneous Adders

SQRT CSLA with Less Delay and Reduced Area Using FPGA

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

Design and Analysis of CMOS based Low Power Carry Select Full Adder

An Efficent Real Time Analysis of Carry Select Adder

DESIGN OF CARRY SELECT ADDER WITH REDUCED AREA AND POWER

An Efficient Implementation of Downsampler and Upsampler Application to Multirate Filters

A Novel Designing Approach for Low Power Carry Select Adder M. Vidhya 1, R. Muthammal 2 1 PG Student, 2 Associate Professor,

International Journal of Advance Engineering and Research Development

Analysis of Low Power, Area- Efficient and High Speed Multiplier using Fast Adder

Optimized area-delay and power efficient carry select adder

A Hierarchical Design of High Performance Carry Select Adder Using Reversible Logic

Efficient Implementation on Carry Select Adder Using Sum and Carry Generation Unit

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

VLSI IMPLEMENTATION OF AREA, DELAYANDPOWER EFFICIENT MULTISTAGE SQRT-CSLA ARCHITECTURE DESIGN

A Highly Efficient Carry Select Adder

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

LOW POWER AND AREA- EFFICIENT HALF ADDER BASED CARRY SELECT ADDER DESIGN USING COMMON BOOLEAN LOGIC FOR PROCESSING ELEMENT

Design and Implementation of Efficient Carry Select Adder using Novel Logic Algorithm

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

An Efficient SQRT Architecture of Carry Select Adder Design by HA and Common Boolean Logic PinnikaVenkateswarlu 1, Ragutla Kalpana 2

IMPLEMENTATION OF UNSIGNED MULTIPLIER USING MODIFIED CSLA

IJCAES. ISSN: Volume III, Special Issue, August 2013 I. INTRODUCTION

An Efficient Low Power and High Speed carry select adder using D-Flip Flop

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

An Efficient Carry Select Adder with Reduced Area and Low Power Consumption

Design and Analysis of Improved Sparse Channel Adder with Optimization of Energy Delay

Efficient Optimization of Carry Select Adder

Design of Delay-Power Efficient Carry Select Adder using 3-T XOR Gate

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

AREA DELAY POWER EFFICIENT CARRY SELECT ADDER ON RECONFIGURABLE HARDWARE

Design of Area-Delay-Power Efficient Carry Select Adder Using Cadence Tool

National Conference on Emerging Trends in Information, Digital & Embedded Systems(NC e-tides-2016)

Improved Performance and Simplistic Design of CSLA with Optimised Blocks

I. INTRODUCTION VANAPARLA ASHOK 1, CH.LAVANYA 2. KEYWORDS Low Area, Carry, Adder, Half-sum, Half-carry.

IMPLEMENTATION OF AREA EFFICIENT AND LOW POWER CARRY SELECT ADDER USING BEC-1 CONVERTER

Area Efficient Carry Select Adder with Half-Sum and Half-Carry Method

An Efficient Higher Order And High Speed Kogge-Stone Based CSLA Using Common Boolean Logic

Area and Delay Efficient Carry Select Adder using Carry Prediction Approach

An Efficient Carry Select Adder A Review

LOW POWER AND AREA EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURE USING MODIFIED SQRT CARRY SELECT ADDER

Design and Implementation of 128-bit SQRT-CSLA using Area-delaypower efficient CSLA

LOW POWER HIGH SPEED MODIFIED SQRT CSLA DESIGN USING D-LATCH & BK ADDER

High Speed Non Linear Carry Select Adder Used In Wallace Tree Multiplier and In Radix-4 Booth Recorded Multiplier

AREA AND POWER EFFICIENT CARRY SELECT ADDER USING BRENT KUNG ARCHITECTURE

Design of high speed hybrid carry select adder

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

International Research Journal of Engineering and Technology (IRJET) e-issn:

A MODIFIED STRUCTURE OF CARRY SELECT ADDER USING CNTFET TECHNOLOGY Karunakaran.P* 1, Dr.Sundarajan.M 2

Comparative Analysis of Various Adders using VHDL

HDL Implementation of New Performance Improved CSLA Gate Level Architecture

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

AN EFFICIENT CARRY SELECT ADDER WITH LESS DELAY AND REDUCED AREA USING FPGA QUARTUS II VERILOG DESIGN

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

Implementation of High Speed Multiplier with CSLA using Verilog

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

Australian Journal of Basic and Applied Sciences. Optimized Embedded Adders for Digital Signal Processing Applications

An Optimized Design of High-Speed and Energy- Efficient Carry Skip Adder with Variable Latency Extension

Design and Implementation of Complex Multiplier Using Compressors

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

FPGA Realization of Hybrid Carry Select-cum- Section-Carry Based Carry Lookahead Adders

A HIGH SPEED DYNAMIC RIPPLE CARRY ADDER

International Journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

Implementation and Analysis of High Speed and Area Efficient Carry Select Adder

Comparison among Different Adders

Novel Architecture of High Speed Parallel MAC using Carry Select Adder

Low Power 32-bit Improved Carry Select Adder based on MTCMOS Technique

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Design and Analysis of BEC and GDI Technique Using Carry Select Adder

Design and Implementation of High Speed Carry Select Adder

Reduced Area Carry Select Adder with Low Power Consumptions

DESIGN OF HIGH SPEED AND ENERGY EFFICIENT CARRY SKIP ADDER

Design of an optimized multiplier based on approximation logic

A Novel Approach For Designing A Low Power Parallel Prefix Adders

Design of Fastest Multiplier Using Area Delay Power Efficient Carry-Select Adder

Transcription:

Design of 32-bit Carry Select Adder with Reduced Area Yamini Devi Ykuntam M.V.Nageswara Rao G.R.Locharla ABSTRACT Addition is the heart of arithmetic unit and the arithmetic unit is often the work horse of a computational. So adders play a key role in designing an arithmetic unit and also many digital integrated s. Carry Select Adder () is one of the fastest adders used in many data processors and in digital s to perform arithmetic operations. But is area-consuming because it consists of dual ripple carry adder (RCA) in the structure. To reduce the area of, a with Binary to Excess-1 Converter is already designed which reduces the area of adder. But there are other techniques to design a to reduce its area. One of such technique is using an add one technique. This paper proposes the design of square root (SQRT ) using add one with significant reduction in area. The proposed design is synthesized using Leonardo Spectrum to get area (number of gates) and delay (ns). The performance in terms of area and delay are evaluated for square root using add one and are compared with existing SQRT and SQRT using Binary to Excess-1 Converter (BEC). The results analysis shows that the proposed SQRT using add one is better than the existing SQRT and SQRT using BEC. Keywords SQRT, ASIC, BEC, add one, ISIM, Leonardo Spectrum 1. INTRODUCTION VLSI IC s are those s which contain more than 10 5 transistors and these s can be used as general purpose IC s such as microprocessors, memories, DSPs and also as Application Specific IC s (ASICs). In VLSI technology, the main design entity is area which measures the cost and power consumption of an IC. Reduced area and high speed data path logic systems are the main areas of research in VLSI system [1]. High-speed addition and multiplication has always been a fundamental requirement of high-performance processors and systems. The speed of addition and multiplication operations depends on the speed of the adder on which the operations are performed. There are many types of digital adders are available for designing a digital s and arithmetic units in a processor. The performance of digital adders is limited by the speed of addition. The reason behind this limitation is the time taken to propagate the carry. The sum for each bit in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated into the next position [2]. There are many ways to design an adder. The Ripple Carry Adder (RCA) structure is chain of Full adders which is easy to design, but takes longer time to perform addition operation due to the delay in propagation of carry from one adder to another. The delay due to carry propagation in RCA is proportional to the number of input bits (N) to RCA. For large values of N, the delay of the RCA also increases. To overcome delay problem, a new adder structure is designed called Carry Look-Ahead Adder (CLA). CLA is designed using two s namely propagate and generate. As the number of input bits increases, the size of propagate and generate s also increases which causes increase in area and also introduces delay again. So CLA avoids the delay problem for less number of input bits, but not suitable for large size input. The provides a compromise between RCA and CLA. The is used in many digital systems to improve the carry propagation delay by independently performing addition operation using two RCAs by assuming C in for one RCA as 0 and for other as 1 [3], [4]. And the final sum is chosen in between the two results of RCA with the help of a multiplexer (mux) to which the control signal is carry out of previous adder stage. Due to the use of two RCAs, the area of increases. To reduce the area, the RCA with C in =1 are replaced.the SQRT has been chosen for comparison with the proposed design as it has a more balanced delay, and requires lower power and area [5-6]. The 8-bit, 16-bit, 32-bit SQRT using add one is proposed to minimize the area and power. This paper in brief is structured as follows-section 2 describes about conventional SQRT and its delay, area evaluations. Section 3 describes about existing SQRT and its area, delay evaluation. Section 4 describes about proposed SQRT and its area, delay evaluation. Section 5 explains the synthesis results of the proposed adder structure and its comparison with the other two adder structure. Finally this paper ends with conclusion. 2. SQUARE ROOT CARRY SELECT ADDER 2.1 Structure of SQRT The is used in many digital systems design to overcome the problem of carry propagation delay by independently performing addition operation by considering carry inputs (C in ) as 1 and 0. Figure 1. shows a 32-bit SQRT. The SQRT is divided into m= 2m carry select stages (CSS), where m is number of input bits. The 32 bit SQRT consists of 7 CSS. The CSS consists of two ripple carry adders one with carry in 0 and other with carry in 1. It also consists of a multiplexer which is used to select the sum and carry values from the two RCAs by using the control signal to it. The control signal to multiplexer is nothing but the carry out of the previous CSS. If the control signal is 1 then sum and carry out of RCA with C in =1 is selected by the multiplexer and if control signal is 0 then sum and carry out of RCA with C in =0 is selected by the multiplexer. 47

with Cin=0 6:4 RCA with with Cin=1 with Cin = 1 with Cin = 1 6:4 RCA with Cin = 1 Cin = 1 6 6 0 C 6 C 3 C OUT Sum 31:24 sum 16:11 sum 10:7 sum 6:4 sum 3:2 sum 1:0 Fig 1. 32-bit SQRT 2.2 Delay and area evaluation of basic modules in SQRT Basic modules in are Ripple Carry Adder (RCA) and multiplexer (mux). RCA is composed of full adder (FA) and half adder (HA). The full adder consists of two AND gates, one OR gate and two OR gates. In turn the OR gate is formed by two inverters, two AND gates and one OR gate, totally OR gate is composed of 5 gates. So FA consists of 13 gates. The half adder consists of one AND gate and one OR gate, so HA consists of totally 6 gates. The mux used is 2:1 mux is used. It consists of one inverter, two AND gates and one OR gate. Therefore 2:1 mux consists of 4 gates. While evaluating the delay and area, it is considered that all gates to be made up of AND, OR, and Inverter (AOI).The delay of each gate equal to 1 unit and also area of each equal to 1 unit. The maximum delay can be calculated by adding up the number of gates in the longest path. The area evaluation is done by counting the total number of AOI gates present in each logic. Table 1. Delay and area count of basic modules of Basic modules Delay Area OR 3 5 2:1 Mux 3 4 Full adder 6 13 Half adder 3 6 2.3 Delay and area evaluation of SQRT As shown in fig.1, group2 consists of a 2-bit RCA with C in =0, a 2-bit RCA with C in =1 and also a 6:3 Mux. The 2-bit RCA with C in =0 consists of one FA and one HA where as 2-bit RCA with C in =1 consists of two FAs. Based on the area count of table-1, the total number of gates present in group2 is: Gate count = 57 (FA + HA + ) FA = 39(3*13) HA = 6(1*6) Mux = 12(3*4) delay. In this adder, dual RCAs are used which occupies more area which in turn increases the power consumption. So a new adder is proposed in [7] is SQRT with Binary to Excess-1 Converter (BEC). 3. EISTING STRUCTURE 3.1 SQRT To reduce the area and power consumption of the regular, RCA with C in =1 is replaced (Binary to Excess-1 Converter) [7] as shown in fig 2. An n-bit RCA, can be replaced with a n+1-bit BEC. Figure 3. explains the basic function of the by using the 4-bit BEC together with the mux. A set of four bits (4-bit input) and the other set of 4- bits (4-bit BEC output) were given as input to the 8:4 multiplexer. Depending on the control signal C in, either the BEC output or the 4-bit input. The advantage of the BEC logic in SQRT is that, as the number of input bits is increased the requirement of area is progressively decreased. Figure 2. shows the structure of a 32-bit SQRT with BEC. 3.2. Delay and area evaluation of SQRT As shown in fig. 2, group2 consists of a 2 bit RCA with C in =0, a 3-bit BEC and a 6:3 Mux. The 2-bit RCA with C in =0 consists of a FA and HA. Based on the area count of table-1, the total number of gates present in group2 is: 48

6:4 RCA 9-bit BEC 7-bit BEC 5-bit BEC 4-bit BEC 3-bit BEC 6 0 C6 C3 C3 Cout sum31:24 sum16:11 sum10:7 sum6:4 sum3:2 sum1:0 Fig 2. 32-bit SQRT 4-bit BEC 4-Bit Input 8:4 4-Bit output Fig 3. 4-bit BEC with 8:4 Gate count =43(FA+HA++BEC) FA = 13(1*13) HA = 6(1*6) AND=1 NOT=1 OR=10(2*5) Mux = 12(3*4) delay. But delay is increased in this architecture. To reduce this delay penalty, a new architecture of SQRT using add one is proposed. Cin 4. PROPOSED STRUCTURE 4.1. SQRT This adder uses add one instead of RCA with C in =l. The main principle used in this adder is, if the results of RCA with C in = 0 is known, the result of RCA with Cin=l can be found by adding one to the result for Cin=0. Thus, an add one can replace the ripple-carry adder for Cin=l in a. With an efficient design of an add one, the area of SQRT can be further reduced when compared with SQRT. Complement scheme is used fordesigning add one. Complement scheme states that, adding one is just inverting each S 0 bit starting from the least significant bit until the first zero is found. The 32-bit SQRT is designed using an add one instead of a RCA with Cin=1 as shown in fig. 4.This architecture also consists of seven carry select stages (CSS). First CSS consists of only adders and remaining stages consist of adders, add one, first zero and multiplexers. The adders used in this architecture are mirror adders and eliminates inverters in the carry out path so delay in carry path is avoided [8], [9]. A multiplexer based add one is proposed. A multiplexer is needed for each bit to choose in between sum and complement of sum. The control signal of the multiplexer is from the first zero. The first zero is NMOS and PMOS chains. This generates 0 at the kth node if no zero is founded until kth bit from the least significant bit; otherwise, it generates 1. If the control signal is 0, the multiplexer chooses sum otherwise, it chooses the inverted sum. The least significant bit does not need a multiplexer since S 1 0 is always the opposite of S 0 0. This saves a few transistors for each 49

6:4 RCA 3:2 RCA 6 0 C6 C3 C out Sum 31:24 sum 15:11 sum 10:7 sum 6:4 sum 3:2 sum 1:0 Fig 4. 32-bit SQRT A B S1 A B 4.2. Delay and area evaluation of SQRT As shown in fig. 4, in group2 has2-bit RCA which consists of two FAs, first zero, inverters, multiplexers and a two input. The total number of gates present in group2 is: Gate count =37(FA+ Mux+ two input + inverters+ first zero ) O S2 Fig 5. Replacing two multiplexers by one multiplexer with a NAND. The carry out for a can be chosen between the carry out for the RCA or the carry out for the add-one. Finally, the multiplexers is placed in the bottom to choose between the results for Cin=0 and the results for Cin=l. A twoinput two select is used to replace the two multiplexers with one multiplexer and a NAND can replace the two multiplexers as shown in fig. 5. Since the sum of the most significant bit for C in =0 is used to get the carryout of a, the proposed CSA delay is longer than the CSA using dual RCAs. Therefore, by replacing the last FA by two-level ORs to get the sum faster, the delay time can be reduced. O S 1 S2 FA=18(9*2) Mux=8(4*2) = mux +NAND gate=4+1=5 Inverters=2 =4 delay. Table 2. Comparison of area of three adder structures (theoretical calculated results) Group Basic SQRT area SQRT area SQRT area Group 1 26 26 19 Group 2 57 43 35 Group 3 87 66 54 Group 4 117 89 71 Group 5 147 112 86 total 434 336 265 50

By comparing the area of proposed SQRT with the basic and SQRT in table 2, it is clear that the proposed adder structure has 169 less gates when compared with basic SQRT structure and 71 less gates when compared with SQRT. For further evaluation of the performance in terms of area and delay, the three adder structures must be designed using VHDL, than simulated and synthesized for ASIC implementation. 5. SILATED RESULTS AND COMPARISON The adders design proposed in this paper has been developed using VHDL and all the simulations are carried outusing ISim simulator. The designs are synthesized in Leonardo Spectrum to get the area (number of gates) and delay (ns). The area and delay required for the traditional and SQRT logic are calculated and compared with the proposed SQRT. The area and delay of three synthesized adders in Leonardo Spectrum for word size length 8bit, 16bit and 32bit is illustrated in Table-3. Table 3. Comparison of area and delay in between the three adders Word size Adder Area (number Delay(ns) (bits) of gates) 8 Basic SQRT 321 2.85 SQRT 286 3.44 SQRT 266 2.94 16 Basic SQRT 796 4.63 SQRT 622 5.07 SQRT 589 4.85 32 Basic SQRT 1506 6.56 SQRT 1293 7.36 SQRT 1245 6.81 6. CONCLUSION The area and delay of 8-bit, 16-bit, 32-bit and 64-bitbasic SQRT, SQRT logic are evaluated and compared with the proposed SQRT logic. It is clear from table-3 that, the proposed adder takes less delay and area when compared with SQRT logic. It is also observed that in the proposed adder the reduction in area is very high with insignificant penalty in the delay when compared with traditional SQRT. As the input length is progressed, the area is decreased in the same proportion, but in the same proportion the delay penalty is not increased. Since the area in the proposed adder is very less, it is obvious that, the power consumption is also very less. Therefore this adder can be preferred for low power applications. 7. REFERENCES [1] O. J. Bedrij, Carry-select adder, IRE Trans. Electron. Comput.,pp.340 344, 1962. [2] Y. Kim and L.-S. Kim, 64-bit carry-select adder with reduced area, Electron. Lett., vol. 37, no. 10, pp. 614 615, May 2001. [3] B. Ramkumar, H.M. Kittur, and P. M. Kannan, ASIC implementation of modified faster carry save adder, Eur. J. Sci. Res., vol. 42, no. 1, pp. 53 58, 2010. [4] T. Y. Ceiang and M. J. Hsiao, Carry-select adder using single ripple carry adder, Electron. Lett., vol. 34, no. 22, pp. 2101 2103, Oct. 1998. [5] J. M. Rabaey, Digtal Integrated Circuits A Design Perspective. Upper Saddle River, NJ: Prentice-Hall, 2001. [6] Y. He, C. H. Chang, and J. Gu, An area efficient 64-bit square root carry-select adder for low power applications, in Proc. IEEE Int. Symp. Circuits Syst., 2005, vol. 4, pp. 4082 4085. [7] B. Ramkumar and Harish M Kittur, Low-Power and Area-Efficient Carry Select Adder IEEE transactions on very large scale integration (VLSI) systems, vol. 20, no. 2, February 2012] [8] N. Weste and K. Eshragian, Principles of CMOS VLSI Designs: A System Perspective, 2nd ed., Addison- Wesley, 1985-1993. [9] Morinaka, H., Makino, H., Nakase, Y. et. al, "A 64 bit Carry Look-ahead CMOS adder using Modified Carry Select". Cz/stoin Integrated Circuit Conference, 1995, pages 585-588 51