A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

Similar documents
Design and Implementation of High Speed Carry Select Adder

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

SQRT CSLA with Less Delay and Reduced Area Using FPGA

An Efficent Real Time Analysis of Carry Select Adder

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

FPGA Implementation of Area Efficient and Delay Optimized 32-Bit SQRT CSLA with First Addition Logic

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

Design of 32-bit Carry Select Adder with Reduced Area

128 BIT MODIFIED SQUARE ROOT CARRY SELECT ADDER

An Efficient Carry Select Adder with Reduced Area and Low Power Consumption

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

Efficient Carry Select Adder Using VLSI Techniques With Advantages of Area, Delay And Power

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

Index Terms: Low Power, CSLA, Area Efficient, BEC.

International Journal of Modern Trends in Engineering and Research

A VLSI Implementation of Fast Addition Using an Efficient CSLAs Architecture

International Journal of Advance Engineering and Research Development

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

Implementation of 64 Bit KoggeStone Carry Select Adder with BEC for Efficient Area

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Low Power and Area EfficientALU Design

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

An Efficient Higher Order And High Speed Kogge-Stone Based CSLA Using Common Boolean Logic

Design of High Speed Hybrid Sqrt Carry Select Adder

A Novel Designing Approach for Low Power Carry Select Adder M. Vidhya 1, R. Muthammal 2 1 PG Student, 2 Associate Professor,

Design and Implementation of Efficient Carry Select Adder using Novel Logic Algorithm

High Speed Non Linear Carry Select Adder Used In Wallace Tree Multiplier and In Radix-4 Booth Recorded Multiplier

II. LITERATURE REVIEW

Available online at ScienceDirect. Procedia Computer Science 89 (2016 )

IMPLEMENTATION OF AREA EFFICIENT AND LOW POWER CARRY SELECT ADDER USING BEC-1 CONVERTER

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

An Efficient SQRT Architecture of Carry Select Adder Design by HA and Common Boolean Logic PinnikaVenkateswarlu 1, Ragutla Kalpana 2

A Highly Efficient Carry Select Adder

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

Design and Analysis of Improved Sparse Channel Adder with Optimization of Energy Delay

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

LOW POWER AND AREA- EFFICIENT HALF ADDER BASED CARRY SELECT ADDER DESIGN USING COMMON BOOLEAN LOGIC FOR PROCESSING ELEMENT

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

A Novel Approach For Designing A Low Power Parallel Prefix Adders

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

An Efficient Implementation of Downsampler and Upsampler Application to Multirate Filters

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

Implementation of Cmos Adder for Area & Energy Efficient Arithmetic Applications

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

IJCAES. ISSN: Volume III, Special Issue, August 2013 I. INTRODUCTION

Improved Performance and Simplistic Design of CSLA with Optimised Blocks

IMPLEMENTATION OF UNSIGNED MULTIPLIER USING MODIFIED CSLA

LOW POWER HIGH SPEED MODIFIED SQRT CSLA DESIGN USING D-LATCH & BK ADDER

Area and Delay Efficient Carry Select Adder using Carry Prediction Approach

Efficient Implementation on Carry Select Adder Using Sum and Carry Generation Unit

Design and Implementation of 128-bit SQRT-CSLA using Area-delaypower efficient CSLA

LowPowerConditionalSumAdderusingModifiedRippleCarryAdder

DESIGN OF CARRY SELECT ADDER WITH REDUCED AREA AND POWER

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

Design of Delay-Power Efficient Carry Select Adder using 3-T XOR Gate

AREA DELAY POWER EFFICIENT CARRY SELECT ADDER ON RECONFIGURABLE HARDWARE

I. INTRODUCTION VANAPARLA ASHOK 1, CH.LAVANYA 2. KEYWORDS Low Area, Carry, Adder, Half-sum, Half-carry.

Design of 16-bit Heterogeneous Adder Architectures Using Different Homogeneous Adders

Optimized area-delay and power efficient carry select adder

Design and Analysis of CMOS based Low Power Carry Select Full Adder

A Hierarchical Design of High Performance Carry Select Adder Using Reversible Logic

Survey of VLSI Adders

AREA AND POWER EFFICIENT CARRY SELECT ADDER USING BRENT KUNG ARCHITECTURE

Comparative Analysis of Various Adders using VHDL

VLSI IMPLEMENTATION OF AREA, DELAYANDPOWER EFFICIENT MULTISTAGE SQRT-CSLA ARCHITECTURE DESIGN

Multiplier and Accumulator Using Csla

An Efficient Low Power and High Speed carry select adder using D-Flip Flop

Analysis of Low Power, Area- Efficient and High Speed Multiplier using Fast Adder

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

National Conference on Emerging Trends in Information, Digital & Embedded Systems(NC e-tides-2016)

Area Efficient Carry Select Adder with Half-Sum and Half-Carry Method

Analysis of Parallel Prefix Adders

Reduced Area Carry Select Adder with Low Power Consumptions

AREA-EFFICIENCY AND POWER-DELAY PRODUCT MINIMIZATION IN 64-BIT CARRY SELECT ADDER Gurpreet kaur 1, Loveleen Kaur 2,Navdeep Kaur 3 1,3

Design and Implementation of Hybrid Parallel Prefix Adder

FPGA Realization of Hybrid Carry Select-cum- Section-Carry Based Carry Lookahead Adders

International Research Journal of Engineering and Technology (IRJET) e-issn:

Design and Estimation of delay, power and area for Parallel prefix adders

Design of Area-Delay-Power Efficient Carry Select Adder Using Cadence Tool

Efficient Optimization of Carry Select Adder

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

Design and Implementation of Complex Multiplier Using Compressors

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Implementation and Analysis of High Speed and Area Efficient Carry Select Adder

HDL Implementation of New Performance Improved CSLA Gate Level Architecture

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

Design and Implementation of High Speed Carry Select Adder

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

LOW POWER AND AREA EFFICIENT PARALLEL FIR DIGITAL FILTER STRUCTURE USING MODIFIED SQRT CARRY SELECT ADDER

Implementation of High Speed Multiplier with CSLA using Verilog

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

Transcription:

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools K.Sravya [1] M.Tech, VLSID Shri Vishnu Engineering College for Women, Bhimavaram, West Godavari District, India. sravyakopparthi@gmail.com Abstract : In digital adders, the speed of addition is limited by the time required to transmit a carry through the adder. Carry Select Adder (CSLA) is one of the fastest adders used in many dataprocessing processors to perform fast arithmetic functions. Now days we use the very High Speed IC Designs In that designs we required very High Speed Structures. Parallel prefix adder is the most flexible and widely used for binary addition. Parallel Prefix adders are best suited for VLSI implementation. Numbers of parallel prefix adder structures have been proposed over the past years intended to optimize area, fan-out, logic depth and inter connect count. This paper presents a new approach to redesign the basic operators used in parallel prefix architectures. Based on this modification 8, 16, 32, 64 and 128-bit Kogge-Stone Parallel Prefix Adders architectures have been developed and compared with the regular and Modified CSLA architecture. The modified CSLA design has reduced area and Power as compared with the regular CSLA but delay is increased. The Proposed Kogge-Stone adder has less delay when compared with regular and modified CSLA. This work estimates the performance of the proposed designs in terms of delay, area are implemented in Xilinx ISE. Keywords High Speed VLSI; CSLA, BEC, Parallel Prefix Adder; dot operators Parallel Prefix Adder; 1. INTRODUCTION Area and power reduction in data path logic systems are the main area of research in VLSI system design. High-speed addition and multiplication has always been a fundamental requirement of highperformance processors and systems. In digital adders, the speed of addition is limited by the time required to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has been summed and a carry propagated K.Murthy Raju [2] Department of Electronics & Communication Shri Vishnu Engineering College for Women, Bhimavaram, West Godavari District, India. venkateswara.106@gmail.com into the next position. The major speed limitation in any adder is in the production of carries and many authors have considered the addition problem. The CSLA is used in many computational systems to moderate the problem of carry propagation delay by independently generating multiple carries and then select a carry to generate the sum. However, the CSLA is not area efficient because it uses multiple pairs of Ripple Carry Adders (RCA) to generate partial sum and carry by considering carry input and then the final sum and carry are selected by the multiplexers (mux). To overcome above problem, n-bit binary to excess-1 code converters (BEC) is used to improve the speed of addition. This logic can be implemented with any type of adder to further improve the speed. Using Binary to Excess-1 Converter (BEC) instead of RCA in the regular CSLA to achieve lower area and power consumption. The main advantage of this BEC logic comes from the lesser number of logic gates than the Full Adder (FA) structure. In electronics, a carry-select adder is a particular way to implement an adder, which is a logic element that computes the -bit sum of two -bit numbers. The carry-select adder is simple but rather fast, having a gate level depth of. The regular carry-select adder generally consists of two ripple carry adders and a multiplexer. Adding two n-bit numbers with a carry-select adder is done with two adders (therefore two ripple carry adders) in order to perform the calculation twice, one time with the assumption of the carry being zero and the other assuming one. After the two results are calculated, the correct sum, as well as the correct carry, is then selected with the multiplexer once the correct carry is known. The number of bits in each carry select block can be uniform, or variable. In the uniform case, the optimal delay occurs for a block

size of. When variable, the block size should have a delay, from addition inputs A and B to the carry out, equal to that of the multiplexer chain leading into it, so that the carry out is calculated just in time. The delay is derived from uniform sizing, where the ideal number of full-adder elements per block is equal to the square root of the number of bits being added, since that will yield an equal number of MUX delays. 1.3 Variable-sized adder A 16-bit carry-select adder with variable size can be similarly created. Here we show an adder with block sizes of 2-2-3-4-5. This break-up is ideal when the full-adder delay is equal to the MUX delay, which is unlikely. The total delay is two full adder delays, and four mux delays. 1.1 Basic building block Figure 1: Basic carry Select adder. Above is the basic building block of a carryselect adder, where the block size is 4. Two 4-bit ripple carry adders are multiplexed together, where the resulting carry and sum bits are selected by the carry-in. Since one ripple carry adder assumes a carry-in of 0, and the other assumes a carry-in of 1, selecting which adder had the correct assumption via the actual carry-in yields the desired result. 1.2 Uniform-sized adder A 16-bit carry-select adder with a uniform block size of 4 can be created with three of these blocks and a 4- bit ripple carry adder. Since carry-in is known at the beginning of computation, a carry select block is not needed for the first four bits. The delay of this adder will be four full adder delays, plus three MUX delays. Figure 2: Regular Fixed Size CSLA Figure 3: Variable Sized CSLA. 2. Basic Structure of Regular 16-Bit CSLA A 16-bit carry select has two types of block size namely uniform block size and variable block size. A 16-bit carry select adder with a uniform block size has the delay of four full adder delays and three MUX delays. While a 16-bit carry select adder with variable block size has the delay of two full adder delays, and four mux delays. Therefore we use 16-bit carry select adder with variable block size. Ripplecarry adders are the simplest and most compact full adders, but their performance is limited by a carry that must ripple from the least-significant to the most-significant bit. A carry-select adder achieves speeds 40% to 90% faster by performing additions in parallel and reducing the maximum carry path. A carry-select adder is divided into sectors, each of which, except for the least significant performs two additions in parallel, one assuming a carry-in of zero, the other a carry-in of one within the sector, there are two 4-bit ripple- carry adders receiving the same data inputs but different Cin. The upper adder has a carry-in of zero, the lower adder a carry-in of one. The actual Cin from the preceding sector selects one of the two adders. If the carry-in is zero, the sum and carry-out of the upper adder are selected. If the carry-in is one, the sum and carry-out of the lower adder are selected. Logically, the result is not different if a single ripple-carry adder were used.

First the coding for full adder and different multiplexers of 6:3, 8:4, 10:5, and 12:6 was done. Then 2, 3, 4, 5-bit ripple carry adder was done by calling the full adder. The regular 128-bit CSLA was created by calling the ripple carry adders and all multiplexers based on circuit. from mux) depends on s3 and mux and partial c3 (input to mux) and mux, respectively. The sum2 depends on c1and mux. For the remaining group s the arrival time of mux selection input is always greater than the arrival time of data inputs from the BEC s. Thus, the delay of the remaining groups depends on the arrival time of mux selection input and the mux delay. Figure 4: Regular 16-bit SQRT CSLA 3. Basic Structure of Modified 16-Bit CSLA It is similar to regular 16-bit SQRT CSLA. Only change is that in basic blocks having two ripplecarry adders, one ripple carry adder fed with a constant 1 carry-in is replaced by BEC. The area estimation of each group is calculated. Based on the consideration of delay values, the arrival time of selection input C1 [time (T) =7] of 6:3 mux is earlier than the s3 [t =9] and c3 [t =7] and later than the s2 [t =4]. Thus, the sum3 and final c3 (output from mux) are depending on s3 and mux and partial c3 (input to mux) and mux, respectively. The sum2 depends on c1 and mux. For the remaining parts the arrival time of mux selection input is always greater than the arrival time of data inputs from the BEC s. Thus, the delay of the remaining MUX depends on the arrival time of mux selection input and the mux delay. First the coding for full adder and multiplexers of 6:3, 8:4, 10:5, and 12:6 was done. The BEC program was design by using NOT, XOR and AND gates. Then 2, 3, 4, 5-bit ripple carry adder was done by calling the full adder. The modified 16- bit CSLA was created by calling the ripple carry adders, BEC and all multiplexers based upon the circuit. Finally, modified 128-bit CSLA was created by calling the ripple carry adders, BEC and all multiplexers based on circuit. The arrival time of selection input of 6:3 mux is earlier. Thus, the sum3 and final c3 (output Figure 5: Modified 16-bit SQRT CSLA. 4. PROPOSED METHOD The binary adder is the critical element in most digital circuit designs including digital signal processors (DSP) and microprocessor data path units. As such, extensive research continues to be focused on improving the power delay performance of the adder. In VLSI implementations, parallel-prefix adders are known to have the best performance. Reconfigurable logic such as Field Programmable Gate Arrays (FPGAs) has been gaining in popularity in recent years because it offers improved performance in terms of speed and power over DSPbased and microprocessor-based solutions for many practical designs involving mobile DSP and telecommunications applications and a significant reduction in development time and cost over Application Specific Integrated Circuit (ASIC) designs. The power advantage is especially important with the growing popularity of mobile and portable electronics, which make extensive use of DSP functions. However, because of the structure of the configurable logic and routing resources in FPGAs, parallel-prefix adders will have a different performance. In particular, most modern FPGAs employ a fast-carry chain which optimizes the carry path for the simple Ripple Carry Adder (RCA).

In this paper, the practical issues involved in designing and implementing tree-based adders on FPGA s. An efficient testing strategy for evaluating the performance of these adders is discussed. 5. KOGGE-STONE ADDER The arrangement of the prefix network specifies the type of the Parallel Prefix Adder.The Kogge-Stone tree achieves both log 2 N stages and fan-out of 2 at each stage. This comes at the cost of long wires that must be routed between stages. The tree also contains more PG cells; while this may not impact the area if the adder layout is on a regular grid, it will increase power consumption. Despite these cost, Kogge-Stone adder is generally used for wide adders because it shows the lowest delay among other structures. The complete functioning of Kogge Stone Adder can be easily comprehended by analyzing it in terms of three distinct parts Computation of Generate and Propagate Signals Calculation of Carries in Parallel Generating of Sum Signals Figure 6: Steps in parallel prefix adder 1. Pre processing This step involves computation of generate and propagate signals corresponding too each pair of bits in A and B. These signals are given by the logic equations below: p i = A i xor B i g i = A i and B i 2. Carry look ahead network This block differentiates Kogge Stone Adder from other adders and is the main force behind its high performance. This step involves computation of carries corresponding to each bit. It uses group propagate and generate as intermediate signals which are given by the logic equations below: P i:j = P i:k+1 and P k:j G i:j = G i:k+1 or (P i:k+1 and G k:j ) 3. Post processing This is the final step and is common to all adders of this family (carry look ahead). It involves computation of sum bits. Sum bits are computed by the logic given below: `S i = p i xor C i-1 Figure 7: 16 bit Kogge-Stone Parallel Prefix Adder. 6. IMPLEMENTATION RESULTS The design proposed in this paper has been developed using Verilog-HDL, Simulation Done by using Model Sim 6.5 e and synthesized in Xilinx ISE 9.1i. The Regular CSLA, Modified CSLA and Kogge Stone Adders Simulation and Synthesis Wave forms show in below figures. The 128 bit Modified Carry Select adder shown figure 10 and 13.It compare to regular CSLA modified CSLA architecture is take less amount of delay it is approximately reduced by 30% of regular method design, But if we go for the Proposed Design (Kogge-Stone Adder) Delay is Less Than 50% of Modified CSLA, Less Than 65% of Regular CSLA and nearly 85% less Delay as Compare Ripple Carry Adder, The Delay Results and Graphs are shown in Table 1 and Fig 15 Figure 8: Regular 128-bit CSLA

Figure 9: Modified 128-bit CSLA Figure 12: 128 bit Modified CSLA Figure 13: 128 bit Kogge-Stone Adder Figure 10: 128 bit Kogge-Stone Adder Delay (ns) RCA CSLA Regula r CSLA Modifie d Kogge Stone 16 Bit 23.540 18.420 16.562 16.481 32 Bit 44.366 25.536 21.411 21.027 64 Bit 83.726 39.588 25.950 27.167 128 Bit 162.446 56.375 40.562 27.977 Table 1. Comparisons of Adders Figure 11: 128 bit Regular CSLA

Figure 14: Comparing Graph for All Adders 8. CONCLUSION A simple modification is maid to regular CSLA to reduce the Delay and area of CSLA architecture. The reduced number of gates of this work offers the great advantage in the reduction of area and also the total power. The compared results show that the modified CSLA has less delay and it requires less area consumption, but this CSLA not suitable for Digital Signal Processing Applications because DSP applications require Very Fast computations design. Therefore Kogge-Stone Parallel Prefix Adder is proposed, it has nearly High Speed Computation as Compared to Modified CSLA. The speed of the proposed adder is almost double the existing adders. In This Paper all the Adders RCA,CSLA Regular, CSLA Modified, Kogge-Stone with 16 Bit, 32 Bit, 64 Bit and 128 Bit Sizes are developed. With Respect of Results and Tables Finally it can be concluded that For Higher Adder Applications Proposed Adder is Best Choice because it has 500 % high Speed compare CSLA Designs. As the functional verification decides the quality of the silicon, we spend 60% of the design cycle time only for the verification/simulation. In order to avoid the delay and meet the TTM, we use the latest verification methodologies. [3] Ramkumar, B., Kittur, H.M. and Kannan,P. M.,(2010 ), ASIC implementation of modified faster carry save adder, Eur. J. Sci. Res., vol. 42, no. 1,pp.53 58. [4] Kogge P, Stone H, A parallel algorithm for the efficient solution of a general class Recurrence relations, IEEE Trans. Computers, vol.c-22, No.8, pp 786-793, Aug.1973. [5] P. M. Kogge and H. S. Stone, A parallel algorithm for the efficient solution of a general class of recurrence equations, IEEE Trans. Computers, vol. 22, no. 8, pp. 786 793, August 1973. [6] J. Liu, S. Zhou, H. Zhu, and C.-K. Cheng, An algorithmic approach for generic parallel adders, in ICCAD, November 2003, pp. 734 730. [7] R. Zimmermann, Non-heuristic optimization and synthesis of parallel-prefix adders, in International Workshop on Logic and Architecture Synthesis, December 1996, pp. 123 132. [8] T. Matsunaga and Y. Matsunaga, Timingconstrained area minimization algorithm for parallel prefix adders, IEICE Transactions on Fundamentals, vol. E90-A, no. 12, pp. 2770 2777, December 2007. [9] T. Matsunaga, S. Kimura, and Y. Matsunaga, Power-conscious syntheses of parallel prefix adders under bitwise timing constraints, in Proc. the Workshop on Synthesis And System Integration of Mixed Information technologies(sasimi), Sapporo, Japan, October 2007, pp. 7 14. [10] Kim,Y. and Kim,L.-S.,(May2001), 64-bit carry-select adder with reduced area, Electron Lett., vol. 37, no. 10, pp. 614 615. [11] E. Abu-Shama and M. Bayoumi, A New cell for low power adders, in Proc.Int. Midwest Symp. Circuits and Systems, 1995, pp. 1014 1017 [12] Verilog HDL- Digital Design and Synthesis, by Samith Palnitkar REFERENCES [1] Bedrij, O. J., (1962), Carry-select adder, IRE Trans. Electron. Comput., pp.340 344. [2] Ceiang,T. Y. and Hsiao,M. J., Carry-select adder using single ripple carry adder, Electron. Lett., vol. 34, no. 22, pp. 2101 2103