A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Similar documents
Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

AREA AND POWER EFFICIENT CARRY SELECT ADDER USING BRENT KUNG ARCHITECTURE

An Efficient Higher Order And High Speed Kogge-Stone Based CSLA Using Common Boolean Logic

LOW POWER HIGH SPEED MODIFIED SQRT CSLA DESIGN USING D-LATCH & BK ADDER

Design and Implementation of a delay and area efficient 32x32bit Vedic Multiplier using Brent Kung Adder

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

Volume 1, Issue V, June 2013

National Conference on Emerging Trends in Information, Digital & Embedded Systems(NC e-tides-2016)

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Efficient FIR Filter Design Using Modified Carry Select Adder & Wallace Tree Multiplier

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

VLSI IMPLEMENTATION OF AREA, DELAYANDPOWER EFFICIENT MULTISTAGE SQRT-CSLA ARCHITECTURE DESIGN

A Novel Approach For Designing A Low Power Parallel Prefix Adders

A Highly Efficient Carry Select Adder

High Performance Vedic Multiplier Using Han- Carlson Adder

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

ISSN Vol.02, Issue.11, December-2014, Pages:

Review Paper on an Efficient Processing by Linear Convolution using Vedic Mathematics

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

Design and Implementation of High Speed Carry Select Adder

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Design and Analysis of Improved Sparse Channel Adder with Optimization of Energy Delay

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

Design and Implementation of High Radix Booth Multiplier using Koggestone Adder and Carry Select Adder

Research Journal of Pharmaceutical, Biological and Chemical Sciences

AN EFFICIENT APPROACH TO MINIMIZE POWER AND AREA IN CARRY SELECT ADDER USING BINARY TO EXCESS ONE CONVERTER

2 Assoc Prof, Dept of ECE, George Institute of Engineering & Technology, Markapur, AP, India,

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

Adder (electronics) - Wikipedia, the free encyclopedia

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Reduced Area Carry Select Adder with Low Power Consumptions

International Journal of Modern Trends in Engineering and Research

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

Comparative Analysis of Various Adders using VHDL

IMPLEMENTATION OF UNSIGNED MULTIPLIER USING MODIFIED CSLA

AN NOVEL VLSI ARCHITECTURE FOR URDHVA TIRYAKBHYAM VEDIC MULTIPLIER USING EFFICIENT CARRY SELECT ADDER

An Efficient Low Power and High Speed carry select adder using D-Flip Flop

Design and Implementation of Hybrid Parallel Prefix Adder

Modelling Of Adders Using CMOS GDI For Vedic Multipliers

Design of High Speed Hybrid Sqrt Carry Select Adder

An Efficient SQRT Architecture of Carry Select Adder Design by HA and Common Boolean Logic PinnikaVenkateswarlu 1, Ragutla Kalpana 2

Design of Efficient 32-Bit Parallel PrefixBrentKung Adder

Analysis Parameter of Discrete Hartley Transform using Kogge-stone Adder

CLAA, CSLA and PPA based Shift and Add Multiplier for General Purpose Processor

128 BIT MODIFIED SQUARE ROOT CARRY SELECT ADDER

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

DESIGN OF LOW POWER MULTIPLIERS

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

AREA-EFFICIENCY AND POWER-DELAY PRODUCT MINIMIZATION IN 64-BIT CARRY SELECT ADDER Gurpreet kaur 1, Loveleen Kaur 2,Navdeep Kaur 3 1,3

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

Design and Implementation of Efficient Carry Select Adder using Novel Logic Algorithm

High Speed Non Linear Carry Select Adder Used In Wallace Tree Multiplier and In Radix-4 Booth Recorded Multiplier

Area and Delay Efficient Carry Select Adder using Carry Prediction Approach

[Krishna, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

Performance Boosting Components of Vedic DSP Processor

A Novel Designing Approach for Low Power Carry Select Adder M. Vidhya 1, R. Muthammal 2 1 PG Student, 2 Associate Professor,

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

Improved Performance and Simplistic Design of CSLA with Optimised Blocks

FPGA Implementation of Area Efficient and Delay Optimized 32-Bit SQRT CSLA with First Addition Logic

SQRT CSLA with Less Delay and Reduced Area Using FPGA

Implementation and Performance Evaluation of Prefix Adders uing FPGAs

I. INTRODUCTION II. RELATED WORK. Page 171

FPGA IMPLEMENTATION OF 32-BIT WAVE-PIPELINED SPARSE- TREE ADDER

DESIGN OF HIGH SPEED 32 BIT UNSIGNED MULTIPLIER USING CLAA AND CSLA

Index terms: Gate Diffusion Input (GDI), Complementary Metal Oxide Semiconductor (CMOS), Digital Signal Processing (DSP).

Design, Implementation and performance analysis of 8-bit Vedic Multiplier

A Compact Design of 8X8 Bit Vedic Multiplier Using Reversible Logic Based Compressor

Pipelined Linear Convolution Based On Hierarchical Overlay UT Multiplier

LOW POWER AND AREA- EFFICIENT HALF ADDER BASED CARRY SELECT ADDER DESIGN USING COMMON BOOLEAN LOGIC FOR PROCESSING ELEMENT

Design of Fastest Multiplier Using Area Delay Power Efficient Carry-Select Adder

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

Performance Analysis of a 64-bit signed Multiplier with a Carry Select Adder Using VHDL

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Analysis of Low Power, Area- Efficient and High Speed Multiplier using Fast Adder

DESIGN OF CARRY SELECT ADDER WITH REDUCED AREA AND POWER

DESIGN AND IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING VEDIC MATHEMATICS

Implementation of 64 Bit KoggeStone Carry Select Adder with BEC for Efficient Area

Design of 16-bit Heterogeneous Adder Architectures Using Different Homogeneous Adders

EFFICIENT DESIGN AND IMPLEMENTATION OF ADDERS WITH REVERSIBLE LOGIC

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

ADVANCES in NATURAL and APPLIED SCIENCES

Design of 32-bit Carry Select Adder with Reduced Area

Performance analysis of different 8-bit full adders

FPGA Implementation of an Intigrated Vedic Multiplier using Verilog

An Efficient Implementation of Downsampler and Upsampler Application to Multirate Filters

Analysis of Parallel Prefix Adders

Faster and Low Power Twin Precision Multiplier

Group 10 Group 9 Group 8 Group 7 Group 6 Group 5 Group 4 Group 3 Group 2 Group 1 Group 0 GG5 PG5 GG4 PG4. Block 3 Block 2 Block 1 Block 0

An Efficent Real Time Analysis of Carry Select Adder

Transcription:

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2 ECE Department, Sri Manakula Vinayagar Engineering College, Puducherry, India E-mails: 1 goldahepzibha@gmail.com; 2 subhakarthikeyan38@yahoo.com ABSTRACT Multiplication is one of the most significant operations in every computational system and a multiplier forms the core of systems such as digital signal processing, image processing and microprocessor. is an important element which contributes to the major power consumption in any system. Hence a fast energy efficient multiplier is always needed in electronics industry for fast computation of results. Recent applications of multipliers with various data lengths are always required in VLSI from processors to application specific integrated circuits (ASICs). In this work the designs of two different array multipliers is presented, one by using Ripple Carry Adder(RCA) based Carry Select Adder (CSLA) and Binary to Excess-1 Converter (BEC) based RCA CSLA for addition of partial product terms and the results are compared with the proposed multiplier using Brent Kung (BK) CSLA in partial product lines. The designs are synthesized using Quartus II Software. This design will meet the challenging task in modern VLSI design with respect to area and delay. Index Terms Ripple Carry Adder, Carry Select Adder, Binary to Excess-1 Converter, Brent Kung Adder. I. INTRODUCTION In most digital signal processing (DSP) systems, a multiplier is one of the key hardware blocks. plays an important role in DSP applications like digital filtering, digital communications and spectral analysis. Power dissipation becomes one of the primary design constraints in DSP applications of portable, battery-operated systems. Since multipliers include complex circuits and must typically operate at a high system clock rate, the delay of a multiplier must be reduced in order to satisfy the overall design. The simplest way to perform a multiplication is by using a single two input adder. For M and N bits wide inputs, the multiplication includes M cycles, using an N-bit adder. Multi-plication by shift and add algorithm adds together M partial products. The partial product term is generated by multiplying the multiplicand with the multiplier bit which, essentially is an AND operation and by shifting the result based on the multiplier s position. Similar to the familiar long hand decimal multiplication, binary multiplication involves the addition of shifted bits of the multiplicand based on the value and position of each of the multiplier bits. Thus, performing binary multiplication is much simpler than decimal multi-plication. The value of the binary digit can either be 0 or 1, thus, depending on the value of the multiplier bit, the partial products can either be a copy of the multiplicand, or 0- In digital logic, this is simply an AND function. A faster way to implement multiplication is to resort to an approach similar to computing a manual multiplication. The entire partial product terms are generated simultaneously and organized in an array. The multioperand addition is performed to compute the final product. The resulting structure is called an array multiplier and based on three functions: partial product generation, partial product accumulation and final addition. The details of existing Architectures are discussed in section II and the implementtation of proposed system is described in section III. The performance comparison and simulation result is presented and discussed in section 107 IV and section V and VI gives the conclusion and limitation respectively. II. EXISTING MULTIPLIER ARCHITECTURES A. using Ripple Carry Adder Based Carry Select Adder The design of Carry select adder using Ripple Carry Adder is the conventional approach available but delay and area consumption is more. This CSLA generally consists of three RCAs and a Multiplexer. Addition of two n-bit numbers using a carry select adder is done with RCAs. One RCA is used for computing the summation of first 4 bits. In order to perform the calculation twice, once with the assumption of the carry being zero (Cin=0) and the other assuming one (Cin=1). Fig.1.shows the block diagram of RCA Based CSLA. The 8x8 comprises of 4, 4x4 multiplier sub blocks. Here, the multiplicands are having the bit size of (n=8) whereas, the result is of 16 bit in size. The input is broken in to smaller groups of size of n/2 = 4, for both inputs, that is a and b. These newly formed groups of 4 bits are given as input to 4x4 multiplier blocks and the result produced 8 bits, which are the output produced from 4x4 multiplier block are sent for addition to an RCA based CSLA. Fig.1. 8-bit Ripple Carry Adder based CSLA

Fig.2. shows the block diagram of 8 bit multiplier using RCA Based CSLA. Fig.3. shows the simulation result of using Ripple Carry adder based CSLA. Fig.-4: Binary to Excess-1 Converter Fig.-2: 8-bit RCA based CSLA Fig.-5: 8-bit Binary to Excess-1 Converter based CSLA The above obtained BEC based RCA CSLA is used for addition of the 8 bits that are coming from the 4x4 multiplier sub blocks. There are three adders needed for the partial product addition and final addition of bits. Finally the 16 bits give the multiplication of the two 8 bits input. Fig.6 shows the block diagram of 8-bit BEC Based RCA CSLA. Fig.-3: Simulation result of RCA Based CSLA B. Using Binary to Excess-1 Converter Based Ripple Carry Adder Carry Select Adder The BEC based CSLA involves less number of logic resources than the conventional CSLA. The RCA for computing Cin =1 is replaced by BEC unit. BEC is used to add 1 to the input numbers. Less number of logic gates is used to design BEC and thus consumes less area than RCA. Hence area is reduced. Fig.4. shows the diagram of 4-bit Binary to Excess- 1 Converter. Fig.5. shows the block diagram of Binary to Excess-1 based RCA CSLA. Fig.-6. 8-bit BEC Based RCA CSLA 108

Fig.-7: Simulation result of BEC Based RCA CSLA III. PROPOSED MULTIPLIER ARCHITECTURES A. Parallel Prefix Adder The parallel prefix adders [1] are more flexible and are used to speed up the binary additions. Parallel prefix adder architecture [2] is obtained from Carry Look Ahead (CLA) structure. In order to increase the speed of arithmetic operation tree like structure is used [3]. Parallel prefix adders are a group of fastest adders and these are used for high performance of arithmetic circuits in industries. Parallel prefix adders composes of three stages [4] involves three stages 1. Pre-Processing Stage 2. Carry Generation Network 3. Post-Processing Stage 1) Pre-processing stage: In this stage, generate and propagate signals are computed to each pair of inputs A and B. These signals are represented by the logic equations 1 and 2. Pi=Ai xor Bi (1) Gi=Ai and Bi (2) 2) Carry generation network: This stage computes the carries corresponding to each bit. Execution of these operations is carried out in parallel [4]. After the carries in parallel are computed, they are segmented into smaller pieces. Carry propagate and generate are used as intermediate signals which are represented by the logic equations 3 and 4. CPi:j=Pi:k+1 and Pk:j (3) CGi:j=Gi:k+1or (Pi:k+1 and Gk:j (4) The operations involved in Fig.8 are given as: CP0=Pi and Pj (5) CG0= (Pi and Gj) or Gi (6) 3) Post processing stage: This is the final step to compute the summation of input bits. It is common for all adders and the sum bits are represented by logic equations 7&8. Ci-1=(Pi and Cin) or Gi (7) Si=Pi xor Ci-1 (8) A. Brent -Kung Adder Brent- Kung adder [5] is a very well- known logarithmic adder architecture giving an optimal number of stages from input to all outputs but with asymmetric loading on all intermediate stages. It is one of the parallel prefix adders. Parallel prefix adders are a unique class of adders that are based on the use of generate and propagate signals. The cost and wiring complexity is less in Brent-Kung adders. But the gate level depth of Brent Kung adders is [6] 0 (log2 (n)), so the speed is lower. The block diagram of 4-bit Brent Kung adder is shown in Fig.9. Fig.-9: Block Diagram of Brent Kung Adder B. Using Brent-Kung Adder based Carry Select Adder Brent-Kung Adder [7] has reduced delay when compared to Ripple Carry Adder. So Brent-Kung Adder based CSLA is designed just by using Brent- Kung Adder. Here the Ripple Carry Adders are replaced by Brent-Kung adder so that delay is reduced. Fig.10. shows the block diagram of 8-bit Brent- Kung based CSLA. Fig.-8: Carry Network Fig.-10: 8-bit Brent-Kung Adder based CSLA 109

Thus the 8 bit multiplier uses the Brent Kung adder for the intermediate addition and final addition. Thus as the Brent Kung Adder based CSLA has reduced delay, the multiplier using BK CSLA also has improved speed. Fig.11 shows the block diagram of 8-bit Brent-Kung based CSLA. Fig.12 shows the simulation result of 8-bit Brent-Kung based CSLA. The result analysis shows that 8-bit BK based CSLA has reduced delay than all other multiplier architectures with a compromise of area. The graphical representation of comparison of delay of different multipliers is shown in Fig.13. It is evident that BK based CSLA has reduced delay than all other multipliers. The graphical representation of comparison of no. of transistors in different multipliers is shown in Fig.14. Table-2 shows the comparison of no. of transistors in different types of multipliers. It is evident that BK based CSLA has increased transistor count than all other multipliers Fig.-11: 8-bit Brent Kung Adder based CSLA Fig.13.Comparison of delay in different multipliers Fig.-12: Simulation result of 8-bit Brent Kung Adder based CSLA IV. SIMULATION RESULTS AND COMPARISON Various adders and multipliers were designed in Quartus II software. Area and delay of various multipliers like 8-bit Ripple Carry adder based CSLA, 8-bit Binary to Excess-1 based CSLA and 8-bit Brent-Kung based CSLA has been calculated. Table. I shows the comparison of delay in different types of multiplier. TABLE - I: COMPARISON OF DELAY IN DIFFERENT MULTIPLIERS TYPE OF RCA Based BEC Based BK Based MULTIPLIER CSLA RCA CSLA CSLA DELAY 55.200ns 50.200ns 45.400ns Fig.14. Comparison of no of transistors in different multipliers Table- 2: comparison of transistor count in different multipliers UNITS RCA Based CSLA BEC Based RCA CSLA BK Based CSL Multiplie RCA 9 1368 6 912 r - - MUX 3 222 3 222 3 222 OR 3 18 3 18 3 18 AND 3 18 3 18 3 18 BEC - - 3 150 - - BK - - - - 9 1746 4bMULT 4 3328 4 3328 4 3328 TOTAL 22 4954 22 4648 22 5332 V. CONCLUSION The various adders and multipliers were simulated using Quartus II software and the results show that RCA Based CSLA has delay of 55.200 ns, BEC Based RCA CSLA has 110

50.200 ns, BK Based CSLA has 45.400 ns. Thus BK based CSLA is found to have reduced delay by 17.75 % compared to RCA Based CSLA, 9.56% compared to BEC Based RCA CSLA. The no. of transistors in RCA Based CSLA is 4954, BEC Based RCA CSLA is 4648, BK Based CSLA is 5332. Thus BK based CSLA has improved speed compared to other multipliers with a small compromise in transistor count. VI.LIMITATION VII. BK based CSLA has reduced delay compared to all other multipliers but the only limitation is that the area is slightly increased due to increased number of transistor count. Thus BK based CSLA has improved speed compared to the other multipliers with a small compromise in transistor count. ACKNOWLEDGMET I would like to thank Mrs. C.P Subha, Associate Professor, ECE Department who had been guiding me throughout the project and supporting me in giving technical ideas about the paper and motivating me to complete the work efficiently and successfully. REFERENCES [1] M. Snir, Depth-size trade-offs for parallel prefix computation. Journal of Algorithms 7(2): 185 201 (1986). [2] David Jeff Jackson and Sidney Joel Hannah, Modelling and Comparison of Adder Designs with Verilog HDL,25th South-eastern Symposium onsystem Theory, Pp.406-410, March (1993). [3] Belle W.Y.Wei and Clark D.Thompson, Area- Time Optimal Adder Design.nIEEE transactions on Computers 39: 666-675 (1990). [4] Choi, Y., Parallel Prefix Adder Design. Proc. 17th IEEE Symposium on Computer Arithmetic, Pp. 90-98, 27th June (2005). [5]Rabaey,J.M., Digital Integrated Circuits-A Design Perspective. New Jersey, Prentice-Hall, (2001) [6] Brent and H. Kung, A Regular Layout for Parallel Adders. IEEE Transaction on Computers C-31(3): 260-264 (1982). [7] Adilakshmi Silveru and M. Bharathi, Design of Kogge-Stone and Brent Kung Adders using Degenerate Pass Transistor Logic. International Journal of Emerging Science and Engineering. 1(4): 2319-6378 (2013). [8] Raminder Preet Pal Singh, Praveen Kumar, and Balwinder Singh, Performance Analysis of 32-Bit Array with a Carry Save Adder and with a Carry Look Ahead Adder. Letters of International Journalof Recent Trends in Engineering 2(6): 83-89 (2009). [9] Ramesh Pushpangdan, Vineeth Sukumaran, Rono Innocent, Dinesh Sasikumar, Vaisak Sundar, High Speed Vedic for Digital Signal Processor. IETE Journal of Research 55(6): 282-286 ( 2009). [10] Alex Panato, Sandro Silva, Flavio Wagner, Marcelo Johann, Ricardo Reis, Sergio Bampi, Design of Very Deep Pipelined s for FPGAs. Proceedings of the Design, Automation and Test in Europe Conference and Exhibition Designers Forum IEEE (2004). [11] Himanshu Thapliyal and M.B. Srinivas, High Speed Efficient N X N Bit Parallel Hierarchical Overlay Architecture Based On Ancient Indian Vedic Mathematics.Transactions on Engineering, Computing and Technology V2: 225-228 (2004) 111