INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Similar documents
High Performance Vedic Multiplier Using Han- Carlson Adder

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

A Novel Approach For Designing A Low Power Parallel Prefix Adders

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Analysis of Parallel Prefix Adders

Design and Estimation of delay, power and area for Parallel prefix adders

Design and implementation of Parallel Prefix Adders using FPGAs

Design and Characterization of Parallel Prefix Adders using FPGAs

Design and Implementation of Hybrid Parallel Prefix Adder

Parallel Prefix Han-Carlson Adder

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

ISSN:

An Efficient Design of Low Power Speculative Han-Carlson Adder Using Concurrent Subtraction

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

Implementation and Performance Evaluation of Prefix Adders uing FPGAs

Comparison among Different Adders

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

Design of High Speed and Low Power Adder by using Prefix Tree Structure

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

64 Bit Pipelined Hybrid Sparse Kogge-Stone Adder Using Different Valance

Performance Enhancement of Han-Carlson Adder

Performance Analysis of Advanced Adders Under Changing Technologies

A New Parallel Prefix Adder Structure With Efficient Critical Delay Path And Gradded Bits Efficiency In CMOS 90nm Technology

Design of Efficient 32-Bit Parallel PrefixBrentKung Adder

A Taxonomy of Parallel Prefix Networks

Modelling Of Adders Using CMOS GDI For Vedic Multipliers

An Optimized Design of High-Speed and Energy- Efficient Carry Skip Adder with Variable Latency Extension

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Digital Computer Arithmetic ECE 666

EE878 Special Topics in VLSI. Computer Arithmetic for Digital Signal Processing

An Efficient Higher Order And High Speed Kogge-Stone Based CSLA Using Common Boolean Logic

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

Area Delay Efficient Novel Adder By QCA Technology


FPGA Implementation of Multiplication and Accumulation Unit using Vedic Multiplier and Parallel Prefix adders in SPARTAN 3E

Power Efficient Weighted Modulo 2 n +1 Adder

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

Survey of VLSI Adders

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Comparative Analysis of Various Adders using VHDL

Design of Efficient Han-Carlson-Adder

[Krishna, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

A Novel 128-Bit QCA Adder

Simulation study of brent kung adder using cadence tool

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Binary Adder- Subtracter in QCA

ISSN Vol.03, Issue.07, September-2015, Pages:

DESIGN OF BINARY MULTIPLIER USING ADDERS

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Design and Implementation of Complex Multiplier Using Compressors

Design and Implementation of a delay and area efficient 32x32bit Vedic Multiplier using Brent Kung Adder

Structural VHDL Implementation of Wallace Multiplier

HIGH PERFORMANCE BAUGH WOOLEY MULTIPLIER USING CARRY SKIP ADDER STRUCTURE

An Efficient SQRT Architecture of Carry Select Adder Design by HA and Common Boolean Logic PinnikaVenkateswarlu 1, Ragutla Kalpana 2

Design and Analysis of CMOS Based DADDA Multiplier

A Novel Design of High-Speed Carry Skip Adder Operating Under a Wide Range of Supply Voltages

Review Paper on an Efficient Processing by Linear Convolution using Vedic Mathematics

Reduced Swing Domino Techniques for Low Power and High Performance Arithmetic Circuits

FPGA Realization of Hybrid Carry Select-cum- Section-Carry Based Carry Lookahead Adders

Design of Roba Mutiplier Using Booth Signed Multiplier and Brent Kung Adder

Implementation Of Radix-10 Matrix Code Using High Speed Adder For Error Correction

Analysis Parameter of Discrete Hartley Transform using Kogge-stone Adder

Performance Comparison of VLSI Adders Using Logical Effort 1

LOW POWER HIGH SPEED MODIFIED SQRT CSLA DESIGN USING D-LATCH & BK ADDER

Comparison of Multiplier Design with Various Full Adders

PUBLICATIONS OF PROBLEMS & APPLICATION IN ENGINEERING RESEARCH - PAPER CSEA2012 ISSN: ; e-issn:

Design and Analysis of Row Bypass Multiplier using various logic Full Adders

Area and Delay Efficient Carry Select Adder using Carry Prediction Approach

CHAPTER 4 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED MULTIPLIER TOPOLOGIES

A Highly Efficient Carry Select Adder

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

Optimized area-delay and power efficient carry select adder

Design of 16-bit Heterogeneous Adder Architectures Using Different Homogeneous Adders

Area Efficient Speculative Han-Carlson Adder

A CASE STUDY OF CARRY SKIP ADDER AND DESIGN OF FEED-FORWARD MECHANISM TO IMPROVE THE SPEED OF CARRY CHAIN

Adder (electronics) - Wikipedia, the free encyclopedia

Copyright. Vignesh Naganathan

Efficient Shift-Add Multiplier Design Using Parallel Prefix Adder

AREA AND POWER EFFICIENT CARRY SELECT ADDER USING BRENT KUNG ARCHITECTURE

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

Research Article Delay Efficient 32-Bit Carry-Skip Adder

AN ADVANCED VLSI ARCHITECTURE OF PARALLEL MULTIPLIER BASED ON HIGHER ORDER MODIFIED BOOTH ALGORITHM

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Index terms: Gate Diffusion Input (GDI), Complementary Metal Oxide Semiconductor (CMOS), Digital Signal Processing (DSP).

Performance Analysis of Multipliers in VLSI Design

DESIGN AND IMPLEMENTATION OF 128-BIT QUANTUM-DOT CELLULAR AUTOMATA ADDER

Design of an optimized multiplier based on approximation logic

EFFICIENT DESIGN AND IMPLEMENTATION OF ADDERS WITH REVERSIBLE LOGIC

Design and Implementation of High Speed Carry Select Adder

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Digital Computer Arithmetic ECE 666

Transcription:

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK DESIGN OF A CARRY TREE ADDER VISHAL R. NAIK 1, SONIA KUWELKAR 2 1. Microelectronics (ETC dept.), Goa College of Engineering, Goa University, India. 2. Assistant Professor (ETC dept.), Goa College of Engineering, Goa University, India Accepted Date: 27/02/2014 ; Published Date: 01/05/2014 Abstract: The core of every microprocessor and digital signal processor is its data path. The heart of data-path and addressing units in turn are arithmetic units which include adders. Parallel-prefix adders offer a highly efficient solution to the binary addition problem and are well suited for VLSI implementations. This paper investigates four types of carry-tree adders the Brent-Kung, Kogge-Stone, the Slansky and Han-Carlson adder) and compares them to the simple Ripple Carry Adder (RCA). Due to the presence of a fast carry-chain, the RCA designs p exhibit better delay performance up to 128 bits. The carry-tree adders are expected to have a speed advantage over the RCA as bit widths approach 256. Keywords: Power-Delay Performance, Area, Carry Tree Adders \ Corresponding Author: MR. VISHAL R. NAIK Access Online On: www.ijpret.com How to Cite This Article: PAPER-QR CODE 413

INTRODUCTION Addition is one of the simplest and commonly used operations and is in most cases a speed determining factor for arithmetic operations. The addition of two binary numbers is the fundamental arithmetic operation in microprocessors, digital signal processors, and dataprocessing application-specific integrated circuits. Several algorithms have been presented for high speed parallel addition, and there is generally a trade off between speed and area. Hence, binary adders are crucial building blocks in very large-scale integrated circuits. For the optimization of speed in adders, the most important factor is carry generation. For the implementation of a fast adder, the generated carry should be driven to the output as fast as possible, thereby reducing the worst path delay which determines the ultimate speed of the digital structure. In the design for timing optimization, a network can be optimized either at circuit or logic level. Logic level optimization is done by manipulating boolean equations, whereas circuit level optimization can be carried out by manipulating transistor sizes and circuit topologies. In the optimization for area, care should be taken in the design of the building blocks of the structure, which determine the area occupied by the architecture and, finally, also effect the speed. However, because of the structure of the configurable logic and routing resources in FPGAs, parallel-prefix adders will have a different performance than VLSI implementations. In particular, most modern FPGAs employ a fast-carry chain which optimizes the carry path for the simple Ripple Carry Adder (RCA)[4]. In this paper, the practical issues involved in designing and implementing tree-based adders on FPGAs are described. Several tree- based adder structures are implemented and characterized on a FPGA and compared with the Ripple Carry Adder (RCA). Finally, some conclusions and suggestions for improving FPGA designs to enable better tree-based adder performance are given. I. Carry tree adder designs 2.2) Brent Kung adder architecture[3] In order to approach the structure known as the Brent Kung Structure, which uses the logarithmic concept, the entire architecture is easily understood by dividing the system into three separate stages: 1. Generate/Propagate Generation 414

2. The Dot ( ) Operation 3. Sum generation Generate/Propagate Generation If the inputs to the adder are given by the signals A and B, then the generate and propagate signals are obtained according to the following equations. G = A.B (2.1) P = A B (2.2) The Dot ( ) Operation The most important building block in the Brent Kung Structure is the dot ( ) operator. The basic inputs to this structure are the generate and propagate signals generated in the previous stage. The operator is a function that takes in two sets of inputs-- (g, p) and (g, p )-- and generates a set of outputs-- ( g + pg, pp ). These building blocks are used for the generation of the carry signals in the structure. For the generation of the carry signals, the carry for the kth bit from the carry look-ahead concept is given by Co,k = Gk +Pk(Gk-1 +Pk-1 +P k-1 ( +P1(G0 +P0 Ci,0))) (2.3) Using the dot operator explained above the Equation 2.3 can be written for the different carry signals as Co,0 = G0 +P Ci,0 = ( G0,P0) Co,1 = G1 + G0 P1 = ((G1, P1 ) (G0, P0)) C0,k = ((Gk,Pk) (Gk-1,Pk-1) (G0,P0) (2.4) where is a function defined in order to access all the tuples. The 8-bit Brent Kung Structure is shown in Figure 2.1. This figure shows all the carry signals generated at different stages in the structure. In the structure, two binary tree structure are represented -- the forward and the reverse trees. The forward binary tree alone is not sufficient for the generation of all the carry 415

signals. It can only generate the signals shown as Co,0,Co,1,Co,3 and Co,7. The remaining carry signals are generated by the reverse binary tree. Fig.2.1. 8-bit Brent-Kung adder tructure[8] Sum generation The final stage in this architecture is the sum generation stage. The sum is given by S= A B C (2.5 ) where A and B are the input signals, and C is the carry signal. The carry is obtained from the dot operator stage discussed earlier, and the exclusive of A and B is actually the propagate signal itself. Hence the sum S can finally be represented and realized as S = P C (2.6 ) 2.2) Kogge-Stone adder architecture The Kogge-Stone structure has a more optimal implementation, as its fan-out is greatly reduced to 2 at the expense of larger "o" (circle) cells. It is obtained by copying the of the most significant bit position. Figure 2.2 shows this carry tree for 4-bit operands. The 8-bit Kogge- Stone structure is shown in Figure 2.2. Recursive division of the blocks can construct full adder using such a tree for the implementation. The number of "o" cells required for the implementation is n(log 2 n-1)+1 and the delay is log 2 n,where n is the adders width. It is expected that Kogge-Stone adder should consume more resources than Any other adder. [5] 416

Fig.2.2. 8-bit Kogge-Stone adder structure.[9] 2.3) J. Sklansky-conditional adder architecture Fig 2.3 shows the J.Sklansky conditional sum addition of two binary coded numbers. We represent by τ 0, τ 1, τ 2, τ 3, τ 4, the successive time intervals during which conditional sums and carries are computed. During τ 0 two sum and carry pairs for each column are computed. One pair under the assumption (assumption A 0 ) that the carry brought to each column is 0, and the other pair under the assumption (assumption B 0 ) that the carry brought to each column is 1. An exception to this is at column 0, where only A 0 is assumed, since the carry brought to that column is known to be 0. 417

Fig.2.3. Conditional sum addition. [7] The first row belonging to τ 0 in Fig.2.3 contains the conditional sum under the assumption A 0, the second row has the conditional carries under the same assumption, and the third and the fourth rows contain the conditional sums and carries under assumption B o. In particular, consider column 0. Here we have, using the well-known Boolean relations for sum and carry bits, S 0 0 = X 0 Y 0 = 0 1=1 C 1 0 = X 0. Y 0 = 0.1=0 S 1 0 = X 1 Y 1 = 1 0=1 C 2 0 = X 1. Y 1 = 1.0=0 S 1 1 = S 1 0 1 = 0 C 2 1 = X 0 V Y 0 = 1 V 0= 1 In a similar manner we find the other entries in the τ 0 columns. During τ 1 conditional sums and carries for pairs of columns ( namely, column numbers 0 and 1, 2 and 3,, and 14 and 15)are computed simultaneously, under the assumption (A 1 ) that carry brought to each pair of columns is 0, and (B 1 ) that this carry is 1. The rows belonging to τ 1 are arranged in a manner similar to those of τ 0. 418

Continuing this process. The conditional sums and final carries are computed for tetrads of columns during τ 3, and for the entire sixteen column group during τ 4 the sums and final carry produced are no longer conditional but are equal to the sum bits and the final carry for the two summands, Xi and Yi. The 8-bit J. Sklansky-conditional adder structure is shown in Figure 2.4 Fig.2.4.8-bit J.Sklansky adder structure. [8] 2.4) Han - Carlson adder architecture. Like Brent and Kung s scheme, Han and Carlson also proposed a scheme to reduce the complexity of prefix tree[18]. This scheme is different from Kogge-Stone scheme in the sense that these performs carry-merge operations on even bits and generate/propagate operation on odd bits. At the end, these odd bits recombine with even bits carry signals to produce the true carry bits. This adder has five stages in which the middle three stages are resembles with the Kogge-Stone structure. The advantage of this adder is its shorter span wires than the Kogge-Stone adder and thus there is a reduction in complexity at the cost of an additional stage for carry-merge path. [10] Fig.2.5. 16-bit Han-Carlson adder structure. 419

III) Related work It is noted that delay models and cost analysis for adder designs developed for VLSI technology do not map directly to FPGA designs. This study focuses on carry-tree adders implemented on a Xilinx Spartan 3 FPGA. The distinctive contributions of this paper are two-fold. First, we consider tree-based adders and a hybrid form which combines a tree structure with a ripple-carry design. The adders which are chosen for analysing are Brent-kung adder, Kogge-Stone adder, J.Slansky adder and Han-Carlson adder. Second, this paper considers the practical issues involved in testing the adders and provides actual measurement data to compare with simulation results. IV.RESULTS AND DISCUSSION The various adder designs are simulated using ModelSim simulator. Simulation results for Ripple carry adder, Brent-Kung adder, Kogge-Stone adder and Sklansky adder are as follows 4.1) Ripple carry adder Description- a=8bit input (00111011); b=8bit input (11001010) sum =8bit output (11110101); c=0; cout=1 Fig.4.1. Simulation result of Ripple carry adder 420

4.2) Brent-Kung adder Fig.4.2. Simulation results of Brent-Kung adder Descriptiona=8bit input (00101111); b=8bit input (01000011) sum =8bit output (01110010); c=0; cout=0 4.3) Kogge-Stone adder Descriptiona=8bit input (00111011); b=8bit input (11001010) sum =8bit output (11110101); c=0; cout=1 421

Fig.4.3. Simulation result of kogge-stone adder 4.4) J. Sklansky adder Fig.4.4. Simulation result of 8-bit J.Slansky adder Descriptiona=8bit input (10101100); b=8bit input (01100110) sum =8bit output (00010010); cout=1 422

Fig.4.1, fig.4.2, fig.4.3 and fig.4.4 shows simulation results of carry tree adders which are implemented in generic. All the codes are written in VHDL language and simulated in ModelSim simulator successfully. Further work: These designs are implementing on FPGA and finding path delay, cell area and power on Xilinx FPGA. Because the adder is often the critical element which determines to a large part the cycle time and power dissipation for many digital signal processing and cryptographical implementations, it would be worthwhile for future FPGA designs to include an optimized carry path to enable tree-based adder designs to be optimized for place and routing. This would improve their performance similar to what is found for the RCA. We plan to explore possible FPGA architectures that could implement a fast-tree chain and investigate the possible tradeoffs involved. REFERENCES 1. Nurdiani Zamhari, Peter Voon, Kuryati Kipli, Kho Lee Chin, Maimun Huja Husin,"Comparison of Parallel Prefix Adder (PPA)", Proceedings of the World Congress on Engineering 2012 Vol II 2. M.Venkata Subbaiah, M. Ravi Kishore," Design and Characterization of Parallel Prefix Adders" National Conference On Electrical Sciences -2012 (NCES-12) 3. R. P. Brent and H. T. Kung, A regular layout for parallel adders, IEEE Trans. Comput., vol. C- 31, pp. 260-264, 1982. 4. Neil H.E.Weste,David Harris and Ayan Banerjee,CMOS VLSI Design(pearson education Asia, third edition, 2006.) 5. P. M. Kogge and H. S. Stone, A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations, IEEE Trans. on Computers, Vol. C-22, No 8, August 1973. 6. Harris, A Taxonomy of Parallel Prefix Networks, IEEE, 2003 7. J. Sklansky, Conditional-Sum Addition Logic, IRE transactions on computers, 1960. 8. Parallel prefix adders, KostasVitoroulis, 2006. Presented to Dr. A. J. Al-Khalili. Concordia University. 9. Wikimedia Commons 423

10. Han, Carlson, Fast Area-Efficient VLSI Adders, IEEE, 1987. 11. K. Vitoroulis and A. J. Al-Khalili, Performance of Parallel Prefix Adders Implemented with FPGA technology, IEEE Northeast Workshop on Circuits and Systems, pp. 498-501, Aug. 2007. 12. G.Jyoshna, P.Murali Krishna, B.Doss, Parallel-Prefix Adder Architecture With Efficient Timing-Area Characteristic, UACEE International journal of 424