PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

Similar documents
Design and Implementation of a delay and area efficient 32x32bit Vedic Multiplier using Brent Kung Adder

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

Research Journal of Pharmaceutical, Biological and Chemical Sciences

Modelling Of Adders Using CMOS GDI For Vedic Multipliers

Fpga Implementation Of High Speed Vedic Multipliers

DESIGN OF A HIGH SPEED MULTIPLIER BY USING ANCIENT VEDIC MATHEMATICS APPROACH FOR DIGITAL ARITHMETIC

High Performance Vedic Multiplier Using Han- Carlson Adder

Performance Analysis of Advanced Adders Under Changing Technologies

Optimum Analysis of ALU Processor by using UT Technique

A Novel Approach For Designing A Low Power Parallel Prefix Adders

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

Simulation study of brent kung adder using cadence tool

DESIGN AND ANALYSIS OF VEDIC MULTIPLIER USING MICROWIND

Design and Implementation of Hybrid Parallel Prefix Adder

Design of High Speed and Low Power Adder by using Prefix Tree Structure

Design, Implementation and performance analysis of 8-bit Vedic Multiplier

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

DESIGN OF BINARY MULTIPLIER USING ADDERS

I. INTRODUCTION II. RELATED WORK. Page 171

Comparative Analysis of 16 X 16 Bit Vedic and Booth Multipliers

Review Paper on an Efficient Processing by Linear Convolution using Vedic Mathematics

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Performance Boosting Components of Vedic DSP Processor

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

Implementation of 32-Bit Carry Select Adder using Brent-Kung Adder

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

Research Article Design of a Novel Optimized MAC Unit using Modified Fault Tolerant Vedic Multiplier

Design of Efficient 64 Bit Mac Unit Using Vedic Multiplier

Implementation and Performance Evaluation of Prefix Adders uing FPGAs

IMPLEMENTATION OF AREA EFFICIENT MULTIPLIER AND ADDER ARCHITECTURE IN DIGITAL FIR FILTER

Analysis of Parallel Prefix Adders

HIGHLY RELIABLE LOW POWER MAC UNIT USING VEDIC MULTIPLIER

Design of 64 bit High Speed Vedic Multiplier

Implementation Of Radix-10 Matrix Code Using High Speed Adder For Error Correction

FPGA Implementation of Multiplication and Accumulation Unit using Vedic Multiplier and Parallel Prefix adders in SPARTAN 3E

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

High Speed 16- Bit Vedic Multiplier Using Modified Carry Select Adder

Design of 4x4 Parity Preserving Reversible Vedic Multiplier

AN NOVEL VLSI ARCHITECTURE FOR URDHVA TIRYAKBHYAM VEDIC MULTIPLIER USING EFFICIENT CARRY SELECT ADDER

VLSI IMPLEMENTATION OF ARITHMETIC OPERATION

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

LOW POWER HIGH SPEED MODIFIED SQRT CSLA DESIGN USING D-LATCH & BK ADDER

OPTIMIZED MODEM DESIGN FOR SDR APPLICATIONS

PIPELINED VEDIC MULTIPLIER

Comparison among Different Adders

PERFORMANCE COMPARISION OF CONVENTIONAL MULTIPLIER WITH VEDIC MULTIPLIER USING ISE SIMULATOR

Area Delay Efficient Novel Adder By QCA Technology

Design and Implementation of Parallel Micro-programmed FIR Filter Using Efficient Multipliers on FPGA

2. URDHAVA TIRYAKBHYAM METHOD

FPGA Implementation of Low Power and High Speed Vedic Multiplier using Vedic Mathematics.

Delay Comparison of 4 by 4 Vedic Multiplier based on Different Adder Architectures using VHDL

Design and Implementation Radix-8 High Performance Multiplier Using High Speed Compressors

Design and FPGA Implementation of 4x4 Vedic Multiplier using Different Architectures

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

ADVANCES in NATURAL and APPLIED SCIENCES

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

VHDL based Design of Convolutional Encoder using Vedic Mathematics and Viterbi Decoder using Parallel Processing

Design and Estimation of delay, power and area for Parallel prefix adders

Design and Implementation of Modified High Speed Vedic Multiplier Using Modified Kogge Stone ADD ER

Compressors Based High Speed 8 Bit Multipliers Using Urdhava Tiryakbhyam Method

A Survey on Design of Pipelined Single Precision Floating Point Multiplier Based On Vedic Mathematic Technique

Design and Simulation of 16x16 Hybrid Multiplier based on Modified Booth algorithm and Wallace tree Structure

International Journal of Modern Engineering and Research Technology

CHAPTER 5 IMPLEMENTATION OF MULTIPLIERS USING VEDIC MATHEMATICS

Keywords Multiplier, Vedic multiplier, Vedic Mathematics, Urdhava Triyagbhyam.

II. VEDIC MATHEMATICS

Hardware Implementation of 16*16 bit Multiplier and Square using Vedic Mathematics

Design of A Vedic Multiplier Using Area Efficient Bec Adder

Low Power VLSI Design of a modified Brent Kung adder based Multiply Accumulate Unit for Reverb Engines

Adder (electronics) - Wikipedia, the free encyclopedia

Optimized high performance multiplier using Vedic mathematics

Comparative Analysis of Various Adders using VHDL

International Journal of Advance Engineering and Research Development

ISSN Vol.03,Issue.02, February-2014, Pages:

FPGA Implementation of an Intigrated Vedic Multiplier using Verilog

Design of 32 Bit Vedic Multiplier using Carry Look Ahead Adder

Parallel Prefix Han-Carlson Adder

Pipelined Linear Convolution Based On Hierarchical Overlay UT Multiplier

Implementation and Analysis of Power, Area and Delay of Array, Urdhva, Nikhilam Vedic Multipliers

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

FPGA Implementation of Complex Multiplier Using Urdhva Tiryakbham Sutra of Vedic Mathematics

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Techniques to Optimize 32 Bit Wallace Tree Multiplier

DESIGN OF HIGH SPEED MULTIPLIERS USING NIKHIALM SUTRA ALGORITHM

Performance Analysis of 4 Bit & 8 Bit Vedic Multiplier for Signal Processing

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

Design and Implementation of Complex Multiplier Using Compressors

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

Design and Simulation of Convolution Using Booth Encoded Wallace Tree Multiplier

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

ASIC Implementation of High Speed Area Efficient Arithmetic Unit using GDI based Vedic Multiplier

Implementation and Performance Analysis of a Vedic Multiplier Using Tanner EDA Tool

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

Index terms: Gate Diffusion Input (GDI), Complementary Metal Oxide Semiconductor (CMOS), Digital Signal Processing (DSP).

Anitha R 1, Alekhya Nelapati 2, Lincy Jesima W 3, V. Bagyaveereswaran 4, IEEE member, VIT University, Vellore

ISSN:

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

CO JOINING OF COMPRESSOR ADDER WITH 8x8 BIT VEDIC MULTIPLIER FOR HIGH SPEED

Transcription:

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU R. Rashvenee, D. Roshini Keerthana, T. Ravi and P. Umarani Department of Electronics and Communication Engineering, Sathyabama University, Chennai, India E-Mail: rashvenee.ravichandran@gmail.com ABSTRACT ALU is one of the most important unit of processor. The computing efficiency of the processor depends on the competency of the ALU. ALU unit performs the arithmetic and logical operations. The adder and multiplier are the main computational units of the arithmetic unit. The performance factors such as delay, power and area. Parallel prefix adders have better delay performance; it involves the execution of the operation in parallel. Brent Kung adder is the most area and power efficient parallel prefix adder. In this paper we proposed high speed Brent Kung adder which consists of Urdhava Tiryakbhyam sutra based Vedic multiplier. In the conventional multiplier speed is restricted by the adders used for partial products. The proposed multiplier is used in the arithmetic unit of an ALU shows better performance in terms of delay. The proposed arithmetic architecture is designed, evaluated and implemented in Xilinx FPGA. Keywords: PPA, vedic multiplier, delay, brent kung adder, RCA, brent kung adder. 1. INTRODUCTION VLSI is the process of integrating thousands of transistors into a single chip. VLSI design is mainly used to minimize the interconnecting fabrics area. The main factors which limit the development of smaller and more complex IC chip are the IC fabrication technology, designer productivity and the cost. Depending on the application, different performance aspects become important. An ALU is an integral part of a processor. It performs all the arithmetic and logical operations. Fast and accurate operation depends on the performance of multiplier [2]. Hence improving the performance of the multiplier improves the efficiency of the ALU. In order to increase the performance of a multiplier we need to make the adder more efficient [3]. Binary addition is the most fundamental and important arithmetic function where the adder should be fast and efficient in terms of area, power and speed. The major problem for binary addition is the propagation delay in the carry chain. As the number of input operands increase the carry propagation delay increases. Its seen that than conventional adders such as ripple carry adder, carry select adder, etc., the parallel prefix adders are more efficient as their structure consists of pre-processing, carry look ahead and post processing sections[4]. The Booth, Wallace, Array multipliers have all been implemented in an ALU, but the multiplier implemented by using the Vedic sutras have been showing high performance when implied in an ALU. 2. EXISTING ADDERS AND MULTIPLIERS There is a wide range of adders and multipliers are used inthe ALU design. The adder is the most used part of the ALU. A. Ripple carry adder Among all the arithmetic operations, multiplication can perform through repeated addition), subtraction (by negating one operand) or division (repeated subtraction).half Adders can be used to add two one bit binary numbers. It is also possible to create a logical circuit using multiple full adders to add N-bit binary numbers [10]. Each full adder inputs a Cin, which is the Cout of the previous adder. This kind of adder is a Ripple Carry Adder, since each carry bit "ripples" to the next full adder. The first (and only the first) full adder may be replaced by a half adder [8]. The block diagram of 4-bit Ripple Carry Adder is shown in Figure-1. Figure-1. Ripple carry adder. B. Vedic multiplication Vedic mathematics is used to solve the complex calculations involve in usual mathematics. Owing to its simple strategy, which mainly includes natural viewpoints of human being, it leads to a straightforward process. It consents to incorporate the arithmetic rules along with high speed and easy implementation, thereby viable for a range of applications based on computing [4, 9]. The Vedic multiplier is based on the algorithm named as Urdhva Tiryakbhyam sutra shown in Figure-2. Traditionally, this Sutra has been employed to multiply two given numbers in a decimal number system. However, we put forward the multiplication of two binary numbers using analogous technique with a motive to make this efficient technique compatible for digital hardware systems. It is a universal multiplication formula that can be used in for all multiplications. It literally means Vertically and Crosswise. The algorithm is viable for multiplication of any two numbers exhibiting bit length 9013

equal to n. Besides this, it introduces a parallel execution of partial products and sums. The clock frequency is not an obstacle in accounting the calculations. This technique has proven advantageous in combating with large delays and complexity of conventional multipliers [1]. inputs A3 A2 and B1B0 and A1A0 and B3 B2. The Block diagram of 4X4 bit Vedic multiplier is shown in Figure-5. To get final product four 2X2 bit Vedic multiplier and three 4-bit Adders are required. Figure-2. Vedic multiplication algorithm. C. Multiplier architectures 2x2 vedic multiplier A 2X2 Vedic multiplier module is implemented using four AND gates and two half adders which is shown in Figure 3 and Figure-4 shows hardware realization of 2X2 Vedic multiplier. 4x4 vedic multiplier Figure-5. Block diagram of 4X4 vedic multiplier. 8x8 vedic multiplier 8X8 bit Vedic multiplier module as shown in Figure-6. It can be easily implemented by using four 4X4 bit Vedic multiplier modules as discussed in the previous section. The 4X4 Vedic multiplier is implemented using four 2X2 bit Vedic multiplier. The Figure-2 is a sample presentation for Vedic multiplication. Figure-3. Block diagram of 2x2 vedic multiplier. Figure-4. Hardware realization of 2x2 vedic multiplier. Figure-6. Block diagram of 8x8 vedic multiplier. 3. PROPOSED PARALLEL PREFIX ADDER The adders play a very crucial role in an ALU in the computation of partial products in the multiplier. This calculation contributes to major delay of a processor. In order to reduce the delay modern adder architectures are used which computes in parallel. It employs a 3-stage structure of the CLA adder. Figure-7 shows the process of parallel prefix adder. The 2X2 multiplier inputs are A1A0 and B1B0. The last block is 2X2 bit multiplier with inputs A3 A2 and B3 B2. The middle one shows two 2X2 bit multiplier with 9014

Pre-calculation of p i, g i for each stage Calculation of the carries. This part is parallelizable to reduce time Combine c i and p i of each stage to generate the sum bits s i Final Sum Figure-7. Process of parallel prefix adder. Computation of carry generation, propagation signals: Previous carry is calculated to the next bit is called propagate signal and generate is to generate the carry bit below are the signals: G i = A i.b i (1) Figure-8. 8-Bit brent kung adder. The black cells and grey cells are the main computational block in a parallel prefix adder and are shown in Figure-9. The black cells computes both generate and propagate while the grey cells compute propagate only. P i = A i B i (2) Calculation of all carry signals: G i:j = G i:k +P i:k. G k-1:j (3) Pi:j= Pi:k. Pk-1:j (4) Calculation of Final Sum: Si = Pi Gi-1:0 (5) A. Brent kung adder Brent-Kung has maximum logic depth, minimum area and avoid explosion of wires. The Brent-Kung adder does odd computation first and then even. It computes prefixes for 2-bit groups. These are used to find prefixes for 4-bit groups, which in turn are used to find prefixes for 8-bit groups, and so forth. The 8 bit adder is shown in Figure-8. The prefixes then fan back down to compute the carries-in to each bit. The tree requires 2log 2 n-1 stages. The fan-out is limited to 2 at each stage. It takes lesser area to be implemented when compared to other prefix adders such as Kogge Stone adder [8]. This will reduce the delay without compromising the power performance of the adder. But the method used in [2, 3] has not reduced delay to a great extent. The area consumption and power are the lowest in BKA than other adders [2, 5]. Figure-9. Black, grey and Buffer cells. 4. PROPOSED BRENT KUNG ADDER ARCHITECTURE The proposed architecture of 8 bit Brent Kung adder the buffers which have been used for the driving compatibility of the gates have been removed to reduce the delay of the adder and is shown in Figure-10. By removal of it, it has not shown any errors or glitches and thus when implemented on the multiplier reduces the delay for the calculation of the carry chain and partial products. As Brent Kung has the simplest structure among all parallel prefix adders our main concern was to reduce the delay. 9015

Figure-12. Simulation output of modified brent kung adder. Figure-10. Proposed 8-bit brent kung adder. Proposed multiplier architecture The Figure-11 shows the modified Brent Kung adder based 8X8 Vedic multiplier. In the proposed architecture delay is reduced and increases the overall performance as compared to the existing Brent Kung adder. Figure-13. Simulation output of Vedic Multiplier using modified Brent Kung Adder. Figure-11. 8X8 vedic multiplier using brent kung adder. 5. RESULTS AND DISCUSSIONS The proposed 8-bit architectures are developed in Xilinx platform. These are designed in verilog HDL, simulated in Xilinx ISim simulator and synthesized using Xilinx XST. These architecture are implemented and analyzed in Xilinx Spartan3 FPGA. The simulation outputs and RTL schematic of modified Adder, Multiplier and ALU are shown in Figure 12, 13, 14 and 15. Table-1 shows synthesize report. Figure-14. Simulation output of ALU using modified brent kung adder. 9016

Figure-15. RTL Schematic of modified 8-bit vedic multiplier. Table-1. Synthesize report. S. No. Parameter Delay (ns) No. of slices No. of LUTs 1 Ripple Carry Adder 18.771 9 15 2 Brent Kung Adder 15.178 10 18 3 4 Vedic Multiplier using RCA Vedic Multiplier using BKA 35.223 114 202 32.66 120 212 5 ALU using RCA 14.334 119 211 6 ALU using BKA 12.383 125 221 6. CONCLUSIONS The proposed Vedic multiplier is using unique addition tree structure, which gives better response in terms of speed in comparison to the conventional Vedic multiplier hardware. Vedic multiplier is using a modified Brent Kung adder which has less delay. The delay of the proposed ALU is reduced by 2ns. The results are concluded that the proposed adder architecture based multiplier and ALU are achieved better computation speed. This proposed ALU is suitable for high speed processor design. REFERENCES [1] Akshata R. Kanchan, Joshi. A., Prof. V.P. Gejji. 2015. Analysisand Implementation of Vedic Multiplier FPGA. International Journal for Technological Research in Engineering. 2(9): 2181-2183. [2] Deepak raj, Sahana, Prof. Praveen J, Prof. Raghavendra Rao R 2015. Design and Implementation of Different Types of Efficient Parallel Prefix Adders. 2015. International Journal of Innovative Research in Electrical, Electronics, Instrumentation and Control Engineering. 3(1). [3] Ravi. T. 2015. Design and performance analysis of ultra low power RISC processor using hybrid drowsy logic in CMOS technologies. International Journal of Applied Engineering Research (IJAER). 10(2): 4287-4296. 9017

[4] M. Manoranjani and T. Ravi. 2015. Multithreshold CMOS Sleep Stack and Logic Stack Technique for Digital Circuit Design. ARPN Journal of Engineering and Applied Sciences. 10(10): 4550-4556. [5] M. Prasanna Kumar, V. Siddharthan, Dr. Gopalakrishnan. K. 2015. Comparative Analysis of Brent Kung and Kogge Stone Parallel Prefix Adder for their Area, Delay and Power Consumption. Indian Journal of Applied Research. 5(10). [6] MeghaTalsania, Eugene John. 2009. Comparative Analysis of Parallel Prefix Adders. A Comparative Analysis of Parallel Prefix Adders 12 th Euromicro Conference on Digital System Design, Architectures, Methods and Tools. pp. 281-286. [7] Prof.Navneet Dubey and ShyamAkashe. 2014. Implementation of an Arithmetic Logic Using Area Efficient Carry Look Ahead Adder. International Journal of VLSI Design and Communication Systems, VLSICS. 5(6): 29-34. [8] R. Ramya and T. Ravi. 2015. Design of Cache Memory Mapping Techniques for Low Power Processor. ARPN Journal of Engineering and Applied Sciences. 10(11): 4783-4788. [9] Sudheer Kumar Yezerla, B Rajendra. 2014. Design and Estimation of delay, Power and area for Parallel Prefix Adders. Proceedings of 2014 RAECS UIET Panjab University Chandigarh, IEEE. [10] S. Ranjith, T. Ravi, P. Umarani, R. Arunya. 2014. Design of CNTFET based sequential circuits using fault tolerant reversible logic. International Journal of Applied Engineering Research. 9(24): 25789-25804. [11] Sunil M, Ankith R D, Manjunatha G D, Premanandha B S. Design and Implementation of Faster Parallel Prefix Kogge Stone Adder. International Journal of Electrical and Electronic Engineering and Telecommunications. 9018