AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

Similar documents
An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

Design and Implementation of Wallace Tree Multiplier Using Kogge Stone Adder and Brent Kung Adder

Efficient Implementation of Parallel Prefix Adders Using Verilog HDL

A Novel Approach to 32-Bit Approximate Adder

Design and Analysis of Approximate Compressors for Multiplication

Design Of Arthematic Logic Unit using GDI adder and multiplexer 1

Design of an optimized multiplier based on approximation logic

A Novel Approach For Designing A Low Power Parallel Prefix Adders

Analysis of Parallel Prefix Adders

International Journal of Advance Engineering and Research Development

DESIGN OF PARALLEL MULTIPLIERS USING HIGH SPEED ADDER

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

An Optimized Design of High-Speed and Energy- Efficient Carry Skip Adder with Variable Latency Extension

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Design of Efficient 32-Bit Parallel PrefixBrentKung Adder

Design and Implementation of High Speed Area Efficient Carry Select Adder Using Spanning Tree Adder Technique

Design and Implementation of Hybrid Parallel Prefix Adder

Modelling Of Adders Using CMOS GDI For Vedic Multipliers

DESIGN AND IMPLEMENTATION OF AREA EFFICIENT, LOW-POWER AND HIGH SPEED 128-BIT REGULAR SQUARE ROOT CARRY SELECT ADDER

AN EFFICIENT DESIGN OF ROBA MULTIPLIERS 1 BADDI. MOUNIKA, 2 V. RAMA RAO M.Tech, Assistant professor

A NOVEL DESIGN FOR HIGH SPEED-LOW POWER TRUNCATION ERROR TOLERANT ADDER

LOW POWER & LOW VOLTAGE APPROXIMATION ADDERS IMPLEMENTATION FOR DIGITAL SIGNAL PROCESSING Raja Shekhar P* 1, G. Anad Babu 2

Design and Implementation of Carry Select Adder Using Binary to Excess-One Converter

A High Speed Wallace Tree Multiplier Using Modified Booth Algorithm for Fast Arithmetic Circuits

Design and Implementation of Complex Multiplier Using Compressors

Design and Implementation of Digit Serial Fir Filter

High Speed, Low power and Area Efficient Processor Design Using Square Root Carry Select Adder

NOVEL HIGH SPEED IMPLEMENTATION OF 32 BIT MULTIPLIER USING CSLA and CLAA

Area Power and Delay Efficient Carry Select Adder (CSLA) Using Bit Excess Technique

Parallel Prefix Han-Carlson Adder

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

Design Of 64-Bit Parallel Prefix VLSI Adder For High Speed Arithmetic Circuits

A NOVEL IMPLEMENTATION OF HIGH SPEED MULTIPLIER USING BRENT KUNG CARRY SELECT ADDER K. Golda Hepzibha 1 and Subha 2

Design of High Speed and Low Power Adder by using Prefix Tree Structure

Design and Analysis of CMOS Based DADDA Multiplier

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 1

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

DESIGN OF CARRY SELECT ADDER WITH REDUCED AREA AND POWER

ISSN: X International Journal of Advanced Research in Electronics and Communication Engineering (IJARECE) Volume 1, Issue 5, November 2012

Design and Implementation of High Speed Carry Select Adder

A New Configurable Full Adder For Low Power Applications

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

A HIGH SPEED DYNAMIC RIPPLE CARRY ADDER

DESIGN AND IMPLEMENTATION OF 64- BIT CARRY SELECT ADDER IN FPGA

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Structural VHDL Implementation of Wallace Multiplier

Design of a Power Optimal Reversible FIR Filter ASIC Speech Signal Processing

Area Delay Efficient Novel Adder By QCA Technology

High Performance Low-Power Signed Multiplier

International Journal of Computer Engineering and Applications, Volume XI, Issue XI, Nov. 17, ISSN

Implementation of 32-Bit Unsigned Multiplier Using CLAA and CSLA

CLAA, CSLA and PPA based Shift and Add Multiplier for General Purpose Processor

Design and Estimation of delay, power and area for Parallel prefix adders

International Journal of Scientific & Engineering Research, Volume 7, Issue 3, March-2016 ISSN

Implementation of 256-bit High Speed and Area Efficient Carry Select Adder

AREA EFFICIENT LOW ERROR COMPENSATION MULTIPLIER DESIGN USING FIXED WIDTH RPR

A Survey on A High Performance Approximate Adder And Two High Performance Approximate Multipliers

An Efficient Higher Order And High Speed Kogge-Stone Based CSLA Using Common Boolean Logic

Design and Implementation of High Radix Booth Multiplier using Koggestone Adder and Carry Select Adder

[Krishna, 2(9): September, 2013] ISSN: Impact Factor: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

Design and Comparative Analysis of Conventional Adders and Parallel Prefix Adders K. Madhavi 1, Kuppam N Chandrasekar 2

Data Word Length Reduction for Low-Power DSP Software

II. Previous Work. III. New 8T Adder Design

A Novel 128-Bit QCA Adder

JDT EFFECTIVE METHOD FOR IMPLEMENTATION OF WALLACE TREE MULTIPLIER USING FAST ADDERS

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

COMPARATIVE ANALYSIS OF 32 BIT CARRY LOOK AHEAD ADDER USING HIGH SPEED CONSTANT DELAY LOGIC

Implementation of Low Power 32 Bit ETA Adder

2. URDHAVA TIRYAKBHYAM METHOD

CHAPTER 3 ANALYSIS OF LOW POWER, AREA EFFICIENT AND HIGH SPEED ADDER TOPOLOGIES

International Journal Of Scientific Research And Education Volume 3 Issue 6 Pages June-2015 ISSN (e): Website:

Design and Implementation of High Speed Carry Select Adder Korrapatti Mohammed Ghouse 1 K.Bala. 2

ISSN Vol.02, Issue.11, December-2014, Pages:

A Highly Efficient Carry Select Adder

Design of Roba Mutiplier Using Booth Signed Multiplier and Brent Kung Adder

Investigation on Performance of high speed CMOS Full adder Circuits

DESIGN AND IMPLEMENTATION OF 128-BIT QUANTUM-DOT CELLULAR AUTOMATA ADDER

A Low Power Array Multiplier Design using Modified Gate Diffusion Input (GDI)

A Novel High-Speed, Higher-Order 128 bit Adders for Digital Signal Processing Applications Using Advanced EDA Tools

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

A Design Approach for Compressor Based Approximate Multipliers

FPGA Implementation of Area-Delay and Power Efficient Carry Select Adder

Low Power and Area EfficientALU Design

FPGA Implementation of Wallace Tree Multiplier using CSLA / CLA

Efficient Implementation on Carry Select Adder Using Sum and Carry Generation Unit

PERFORMANCE IMPROVEMENT AND AREA OPTIMIZATION OF CARRY SPECULATIVE ADDITION USING MODIFIED CARRY GENERATORS

Implementation and Performance Evaluation of Prefix Adders uing FPGAs

FPGA IMPLEMENTATION OF 32-BIT WAVE-PIPELINED SPARSE- TREE ADDER

PROMINENT SPEED ARITHMETIC UNIT ARCHITECTURE FOR PROFICIENT ALU

COMPARISION OF LOW POWER AND DELAY USING BAUGH WOOLEY AND WALLACE TREE MULTIPLIERS

Optimized area-delay and power efficient carry select adder

EFFICIENT VLSI IMPLEMENTATION OF A SEQUENTIAL FINITE FIELD MULTIPLIER USING REORDERED NORMAL BASIS IN DOMINO LOGIC

DESIGN OF LOW POWER ETA FOR DIGITAL SIGNAL PROCESSING APPLICATION 1

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

Transcription:

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER 1 CH.JAYA PRAKASH, 2 P.HAREESH, 3 SK. FARISHMA 1&2 Assistant Professor, Dept. of ECE, 3 M.Tech-Student, Sir CR Reddy College of Engineering, West Godavari (Dt), Eluru, AP. ABSTRACT: Multiplication is the basic building block for several DSP processors, Image processing and many other. Over the years the computational complexities of algorithms used in Digital Signal Processors (DSPs) have gradually increased. This requires a parallel prefix multiplier to achieve high execution speed or to meet the performance demands. The architecture of proposed multiplier mainly consists of parallel prefix Adder. Adders are most significant in digital signal processing and control systems. The highspeed and a processor or system accuracy is based on the performance of adder. The parallel prefix adder consists of three stages of operations they are pre-processing stage, carry generation stage, post-processing stage. This adder makes the proposed multiplier fast of operation. In this research work, a new design of Parallel Prefix Multiplier is proposed and this proposed design of multiplier uses a very fast parallel prefix adder. The experimental results show that our proposed system reduces the area by 29484Kbytes and delay by 64.825ns. Key words: Parallel prefix adder, proposed multiplier, gray cell, finite field multiplier I. INTRODUCTION According to Moore s Law, for every two years the number of transistors on a chip almost doubles. For more power density and more heat on the circuits, complicated designs can be implemented on the chip. In security technologies public Key cryptography is popular and most significant one. It can provide certain unique security Services, such as key exchange and digital Signature. As mentioned above public s key Cryptography is used for the purpose of Security, they are two types (1) RSA (2) Elliptic curve. EC cryptosystem uses shorter key compared with RSA to provide the same level of Security EC used in an EC crypto system is defined over finite field s low-power Design of finite field arithmetic provides results in an EC cryptosystem. It consumes low power and more suitable for wireless application. For hardware implementation binary Extension field denoted by GF is very attractive because it offers carry free arithmetic. There are various methods to represent field Elements in GF such as polynomial basis (PB) normal basis, and dual basis. The most popularly used basis is PB because it is adopted as one of the basis choices by organizations that set standards for cryptography applications. For efficient implementation of multipliers over GF generalized PB have been proposed. The choice of the irreducible polynomial P(x) affects the complexity of a finite field multiplier. Irreducible polynomials have less number of non-zero terms. Irreducible polynomials can provide multipliers with lower capacity. PB finite field multiplier architectures can be categorized into bit serial bit parallel and digit serial architecture. Bit serial architecture is area efficient, and it is too slow for many applications. Bit parallel is fast and expensive in term of area. The digit serial architecture is flexible, it has moderate speed and reasonable cost of implementation. Two low-energy digit Page No:2983

serial PB multipliers have been proposed binary tree structure of XOR gates are used instead of a linear array of XOR gates far degree reduction, reduce both power consumption and delay. Various digit serial multipliers were proposed Such as most significant digit, least Significant digit with modifications in architecture. A factoring technique is involved in design of a digit serial PB multiplier in GF. II. EXISTED SYSTEM A finite Field is defined as set of finite many elements where addition and multiplication are the operations. A binary extension field GF (2m) is generated by a degree m irreducible polynomial, P(x) = x m +pm-1 x m -1 + ------p2 x2 +p1x+1. P1 is either O or 1. Dynamic power consumption in CMOS based design consists of a large number of standard cells and nets. It can be expressed as p dynamic = p switching + p internal Pswitching is the total switching power which Obtained by souring over all nets [a net is a connection to the cells inputs as outputs]. Switching power is the power dissipated due to the charging and discharging of the output load capacitance of a cell. P internal is the total internal power obtained by summing over all cells. The internal power of each cell is the power consumed within the cell because of the charging and discharging of internal nodes capacitances of a cell and short circuit nearest dynamic power (P dynamic) can be reduced by lowering P switching or p internal. The effective method to reduce power consumption is factoring applicable for both architecture and gate level. Fig. 1 Finite Field Multiplier An architecture Diagram for digit serial PB multiplier in GF is shown in fig 1. There are three Modules those are k x m multiplier, and field adder. K x m Multiplier has two Operands one operand B of m-bit and others operand A j of k-bit. A j Changes for different clock cycles j. Therefore it has higher switching activity when compared with operand B. Constant multiplier module realizes multiplication between a field element and the constant x k field adder modules implements finite field addition using in m two input XOR gates formed as a one layer network. Among these three k x m multiplier is the most complex module. By using this multiplier we proposed cryptography for security applications in communications. III. PROPOSED SYSTEM Research on binary operation elements and motivation gives development of devices. Field programmable gate arrays [FPGA s] are most popular in recent years because they improve the speed of microprocessor based on applications like mobile DSP and telecommunication. The construction of efficient parallel prefix adder consists of three stages. They are pre-processing stage, carry generation stage, postprocessing stage. Page No:2984

A. Pre-Processing Stage In the pre-processing stage, generate and propagate are from each pair of inputs. The propagate perform XOR operation of input bits and generate operation AND operation of input bits. The propagate (Pi) and generate (Gi) are shown in below equations 1 and 2. (1) (2) B. Carry Generation Stage In this stage, carry is generated for each bit called as carry generate (Cg). The carry propagate and carry generate is generated for the further operation but final cell present in the each bit operation gives carry. The last bit carry will help to produce sum of the next bit simultaneously till the last bit. The carry generate and carry propagate are given in below equations 3 and 4. (5) C. Post-processing stage It is the final stage of an efficient parallel prefix adder, the carry of a first bit is XORed with the next bit of propagates then the output is given as sum and it is shown in equation 6. (6) It is used for two sixteen bit addition operations and each bit carry is undergoes post-processing stage with propagate, gives the final sum. The first input bits goes under pre-processing stage and it will produce propagate and generate. These propagates and generates undergoes carry generation stage produces carry generates and carry propagates, these undergoes post-processing stage and gives final sum. The step by step process of parallel prefix adder is shown in Fig 2. (3) (4) The above carry propagate Cp and carry generation Cg in equations 3 & 4 is black cell and the below shown carry generation in equation 5 is gray cell. The carry propagate is generated for the further operation but final cell present in the each bit operation gives carry. The last bit carry will help to produce sum of the next bit simultaneously till the last bit. This carry is used for the next bit sum operation, the carry generate is given in below equations 5. Fig 2: Flow chart for PPA In Efficient PPA, black cell operates three gates and gray cell operates two gates. The gray cell reduces the delay and memory because it operates only two gates. The Page No:2985

proposed adder is design with the both black and gray cells. By using gray cell operations at the last stage of proposed adder gives a enormous dropping delay and memory used. In Parallel Prefix adders the execution of an operation is in parallel. This is done by segmentation the operation in smaller pieces which are computed in parallel. The output is depends on the initial inputs. Parallel Prefix Adder (PPA) is equivalent to carry look ahead adder (CLA). A Carry look ahead adder is a type of adder used in digital logic. CLA is designed to overcome the latency introduced by repelling effect of carry bits in RCA. A CLA improves speed by reducing carry bits. It calculates one or more carry bits before the sum, which reduces the wait time to calculate the result of larger bit value. generating the carries in parallel. PP finite field multiplier is fastest multiplier with focus on design time and is the choice for high performance multiplier in industry. IV. RESULTS TABLE 1 Comparison Table of area and delay Fig 4. Comparison graph of delay Fig 5. Comparison graph of area Fig. 3: Proposed Redundant PP Multiplier CLA uses the concept of generating (G) and propagating (P) carries. These two are differ in the way their carry generation block is implemented. The main advantage of PP Multiplier is the carry reduces the number of logic levels by essentially Page No:2986

efficient than the conventional one in terms of memory used and speed. The results show that in existed system, area occupied is 371744Kbytes and delay is 110.363ns. In our proposed system, area occupied is 342260Kbytes and delay is 45.538ns. Hence, by the results our proposed system is more efficient than existed system. Fig 6. RTL Schematic Fig 7. Technology Schematic Fig 8. Output Waveform V. CONCLUSION In this project, An Multiplier using Parallel Prefix Adder is proposed. The latency of existing multiplier has been reduced. The conditional sum technique used in the adder is a good technique for energy efficiency. The reduction in energy of the carry tree is more than the energy overhead due to conditional sum block due to the fact that the sum block complexity is less than that of carry tree. It is proved that the adder structure implemented in this work has reduced delay. However, the parallel prefix finite field multiplier has better energy efficiency. The results prove that the proposed architecture is more V. REFERENCES [1] S.-L. Lu, Speeding up processing with approximation circuits, Computer, vol. 37, no. 3, pp. 67 73, Mar. 2004. [2] J. Han and M. Orshansky, Approximate computing: an emerging paradigm for energy-efficient design, in Proc. ETS, pp. 1-6, May2013 [3] C. Labrado, H. Thapliyal and F. Lombardi Design of Majority Logic Based ApproximateArithmetic Circuits, inproc. IEEE International Symposium on Circuits and Systems (ISCAS), pp. 2122-2125, May 2017. [4] V. Gupta, D. Mohapatra, S. P. Park, A. Raghunathan, and K. Roy, Impact: Imprecise adders for low-power approximate computing, in Proc. Int.Symp. Low Power Electronics and Design (ISLPED), pp. 409 414, Aug. 2011. [5] Z. Yang, A. Jain, J. Liang, J. Han, and F. Lombardi, Approximate xor/xnorbased adders for inexact computing, in Proc. 13th IEEE Conf. Nanotechnology (IEEE-NANO), pp. 690 693, Aug.2013. [6] V. Gupta, D. Mohapatra, A. Raghunathan, and K. Roy. Low-power digital signal processing using approximate adders, IEEE Trans. Comput.-Aided Des. Integ. Circuits Syst, vol. 32,pp. 124 137, 2013. [7] S. Rehman, W. El-Harouni, M. Shafique, A. Kumar, and J. Henkel. Page No:2987

Architectural-Space Exploration of Approximate Multipliers,. in Proc. Int. Conf. Comput.-Aided Des. (ICCD), pp. 1-6,Nov.2016. CH JAYA PRAKASH completed his B.Tech in ECE from JNTU Hyderabad and M.Tech in VLSI System Design from JNTU Kakinada. He is working as Assistant Professor in the Dept. of ECE, Sir CR Reddy College of Engineering, Eluru. HAREESH PANCHETI completed his B.Tech in ECE from JNTU Hyderabad and M.Tech in VLSI Design from SASTRA University. He is working as Assistant Professor in the Dept. of ECE, Sir CR Reddy College of Engineering, Eluru. SK. FARISHMA completed her B.Tech in BVSR engineering college, Chemakurthy and pursuing M.Tech in Sir CR Reddy college of engineering, Eluru. Her area of interest is VLSI. Page No:2988