Data Word Length Reduction for Low-Power DSP Software

Similar documents
Low-Power Multipliers with Data Wordlength Reduction

Low Power Approach for Fir Filter Using Modified Booth Multiprecision Multiplier

Design of Area and Power Efficient FIR Filter Using Truncated Multiplier Technique

Design and Performance Analysis of a Reconfigurable Fir Filter

An Efficient Reconfigurable Fir Filter based on Twin Precision Multiplier and Low Power Adder

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

Modified Booth Encoding Multiplier for both Signed and Unsigned Radix Based Multi-Modulus Multiplier

Design and Implementation of 64-bit MAC Unit for DSP Applications using verilog HDL

FOR HIGH SPEED LOW POWER APPLICATIONS USING RADIX-4 MODIFIED BOOTH ENCODER

DESIGN OF MULTIPLE CONSTANT MULTIPLICATION ALGORITHM FOR FIR FILTER

Power-conscious High Level Synthesis Using Loop Folding

Implementation of Parallel Multiplier-Accumulator using Radix- 2 Modified Booth Algorithm and SPST

A New High Speed Low Power Performance of 8- Bit Parallel Multiplier-Accumulator Using Modified Radix-2 Booth Encoded Algorithm

LOW POWER & LOW VOLTAGE APPROXIMATION ADDERS IMPLEMENTATION FOR DIGITAL SIGNAL PROCESSING Raja Shekhar P* 1, G. Anad Babu 2

Mahendra Engineering College, Namakkal, Tamilnadu, India.

Design of Low Power Column bypass Multiplier using FPGA

DESIGN & IMPLEMENTATION OF FIXED WIDTH MODIFIED BOOTH MULTIPLIER

Low Power VLSI CMOS Design. An Image Processing Chip for RGB to HSI Conversion

Modified Booth Multiplier Based Low-Cost FIR Filter Design Shelja Jose, Shereena Mytheen

A New network multiplier using modified high order encoder and optimized hybrid adder in CMOS technology

ENHANCING SPEED AND REDUCING POWER OF SHIFT AND ADD MULTIPLIER

High performance Radix-16 Booth Partial Product Generator for 64-bit Binary Multipliers

COPY RIGHT. To Secure Your Paper As Per UGC Guidelines We Are Providing A Electronic Bar Code

Ajmer, Sikar Road Ajmer,Rajasthan,India. Ajmer, Sikar Road Ajmer,Rajasthan,India.

Methods for Reducing the Activity Switching Factor

AN EFFICIENT DESIGN OF ROBA MULTIPLIERS 1 BADDI. MOUNIKA, 2 V. RAMA RAO M.Tech, Assistant professor

VLSI Implementation of Digital Down Converter (DDC)

Low-Power CMOS VLSI Design

A Review on Different Multiplier Techniques

An Optimized Wallace Tree Multiplier using Parallel Prefix Han-Carlson Adder for DSP Processors

Innovative Approach Architecture Designed For Realizing Fixed Point Least Mean Square Adaptive Filter with Less Adaptation Delay

AN EFFICIENT MAC DESIGN IN DIGITAL FILTERS

Reduce Power Consumption for Digital Cmos Circuits Using Dvts Algoritham

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 1

Design and Characterization of 16 Bit Multiplier Accumulator Based on Radix-2 Modified Booth Algorithm

An Efficient Design of Parallel Pipelined FFT Architecture

A Novel Approach for High Speed and Low Power 4-Bit Multiplier

A Parallel Multiplier - Accumulator Based On Radix 4 Modified Booth Algorithms by Using Spurious Power Suppression Technique

REALIAZATION OF LOW POWER VLSI ARCHITECTURE FOR RECONFIGURABLE FIR FILTER USING DYNAMIC SWITCHING ACITIVITY OF MULTIPLIERS

High Performance Low-Power Signed Multiplier

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

Performance Analysis of Multipliers in VLSI Design

Design of Baugh Wooley Multiplier with Adaptive Hold Logic. M.Kavia, V.Meenakshi

Design of an optimized multiplier based on approximation logic

Design A Redundant Binary Multiplier Using Dual Logic Level Technique

AREA AND DELAY EFFICIENT DESIGN FOR PARALLEL PREFIX FINITE FIELD MULTIPLIER

ASIC Design and Implementation of SPST in FIR Filter

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

Comparison of Conventional Multiplier with Bypass Zero Multiplier

Low-Power Digital CMOS Design: A Survey

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

Low Power Design for Systems on a Chip. Tutorial Outline

ISSN Vol.07,Issue.08, July-2015, Pages:

DESIGN OF LOW POWER MULTIPLIER USING COMPOUND CONSTANT DELAY LOGIC STYLE

Design and Implementation of Truncated Multipliers for Precision Improvement and Its Application to a Filter Structure

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm

Tirupur, Tamilnadu, India 1 2

VLSI Implementation of Reconfigurable Low Power Fir Filter Architecture

LOW POWER VLSI TECHNIQUES FOR PORTABLE DEVICES Sandeep Singh 1, Neeraj Gupta 2, Rashmi Gupta 2

A Fixed-Width Modified Baugh-Wooley Multiplier Using Verilog

Multiple Constant Multiplication for Digit-Serial Implementation of Low Power FIR Filters

Trade-Offs in Multiplier Block Algorithms for Low Power Digit-Serial FIR Filters

Globally Asynchronous Locally Synchronous (GALS) Microprogrammed Parallel FIR Filter

S.Nagaraj 1, R.Mallikarjuna Reddy 2

High Speed Binary Counters Based on Wallace Tree Multiplier in VHDL

Implementation and Performance Analysis of a Vedic Multiplier Using Tanner EDA Tool

The Design of a Low Power Asynchronous Multiplier

Comparison of Different Techniques to Design an Efficient FIR Digital Filter

A Survey of Optimization Techniques Targeting Low Power VLSI Circuits

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

AN EFFICIENT MULTI RESOLUTION FILTER BANK BASED ON DA BASED MULTIPLICATION

A New Configurable Full Adder For Low Power Applications

AUTOMATIC IMPLEMENTATION OF FIR FILTERS ON FIELD PROGRAMMABLE GATE ARRAYS

VLSI Design and FPGA Implementation of N Binary Multiplier Using N-1 Binary Multipliers

Implementation of FPGA based Design for Digital Signal Processing

A New Architecture for Signed Radix-2 m Pure Array Multipliers

REVIEW ARTICLE: EFFICIENT MULTIPLIER ARCHITECTURE IN VLSI DESIGN

Design and Implementation of Reconfigurable FIR Filter

EXPERIMENTS ON DESIGNING LOW POWER DECIMATION FILTER FOR MULTISTANDARD RECEIVER ON HETEROGENEOUS TARGETS

DESIGN OF FIR FILTER ARCHITECTURE USING VARIOUS EFFICIENT MULTIPLIERS Indumathi M #1, Vijaya Bala V #2

AN ADVANCED VLSI ARCHITECTURE OF PARALLEL MULTIPLIER BASED ON HIGHER ORDER MODIFIED BOOTH ALGORITHM

A Novel Approach of an Efficient Booth Encoder for Signal Processing Applications

Reconfigurable High Performance Baugh-Wooley Multiplier for DSP Applications

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) STUDY ON COMPARISON OF VARIOUS MULTIPLIERS

IN SEVERAL wireless hand-held systems, the finite-impulse

A Survey on Power Reduction Techniques in FIR Filter

Modified Partial Product Generator for Redundant Binary Multiplier with High Modularity and Carry-Free Addition

Keywords: Column bypassing multiplier, Modified booth algorithm, Spartan-3AN.

32-Bit CMOS Comparator Using a Zero Detector

High Speed Vedic Multiplier Designs Using Novel Carry Select Adder

An Optimized Implementation of CSLA and CLLA for 32-bit Unsigned Multiplier Using Verilog

Prototype Fabrication of Field-Programmable Digital Filter LSIs Using Multiple-Valued Current-Mode Logic Device Scaling and Future Prospects

Performance Comparison of Multipliers for Power-Speed Trade-off in VLSI Design

An Analysis for Power Minimization at Different Level of Abstraction to Optimize Digital Circuit

International Journal of Scientific & Engineering Research, Volume 4, Issue 5, May ISSN

Multiplier Design and Performance Estimation with Distributed Arithmetic Algorithm

DESIGN FOR LOW-POWER USING MULTI-PHASE AND MULTI- FREQUENCY CLOCKING

An Optimized Design for Parallel MAC based on Radix-4 MBA

Optimization of energy consumption in a NOC link by using novel data encoding technique

Transcription:

EE382C: LITERATURE SURVEY, APRIL 2, 2004 1 Data Word Length Reduction for Low-Power DSP Software Kyungtae Han Abstract The increasing demand for portable computing accelerates the study of minimizing power dissipation. Most power in processes is consumed by the switching activity of capacitance. The switching power can be reduced by not only minimizing hardware but also optimizing software. Key papers for low power techniques of hardware and software are surveyed. Power analysis and minimization techniques make it possible to optimize software power. Variable word lengths in arithmetic operations without any change of hardware can be a promising software power minimization technique. The objective of this work is to analyze and minimize power dissipation of digital signal processing (DSP) blocks at software level. Preliminary results of an array multiplier with reduced word lengths and plans are presented. This research will have contribution to prolong battery life in portable DSP applications. Index Terms Low-power, word length, switching activity, DSP, software power minimization I. Introduction Portable computing demands minimizing power dissipation due to limited power supply. Since power in CMOS circuits dissipates if they are switching, a major focus of low power design is to reduce the switching activity to the minimal level required to perform the computation [1]. The switching activity is reduced by modifying hardware, changing of the operation order and reducing data word length. One of the examples of data word length reduction is shown in Fig. 1. This figure shows 4 4 bit multiplication with 2-bit multiplier and 3-bit multiplicand. This multiplication has less power consumption compared to 4-bit multiplier and 4-bit multiplicand due to unused functional blocks reduce power dissipation.

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 2 Fig. 1. Example of data word length reduction in multiply The objective of this research is to analyze and minimize power dissipation of digital signal processing blocks at software level with reducing data word length while keeping hardware architectures. One of the problems is how to analyze and measure power consumption for software level power reduction. After a brief summary of the power analysis in CMOS circuits, two techniques will be presented to reduce power dissipation at software level. II. Power Analysis There are three major sources of power dissipation in digital CMOS circuits which are summarized in the following equation: [1] P avg = P switching + P short circut + P leakage (1) The first term represents the switching component of power, the second term is due to the direct-path short circuit current conducting current directly from supply to ground, and the leakage power is primarily determined by fabrication technology considerations.

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 3 The switching component of power is, P switching = αc L V 2 ddf clk (2) where α is the switching activity parameter, C L is the load capacitance, V dd is the operating voltage and f clk is the operating frequency. The switching power can be reduced through operation reduction, choice of number representation, exploitation of signal correlations, logic design, and physical design. The switching activity can be also reduced by optimizing the ordering of operations and by minimizing number of operations. αc L can also be viewed as the effective switching capacitance of the transistors nodes on charging and discharging. Therefore minimizing switching activities can effectively reduce power dissipation without impacting the circuit s operational performance [2]. III. Software Power Minimization Because of the increasing demand of software power analysis tool, Tiwari, Malik, and Wolfe [3] first systematically attempted to model software power cost. They formulated an instruction level power model for the microprocessor after measuring power of instruction sets. They made it possible to compare and evaluate programs in terms of their energy consumption for software power optimization. Lee, Tiwari, Malik and Fujita [4] developed power analysis and minimization techniques for embedded DSP software. They found that in typical DSP applications the multiplier in the multiply and accumulate (MAC) unit is usually a major source of power consumption. A micro-architectural power model for the multiplier was developed and analyzed for further power minimization. They observed the wide current variation of MAC instructions mainly according to the two values being multiplied in MAC unit. They also used the operand swapping technique for Booth multiplier [5]. The Booth

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 4 multiplier does not treat two input symmetrically. Their experiment showed just swapping the operations in register A and B can reduce power for MAC instructions. They also used instruction packing, instruction scheduling, and memory bank assigning for low energy. IV. Minimizing Word Length for Low-Power Chandrakasan et al. [6] showed the number of bits affects all key parameters of a design, including speed, area, and power. Choi and Burleson [7] presented a general searchbased methodology for wordlength optimization and used switching power model for the power dissipation. Considering a voltage dropping factor and the area of computing element according to the wordlengths, they analyzed the switching power consumption assuming the power dissipation is proportional to the area of computing element. Erdogan and Arslan [8] showed low power multiplication schemes for finite impulse response (FIR) filter on DSP processors. They used data bus and coefficient bus for the filtering operation separately. They measured the switching activity of 8, 16, and 32 bit array multipliers for filter orders of 32, 64, and 128. They showed power reduction by decreasing in switching activity at coefficient inputs of the multiplier and both data and coefficient memory buses. Chen, Wang and Wu [2] presented low-power 2 s compelment multipliers by minimizing the switching activities of partial products using the radix-4 Booth algorithm [5]. They used the fact that switching activities of the unused functional blocks are minimized where input bits of unused functional blocks remain unaltered. They increased the probability that the partial products become zero by swapping input data. Wordlength can be also changed by reconfigurable multiplier for low-power. Kim and Papaefthymiou [9] proposed a reconfigurable pipelined multiplier architecture by adapting its structure to computational requirements over time. It can efficiently cope with variable

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 5 100 80 Counts 60 40 20 Fig. 2. 0 0 1000 2000 3000 4000 Number of node swtiching Switching count of 16 16 bit array multiplier with 10,000 random input data-rate multimedia applications such as video processing. The multiplier structures can dynamically reconfigure to lower their power consumption based on zero inputs and inputrate variations. V. Preliminary Results and Plans Much research assumed fixed hardware architectures and full dynamic range for arithmetic. However, switching activities vary according to dynamic range. For example, 16 16 bit multiplier has lower power consumption if input data word length is smaller. Fig. 2 shows an example of the switching result for 16 16 bit multiplier. Switching activity of array multiplier is simulated using synchronous data flow (SDF) [10] since each component in the multiplier produces and consumes same number of token. Table I shows preliminary result for reduced data word length of array multiplier with fixed hardware. The switching activity decreases as input data word length is reduced.

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 6 TABLE I Transition counts for array multiplier of 16 16 bit multiplier with 10,000 random data Input data Min Max Std Mean 4-bit 0 305 54 94 8-bit 0 1198 163 518 16-bit 383 3152 319 1863 Fig. 3. FIR filter for SDF graph Different architectures such as Booth Radix-4 multiplier and Wallace multiplier [11] will be also simulated and compared. The power can be reduced but precision has loss, if data word length is shorter. So tradeoffs between power consumption and precision will be researched. Power model for different input word length does not exist. So I will formulate the power model for arithmetic operations. FIR filter will be simulated for DSP application. The filter structure is shown in Fig. 3. On each firing, every block will consume and produce a signal token. Therefore the filter will use SDF graph [10].

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 7 References [1] A. P. Chandrakasan and R. W. Brodersen, Minimizing power consumption in digital CMOS circuits, Proc. IEEE, vol. 83, pp. 498 523, Apr. 1995. [2] Oscal T.-C. Chen, Sandy Wang, and Yi-Wen Wu, Minimization of switching activities of partial products for designing low-power multipliers, IEEE Trans. VLSI Syst., vol. 11, pp. 418 433, June 2003. [3] V. Tiwari, S. Malik, and A. Wolfe, Power analysis of embedded software: A first step towards software power minimization, in Proc. IEEE Int. Conf. on Computer-Aided Design, San Jose, CA, Nov. 1994, pp. 429 435. [4] M. T. Lee, V. Tiwari, S. Malik, and M. Fujita, Power analysis and minimization techniques for embedded DSP software, IEEE Trans. VLSI Syst., vol. 5, pp. 123 135, Mar. 1997. [5] A.D. Booth, A signed binary multiplication technique, Quart. J. Mech. Appl. Math., vol. 4, pp. 236 240, 1951. [6] A. P. Chandrakasan, M. Potkonjak, J. Rabaey, and R. W. Brodersen, Optimizing power using transformations, IEEE Trans. Computer-Aided Design, vol. 14, pp. 12 31, Jan. 1995. [7] H. Choi and W. P. Burleson, Search-based wordlength optimization for VLSI/DSP synthesis, in Proc. IEEE Workshop on VLSI Signal Processing, Oct. 1994, vol. 7, pp. 198 207. [8] A. T. Erdogan and T. Arslan, Low power multiplication scheme for FIR filter implementation on single multiplier CMOS DSP processors, Electronics Letters, vol. 32, pp. 1959 1960, 1996. [9] S. Kim and M. Papaefthymiou, Reconfigurable low-energy multiplier for multimedia

K. Han: DATA WORD LENGTH REDUCTION FOR LOW-POWER DSP SOFTWARE 8 system design, in Proc. IEEE Workshop on VLSI, Apr. 2000, pp. 129 134. [10] B. L. Evans, Class Notes for EE382C: Embedded Software Systems, The University of Texas at Austin, 2003. [11] C. S. Wallace, A suggestion for a fast multiplier, IEEE Trans. Comput., vol. 13, pp. 14 17, 1964.