Subband coring for image noise reduction. Edward H. Adelson Internal Report, RCA David Sarnoff Research Center, Nov

Similar documents
MULTIRATE DIGITAL SIGNAL PROCESSING

Copyright S. K. Mitra

FINITE-duration impulse response (FIR) quadrature

Two-Dimensional Wavelets with Complementary Filter Banks

Fourier Transform. Any signal can be expressed as a linear combination of a bunch of sine gratings of different frequency Amplitude Phase

A DUAL TREE COMPLEX WAVELET TRANSFORM CONSTRUCTION AND ITS APPLICATION TO IMAGE DENOISING

Digital Signal Processing

McGraw-Hill Irwin DIGITAL SIGNAL PROCESSING. A Computer-Based Approach. Second Edition. Sanjit K. Mitra

Digital Image Processing

CoE4TN4 Image Processing. Chapter 4 Filtering in the Frequency Domain

Digital Signal Processing

Digital Image Processing

Wavelet Transform. From C. Valens article, A Really Friendly Guide to Wavelets, 1999

Computer Vision, Lecture 3

OPTIMIZED SHAPE ADAPTIVE WAVELETS WITH REDUCED COMPUTATIONAL COST

Achim J. Lilienthal Mobile Robotics and Olfaction Lab, AASS, Örebro University

ELEC Dr Reji Mathew Electrical Engineering UNSW

Frequency Domain Enhancement

Quantized Coefficient F.I.R. Filter for the Design of Filter Bank

Simulation of Frequency Response Masking Approach for FIR Filter design

Convolution Pyramids. Zeev Farbman, Raanan Fattal and Dani Lischinski SIGGRAPH Asia Conference (2011) Julian Steil. Prof. Dr.

Fong, WC; Chan, SC; Nallanathan, A; Ho, KL. Ieee Transactions On Image Processing, 2002, v. 11 n. 10, p

Broadband Microphone Arrays for Speech Acquisition

Continuously Variable Bandwidth Sharp FIR Filters with Low Complexity

EE482: Digital Signal Processing Applications

Computer Graphics (Fall 2011) Outline. CS 184 Guest Lecture: Sampling and Reconstruction Ravi Ramamoorthi

Wavelet Transform. From C. Valens article, A Really Friendly Guide to Wavelets, 1999

Filtering in the spatial domain (Spatial Filtering)

CS534 Introduction to Computer Vision. Linear Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University

1.Discuss the frequency domain techniques of image enhancement in detail.

VU Signal and Image Processing. Torsten Möller + Hrvoje Bogunović + Raphael Sahann

Image acquisition. Midterm Review. Digitization, line of image. Digitization, whole image. Geometric transformations. Interpolation 10/26/2016

An Efficient and Flexible Structure for Decimation and Sample Rate Adaptation in Software Radio Receivers

Boundary filter optimization for segmentationbased subband coding

Practical Image and Video Processing Using MATLAB

Introduction to Wavelet Transform. Chapter 7 Instructor: Hossein Pourghassem

קורס גרפיקה ממוחשבת 2008 סמסטר ב' Image Processing 1 חלק מהשקפים מעובדים משקפים של פרדו דוראנד, טומס פנקהאוסר ודניאל כהן-אור

Enhanced DCT Interpolation for better 2D Image Up-sampling

Announcements. Image Processing. What s an image? Images as functions. Image processing. What s a digital image?

Signal Processing Toolbox

Fourier Transforms and the Frequency Domain

Experiment 6: Multirate Signal Processing

Time-skew error correction in two-channel time-interleaved ADCs based on a two-rate approach and polynomial impulse responses

Design and Simulation of Two Channel QMF Filter Bank using Equiripple Technique.

Image Processing. What is an image? קורס גרפיקה ממוחשבת 2008 סמסטר ב' Converting to digital form. Sampling and Reconstruction.

Prof. Vidya Manian Dept. of Electrical and Comptuer Engineering

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

A Robust Technique for Image Descreening Based on the Wavelet Transform

IN the past decades, various filter bank-related techniques

Midterm Review. Image Processing CSE 166 Lecture 10

Understanding Digital Signal Processing

Sampling Theory. CS5625 Lecture Steve Marschner. Cornell CS5625 Spring 2016 Lecture 7

Image Enhancement using Histogram Equalization and Spatial Filtering

Biosignal filtering and artifact rejection. Biosignal processing, S Autumn 2012

Filip Malmberg 1TD396 fall 2018 Today s lecture

Part One. Efficient Digital Filters COPYRIGHTED MATERIAL

Discrete-Time Signal Processing (DSP)

DIGITAL FILTER DESIGN WITH OPTIMAL ANALOG PERFORMANCE. Yutaka Yamamoto Λ and Masaaki Nagahara y,

International Journal of Advancedd Research in Biology, Ecology, Science and Technology (IJARBEST)

Sampling and Reconstruction

Design of Two-Channel Low-Delay FIR Filter Banks Using Constrained Optimization

Module 3 : Sampling and Reconstruction Problem Set 3

Almost Perfect Reconstruction Filter Bank for Non-redundant, Approximately Shift-Invariant, Complex Wavelet Transforms

DISCRETE FOURIER TRANSFORM AND FILTER DESIGN

II: (1999) 46 (8) ISSN

An Efficient Algorithm to Design Nearly Perfect- Reconstruction Two-Channel Quadrature Mirror Filter Banks

Outline. Design Procedure. Filter Design. Generation and Analysis of Random Processes

FILTER BANKS WITH IN BAND CONTROLLED ALIASING APPLIED TO DECOMPOSITION/RECONSTRUCTION OF ECG SIGNALS

8. Lecture. Image restoration: Fourier domain

AUTOMATIC IMPLEMENTATION OF FIR FILTERS ON FIELD PROGRAMMABLE GATE ARRAYS

IIR Ultra-Wideband Pulse Shaper Design

Proceedings of the 5th WSEAS Int. Conf. on SIGNAL, SPEECH and IMAGE PROCESSING, Corfu, Greece, August 17-19, 2005 (pp17-21)

Design of a Sharp Linear-Phase FIR Filter Using the α-scaled Sampling Kernel

Module 3: Video Sampling Lecture 18: Filtering operations in Camera and display devices. The Lecture Contains: Effect of Temporal Aperture:

Introduction to Wavelets Michael Phipps Vallary Bhopatkar

TDI2131 Digital Image Processing

Improving Virtual Sound Source Robustness using Multiresolution Spectral Analysis and Synthesis

Digital Image Processing. Digital Image Fundamentals II 12 th June, 2017

Optimal Design RRC Pulse Shape Polyphase FIR Decimation Filter for Multi-Standard Wireless Transceivers

Multirate Signal Processing

Digital Image Processing. Lecture 5 (Enhancement) Bu-Ali Sina University Computer Engineering Dep. Fall 2009

Image Filtering and Gaussian Pyramids

Sampling and Reconstruction

Filter Banks I. Prof. Dr. Gerald Schuller. Fraunhofer IDMT & Ilmenau University of Technology Ilmenau, Germany. Fraunhofer IDMT

(i) Understanding of the characteristics of linear-phase finite impulse response (FIR) filters

The Scientist and Engineer's Guide to Digital Signal Processing By Steven W. Smith, Ph.D.

Design of IIR Half-Band Filters with Arbitrary Flatness and Its Application to Filter Banks

Images and Filters. EE/CSE 576 Linda Shapiro

Using the DFT as a Filter: Correcting a Misconception by Richard G. Lyons

Image features: Histograms, Aliasing, Filters, Orientation and HOG. D.A. Forsyth

Denoising and Effective Contrast Enhancement for Dynamic Range Mapping

CG401 Advanced Signal Processing. Dr Stuart Lawson Room A330 Tel: January 2003

ISI-Free FIR Filterbank Transceivers for Frequency-Selective Channels

Optimized PR-QMF Based Codes For Multiuser. Communications. Kenneth Hetling, Gary Saulnier, and Pankaj Das. Rensselaer Polytechnic Institute

The Design of Compressive Sensing Filter

COMPLEX WAVELETS FOR SHIFT INVARIANT ANALYSIS AND FILTERING OF SIGNALS. Nick Kingsbury

NEITHER time- nor frequency-domain based methods

Digital Image Processing 3/e

INDEX Space & Signals Technologies LLC, All Rights Reserved.

(i) Understanding of the characteristics of linear-phase finite impulse response (FIR) filters

Transcription:

Subband coring for image noise reduction. dward H. Adelson Internal Report, RCA David Sarnoff Research Center, Nov. 26 1986. Let an image consisting of the array of pixels, (x,y), be denoted (the boldface indicating that it is a discrete function over (x,y)). Likewise, let a kernel with taps A(x,y) be denoted A. The image can be decomposed into a set of N filtered subimages, S i, by convolution with the set of N kernels, A i. Let * indicate convolution; then the decomposition is: = S 0 +... +S N-1 where S i = *A i We do not assume that the subimages have been decimated. Thus each subimage has the same number of pixels as the original image, and the total number of pixels in the decomposition is N times the number in the original image. The kernels would normally be chosen to produce a decomposition into a "useful" set of subimages. In the case of noise coring, this will usually mean that the kernels are selective for orientation and scale, i.e. that they select out limited patches in the spatial frequency domain. For example, we might choose A 1 to select for vertical energy, A 2 to select for horizontal energy, and A 3 to select for diagonal energy, as shown by the spectra in figure 1(a). A 0 contains the remaining lowpassed energy. The decomposition is shown as a flow diagram in figure 1(b).

A 0 A 1 A 2 A 3 Lowpass Vertical Horizontal Diagonal y y y y x x x x (a) *A 3 S 3 *A 2 S 2 *A 1 S 1 (b) *A 0 S 0 Figure 1: (a) the spatial-frequency spectra for a decomposition into lowpass, vertical, horizontal, and diagonal energy. (b) A flow diagram of the decomposition and reconstruction. The original image,, is decomposed into the 4 subimages, S 0,...S 3, by by filtering with the kernels A 0,...A 3. These subimages sum to reconstruct the original,. (Observe that it is sufficient, but not necessary, that the kernels themselves sum to the unit impulse, (x,y) ).

Suppose now that each kernel A i can be expressed as the convolution of a kernel B i with itself: A i = B i *B i. Then one can define a symmetrical subband transform as follows: Forward transform: Let T i = *B i. Then the transform is: =>> { T 0,...,T N-1 } Thus transform consists of the set of subband images, T i. Inverse transform: To reconstruct the image from the transform, apply the kernels B i once again to the corresponding images T i and sum the resulting images S i. T 0 +... + T N-1 * B N-1 = S 0 +... + S N-1 = The transform is illustrated in figure 2. The image is filtered through a bank of parallel filters, indicated by the convolving kernels B i. The resulting filtered images are refiltered by the same kernels, and then summed to reconstruct the original image.

Forward transform Inverse transform T 3 S 3 T 2 S 2 T 1 S 1 T 0 S 0 Figure 2: A symmetrical subband transform. Image is filtered by convolution with kernels B 0,...B 3, to form the transform images T 0...T 3. A second filtering leads to a set of subimages S 0...S 3 that can be summed to reconstruct the original image,. In a typical subband transform, one of the filters is lowpass, containing all of the DC component, while the others are bandpass, with no DC. We will assume that the kernel B 0 is lowpass, as are the associated images T 0 and S 0. The basis functions of the subband transform are the kernels themselves, taking centers at all positions in the image. That is, the transform can be considered to decompose the image into a sum of subimages, where the subimages themselves are sums of kernels repeated along the (x,y) grid: where = Σ i S i S i (x,y) = Σ u,v T i (u,v)b i (x-u,y-v)]

If the basis functions are selective for useful image information such as edges or lines, then the transform can be used to reduce visible noise by applying a static non-linear coring function. A typical coring function is shown in figure 3. It is assumed that the values of the coefficients to which it is applied cover the arbitrary range (-127 to 127). If a coefficient is near zero, its value is likely to represent noise and therefore it is attenuated. If a value is large, it is likely to represent legitimate image information and therefore it is left unchanged or slightly amplified. There is a smooth transition between the "coring" region and the "peaking" region. The precise shape of the best coring function will depend on the characteristics of the signal and the noise, but the coring function will have this general form. CORING FUNCTION 127 OUTPUT 0-127 -127 0 127 INPUT Figure 3: A typical coring function, which acts as a static non-linearity on each pixel value of a bandpass transform image. Values near zero are attenuated, while values far from zero are left unchanged or are amplified. Coring can be usefully applied to all of the bandpass images in the transform, i.e. all of the images except for the lowpass image T 0. The process

is shown in figure 4. Denote the coring functions c i (.), the cored transform images T i, and the resulting cored subimages S i. That is, T i (x,y) = c i ( T i (x,y) ) and S i = T i * B The final cored image is then = i [ S i ] T 3 T 3 S 3 T 2 c 3 T 2 S 2 T 1 c 2 T 1 S 1 c 1 T 0 Figure 4: Flow diagram for coring an image. The original,, is transformed to the images T 0,...T 3. The three bandpass images, T 1,...T 3, are then cored with the static non-linearities, c 1,..., c 3. Reconstruction proceeds normally. The final cored image,, should have less noise. Coring is more effective when carried out at multiple spatial frequency bands. The previous description applies to coring at a single spatial frequency range determined by the kernels B i. The lowpass transform image,

T 0, was not cored, and so all of its information (both signal and noise) is passed unchanged. This information can be further subdivided by applying a second subband transform with kernels tuned to lower frequencies, and coring can be applied to the resulting transformed images T i. The most efficient way to achieve this is hierarchically, as shown in figure 5. At the first level, the image is transformed into the images T 0, T 1,T 2, and T 3. Since T 0 is lowpass, it cannot be directly cored. However it can be used as input to a second coring stage, where all the kernels operate at lower frequencies. The second stage kernels are denoted B i, the transform images T i, the coring function c i, the cored transform images T i, and the cored subimages S i. The cored version of T 0, denoted T 0, is then combined with the cored transform images from the first stage, and the final cored image is reconstructed. This process can be repeated for several stages; each stage will remove noise in a lower frequency band than the previous stage. In practice there is usually little advantage to repeating the process at more than 3 frequency bands.

T 3 T 3 S 3 T 2 c 3 T 2 S 2 T 1 c 2 T 1 S 1 c 1 S 0 T 0 T 0 T 3 T 2 T 1 T 0 c 3 c 2 c 1 T 3 T 2 T 1 S 3 S 2 S 1 S 0 Figure 5: Two stage coring. The low-pass transform image T 0 undergoes a second stage of coring by a set of kernels B 0,...B 3. These kernels are tuned to lower spatial frequency than the first set, and so core out noise in a lower frequency band. We will now describe a simple concrete example of such a two-level coring system, using 2x2 Hadamard functions as the kernels. These are not

very good kernels because their poor filter selectivity; but they offer an example that is easily understood. The first-level kernels are as follows: 1 1 B 0 = 1/4 B 1 = 1/4 1 1-1 -1 1 1-1 1 B 2 = 1/4 B 3 = 1/4-1 1 1-1 -1 1 And for the second level, the same kernels are simply padded with zeroes to spread their taps to double the distance, thereby reducing their frequency tuning by one octave: 1 0 1 B 0 = 1/4 B 1 = 1/4 0 0 0 1 0 1-1 0-1 0 0 0 1 0 1-1 0 1 B 2 = 1/4 B 3 = 1/4 0 0 0-1 0 1 1 0-1 0 0 0-1 0 1 Note that at the second level the kernels are being applied to an image that has already been convolved with the lowpass kernel B 0. Thus the effective kernels at the second stage are actually given by the convolutions of the new kernels, B i, with B 0 :

B0 * B 0 = 1/16 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 B0 * B 1 = 1/16-1 -1-1 -1-1 -1-1 -1 1 1 1 1 1 1 1 1 B0 * B 2 = 1/16-1 -1 1 1-1 -1 1 1-1 -1 1 1-1 -1 1 1 B0 * B 3 = 1/16 1 1-1 -1 1 1-1 -1-1 -1 1 1-1 -1 1 1 The same process of padding and convolution can be repeated at further stages. Large effective 2-D kernels are thus generated, using the principles of hierarchical discrete correlation previously outlined by Burt (1981). More effective coring requires kernels with better tuning. A good set are the quadrature mirror filters (QMFs) that have been widely used in speech coding (Crochiere and Rabiner, 1983). A pair of high-pass and low-pass QMF kernels can be separably combined to form a set of four 2-D kernels (cf. Woods and ONeil, 1986). Note that QMF kernels are designed to be used with decimation, but that we do not actually decimate in our coring procedure. This is because we find that decimation introduces "jaggies" and other aliasing artifacts when combined with the non-linear operation of coring. Since we do not decimate we are effectively working at double the necessary linear sample density at the first stage, and at quadruple the necessary linear sample density at the second stage. This extra density leads to extra computations, but prevents aliasing artifacts. Here is an example of a 12-tap low-pass and high-pass QMF pair, taken from Johnson (1980): Lowpass: Highpass: -.003809.003809.018857 -.018857

-.002710.002710 -.084696.084696.088470 -.088470.484389 -.484389.484389.484389.088470.088470 -.084696 -.084696 -.002710 -.002710.018857.018857 -.003809 -.003809 The QMF kernels that have been published have an even number of taps, and the high-pass kernels are odd-symmetric. We have also derived kernels that are even symmetric, with an odd number of taps; these kernels are different from those that have been published, and tend to be more compact. These filters were derived numerically to satisfy the three constraints: (1) narrow spatial frequency tuning, (2) spatial compactness, and (3) spectral completeness (i.e. ability to reconstruct the original image accurately from the subimages). A 5-tap kernel pair is shown here: Lowpass: Highpass: -.0516 -.0516.2500 -.2500.6032.6032.2500 -.2500 -.0516 -.0516 A 7-tap kernel pair is shown here: Lowpass: Highpass: -.0052.0052 -.0516 -.0516.2552 -.2552

.6035.6035.2552 -.2552 -.0516 -.0516 -.0052.0052 By combining these two 1-D kernels separably, in all four pairings, one can create four 2-D kernels, selective for low-pass, vertical, horizontal, and diagonal energy. Because these kernels are spatially compact, they lead to coring that is well localized, without ringing artifacts. And because the kernels have compact spectra, they are good at selecting out oriented image information within each frequency band. Asymmetric transforms. We note that it is also possible to use asymmetric transforms, where the sampling functions are different from the basis functions. We previously assumed that the original set of kernels, A i, could be expressed as the convolution of B i with itself. This amounts to saying that the sampling functions are the same as their corresponding basis functions (up to a scaling), a fact that will hold for orthogonal basis sets. More generally we can assume that the A i are expressible as the convolutions of sampling/basis pairs that need not be equal. This occurs when the sampling and basis functions are related by the pseudo-inverse. The asymmetric transform is illustrated in figure 6. Let the sampling kernels be b i and the basis kernels be B i. Then we let T i = *b i. and the forward transform is: =>> { T 0,...,T N-1 } For the inverse transform, the kernels B i are applied to the corresponding images T i to give the subband images S i. These are summed to reconstruct the original image. That is,

T 0 +... + T N-1 * B N-1 = S 0 +... + S N-1 = Forward transform Inverse transform *b 3 T 3 S 3 T 2 S 2 *b 2 T 1 S 1 *b 1 *b 0 T 0 S 0 Figure 6: An asymmetric subband transform. The transforming kernels are b 0,...b 3, while the kernels used for reconstruction are B 0,...B 3. References: Burt, P J "Fast filter transforms for image processing," Comput Gr. Image Proc., 16, 20-51(1981). Crochiere, R, and Rabiner, L R, Multirate digital signal processing, nglewood Cliffs: Prentice-Hall (1983). Johnston, J D "A filter family designed for use in quadrature mirror filter banks," Proc. 1980 I Int. Conf. Acoust. Speech Signal Process, pp 291-294. (1980).

Woods, J W, and ONeil, S D, "Subband coding of images," I Trans. Acoustics, Speech and Sig. Proc., ASSP-34, 1278-1288 (1986).