Historical Document Preservation using Image Processing Technique

Similar documents
CoE4TN4 Image Processing. Chapter 3: Intensity Transformation and Spatial Filtering

Digital Image Processing

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

Performance Analysis of Average and Median Filters for De noising Of Digital Images.

Image Processing for feature extraction

Chapter 6. [6]Preprocessing

An Efficient Noise Removing Technique Using Mdbut Filter in Images

I. INTRODUCTION II. EXISTING AND PROPOSED WORK

Image Enhancement using Histogram Equalization and Spatial Filtering

A Spatial Mean and Median Filter For Noise Removal in Digital Images

Removal of Gaussian noise on the image edges using the Prewitt operator and threshold function technical

CS534 Introduction to Computer Vision. Linear Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University

Hybrid Binarization for Restoration of Degraded Historical Document

IMAGE ENHANCEMENT IN SPATIAL DOMAIN

Study And Analysis Of Enhancement And Edge Detection Method For Human Bone Fracture X-Ray Image

Image Denoising using Filters with Varying Window Sizes: A Study

Performance Comparison of Various Filters and Wavelet Transform for Image De-Noising

Non Linear Image Enhancement

Image Filtering. Median Filtering

Image Denoising Using Statistical and Non Statistical Method

Filtering in the spatial domain (Spatial Filtering)

APJIMTC, Jalandhar, India. Keywords---Median filter, mean filter, adaptive filter, salt & pepper noise, Gaussian noise.

Removal of Salt and Pepper Noise from Satellite Images

Chapter 3. Study and Analysis of Different Noise Reduction Filters

DIGITAL IMAGE DE-NOISING FILTERS A COMPREHENSIVE STUDY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

PERFORMANCE ANALYSIS OF LINEAR AND NON LINEAR FILTERS FOR IMAGE DE NOISING

Image Restoration and De-Blurring Using Various Algorithms Navdeep Kaur

Computing for Engineers in Python

Digital Image Processing 3/e

Processing and Enhancement of Palm Vein Image in Vein Pattern Recognition System

Table of contents. Vision industrielle 2002/2003. Local and semi-local smoothing. Linear noise filtering: example. Convolution: introduction

A Fast Median Filter Using Decision Based Switching Filter & DCT Compression

Achim J. Lilienthal Mobile Robotics and Olfaction Lab, AASS, Örebro University

Image De-noising Using Linear and Decision Based Median Filters

Image restoration and color image processing

Guided Image Filtering for Image Enhancement

FUZZY BASED MEDIAN FILTER FOR GRAY-SCALE IMAGES

Image acquisition. Midterm Review. Digitization, line of image. Digitization, whole image. Geometric transformations. Interpolation 10/26/2016

Image Deblurring. This chapter describes how to deblur an image using the toolbox deblurring functions.

Keywords Fuzzy Logic, ANN, Histogram Equalization, Spatial Averaging, High Boost filtering, MSE, RMSE, SNR, PSNR.

COMPARITIVE STUDY OF IMAGE DENOISING ALGORITHMS IN MEDICAL AND SATELLITE IMAGES

FILTER FIRST DETECT THE PRESENCE OF SALT & PEPPER NOISE WITH THE HELP OF ROAD

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

Digital Image Processing. Lecture # 3 Image Enhancement

Prof. Vidya Manian Dept. of Electrical and Comptuer Engineering

A Comparative Review Paper for Noise Models and Image Restoration Techniques

Frequency Domain Enhancement

LAB MANUAL SUBJECT: IMAGE PROCESSING BE (COMPUTER) SEM VII

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

ABSTRACT I. INTRODUCTION

Image analysis. CS/CME/BIOPHYS/BMI 279 Fall 2015 Ron Dror

A Novel Approach for MRI Image De-noising and Resolution Enhancement

International Journal of Pharma and Bio Sciences PERFORMANCE ANALYSIS OF BONE IMAGES USING VARIOUS EDGE DETECTION ALGORITHMS AND DENOISING FILTERS

BASIC OPERATIONS IN IMAGE PROCESSING USING MATLAB

PARAMETRIC ANALYSIS OF IMAGE ENHANCEMENT TECHNIQUES

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

A No Reference Image Blur Detection using CPBD Metric and Deblurring of Gaussian Blurred Images using Lucy-Richardson Algorithm

Image filtering, image operations. Jana Kosecka

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

Direction based Fuzzy filtering for Color Image Denoising

Image Filtering Josef Pelikán & Alexander Wilkie CGG MFF UK Praha

VLSI Implementation of Impulse Noise Suppression in Images

SYLLABUS CHAPTER - 2 : INTENSITY TRANSFORMATIONS. Some Basic Intensity Transformation Functions, Histogram Processing.

DARK CURRENT ELIMINATION IN CHARGED COUPLE DEVICES

4 STUDY OF DEBLURRING TECHNIQUES FOR RESTORED MOTION BLURRED IMAGES

Survey on Impulse Noise Suppression Techniques for Digital Images

Image Enhancement in spatial domain. Digital Image Processing GW Chapter 3 from Section (pag 110) Part 2: Filtering in spatial domain

Design of Novel Filter for the Removal of Gaussian Noise in Plasma Images

Image Enhancement in the Spatial Domain (Part 1)

Computer Vision. Howie Choset Introduction to Robotics

Image preprocessing in spatial domain

INSTITUTE OF AERONAUTICAL ENGINEERING Dundigal, Hyderabad

Image Enhancement. DD2423 Image Analysis and Computer Vision. Computational Vision and Active Perception School of Computer Science and Communication

Digital Image Processing

Performance Comparison of Mean, Median and Wiener Filter in MRI Image De-noising

Part I Feature Extraction (1) Image Enhancement. CSc I6716 Spring Local, meaningful, detectable parts of the image.

FPGA IMPLEMENTATION OF RSEPD TECHNIQUE BASED IMPULSE NOISE REMOVAL

Study of Various Image Enhancement Techniques-A Review

A Modified Non Linear Median Filter for the Removal of Medium Density Random Valued Impulse Noise

Computer Vision, Lecture 3

Noise and Restoration of Images

CSE 564: Visualization. Image Operations. Motivation. Provide the user (scientist, t doctor, ) with some means to: Global operations:

Literature Survey On Image Filtering Techniques Jesna Varghese M.Tech, CSE Department, Calicut University, India

Removal of High Density Salt and Pepper Noise through Modified Decision based Un Symmetric Trimmed Median Filter

Image Denoising Using Different Filters (A Comparison of Filters)

Stochastic Image Denoising using Minimum Mean Squared Error (Wiener) Filtering

Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images

Image processing for gesture recognition: from theory to practice. Michela Goffredo University Roma TRE

International Journal of Innovative Research in Engineering Science and Technology APRIL 2018 ISSN X

Computation Pre-Processing Techniques for Image Restoration

CSC 320 H1S CSC320 Exam Study Guide (Last updated: April 2, 2015) Winter 2015

Algorithm for Image Processing Using Improved Median Filter and Comparison of Mean, Median and Improved Median Filter

Image Denoising with Linear and Non-Linear Filters: A REVIEW

TDI2131 Digital Image Processing

Performance Analysis of Local Adaptive Real Oriented Dual Tree Wavelet Transform in Image Processing

Image Processing Lecture 4

IDENTIFICATION OF FISSION GAS VOIDS. Ryan Collette

Image binarization techniques for degraded document images: A review

Preparing Remote Sensing Data for Natural Resources Mapping (image enhancement, rectifications )

Transcription:

Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 4, April 2013, pg.247 255 RESEARCH ARTICLE ISSN 2320 088X Historical Document Preservation using Image Processing Technique Prof. D.N. Satange 1, Ms. Swati S. Bobde 2, Ms. Snehal D. Chikate 3 1 Asst. Professor, Dept of computer Sci., Arts, Commerce & Science College, Amravati, India 2 Post Graduate Student, Dept of computer Sci., Arts, Commerce & Science College, Amravati, India 3 Post Graduate Student, Dept of computer Sci., Arts, Commerce & Science College, Amravati, India 1 dineshnsatange@rediffmail.com; 2 bobde.swati@gmail.com; 3 snehal.chikate@gmail.com Abstract In this paper we present recursive methods for the cleaning and the enhancing of historical documents. Most of the methods, used to clean and enhance documents or transform them to binary images, implement combinations of complicated image processing techniques which increase the computational cost and complexity. There is a problem of preserving historical documents from the degradation due to some bad storage conditions & poor contrast between foreground & background due to humidity. Digital image processing is an area characterized by the need for extensive experimental work to establish the viability of proposed solutions to a given problem. Our method simplifies the procedure by taking into account special characteristics of the document images. Moreover, the fact that the methods consists of iterated steps, makes it more flexible concerning the needs of the user. At the experimental results, comparison with other methods is provided and proves which one is best. Key Terms: - Image Enhancement techniques; Filters; Noise Removal techniques; handwritten Devnagari Document I. INTRODUCTION The objective of the research concerns to historical document preservation. Old documents are also prone to being attacked by pests and insects. For this, most documents that are very old enough to just suddenly crumble at slight touch are sealed on tight containers to prevent any insect from being able to get to it at any probable way. Connecting past and present is essential in order for one to find the right path towards future. It is for this reason. Why History is an important part of one's learning. In this manner, people are able to know how life came about and why they are now experiencing the independence they have. Tell-tales, written letters and pictures are some of the key items that most people hold on to in order to know about the past. These documents are the sole connection for one to better understand what indeed happened before. But it is amazing to see that after hundreds of years, such items remain as they are and enjoyed by everyone. The binarization of images is a long investigated field with remarkable accomplishments. Some of them have also been applied to documents or historical documents. One of the older methods in image binarization is Otsu s, based on the variance of pixel intensity. Bernsen calculates local thresholds using the neighbours. Niblack uses local mean and standard deviation. Sauvola presents a method specialized on document an image that applies two algorithms in order to calculate a different threshold for each pixel. The historical documents suffer from bad storage conditions and poor contrast between foreground and background due to humidity, paper deterioration and ink seeking. Moreover, the fragility of those documents does not allow access to many researchers while a legible digitized version is more accessible. Vast amounts of historical hand written texts are the property of state and country 2013, IJCSMC All Rights Reserved 247

libraries, where these texts will be converted to their digital form to preserve the information in secondary sources even if the primary sources such as ancient scrolls of text get degraded. Converting a scanned grey scale image into a binary image, while retaining the foreground (or regions of interest) and removing the background is an important step in many image analysis systems including document image processing. Original documents are often dirty due to smearing and smudging of text and aging. In this paper, we focus on documents where the foreground mainly comprises handwritten text. In some cases, the documents are of very poor quality due to seeping of ink from the other side of the page and general degradation of the paper and ink. The filtering algorithms are implemented on twenty five samples for various noise types. The simulation is performed on MATLAB R2007b version. In the next section comparison of all methods are analyzed in details. II. IMAGE SENSING & ACQUISION Fig. 1 single imaging, line sensor, array sensor [1] We will briefly discuss means for getting a picture into a computer CCD camera. Such a camera has, in place of the usual _lm, an array of photo sites; these are silicon electronic devices whose voltage output is proportional to the intensity of light falling on them. For a camera attached to a computer, information from the photo sites is then output to a suitable storage medium. Generally this is done on hardware, as being much faster and more efficient than software, using a frame-grabbing card. This allows a large number of images to be captured in a very short time in the order of one ten-thousandth of a second each. Fig. 1 shows three principle sensor arrangements used to transform illumination energy into digital images. The idea is simple: incoming energy is transformed into a voltage by the combination of input electrical power and sensor material that is responsive to the particular type of energy being detected. The output voltage waveform is the response of the sensors and a digital quantity is obtained from each sensor by digitizing the response. In this section we look at the principal modalities for image sensing and generation. III. INTENSITY TRANSFORMATION When you are working with gray-scale images, sometimes you want to modify the intensity values. For instance, you may want to reverse the black and the white intensities or you may want to make the darks darker and the lights lighter. An application of intensity transformations is to increase the contrast between certain intensity values so that you can pick out things in an image. For instance, the following two images show an image before and after an intensity transformation.[1] Types of Transformation Functions PhotographicNegative(using imcomplement) Logarithmictransformations(using c*log(1+f)) gamma transformation (using imadjust) [9] 2013, IJCSMC All Rights Reserved 248

Image Negatives Original Image Fig. 2 Original Image Negative Image Fig. 3 Negative Image Log Transformations Original Image Fig. 4 Original Image Log Image Fig. 5 Original Image 2013, IJCSMC All Rights Reserved 249

x 10 5 Histogram for Negative Image 3 x 10 5 Histogram for Log 2 2.5 1.5 2 1 1.5 1 0.5 0.5 0 0 0 50 100 150 200 250 (a) 5 x 10 Histogram for Gamma = 3 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 (b) 5 x 10 Histogram for Gamma = 1 2 2 1.5 1.5 1 1 0.5 0.5 0 0 0 50 100 150 200 250 0 50 100 150 200 250 (c) (d) Fig. 6 Histogram of (a) Negative Image, (b) LogI mage, (c)gamma=1, (d) Gamma=3 Gamma Transformations Image with Gamma=1 Fig. 7 Image with Gamma=1 Image with Gamma=3 Fig.8 Image with Gamma=3 IV. REMOVING NOISE FROM IMAGES Noise in documents is classified based on the criteria if it is dependent on the underlying content or independent of the underlying content. Stray marks, marginal noise, ink blobs and salt-and-pepper noise are independent of size. Such content-dependent noise is comparatively more difficult to model, mathematically non-linear and often multiplicative. If noise shows a consistent behavior in terms of these properties, it is called regular noise. In this paper we have made an attempt to study for two common types of noises like Gaussian, salt and pepper. The following example compares using an averaging filter and medfilt2 to remove salt and 2013, IJCSMC All Rights Reserved 250

pepper noise. This type of noise consists of random pixels' being set to black or white (the extremes of the data range). In both cases the size of the neighborhood used for filtering is 3-by-3.[3] Read in the image and display it. Fig. 9 Original Image Fig. 10 add noise in it Fig. 11 Filter the noisy image with a averaging filter Fig.12 now uses a median filter to filter the noisy image and display the results. Notice that medfilt2 does a better job of removing noise, with less blurring of edges. Gaussian Noise Gaussian noise also called Random Variation Impulsive Noise (RVIN) or normal noise T is a type of statistical noise in which the amplitude of the noise follows that of a Gaussian distribution. Gaussian Noise occurs as the probability density function of the normal distribution. Thus Gaussian Noise represents the frequency spectrum that has a bell shaped curve. Gaussian distribution noise can be expressed by: 2013, IJCSMC All Rights Reserved 251

Where: P(x) is the Gaussian distribution noise in image; µ and s is the mean and standard deviation respectively. Salt-and-pepper Noise Salt-and-pepper noise is also called as Fat-tail distributed or impulsive noise or spike noise. An image containing salt-and-pepper noise will have dark pixels in bright regions and bright pixels in dark regions. Salt and pepper noise is predominantly found in digital transmission and storage. It can be described as: I (t) = (1-e) S (t) + e N (t) (2) S (t) represents the amount of dark pixels in bright regions, N (t) represents bright pixels in dark regions and I(t) represents the overall salt-and-pepper noise in the given image and e={0,1},with a probability P. There is a clear 50% probability of the occurrence of either black or white pixels within the image giving rise to salt and pepper noise. To simulate the effects of some of the problems listed above, the toolbox provides the imnoise function, which you can use to add various types of noise to an image. The examples in this section use this function. V. IMAGE FILTERING ALGORITHMS In image processing, filters are mainly used to suppress either the high frequencies in the image, i.e. smoothing the image, or the low frequencies, i.e. enhancing or detecting edges in the image. Image restoration and enhancement techniques are described in both the spatial domain and frequency domain, i.e. Fourier transforms. Noise removal is easier in the spatial domain as compared to the frequency domain as the spatial domain noise removal requires very less processing time. Spatial processing is classified into point and mask processing. Point processing involves the transformation of individual pixels independently of other pixels in the image. These simple operations are typically used to correct for defects in image acquisition hardware, for example to compensate for under/over exposed images. On the other hand, in mask processing, the pixel with its neighborhood of pixels in a square or circle mask are involved in generating the pixel at (x, y) coordinates in the enhanced image. It is a more costly operation than simple point processing, but more powerful. The application of a mask to an input image produces an output image of the same size as the input. One of the most important requirements of noise removal algorithms is that they should provide satisfactory amount of noise removal and also help preserve the edges. For the stated conditions to be satisfied there are two types of filters with their significant advantages and disadvantages. The two types of filters are the linear and non-linear filters. The linear filters have the advantage of faster processing but the disadvantage of not preserving edges. Conversely the nonlinear filters have the advantage of preserving edges and the disadvantage of slower processing. [2] Median Filter It is important to perform noise removal during signal processing on an image or on a signal. One of the methods to perform this noise reduction is by performing neighborhood averaging. The neighborhood averaging can suppress isolated out-of-range noise, but the side effect is that it also blurs sudden changes such as sharp edges. The median filter is an effective method that can suppress isolated noise without blurring sharp edges. In Median Filtering, all the pixel values are first sorted into numerical order and then replaced with the middle pixel value.[4] [6] Let y represent a pixel location and w represent a neighborhood centered around location (m, n) in the image, then the working of median filter is given by y [m, n]=median{x[ i,j],( i, j) belongs to w} Since the pixel y[m,n] represents the location of the pixel y,m and n represents the x and y co-ordinates of y. W represents the neighborhood pixels surrounding the pixel position at (m, n).( i, j) belongs to the same neighborhood centered around (m, n).thus the median method will take the median of all the pixels within the range of ( i, j) represented by X[i,j]. [2] 2013, IJCSMC All Rights Reserved 252

Fig. 13 Result by Median Filter Wiener Filter The inverse filtering is a restoration technique for deconvolution, i.e., when the image is blurred by a known low pass filter, it is possible to recover the image by inverse filtering or generalized inverse filtering. However, inverse filtering is very sensitive to additive noise. The approach of reducing degradation at a time allows us to develop a restoration algorithm for each type of degradation and simply combine them. The Wiener filtering executes an optimal trade-off between inverse filtering and noise smoothing. It removes the additive noise and inverts the blurring simultaneously. The Wiener filtering is optimal in terms of the mean square error. In other words, it minimizes the overall mean square error in the process of inverse filtering and noise smoothing. The Wiener filtering is a linear estimation of the original image. The approach is based on a stochastic framework. The orthogonality principle implies that the Wiener filter in Fourier domain can be expressed as follows: Where Sxx(f1,f2),Sηη(f1,f2) are respectively power spectra of the original image and the additive noise, and is the blurring filter. It is easy to see that the Wiener filter has two separate part, an inverse filtering part and a noise smoothing part. It not only performs the deconvolution by inverse filtering (high pass filtering) but also removes the noise with a compression operation (low pass filtering). Fig. 14 Result by Wiener Filter Average Filter Mean filtering is a simple, intuitive and easy to implement method of smoothing images, and to reduce the amount of intensity variation between one pixel and the next. Average filtering replaces each pixel value in an image with the mean value of its neighbors, including itself. The simplest procedure would be to calculate the mask for all the pixels in the image. For all the pixels in the image which fall under this mask, it will be considered as the new pixel. This has the effect of eliminating pixel values which are unrepresentative of their surroundings. Average filter is also considered to be a convolution filter or a mean filter.[2] 2013, IJCSMC All Rights Reserved 253

Fig.15 Result by Average Filter Laplacian Filter Detecting edges within an image can be done by the laplacian filter. It denotes areas where the intensity changes rapidly, hence producing an image with all the edges. The Laplacian is often applied to an image that has first been smoothed with something approximating a Gaussian smoothing filter, in order to reduce its sensitivity to noise. The operator normally takes a single gray level image as input and produces another grey level image as output. As radius of interest on the image is increased, this method will prove to be more computationally expensive.[2] Prewitt Filter The Prewitt filter operator is used in image processing particularly within edge detection algorithms. Prewitt filter is a discrete differentiation operator computing an approximation of the gradient of the image intensity function. Fig. 16 Result by Prewitt Filter At each point in the image, the result of the Prewitt operator is either the corresponding gradient vector or the norm of this vector.[5] The Prewitt operator is based on convolving the image with a small, separable, and integer valued filter in horizontal and vertical direction and is therefore relatively inexpensive in terms of computations. On the other hand, the gradient approximation which it produces is relatively crude, in particular for high frequency variations in the image. The working of Prewitt filter consists of computing the root mean square root of two 3 cross 3 matrices. [2] VI. RESULT AND DISCUSSION A number of simulations and experiments have been conducted using Matlab R2010a version for evaluation of the filters on two different types of noise. In order to do this, we have collected 15samples of the handwritten Devnagari documents. To test the accuracy of the Filtering algorithms, below stated steps are followed: a) First an uncorrupted handwritten Devnagari document image is taken as input b) Next the document image is converted to rgb2gray image. c) Different noises are added to the handwritten Devnagari document image artificially with 10% noise density. d) The filtering algorithms are applied for reconstruction of handwritten Devnagari document images. 2013, IJCSMC All Rights Reserved 254

f) To test the performance of the filters for varying noise density, Gaussian noise with different variance is applied on the binary document image. [2] VII. CONCLUSION In this paper five filtering algorithms were applied on Salt & Pepper noise which would be developed in a Devnagari handwritten document during image capture, during transmission or that would have developed due to the progression of time leading to blurring and poor contrast of the written letters. Seven image performance techniques were chosen to evaluate the filters on different noise levels. From the experimental results it is seen that median, average and wiener filters perform better compared to Laplacian and Prewitt. It is also observed that median filter is better in removing salt and pepper noise. Acknowledgement We are very grateful to Amba Devi Trust s Library, Amravati, Maharshtra, India by providing the historical documents for experimental work REFERENCES [1] Digital ImageProcessingUsing MATLAB Second Edition,Rafael C. Gonzalez [2] International Journal of Computer Applications (0975 888) Volume 48 No.12, June 2012 [3] Mr. Salem Saleh Al-amri, Dr. N.V. Kalyankar and Dr. Khamitkar S.D, A comparative study of removal noise from remote sensing image Published by IJCSI International Journal of Computer Science Issues, Vol. 7, Issue. 1, No. 1, January 2010. [4] MasoudNosrati, RonakKarimi,Mehdi Hariri, Detecting circular shapes from areal images using median filter and CHT, Published in Global Jounal of Computer Science and Technology.Volume 12,January 2012 [5] Dr.G.Padmavathi, Dr.P.Subashini, Mr.M.Muthu Kumar and Suresh Kumar Thakur, Comparison of filters used for underwater Image-Preprocessing,IJCSNS International Journal of Computer Science and Network Security, VOL.10 No.1, January 2010 [6] MasoudNosrati, RonakKarimi,Mehdi Hariri, Detecting circular shapes from areal images using median filter and CHT, Published in Global Jounal of Computer Science and Technology.Volume 12,January 2012 2013, IJCSMC All Rights Reserved 255