Restoration of Degraded Historical Document Image 1

Similar documents
Image binarization techniques for degraded document images: A review

Hybrid Binarization for Restoration of Degraded Historical Document

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

International Journal of Innovative Research in Engineering Science and Technology APRIL 2018 ISSN X

Enhanced Binarization Technique And Recognising Characters From Historical Degraded Documents

An Improved Bernsen Algorithm Approaches For License Plate Recognition

A Review Paper on Image Processing based Algorithms for De-noising and Enhancement of Underwater Images

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

Er. Varun Kumar 1, Ms.Navdeep Kaur 2, Er.Vikas 3. IJRASET 2015: All Rights are Reserved

Image Restoration and De-Blurring Using Various Algorithms Navdeep Kaur

NON UNIFORM BACKGROUND REMOVAL FOR PARTICLE ANALYSIS BASED ON MORPHOLOGICAL STRUCTURING ELEMENT:

Recovery of badly degraded Document images using Binarization Technique

Contrast adaptive binarization of low quality document images

FACE RECOGNITION USING NEURAL NETWORKS

Efficient 2-D Structuring Element for Noise Removal of Grayscale Images using Morphological Operations

Colored Rubber Stamp Removal from Document Images

Multispectral Image Restoration of Historical Document Images

RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS

Document Recovery from Degraded Images

Locating the Query Block in a Source Document Image

License Plate Localisation based on Morphological Operations

Image Smoothening and Sharpening using Frequency Domain Filtering Technique

Implementing Morphological Operators for Edge Detection on 3D Biomedical Images

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL

ABSTRACT I. INTRODUCTION

A Study On Preprocessing A Mammogram Image Using Adaptive Median Filter

A Spatial Mean and Median Filter For Noise Removal in Digital Images

IDENTIFICATION OF FISSION GAS VOIDS. Ryan Collette

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

ANALYSIS OF GABOR FILTER AND HOMOMORPHIC FILTER FOR REMOVING NOISES IN ULTRASOUND KIDNEY IMAGES

Image processing for gesture recognition: from theory to practice. Michela Goffredo University Roma TRE

A Study on Image Enhancement and Resolution through fused approach of Guided Filter and high-resolution Filter

Efficient Document Image Binarization for Degraded Document Images using MDBUTMF and BiTA

Robust Document Image Binarization Techniques

More image filtering , , Computational Photography Fall 2017, Lecture 4

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

A Novel Multi-diagonal Matrix Filter for Binary Image Denoising

DENOISING DIGITAL IMAGE USING WAVELET TRANSFORM AND MEAN FILTERING

Quantitative Analysis of Local Adaptive Thresholding Techniques

A Novel Method for Enhancing Satellite & Land Survey Images Using Color Filter Array Interpolation Technique (CFA)

Guided Image Filtering for Image Enhancement

Detection of License Plates of Vehicles

Segmentation of Liver CT Images

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing

Performance Comparison of Various Filters and Wavelet Transform for Image De-Noising

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2

Image Measurement of Roller Chain Board Based on CCD Qingmin Liu 1,a, Zhikui Liu 1,b, Qionghong Lei 2,c and Kui Zhang 1,d

Restoration of Motion Blurred Document Images

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

Implementation of License Plate Recognition System in ARM Cortex A8 Board

MATHEMATICAL MORPHOLOGY AN APPROACH TO IMAGE PROCESSING AND ANALYSIS

Automatic Licenses Plate Recognition System

Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding

Volume 7, Issue 5, May 2017

A Review of Optical Character Recognition System for Recognition of Printed Text

Chapter 6. [6]Preprocessing

[More* et al., 5(8): August, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

Linear Gaussian Method to Detect Blurry Digital Images using SIFT

Optical Character Recognition for Hindi

Image De-noising Using Linear and Decision Based Median Filters

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

C. Efficient Removal Of Impulse Noise In [7], a method used to remove the impulse noise (ERIN) is based on simple fuzzy impulse detection technique.

An Efficient Noise Removing Technique Using Mdbut Filter in Images

Lossless Huffman coding image compression implementation in spatial domain by using advanced enhancement techniques

Remove Noise and Reduce Blurry Effect From Degraded Document Images Using MATLAB Algorithm

An Effective Method for Removing Scratches and Restoring Low -Quality QR Code Images

Automatic Morphological Segmentation and Region Growing Method of Diagnosing Medical Images

Implementation of Block based Mean and Median Filter for Removal of Salt and Pepper Noise

An Efficient Method for Landscape Image Classification and Matching Based on MPEG-7 Descriptors

Segmentation of Blood Vessel in Retinal Images and Detection of Glaucoma using BWAREA and SVM

Keywords Fuzzy Logic, ANN, Histogram Equalization, Spatial Averaging, High Boost filtering, MSE, RMSE, SNR, PSNR.

A Review over Different Blur Detection Techniques in Image Processing

VLSI Implementation of Impulse Noise Suppression in Images

Performance Comparison of Mean, Median and Wiener Filter in MRI Image De-noising

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

International Journal of Advance Engineering and Research Development

` Jurnal Teknologi IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE.

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

Gray Image Reconstruction

MAV-ID card processing using camera images

Removal of High Density Salt and Pepper Noise through Modified Decision based Un Symmetric Trimmed Median Filter

CoE4TN4 Image Processing. Chapter 3: Intensity Transformation and Spatial Filtering

Computer Vision. Howie Choset Introduction to Robotics

Noise Detection and Noise Removal Techniques in Medical Images

A Modified Non Linear Median Filter for the Removal of Medium Density Random Valued Impulse Noise

Vehicle License Plate Recognition System Using LoG Operator for Edge Detection and Radon Transform for Slant Correction

Binarization of Historical Document Images Using the Local Maximum and Minimum

A new seal verification for Chinese color seal

Blur Detection for Historical Document Images

AUTOMATED MALARIA PARASITE DETECTION BASED ON IMAGE PROCESSING PROJECT REFERENCE NO.: 38S1511

Main Subject Detection of Image by Cropping Specific Sharp Area

A Proficient Roi Segmentation with Denoising and Resolution Enhancement

A Scheme for Salt and Pepper oise Reduction and Its Application for OCR Systems

A Review on Image Enhancement Technique for Biomedical Images

Applications of Flash and No-Flash Image Pairs in Mobile Phone Photography

Transcription:

Restoration of Degraded Historical Document Image 1 B. Gangamma, 2 Srikanta Murthy K, 3 Arun Vikas Singh 1 Department of ISE, PESIT, Bangalore, Karnataka, India, 2 Professor and Head of the Department of CSE, PESSE, Bangalore, Karnataka, India 3 Departments of ECE, PESIT, Bangalore, Karnataka, India 1 ganga_h@pes.edu, 2 srikantamurthy@pes.edu, 3 arunvikas@pes.edu ABSTRACT Restoration plays a very important role in enhancing the degraded noisy images. To enhance the degraded image, the numerous algorithms have been designed. Since image processing algorithms are subjective, not all algorithms that developed will address all type of degradedness. To address specific type of problem the suitable algorithms need to be selected. In this paper a combination of spatial domain method along with set theory operations are used to enhance the historical image. The proposed method eliminates noise, uneven background and enhances the contrast of the script image. The result of the proposed method is compared with Mean and Gaussian filter. Performance of the proposed method proved to be better than these methods. The restored images will have clear uniform background and foreground with enhanced character appearance. Keywords: Bilateral Filter, contrast Enhancement, Denoising, Degraded Document, Historical, Spatial Domain. 1. INTRODUCTION It is very important to preserve the historical documents which reveal the information about the civilized past. These documents are available in various forms like stone and metal carvings, palm leaf manuscripts, paper manuscripts. These documents pose many degradation due to weather conditions, method of preservations etc. Stone carvings also referred to as epigraphically scripts, are the oldest among all forms of communication in olden days. It is found from the archeological department that the epigraphically scripts belonging to 3 rd century BC are also available and maintained. But not all available ones are in good condition. To preserve this information for longer period, one needs modern tools and techniques. Another oldest medium of writing and communicating in South Asia are palm leaf manuscripts. These are also the major sources for writing and painting in South East Asian countries including Thailand, Nepal, India, Barma, Indonesia etc.[uday Kumar et. al 2009]. Hence it is required to develop an automated system to decipher these inscriptions. The system takes the camera grabbed or scanned images of the inscriptions as an input and processes it before the character recognition is taken up. The images so captured have major problems like the broken letters, erased letters, distortion due to fossils settled and so on. The presence of unwanted marks engraved by the sculptor leads to wrong diagnosis of inscriptions. Hence this requires a lot of preprocessing before the character recognition is taken up. The need for efficient image restoration methods have grown with the massive production of digital images of all kinds, often taken in poor conditions. Even though good cameras are available, images may not be in a condition to directly use for the analysis. From the literature survey, it is observed that, there are several techniques like moving average filter, Gaussian filter for noise suppression are available. However, the techniques effectively suppress the noise but fail to preserve many useful details. In the past decades, many researchers shown considerable interest in using the wavelet transform as a powerful tool for recovering signal from noisy data and method is generally referred to as wavelet shrinkage techniques. In 1995, Donohue presented a soft thresholding method for denoising in 1-D signal using wavelets pyramidal filtering. Chang, Yu and Vetterli [2000] introduced an adaptive wavelet thresholding for image denoising and compression. Shijian Lu et. al.[2007] proposed technique which estimates document background surface using an iterative polynomial smoothing procedure. Various types of document degradations are then compensated by using the estimated document background surface intensity. Using L1- norm image gradient, the text stroke edge is detected from the compensated document image. Finally, the document text is segmented by a local threshold that is estimated based on the detected text stroke edges. Napa Sae-Bae and Somkait Udomhunsakul [2007] presented adaptive BSVD method to denoise the image. It is found that these techniques alone cannot improve the visibility of the degraded images. Ntogas et al. [2008] proposed binarization procedure consisted of five discrete steps in image processing, for different classes of document images. A refinement technique enhances further the image quality. The main contribution of this paper was to propose a simple and robust binarization procedure for pre-filtered historical manuscripts images, and simulation results are also presented. E. Badekas et. al. [2009] proposed a new method which the estimates the best parameter values for 792

each one of the document binarization techniques and also the estimation of the best document binarization result of all techniques. Yahia S. et. al. [2009] presented an enhanced system for degraded old document. The developed system was able to deal with degradations which occur due to shadows, non-uniform illumination, low contrast and noise. The developed system was able to separate the two regions of the document. Laurence Likforman-Sulem[2011] presented novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step was based on the Total Variation framework. And second step was based on Non-local Means. Non Local Mean filter computational complexity depends on the size of the patch and window. The paper is organized as follows: Section 2 presents the brief introduction of bilateral filter. Mathematical morphology is explained in the section 3. The proposed method is detailed in section 4. Section 5 reports experimental results and section 6 provides concluding remarks. 2. BILATERAL FILTER We briefly review the bilateral filter in this section. Bilateral filter is a non linear filter in spatial domain, which does averaging without smoothing the edges[ming Zhang 2009]. The bilateral filter takes a weighted sum of the pixels in a local neighborhood; the weights depend on both the spatial distance and the intensity distance. Actually the bilateral filter has weights as a product of two Gaussian filter weights, one of which corresponds to average intensity in a spatial domain, and second weight corresponds to the intensity difference. Hence no smoothing occurs, when one of the weights is close to 0. It means, the product becomes negligible around the region, where intensity changes rapidly, which represents usually the sharp edges. As a result, the bilateral filter preserves sharp edges [Jinwook Kim et al 2009]. Mathematically, at a pixel location x, the output of a bilateral filter is calculated as follows, 3. MATHEMATICAL MORPHOLOGY From past five decades, many image-processing techniques have been developed and their applications in real world are tremendous. One among many techniques is the mathematical morphology, which is continuously receiving great attention. Since it provides a quantitative description of geometric structure and shape as well as a mathematical description of algebra, topology, probability, and integral geometry, it has been enormously useful in many image processing and analysis applications. Mathematical morphological operations are based on simple set theory concept. These operations are non linear operations and can be used to extract shape features such as edges, holes, corners, wedges, and cracks by operating with structuring elements of varied sizes and shapes. They are also used to break thin isthmus and fill the gaps between bridges. Morphological operations can be employed in enhancement, preprocessing, segmentation, feature extraction, pattern classification, etc. The two basic operations dilation and erosion are the fundamental building block of all other morphological operations. The morphological operations require two components, one is the input image and another one is the structuring element. Based on the type of problem structuring element will be selected. Since these operations are based on the simple set theory concept, the computational complexity will be less. Binary mathematical morphology uses binary image data and performs Boolean operations. Gray scale morphology uses gray scale image and uses min max theory to select the intensity of the output image. The detailed study of mathematical morphology is again not in the scope of our paper. 4. PROPOSED METHODOLOGY where and are parameters controlling the fall-off of weights in spatial and intensity domains, respectively. and are input and output images respectively. is a spatial neighborhood of pixel I(x), and C is the normalization constant: The proposed method is combines two powerful image processing techniques, one spatial filtering technique and gray scale mathematical morphology. The proposed method takes degraded historical document as input and performs series of the step as explained below in detail. The camera grabbed image is converted into gray scale image shown in Fig 1. Fig. 2 shows the binarized image of the degraded document image. Then bilateral filter 793

is applied to the input image. Result of the bilateral filter is shown in Fig. 3. The image is filtered without smoothing the edges. As mentioned in the section 2, bilateral filter filters the image without smoothing the edges. This preserves edges and filters out the noise. Mathematical reconstructions operations are done along with contrast enhancement. Gray scale opening is applied with disk structuring element and result image is added to the bilateral filtered image. reconstructed image. To eliminate background, we need to normalize the background intensity. Using Gaussian filter with 50 window size and 30 standard deviation, the reconstructed image is blurred and result(blurred) image is subtracted from filtered image. Again result image is added to filtered image and reconstructed image to get the enhanced image as in Fig 5. Fig 1: Input image. Fig 5: Reconstructed image using morphological operations Mathematical dilation is applied to enhance the image and shown in Fig 6. Fig 2: Binarized input image Fig 6: Final enhanced using using morphological dilation. Global thresholding method is applied to binarize the result image and is shown in fig 7. Fig 3: Filtered Image using Bilateral Filter Fig 7: Binarized image of Fig 6. 5. RESULTS AND DISCUSSION Fig 4: Binarized image of Fig 3 Filtered image is subjected to gray scale opening followed by closing. Resultant image is added to filtered image. Result is added back to filtered image to get The proposed method is used to enhance the palm script and epigraphically script images. Experimentation has been performed on the set of more than 200 images of various sizes. The camera grabbed images are used with size varying from 3000x4500 to 300x450. Length and width of the palm scripts are varying from 40 inch to 2 inch and from 794

4-5 inch to 2 inch respectively. The proposed method enhances the degraded image with uniform background. The enhanced document image can be used further to segment the document into lines, words and character for recognition purpose. So preprocessing stage plays very important role in pattern recognition. Properly binarized image increases the performance of the segmentation algorithm. From correctly segmented character, valid features can be extracted from feature extraction techniques. Classifier performance is entirely depends on the extracted features. So enhancement of the degraded documents is must in preserving them in digital format and/or hard copy, and understanding the document contents. Some of the experimental results are shown here. 5. CONCLUSION Historical documents are usually pose many degradations, due to weather condition, preservation and handling methods, etc. Image processing techniques are required to address these issues and to preserve of documents, as these provide valuable information about our culture. Simple and computationally efficient hybrid algorithm is proposed using a combination of bilateral filtering and set theory approach. Bilateral filter is an efficient in eliminating the noise without smoothing the edges. Mathematical morphology which is based on set theory approach uses simple operations which are computationally less complex. Morphological operations are very much useful in bridging gap between broken parts of the character. The proposed method is compared with average and Gaussian filter. Results show that the proposed method performs better than compared methods in enhancing the degraded document image. (a) (b) (c) (d) (e) (f) Fig (8) a): Result of mean filter, b)result of Gaussian filter c) Result of proposed method. Fig d), e) and f) are the binarized of Fig a), b) and c) (a) b) ( 795

(c ) d) Figure 9 (a): input image (b) Binarized input image (c) Enhanced image using proposed method and (d) Binarized image of the enhanced image. ( (a) (b) (c) 796

(d) Figure 10 (a): Input image (b) Binarized input image (c) Enhanced image using proposed method and (d) Binarized image of the enhanced image. (a) (b) (c) 797

(d) Figure 11 (a): Iinput image (b) Binarized input image (c) Enhanced image using proposed method and (d) Binarized image of the enhanced image. REFERENCES [1] D.L. Donoho, De-noising by soft-thresholding, IEEE Transaction on Information Theory, Vol.41 (3), May 1995, pp. 613-627. [2] D. Udaya Kumar, G.V.Sreekumar, U. A. Athvankar, Traditional writing system in Southern India, Palm leaf manuscripts, Design Thoughts July 2009. [3] Ming Zhang, Bilateral Filter In Image Processing, Master of Science in Electrical Engineering, Thesis, 2009. [4] S.G. Chang, Yu. Bin, M. Vetterli, Adaptive wavelet thresholding for image denoising and compression, IEEE Transactions on Image Processing Vol.9 (9), September 2000, pp. 1532-1546. [5] Napa Sae-Bae and Somkait Udomhunsakul, Adaptive Block Based Singular value Decomposition filtering, IEEE Conference on Computer graphics Image and visualization, CGIV 2007. [6] Nobuyuki Otsu, A threshold selection method from gray level histograms IEEE Trans. Systems Man and Cybernetics, Volume 9, Issue 1, Jan 1979 pp: 62-66. [7] Shijian Lu, Chew Lim Tan, Binarization of Badly Illuminated Document Images through Shading Estimation and Compensation, Ninth International Conference on Document Analysis and Recognition, ICDAR 2007, 978-0-7695-2822-9, pp 321-316. [8] Ntogas, Nikolaos Ventzas, Dimitrios, A Binarization Algorithm For Historical manuscripts, 12 th WSEAS International Conference on Communications, ISSN: 1790-5117, Greece, July 23-25, 2008, pp 41-51. [9] E. Badekas and N. Papamarkos, Estimation of Appropriate Prameter Values For Document Binarization Techniques, International Journal of Robotics and Automation, Vol. 24, No. 1, 2009, pp 66-78. [10] Yahia S. Halabi, Zaid SA, Modeling Adaptive Degraded Document Image Binarization and Optical Character System, European Journal of Scientific Research, ISSN 1450-216X Vol.28 No.1 2009, pp.14-32.\ [11] Jinwook Kim, Soojae Kim, Bilateral Filtered Shadow Maps, Proceeding ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II, ISBN: 978-3-642-10519-7, 2009, [12] Laureence Likeforman-Sulem, Jerome Drabon ans Elisa H. Banrney Smith, Enhancement of Histotrocail Printed Document Images By Combining and Non Local Means Filtering, Image and Vision Computing, Volume 29, Issue 5, April 2011, pp 351-363. 798