Hybrid Binarization for Restoration of Degraded Historical Document

Similar documents
Restoration of Degraded Historical Document Image 1

Historical Document Preservation using Image Processing Technique

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

Image binarization techniques for degraded document images: A review

[More* et al., 5(8): August, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

A Study On Preprocessing A Mammogram Image Using Adaptive Median Filter

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images

Chapter 6. [6]Preprocessing

Contrast adaptive binarization of low quality document images

Improving the Quality of Degraded Document Images

Multispectral Image Restoration of Historical Document Images

Enhanced Binarization Technique And Recognising Characters From Historical Degraded Documents

Performance Analysis of Local Adaptive Real Oriented Dual Tree Wavelet Transform in Image Processing

Computing for Engineers in Python

Quantitative Analysis of Local Adaptive Thresholding Techniques

Digital Enhancement of Palm Leaf Manuscript Images using Normalization Techniques

Implementation of Block based Mean and Median Filter for Removal of Salt and Pepper Noise

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2

Efficient Document Image Binarization for Degraded Document Images using MDBUTMF and BiTA


APPLICATION OF THRESHOLD TECHNIQUES FOR READABILITY IMPROVEMENT OF JAWI HISTORICAL MANUSCRIPT IMAGES

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

APJIMTC, Jalandhar, India. Keywords---Median filter, mean filter, adaptive filter, salt & pepper noise, Gaussian noise.

Image Denoising Using Statistical and Non Statistical Method

` Jurnal Teknologi IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE.

Noise Removal and Binarization of Scanned Document Images Using Clustering of Features

Fig 1 Complete Process of Image Binarization Through OCR 2016, IJARCSSE All Rights Reserved Page 213

IDENTIFICATION OF FISSION GAS VOIDS. Ryan Collette

GAUSSIAN DE-NOSING TECHNIQUES IN SPATIAL DOMAIN FOR GRAY SCALE MEDICAL IMAGES Nora Youssef, Abeer M.Mahmoud, El-Sayed M.El-Horbaty

Processing and Enhancement of Palm Vein Image in Vein Pattern Recognition System

Direction based Fuzzy filtering for Color Image Denoising

Recovery of badly degraded Document images using Binarization Technique

Robust Document Image Binarization Techniques

Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding

A Histogram based Algorithm for Denoising Images Corrupted with Impulse Noise

Samandeep Singh. Keywords Digital images, Salt and pepper noise, Median filter, Global median filter

Image Restoration and De-Blurring Using Various Algorithms Navdeep Kaur

FILTER FIRST DETECT THE PRESENCE OF SALT & PEPPER NOISE WITH THE HELP OF ROAD

Impulse Noise Removal and Detail-Preservation in Images and Videos Using Improved Non-Linear Filters 1

A Fast Median Filter Using Decision Based Switching Filter & DCT Compression

I. INTRODUCTION II. EXISTING AND PROPOSED WORK

IJRASET 2015: All Rights are Reserved

Image De-noising Using Linear and Decision Based Median Filters

ANALYSIS OF GABOR FILTER AND HOMOMORPHIC FILTER FOR REMOVING NOISES IN ULTRASOUND KIDNEY IMAGES

Analysis and Implementation of Mean, Maximum and Adaptive Median for Removing Gaussian Noise and Salt & Pepper Noise in Images

ABSTRACT I. INTRODUCTION

Er. Varun Kumar 1, Ms.Navdeep Kaur 2, Er.Vikas 3. IJRASET 2015: All Rights are Reserved

A Study on Image Enhancement and Resolution through fused approach of Guided Filter and high-resolution Filter

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

COMPARITIVE STUDY OF IMAGE DENOISING ALGORITHMS IN MEDICAL AND SATELLITE IMAGES

A Modified Non Linear Median Filter for the Removal of Medium Density Random Valued Impulse Noise

International Conference on Computer, Communication, Control and Information Technology (C 3 IT 2009) Paper Code: DSIP-024

Lossless Huffman coding image compression implementation in spatial domain by using advanced enhancement techniques

Study of Various Image Enhancement Techniques-A Review

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION

An Automatic System for Detecting the Vehicle Registration Plate from Video in Foggy and Rainy Environments using Restoration Technique

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

A.P in Bhai Maha Singh College of Engineering, Shri Muktsar Sahib

Automated License Plate Recognition for Toll Booth Application

A Review of Optical Character Recognition System for Recognition of Printed Text

Keywords Fuzzy Logic, ANN, Histogram Equalization, Spatial Averaging, High Boost filtering, MSE, RMSE, SNR, PSNR.

Binarization of Historical Document Images Using the Local Maximum and Minimum

A Spatial Mean and Median Filter For Noise Removal in Digital Images

ISSN: [Khan* et al., 7(8): August, 2018] Impact Factor: 5.164

DENOISING DIGITAL IMAGE USING WAVELET TRANSFORM AND MEAN FILTERING

Image Denoising using Filters with Varying Window Sizes: A Study

International Journal of Innovations in Engineering and Technology (IJIET)

Binarization of Color Document Images via Luminance and Saturation Color Features

Design of Novel Filter for the Removal of Gaussian Noise in Plasma Images

Removal of High Density Salt and Pepper Noise through Modified Decision based Un Symmetric Trimmed Median Filter

An Efficient Noise Removing Technique Using Mdbut Filter in Images

ORIGINAL ARTICLE A COMPARATIVE STUDY OF QUALITY ANALYSIS ON VARIOUS IMAGE FORMATS

Noise Detection and Noise Removal Techniques in Medical Images

Remove Noise and Reduce Blurry Effect From Degraded Document Images Using MATLAB Algorithm

Colored Rubber Stamp Removal from Document Images

Design and Implementation of Gaussian, Impulse, and Mixed Noise Removal filtering techniques for MR Brain Imaging under Clustering Environment

Implementation of Barcode Localization Technique using Morphological Operations

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Contrast Enhancement Techniques using Histogram Equalization: A Survey

Performance Comparison of Various Filters and Wavelet Transform for Image De-Noising

An Improved Bernsen Algorithm Approaches For License Plate Recognition

C. Efficient Removal Of Impulse Noise In [7], a method used to remove the impulse noise (ERIN) is based on simple fuzzy impulse detection technique.

Design of Hybrid Filter for Denoising Images Using Fuzzy Network and Edge Detecting

An Enhancement of Images Using Recursive Adaptive Gamma Correction

PERFORMANCE ANALYSIS OF LINEAR AND NON LINEAR FILTERS FOR IMAGE DE NOISING

Keyword: Morphological operation, template matching, license plate localization, character recognition.

AN ITERATIVE UNSYMMETRICAL TRIMMED MIDPOINT-MEDIAN FILTER FOR REMOVAL OF HIGH DENSITY SALT AND PEPPER NOISE

A Review on Image Fusion Techniques

Implementation of global and local thresholding algorithms in image segmentation of coloured prints

An Efficient Gaussian Noise Removal Image Enhancement Technique for Gray Scale Images V. Murugan, R. Balasubramanian

International Journal of Scientific & Engineering Research, Volume 4, Issue 7, July ISSN

International Journal of Pharma and Bio Sciences PERFORMANCE ANALYSIS OF BONE IMAGES USING VARIOUS EDGE DETECTION ALGORITHMS AND DENOISING FILTERS

A Review Paper on Image Processing based Algorithms for De-noising and Enhancement of Underwater Images

INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN

Robust Document Image Binarization Technique for Degraded Document Images

FUZZY BASED MEDIAN FILTER FOR GRAY-SCALE IMAGES

De-Noising Techniques for Bio-Medical Images

Transcription:

Hybrid Binarization for Restoration of Degraded Historical Document Rohini Umbare 1, M.D Mali 2, Sunita Sagat 3 P.G. Student, Department of E&TC Engineering, N.B. Navale Sinhgad College of Engineering, Solapur, Maharashtra, India 1 Assistant Professor, Department of E&TC Engineering, N.B. Navale Sinhgad College of Engineering, Solapur, Maharashtra, India 2 Assistant Professor, Department of E&TC Engineering, N.B. Navale Sinhgad College of Engineering, Solapur, Maharashtra, India 2 ABSTRACT: Historical Documents are the valuable source of information but they suffer degradation problems, such as ink seepage, uneven illumination, big variations, strains etc. for that binarization technique play very important role to remove the noise and improve the quality of documents. This paper focuses on degraded historical documents, which are in the form of machine printed and handwritten. The proposed hybrid binarization which is combination of local and global thresholding method consist of five stages such as noise elimination, foreground layer extraction, degradation of background layer estimation, thresholding, and vicinity analysis. Initially, a technique named global thresholding is applied to the whole image and local thresholding is applied to sub image. Therefore, a better adaptability is achieved for the algorithm where various kinds of noise exist in different areas of same image. Advantage of applying global thresholding, is that it avoids the computational time and cost of applying a local thresholding in the entire image. Hence it is indicated that this proposed method is effective in removing background noise and improving the quality of degraded images. KEYWORDS: Image Processing, Image Enhancement, Binarization Method, Thresholding Method. I. INTRODUCTION Degraded historical documents are preserved in academic libraries, institutions, historical museums for their extensive usage but The historical documents suffers degradation problem which is caused by a combination of factors such as temperature level, environmental conditions and low quality paper. The Electronic scanning is the approach for handling such documents to preserve the culture of heritage. But the resulting images have low contrast and corrupted by various artefacts so they are often difficult to read. The restoration and enhancement of degraded historical document images are considered a transformation process which concentrated to restore its original representation of document images As well as restoration and enhancement, which are desired to improve the quality of subsequent segmentation. Degraded historical document image are considered a combination of multilayer information that are foreground layer, background layer and degraded layer. The image processing technique is applied to restore and enhance the quality of degraded document images. In general, there are three steps for restoration of historical document images: pre-processing, binarization and post-processing [17].in this pre-processing step eliminate the noise on the image, binarizaion[10] step to transform gray level image into binary image and post-processing step to enhance the quality of binary image. According to image processing techniques, the binarization methods play an important role in the document image processing. The binarization method classified as global and local thresholding. The binarization methods can be classified as global and local thresholding. A global thresholding, such as Otsu s, Kittler s and Kapur s methods and it provide a single threshold to classify an image into foreground and background, a Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14398

local thresholding calculate an adaptive threshold value in local block. The block size must be small to indicate local details and large to remove noise. Bernsen s, Niblack s and Sauvola s methods [7] are well-known local thresholding methods. The Niblack s method calculates local threshold by using the mean and standard deviation value of gray-level image in a local block. Let s g(x, y) is a gray level image, and μ(g(x, y)) and σ(g(x, y)) is the average and standard deviation of gray level values of g(x, y). A variable k is used to adjust a ratio of foreground pixels particularly for edge of character.the threshold of Niblack s method defined as TNibg(x, y) is calculated by the following formula: TNibg(x, y) = μ (g(x, y)) + (k. σ. g(x, y)) --------------- (1) The Niblack s method generates poor quality result. Because of this, the Sauvola s method as an improved Niblack s method is proposed to solve this problem. A variable r is added to Niblack s formula to change the behaviour from static to dynamic range standard deviation. The threshold of Sauvola s method defined as TSaug(x, y) is calculated by using the following formula: TSaug(x, y) = μ g(x, y). (1 k). 1.(,) ---------------- (2) Niblack s method is applied to estimate foreground regions and then background regions are estimated sequentially. The background regions estimation is guided by the value of the initial binary image. After the binarization process by using local thresholding, post-processing is performed to reduce noise and enhance the quality of text regions. II. LETERATURE SURVEY Zhixin Shi and Srirangaraj Setlur and Venu Govindaraju [9] proposed a Transform based strategy for improving computerized pictures of palm leaf structures. The technique utilizes an alertly chose pivoting background color in a linear transform to improve the legibility of the forefront content.at that point a blend of two other image processing algorithms. Histogram normalization and background normalization are used to the transformed image. Nikolaos NTOGAS and Dimitrios VENTZAS [10] propose a straightforward and strong binarization procedure. This work concentrates on text image enhancement and restoration, denoising and binarization using Mat lab. Binarization is obtained by global and local thresholding and at last refinement is applied to further clarify text and foreground compared to background. Ketki R. Ingole and V K. Shandilya[11] propose a technique for Historical Manuscripts document enhancement that improves the quality of historical Arabic manuscripts which shows uneven background and low contrast due to manufacturing and the effect of getting old and degradation then a background normalization algorithm applied to smoothen out the background and produces more legible images to the eye. B. Gangamma[15] proposed a Simple and efficient method for degraded historical document enhancement. This method enhances images using adaptive histogram equalization for setting contrast followed by gray scale morphological operations to eliminate noise, eliminate background from images with pure foreground contents. Blurred and skewed images are not in the scope of this technique. The binarization process separates foreground text from background. If the histogram of the image is bimodal or sparsely distributed, then binarization process separates text from background. Dimitrios Ventzas propose a work on denoising and binarization. That was introduce an innovative sequential procedure for digital image acquisition of historical documents including image preparation, image type classification according to their condition and their spatial structure, global and local features or both, including document image data mining. Image processing pixel alterations, allow one-pass iterations only by near neighborhood of alteration reprocessing algorithms. Gangamma and Srikanta Murthy [13] proposed a combination of spatial domain methods along with set theory operations to enhance the historical manuscript images. Bilateral filter is an efficient in eliminating the noise without smoothing the edges. Mathematical morphology which is based on set theory approach uses simple operations which are computationally less complex. This eliminates noise, rough background and improves the contrast of the script image. The restored images will have understandable consistent background and foreground with enhanced character Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14399

emergence. The enhanced document image can be used further to segment the document into lines, words and character for recognition purpose. And result of this planned technique is compared with Mean and Gaussian filter, and proved to be better than these techniques. D.N. Satange and Swati S.present[16] a recursive technique which includes iterated steps that makes it more stretchy regarding the needs of the user for enhancing and cleaning of historical manuscript documents. In this paper five filtering algorithms were applied on Salt & Pepper noise which was developed in handwritten Devnagari documents during image capturing and transmission. Rupinder K and Jaspreet K proposed a technique which works for both sided documents at the same time and give best results than existing techniques to remove the show-through from historical manuscript documents. In this by text segmentation through binarization the foreground and background should be separated. Using binarization a digital image converts into 0 (White) and 1 (Black) that is background as white and text as black, so by this text became more clear and read able, also it require less memory for storage. III. PROPOSED METHODOLOGY AND DISCUSSION This section presents the description of the proposed method based on a hybrid binarization method for restoration of degraded historical document images. The overview of the proposed method is illustrated in Fig. 1 the proposed method consists of five stages. 1. The noise elimination stage aims to eliminate noise areas by using a Wiener filter. The Wiener filter is a proper method and is proved an efficient technique for degraded document image filtering. The original graylevel image will be separated into 5 5 local blocks around corresponding pixel (x, y). Let μ and σ be mean and variance in a local block, Avg (σ) be an average variance of the original image. The gray-level value of the original and filtered image of pixel (x, y) are defined as Go(x, y) and Gw (x, y) respectively. The Go(x, y) is transformed to Gw (x, y) 2. Binarization (local) stage aims to extracts the foreground pixels from the binary images of three well known binarization methods. Such as Niblack s method, Sauvola s [7] and laplacian gauss s method. The filtered image Gw(x, y) is transformed to three binary images B1(x, y), B2(x, y) and B3(x, y). The extracted foreground layer in form of binary image defined as Bf(x, y). 3. The degradation of the background layer estimation stage is based on the cluster analysis method, estimates the degradation of the background layer by replacing the foreground area with the estimated background which is the average value of cluster pixel. The gray value of pixel (x, y) of the degradation of the background layer defined as Gdb(x, y). 4. The thresholding stage transforms gray-level image into a binary image by calculating the threshold value in accordance with the gray value of the estimated degradation of the background layer, which is derived from otsu s method is combined with logistic sigmoid function. Based on the threshold Tada(x, y), the binary image defined as Bada(x, y). 5. The vicinity analysis stage enhances the quality of the binary image by analysing and categorizing the pixels of binary image into the correct group. Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14400

Fig 1- The proposed method. IV. EXPERIMENTAL RESULT In order to investigate and demonstrate the advantage of the proposed method, the experiments are on 110 degraded historical document images. The experimental results are evaluated by using following parameters such as Mean Square Error (MSE), Power to Signal Noise Ratio (PSNR), Average Difference (AD), and Structural Content (SC). The output of proposed method is compared with the Niblack s and sauvola s method. S.No Parameters Description 1. Mean Square Error (MSE) MSE = 1 (x(i, j) y(i, j)) MN 2. Power to Signal Noise Ratio(PSNR) 3. Average Difference(AD) 4. Structural Content(SC) PSNR = 10 log ( ) AD = 1 (x(i, j) y(i, j)) MN (y(i, j) SC = (x(i, j) ) ) Table1: Existing Measure of Quality Metrics Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14401

HW Methods MSE PSNR AD SC IMAGE1 Niblack 461.21 31.49 211.26 1.38 Sauvola 463.82 31.46 211.84 1.35 Proposed 25.07 44.13-16.91 1.24 IMAGE2 Niblack 430.27 31.79 202.17 1.52 Sauvola 432.5 31.77 202.66 3.85 Proposed 31.4 43.16-13.03 1.25 IMAGE3 Niblack 148.39 36.41 115.89 4.12 Sauvola 149.84 36.37 116.48 2.5 Proposed 142.02 36.6-97.26 3.6 IMAGE4 Niblack 283.63 33.6 165.09 2.09 Sauvola 285.57 33.57 165.64 1.77 Proposed 71.57 39.58-59.82 1.98 Table2: Three Method Performance Comparisons. (A) ORIGINAL IMAGE(HW) (B) NIBLACK S METHOD (C) SAUVOLA S METHOD (D) PROPOSED METHOD FIGURE 1: EXPERIMENTAL RESULT OF THREE METHODS. Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14402

V. CONCLUSION In this research, the new binarization method based on an adaptive multilayer-information for the restoration of degraded historical document images is proposed. The experiments are implemented by using MATLAB. The experimental results of the proposed method with 110 document images perform and analysed by evaluating PSNR, MSE, AD and SC. Moreover, the proposed method demonstrates superior performance against two well-known adaptive binarization methods on various degraded historical handwritten and machine printed document images. Furthermore, the proposed method can be applied with any degraded document images which have the same characteristics as the test set. But the parameters and techniques used in this method must be adjusted to be suitable for those document images. REFERENCES 1) N.Maheshwari,P.singh,A.Maloo A Review of Digital Image Enhancement method of degraded Indian Ancient Manuscripts International Journal for scientific research & development vol.3,issue 03,2015. 2) R.Hedjam, M.cheriet Novel data representation for text extraction from multispectral historical document Images International conferences on Document analysis and Recognition 2011. 3) P.Kale, S.T Gandhe Enhancement of old images by hybrid binarization Method International Journal of advanced Research in computer Engineering & Technology vol 3 Issue 9, September 2014. 4) NTOGAS, NIKOLASO- A Binarization algorithm for historical manuscripts 12 th WSEAS International conference on communication Heraklion, Greece, July 2008. 5) P.Jadhav, S.Shaikh A Review Of Damaged Manuscripts Using Binarization Technique International Journal of Engineering & computer Science.vol 5, Issue 2016. 6) A.Mukherjee, K. Soumen Enhancement of Image Resolution By Binarization International Journal of computer Application.vol-10, Issue 2010. 7) T.Romen Singh, O.Imoch Singh A New Local Adaptive Thresholding Technique in Binarization IJCSI vol-8, Issue 2011. 8) Niti Kamboj, V.Kumar Degraded Document Image Enhancement Using Global Thresholding International Journal for Technological Research in Engineering vol.1, Issue 2014. 9) Zhixin Shi, Srirangaraj Setlur and Venu Govindaraju, Digital Enhancement of Palm Leaf Manuscript Images using Normalization Techniques, Dec. 2006. 10) Nikolaos, Ventzas, Dimitrios a binarization algorithm for historical manuscripts 12 th WSEAS International Conference on communication, Heraklion, Greece, July 23-25, 2008 Feb. 11) Ms. Ketki R. Ingole, Prof. V K. Shandilya, Image Restoration of Historical Manuscripts, Aug. 2010. 12) K. Sitti Rachmawati Yahya, S. N. H. Sheikh Abdullah, K. Omar, M. S. Zakaria, Review on Image Enhancement Methods of Old Manuscript with Damaged Background, April 2010. 13) B Gangamma, Srikanta Murthy K Enhancement of Degraded Historical Kannada Documents, Sept. 2011. 14) Dimitrios Ventzas, Nikolaos Ntogas and Maria-Malamo Ventza Digital Restoration by Denoising and Binarization of Historical Manuscripts Images, July 2012. 15) B. Gangamma, Srikanta Murthy K, Arun Vikas Singh Restoration of Degraded Historical Document Image, May 2012. 16) Prof. D.N. Satange1, Ms. Swati S. Bobde2, Ms. Snehal D. Chikate Historical Document Preservation using Image Processing Technique, IJCSMC, Vol. 2, Issue. 4, April 2013, pg.247 255. 17) Hugvainn Zarien, Senior Member, Jinendra Pallavi, Research Fellow Restoration of Degraded Historical Document Image: An Adaptive Multilayer-Information Binarization Technique, IEEE JOURNAL ON SELECTED AREAS IN IMAGE PROCESSING, VOL. NO. 67, JANUARY 2015. Copyright to IJIRSET DOI:10.15680/IJIRSET.2017.0607254 14403