Digital Imaging Systems for Historical Documents

Similar documents
DIGITAL IMAGE PROCESSING UNIT III

Image Enhancement. DD2423 Image Analysis and Computer Vision. Computational Vision and Active Perception School of Computer Science and Communication

DIGITAL IMAGE PROCESSING (COM-3371) Week 2 - January 14, 2002

Image Evaluation and Analysis of Ink Jet Printing System (I) - MTF Measurement and Analysis of Ink Jet Images -

Effect of Ink Spread and Opitcal Dot Gain on the MTF of Ink Jet Image C. Koopipat, N. Tsumura, M. Fujino*, and Y. Miyake

CoE4TN4 Image Processing. Chapter 3: Intensity Transformation and Spatial Filtering

Table of contents. Vision industrielle 2002/2003. Local and semi-local smoothing. Linear noise filtering: example. Convolution: introduction

Digital Image Processing

Midterm Review. Image Processing CSE 166 Lecture 10

CoE4TN4 Image Processing. Chapter 4 Filtering in the Frequency Domain

1.Discuss the frequency domain techniques of image enhancement in detail.

Filtering in the spatial domain (Spatial Filtering)

Automatic processing to restore data of MODIS band 6

Frequency Domain Enhancement

Noise and Restoration of Images

IMAGE ENHANCEMENT IN SPATIAL DOMAIN

ELEC Dr Reji Mathew Electrical Engineering UNSW

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

Lecture #10. EECS490: Digital Image Processing

Defocusing and Deblurring by Using with Fourier Transfer

Image Evaluation and Analysis of Ink Jet Printing System (I) MTF Measurement and Analysis of Ink Jet Images

Enhanced Method for Image Restoration using Spatial Domain

Digital Image Processing. Image Enhancement: Filtering in the Frequency Domain

Image Processing for feature extraction

International Journal of Advancedd Research in Biology, Ecology, Science and Technology (IJARBEST)

Image Enhancement using Histogram Equalization and Spatial Filtering

A TRUE WIENER FILTER IMPLEMENTATION FOR IMPROVING SIGNAL TO NOISE AND. K.W. Mitchell and R.S. Gilmore

International Journal of Engineering and Emerging Technology, Vol. 2, No. 1, January June 2017

Today. Defocus. Deconvolution / inverse filters. MIT 2.71/2.710 Optics 12/12/05 wk15-a-1

Sensitivity analysis of phase diversity technique for high resolution earth observing telescopes

Digital Image Processing

Image analysis. CS/CME/BIOPHYS/BMI 279 Fall 2015 Ron Dror

Image Enhancement in the Spatial Domain (Part 1)

Module 3: Video Sampling Lecture 18: Filtering operations in Camera and display devices. The Lecture Contains: Effect of Temporal Aperture:

Image acquisition. In both cases, the digital sensing element is one of the following: Line array Area array. Single sensor

Digital Image Processing

Restoration of Motion Blurred Document Images

Refined Slanted-Edge Measurement for Practical Camera and Scanner Testing

A Kalman-Filtering Approach to High Dynamic Range Imaging for Measurement Applications

IMAGE PROCESSING: AREA OPERATIONS (FILTERING)

Image Deblurring. This chapter describes how to deblur an image using the toolbox deblurring functions.

TDI2131 Digital Image Processing

Implementation of Adaptive Coded Aperture Imaging using a Digital Micro-Mirror Device for Defocus Deblurring

CS534 Introduction to Computer Vision. Linear Filters. Ahmed Elgammal Dept. of Computer Science Rutgers University

Automated License Plate Recognition for Toll Booth Application

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Gonzales & Woods, Emmanuel Agu Suleyman Tosun

Midterm Examination CS 534: Computational Photography

30 lesions. 30 lesions. false positive fraction

EEL 6562 Image Processing and Computer Vision Image Restoration

Εισαγωγική στην Οπτική Απεικόνιση

Spatially Varying Color Correction Matrices for Reduced Noise

Deconvolution , , Computational Photography Fall 2018, Lecture 12

Optical Performance of Nikon F-Mount Lenses. Landon Carter May 11, Measurement and Instrumentation

Chapter 3. Study and Analysis of Different Noise Reduction Filters

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Mechanical Engineering Department. 2.71/2.710 Final Exam. May 21, Duration: 3 hours (9 am-12 noon)

Non Linear Image Enhancement

Postprocessing of nonuniform MRI

Part I Feature Extraction (1) Image Enhancement. CSc I6716 Spring Local, meaningful, detectable parts of the image.

Image Quality Assessment for Defocused Blur Images

A Study of Slanted-Edge MTF Stability and Repeatability

DIGITAL IMAGE DE-NOISING FILTERS A COMPREHENSIVE STUDY

Image acquisition. Midterm Review. Digitization, line of image. Digitization, whole image. Geometric transformations. Interpolation 10/26/2016

EFFECT OF DEGRADATION ON MULTISPECTRAL SATELLITE IMAGE

Blind Single-Image Super Resolution Reconstruction with Defocus Blur

Exercise Problems: Information Theory and Coding

UM-Based Image Enhancement in Low-Light Situations

Interpolation of CFA Color Images with Hybrid Image Denoising

Image Restoration using Modified Lucy Richardson Algorithm in the Presence of Gaussian and Motion Blur

Digital Imaging Rochester Institute of Technology

Improving Signal- to- noise Ratio in Remotely Sensed Imagery Using an Invertible Blur Technique

EC-433 Digital Image Processing

Image Enhancement in the Spatial Domain

Chapter 6. [6]Preprocessing

Image Restoration. Lecture 7, March 23 rd, Lexing Xie. EE4830 Digital Image Processing

Fast Inverse Halftoning

Using the Normalized Image Log-Slope, part 2

SUPER RESOLUTION INTRODUCTION

Achim J. Lilienthal Mobile Robotics and Olfaction Lab, AASS, Örebro University

A Novel Image Deblurring Method to Improve Iris Recognition Accuracy

Image De-noising Using Linear and Decision Based Median Filters

Analysis of infrared images in integrated-circuit techniques by mathematical filtering

A Comprehensive Review on Image Restoration Techniques

Lecture Notes 10 Image Sensor Optics. Imaging optics. Pixel optics. Microlens

ECC419 IMAGE PROCESSING

Fourier Transforms and the Frequency Domain

Toward Non-stationary Blind Image Deblurring: Models and Techniques

Visibility of Uncorrelated Image Noise

3D light microscopy techniques

Evaluation of Legibility

Comparison of Reconstruction Algorithms for Images from Sparse-Aperture Systems

loss of detail in highlights and shadows (noise reduction)

Visual perception basics. Image aquisition system. IE PŁ P. Strumiłło

LPCC filters realization as binary amplitude hologram in 4-f correlator: range limitation of hologram pixels representation

Applications of Maskless Lithography for the Production of Large Area Substrates Using the SF-100 ELITE. Jay Sasserath, PhD

SENSOR HARDENING THROUGH TRANSLATION OF THE DETECTOR FROM THE FOCAL PLANE. Thesis. Submitted to. The School of Engineering of the UNIVERSITY OF DAYTON

Optimizing throughput with Machine Vision Lighting. Whitepaper

Matlab (see Homework 1: Intro to Matlab) Linear Filters (Reading: 7.1, ) Correlation. Convolution. Linear Filtering (warm-up slide) R ij

Solution Set #2

LAB MANUAL SUBJECT: IMAGE PROCESSING BE (COMPUTER) SEM VII

Digital Images & Image Quality

Transcription:

Digital Imaging Systems for Historical Documents Improvement Legibility by Frequency Filters Kimiyoshi Miyata* and Hiroshi Kurushima** * Department Museum Science, ** Department History National Museum Japanese History Sakura-shi, Chiba Pref., Japan Abstract As the first step in historical research it is very important to read historical documents. Historians like to peruse original documents directly, but as in practice this may be difficult conventional photographic systems are ten used. Historical documents vary greatly in type and size, therefore the application digital imaging systems promises to contribute to the development historical research. In this article, we at first point out the basic requirements a digital imaging system used for historical research. Then according to these requirements, we introduce an processing technique to improve the legibility historical documents by spatial frequency filters. Since the paper used in some historical documents is very thin, letters written on the reverse side the paper can be observed from the front side. As a result, letters on both sides the paper become mixed up when the documents are read. This is one reason for the legibility the documents to be degraded. We separate these two kinds letters having different frequency components by two kinds spatial frequency filters, which are fundamentally analogous to the un-sharp mask filter technique. These filters have different responsibilities in the frequency component to determine whether the letters are on the front side or not. The experimental results showed that the letters on the front side were separated well, but further considerations and additional experiments are necessary to improve the legibility the historical documents. Introduction In historical research based on historical documents, the first step is to read the documents carefully and deeply. Many kinds valuable documents are stored in private houses. Because in most cases the researchers cannot take these documents away, conventional photographic imaging systems such as micro-film, photo prints, and film readers are widely used so that the documents can be read in the researchers' laboratories. Recent digital imaging systems promise to benefit historical research with instant checking, low cost and some support digital processing techniques. Since the paper used in some historical documents is very thin, letters written on the reverse side the paper can be observed from the front. As a result, letters on both sides the paper become mixed up when the documents are read. This is one reason for the legibility the documents to be degraded. In this study, spatial frequency filters are designed to extract the letters written on the front side a document from s taken by a digital camera. In the design the filters, we suppose that the letters written on both sides originally have the same sharp edges, which have a high frequency component, but the letters on the reverse side are blurred by a low-pass filter effect the paper. We separate these two kinds letters having different frequency components by using two kinds spatial frequency filters, which are fundamentally analogous to the un-sharp mask filter technique. These filters have different responsibilities in the frequency component to determine whether the letters are on the front side or not. The details are described in the following sections. Basic Requirements for Historical Research Listed below are the basic requirements a digital imaging system for historical research carried out as an investigation by historians. ( Ease Use Usually historians have not had special training for taking s. Their investigations may be limited to a short period time, and therefore ease use is very important. ( Reasonable Image Quality Because high quality s can be taken by a photographer after the investigation, a reasonable quality is suitable at the investigation stage. The quality required in the investigation is sufficient for the documents to be read. Excessively high quality is not required. 291

( Instant Image Checking Many documents are fragile, making it difficult to take repeated s. It is therefore best to be able to check an immediately after it is shot. illuminant digital camera (4) Low Cost The number historical documents used in investigations is huge, therefore the cost taking s is a severe problem. host PC (5) Flexible and Simple Set-Up Imaging System Generally, there is no photographic studio at investigation sites, therefore imaging conditions such as lighting conditions are very poor. Furthermore, historical documents vary greatly in size and type, and usually there is not enough space to take an. A flexible and simple set-up is also important in the digital imaging system. (6) Support System The following support systems are helpful for historians: digital processing techniques including enlargement or placement s in a side-by-side position; some techniques for easy accessibility including data retrieval; data sharing through network systems; and easy maintenance the data. Reliability for long-term storage is also required. The improvement legibility described in this article is referred to as part the support system. Improvement Legibility Because the letters written in historical documents show slight gradation, with partial blurring or fadig, a conventional thresholding method applied to digital s has difficulty in determining whether the letters were written on the front or reverse side the paper. A method based on the spatial frequency information the letters is therefore proposed in this research. In terms spatial frequency, there are two kinds blur level for the contours the letters. The contours letters written on the front side the paper are sharp, whereas the contours letters on the reverse side are relatively blurred. If shift invariance can be assumed and the surface the paper is flat, the difference in blur level in the digital tells us which letters are written on the front side. After the separation the letters based on this idea, some kinds post- processing such as hi-pass filtering and mirror reverse for the reverse side letters can be applied to improve legibility. Figure 1 shows the experimental set-up, and Figure 2 is a flowchart the experiment. test sample tripod Figure 1. Experimental set-up. acquisition acquisition pre-processing pre-processing estimation estimation reflectance reflectance un-uniformity un-uniformity correction correction illumination illumination determination determination component component processing processing to to improve improve legibility legibility labeling labeling letter letter area area hi-pass hi-pass and and low-pass low-pass filtering filtering synthesis synthesis filtered filtered s s processed processed Figure 2. Flowchart the experiment. Image Acquisition A digital camera, which is a single lens reflex type, is used in this experiment. This camera has a CCD sensor to yield 8 bit/pixel in the R, G and B color channels. A single illuminant is used to provide a simplified set-up according to the basic requirements mentioned in the previous section. Un-uniformity is a problem under single illuminant lighting conditions. This is therefore corrected by a method mentioned in a later section. Figure 3 shows a test sample printed on both sides the printing paper using an ink jet color printer. There are 4 Japanese syllabary characters on the front side, and 3 letters on the reverse side. All the letters are printed using black ink only. The letters on the reverse side can be observed through the paper, and are mixed with the letters on the front side. This is a cause degradation in legibility. 292

2. Determination Spectral Component If a component in the reflectance the ink used in the historical document is given as prior information, it could provide useful information for extracting the letter area. However, the wavelength λ = 550 nm is used in this experiment. The showing single wavelength reflectance is used in the following sections. Processing to Improve Legibility Pre-processing Figure 3. Test sample. 2. Estimation Spectral Reflectance Many recent studies have addressed estimating the reflectance objects. Insar as the basic requirements this research are concerned, obtaining the reflectance historical documents fers many advantages. Therefore one method named the Wiener estimation method is applied to estimate the reflectance historical documents. The Wiener estimation matrix M is determined as follows. 1 M = R rv R vv -1 ( R rv = < rv t > ( R vv = < vv t > ( Vector r is the measured reflectance the Macbeth Color Checker, and vector v is the sensor response including higher order terms when the Checker is taken as a digital. Matrix R rv is a cross-correlation matrix between vector r and v. Matrix R vv is an autocorrelation matrix vector v. The symbol < > shows the ensemble average, and t shows the transpose the vector. Spectral data f(x,y,λ) is calculated from vector v which is a sensor response vector v including higher order pixel value in the digital f(x,y) by using the matrix M as follows. f(x,y,λ) = Mv (4) 2. Un-uniformity Correction Illumination The test sample and a white reference are taken under the same lighting conditions and camera settings. Each pixel value in both digital s is converted from a digital signal to reflectance using the Wiener estimation method, then the un-uniformity is corrected by the following equation. f'(x,y,λ) = f(x,y,λ)/f paper (x,y,λ) (5) In this experiment, conventional printing paper is used as a white reference in accordance with the case--use reason cited in the basic requirements. 3. Labeling Letter Area If we can consider only the frequency response in the imaging system, the taken g(x,y) is represented from the system response h(x,y) and the original f(x,y) as follows. g(x,y) = h(x,y)*f(x,y) (6) where symbol * means convolution integral. This equation is shown in the Fourier domain as follows. G(u,v) = H(u,v)F(u,v) (7) If the imaging system is shift invariant, H(u,v) is unique. The sharpness difference in G(u,v) is, therefore, caused by a difference in F(u,v). On the other hand, if the sharpness in F(u,v) is constant but the sharpness at the same area in G(u,v) is different, it is referred to as a change in H(u,v). In the un-sharp masking method, sharp regions such as edge areas are detected by low-pass filtering because sharp areas are more blurred by the low-pass filter than us-sharp areas such as smooth parts in the. If a different lowpass filter is applied to an, the different level sharp areas can be detected. In this experiment the difference means whether the letters are on the front side or the reverse. The letters written on the reverse side are more blurred than the letters on the front because letters on the reverse side are observed through the paper, which can be referred to as a light scattering layer. This characteristic can be applied to separate the letters on the front or reverse side with the following equation. { } { } l ' Φ 1 H σ (u,v)f(u,v) ( 1 x,y)= Φ 1 H σ (u,v)f( u,v) 2 where Φ -1 is an inverse Fourier transformation, H σ (u,v) is a Gaussian type low-pass filter, which has mean 0 and variance σ 2 as follows. (8) H σ (u,v)= exp u2 + v 2 2σ 2 (9) In this experiment, σ 2 2 > σ 2 1 is assumed, and its value is σ 2 1 is 1.0 and σ 2 2 is 400.0. The labeling the letter area is carried out by the following equation. 293

front l ' (x,y) t 1 l(x,y) = reverse t 1 > l ' (x,y) t 2 paper otherwise (10) The threshold t 1 and t 2 are determined experimentally. In this experiment, t 1 and t 2 are 18.0 and 12.0, respectively. 3. Hi-pass and Low-pass Filtering Because the MTF the human visual system has directional frequency response, 2 a hi-pass filtering method considering the dependency is introduced to obtain a reasonable visual filtering effect without unwanted artifacts after the filtering. In this experiment, the hi-pass filter is defined by the equations as follows where α and β are coefficients to control the filter effects. (a) Result the labeling. H h (u,v)=α k(w)exp u2 + v 2 2σ 2 (1 ( ) k ( w) = 1 β sin 2φ,φ = tan 1 v u (1 The low-pass filter is determined in this experiment as follows. H l (u,v) = k(w)exp u 2 +v 2 2σ 2 (1 (b) Extraction front side letters. 3. Synthesis Filtered Image The hi-pass filter is used only for the area detected as a target area, and the low-pass filter is affected for the resultant area. Actually the synthesis process is carried out using the labeling result in a pixel by pixel process. If the labeling l(x,y) shows the target area, then the pixel value the hi-pass filtered is selected as the pixel value the processed. On the other hand, if the labeling l(x,y) doesn't show the target area, the pixel value the low-pass filtered is selected. Figure 4(a) shows the result the labeling. In Figure 4(a), black, white, and gray areas show the front side, reverse side, and other parts respectively. Figure 4(b) and (c) show the results the synthesis for letters written on the front side and reverse side respectively. Figure 4(c) is mirror inverted. Discussion The experimental results showed that the designed filters were effective at extracting letters written on the front side the paper. However, for letters on the reverse side, the contours letters on the front side were falsely detected as part the letters on the reverse side. In addition, the thresholds to use in the separation the letter areas were determined experimentally. An analytical method to determine the thresholds from the results PSF measurement the paper 3 is important in future work. (c) Extraction reverse side letters. (mirror reversed) Figure 4. Results the experiment. In the proposed application a method for real historical documents, it is a problem that the surface the documents is not flat. The un-flat surface will cause a change blur level in the taken, therefore a correction method for this un-flat surface is required. In the correction, it would be effective to use information depth. The method proposed in this experiment has similarity to the Depth from Defocus (DFD) method in the sense using information blur level. 4 In the DFD, the depth map is obtained from changes sharpness in the, and it is analogous to the method introduced in this experiment. The combination DFD and the proposed method will be promised. 294

Conclusions The basic requirements a digital imaging system for historical documents were pointed out, and one them, the legibility the documents, was improved in this study. However, many things remain to be solved, and the improvement legibility has to be evaluated quantitatively by historians. Furthermore, a digital imaging system that can get texture information on the surface the historical documents and high accuracy color information is required for the application digital imaging systems to historical research. In addition, this application system could be used in galleries at museums as a part the support system for visitors. References 1. Norimichi Tsumura, et al, Estimation reflectance from multi-band s by multiple regression analysis, Japanese Journal Optics, Vol. 27, No. 7, pp. 384-391, 1998 (in Japanese) 2. Tetsuya Ishihara, et al, Dependence Directivity in Spatial Frequency Response the Human Eye ( -Mathematical Modeling Modulation Transfer Function-, Journal The Society Photographic Science and Technology Japan, Vol. 65, No. 2, pp.128-133, 2002 (in Japanese) 3. Chawan Koopipat, et al, Image Evaluation and Analysis Ink Jet Printing System (I) MTF Measurement and Analysis Ink Jet Images, Journal Imaging Science and Technology, Vol. 45, No. 6, pp. 591-597, 2001 4. Alex Paul Pentland, A New Sense for Depth Field, IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol. 9, No. 4, pp. 523-531, 1987 Biography Kimiyoshi Miyata received his ME and Ph.D. degrees in Imaging Science from Chiba University in 1992 and 2000 respectively. After working at Mitsubishi Electric Corporation for 9 years, he joined the Department Museum Science at the National Museum Japanese History in 2001. His research interests concern applications digital imaging studies to museum activities. In 2000 he was awarded the Progressing Award and Itek Award from SPSTJ and IS&T respectively. 295