AN INTEGRATED SYSTEM FOR HANDWRITTEN DOCUMENT IMAGE PROCESSING

Size: px
Start display at page:

Download "AN INTEGRATED SYSTEM FOR HANDWRITTEN DOCUMENT IMAGE PROCESSING"

Transcription

1 International Journal of Pattern Recognition and Artificial Intelligence Vol. 17, No. 4 (2003) c World Scientific Publishing Company AN INTEGRATED SYSTEM FOR HANDWRITTEN DOCUMENT IMAGE PROCESSING E. KAVALLIERATOU, N. DROMAZOU, N. FAKOTAKIS and G. KOKKINAKIS Wire Communications Laboratory, University of Patras, Patras, Greece ergina@wcl.ee.upatras.gr In this paper we attempt to face common problems of handwritten documents such as nonparallel text lines in a page, hill and dale writing, slanted and connected characters. Towards this end an integrated system for document image preprocessing is presented. This system consists of the following modules: skew angle estimation and correction, line and word segmentation, slope and slant correction. The skew angle correction, slope correction and slant removing algorithms are based on a novel method that is a combination of the projection profile technique and the Wigner Ville distribution. Furthermore, the skew angle correction algorithm can cope with pages whose text line skew angles vary, and handle them by areas. Our system can be used as a preprocessing stage to any handwriting character recognition or segmentation system as well as to any writer identification system. It was tested in a wide variety of handwritten document images of unconstrained English and Modern Greek text from about 100 writers. Additionally, combinations of the above algorithms have been used in the framework of the ACCeSS system (European project LE , aiming at the automatic processing of application forms of insurance companies) as well as in the processing of GRUHD and IAM-B databases for automating the procedure of extracting data. Keywords: Image feature; document image; skew angle estimation; slant correction. 1. Introduction A major problem of pattern recognition is the Optical Character Recognition (OCR). The OCR systems attempt to facilitate the everyday use of computers aiming at the transformation of large amounts of documents, either printed or handwritten, into electronic form for further processing. Since the early 1960s, numerous systems have been proposed. In contemporary times, the recognition of printed and handprinted isolated characters is performed with accuracy that exceeds 90%. 22 However, the recognition of unconstrained handwriting still remains an open research problem. Typical problems that complicate the automatic recognition of unconstrained handwriting are the nonparallel text lines in a page, the hill and dale writing, as well as the slanted and connected characters. These problems are illustrated in Fig

2 618 E. Kavallieratou et al. non-parallels text lines hill and dale writing Slanted and connected characters Fig. 1. Typical problems of handwritten documents. Currently, there are two main trends in the recognition of unconstrained handwriting. Some researchers perform character segmentation before the actual character recognition, 6 while others avoid the segmentation stage. 18 Nevertheless, whatever the application and the methodology are, the preprocessing stage, facing the problems mentioned in the previous paragraph, is very important both for character segmentation and character recognition. An insufficient solution of these preprocessing tasks may cause considerable losses of accuracy in any further processing procedure. Several methods have been proposed to solve the problem of skew detection. 10,11,15,16,20,21,27,29 Analytical reviews regarding these methods can be found in Refs. 2, 12 and 19. Although many of these approaches are satisfactorily accurate and fast, they are designed to handle documents in general. Consequently, they are not always able to deal with specific particularities of a handwritten document, such as the correction of nonparallel text lines within the same page. Nevertheless, Spitz 25 presents a technique able to detect multiple skew angles, applied mostly to printed compressed documents. The few proposed methods for handwritten or mixed (i.e. handwritten and printed) documents concern specific applications. 30 Chin 7 analyzes this problem and underlines that the methods that handle printed pages successfully do not manage to handle handwritten pages that well, or even if they do the computational cost is greater. On the other hand, the most commonly used method for slant estimation is the calculation of the average angle of near-vertical strokes. 5,6,18 This approach requires the detection of the edges of the characters and its accuracy strongly depends on the specific characters that compose the word. Shridar 24 presents two more methods

3 Handwritten Document Image Processing 619 Document image Skew Correction Line and Word segmentation Slope Correction Slant Removal Word image Fig. 2. The proposed system. for slant estimation and correction. In the first one, the vertical projection profile is used while the second one makes use of the chain code method. However, the evaluation of the slant correction approaches may be subjective since the slants can vary even within a single word. Additionally, in the relevant literature a slant correction procedure is rarely evaluated separately, so that comparative results cannot be given. In Refs. 23 and 24, two different slope correction methods are proposed. The first makes use of the least-squared error line fit to the bottom profile of the line image and the second attempts to find the baseline of the word in order to correct its skewing angle. In this paper, we present an integrated system, shown in Fig. 2, for handwritten document image analysis appropriate to be used as a preprocessor for handwritten character recognition systems. In particular, the presented system copes with three problems: (i) Skew estimation and correction of the document image. That is, the angle formed by the horizontal axis of the document image and the text lines. A new method based on the Wigner Ville distribution of the horizontal projection profile of the document image is proposed. Our approach is capable of localizing the areas of the handwritten document images with different skew angles automatically and handling each of them separately. Moreover, it is fast and accurate. (ii) Line and word segmentation. A variation of an already existing approach described by Shridar and Kimura 24 is followed since it combines ease of implementation and high accuracy results. (iii) Slope and slant correction. The term slope of a word is referred to the angle formed by the horizontal axis of the word and the corresponding text line while the term slant of characters is referred to characters whose vertical strokes are inclined regarding the vertical axis of the word. A similar method with the one used for skew estimation is followed. This approach is based on a simple algorithm and achieves very low response time cost. Moreover, it is not dependent on specific characters. In the next section a brief presentation of the Wigner Ville distribution, as it is used in the proposed system, is given. The modules of the system, mentioned above, are described in detail in Secs Finally, some experimental results and the conclusions drawn by this study are included in Secs. 6 and 7, respectively.

4 620 E. Kavallieratou et al. 2. Wigner Ville Distribution The Wigner Ville (WV) time and frequency distribution, W (t, ω), is a well-known representative of the energy distributions and member of the Cohen s class W (t, ω) = T s (t kt )e 2j ωkt s(t + kt ) π k= where s(t) represents the analytical signal and 1/T the sampling frequency. For a discrete signal s(n) and T = 1, the Wigner Ville Distribution (WVD) can be expressed as: W (n, θ) = 1 s (n k)2 2jθk s(n + k). π k= Moreover, the horizontal and vertical projection profiles of a black-and-white document image f(x, y) are: P h (x) = y f(x, y) P ν (y) = x f(x, y). By introducing the above equations in the WVD of a discrete signal, the spacefrequency distribution is expressed as: W (y, θ) = 1 P (y k)e 2jθk P (y + k). π k= The Wigner Ville function is a particularly popular distribution due to the large amount of desirable mathematical properties it satisfies. 26 Mecklenbrauker 8 proves the uniqueness of the WVD in the sense that it is that single energy distribution that possesses all the started desirable properties. This justifies the numerous applications of WVD in pattern recognition, 3,9 synthesis, 28 seismic signal, 4 and optics 14 etc. In the case of our application, the superiority of WVD over the other Cohen s class distributions has been proved in Ref. 12. In more detail, the tests with several distributions, members of the Cohen s class, proved WVD to be the best compromise between computational cost and accuracy. 3. Document Skew Angle Estimation and Correction As already mentioned, skew estimation and correction are performed in the first stage of our system by using the projection profile technique and the WVD. Specifically, the maximum intensity of the WVD of the horizontal histogram of a document image is used as the criterion for estimating its skew angle. Moreover, once the dominating skew angle of the entire document page has been detected and

5 Handwritten Document Image Processing 621 Fig. 3. Example of document page. corrected, the areas with different skew angles are localized by the deep and wide valleys of the curve of maximum intensity, and the skew estimation procedure is repeated for each area. The term dominant skew angle, here, is used for the skew angle that is met in a wider scale in the document image, as it will be later shown. The horizontal histogram of a properly oriented document page presents the maximum peaks and the most intent alternations between peaks and dips than any other histogram-by-angle of the same page. 21,24 The WVD of the histograms represent their time-frequency distribution, where in this case the time increases according to the height of the page. Consequently, the WVD presents maximum intensity for the histograms of 0 and 180 corresponding to the properly-oriented and the reverse one respectively, which contain the major peaks and dips alternations.

6 622 E. Kavallieratou et al. Fig. 4. The horizontal histograms and the corresponding curves of maximum intensity for several skew angles concerning the document image of Fig. 4. The closer the skew angle to 0 and 180, the larger the values of the maximum intensity. Hence our algorithm is able to detect skew angles varying between 89 and +89 in respect to the notional horizontal axis of properly-oriented document. Otherwise, the resultant page would be the reverse one. In Fig. 4, the horizontal histograms and the corresponding curves of maximum intensity for several skew angles, concerning the document image of Fig. 3, are shown. It has to be noted that the calculation of the histograms and the further application of the Wigner Ville function for every skew angle are not required, since it would imply unnecessary computational complexity and cost. Instead, at the beginning, an initial step (big step) is used for a first estimation, and then a rotation degree by degree is applied within a smaller range. The big step should be such that the whole area between +89 and 89 and only that should be covered. In the opposite case, if the covered area is greater than that of ±89, it is possible to end up with the page reversed. On the other hand, if the covered area is smaller than that and the skew angle greater than the covered area, the system will fail to correct the page. Then, in order to cover the whole area between +89 and 89, we calculated the sum of necessary rotations for various steps, including the rotations by the big step as well as the remaining rotations in the smaller range. The results are illustrated in Fig. 5. We note that for a big step of 12, the number of required rotations is minimized. In order to make use of it, the WVD and its maximum intensity are calculated for each histogram. Thus, a curve of maximum intensity corresponds to each angle. The

7 Handwritten Document Image Processing 623 amount of rotations big step Fig. 5. The relation between big step and rotations. The required rotations are minimized for a big step equal to 12. Fig. 6. The document image of Fig. 1 after the skew angle correction. curve that presents the most maxima throughout the space domain is selected. Moreover, in order to focus on the peaks of the curves we check only the curve values above a threshold. A tenth of the maximum peak was proved to be a good threshold in our experiments over 2000 documents (see Sec. 6).

8 624 E. Kavallieratou et al. In handwritten documents more than one skew angle are very often met. Most of the proposed approaches handle this case by estimating and correcting the skewing according to the dominant angle without taking the skew angle variations into consideration. Our approach deals with this problem, by processing the page by areas. Initially, the dominant skew angle, as already mentioned, is calculated and the page is corrected according to that. This correction is essentially in order to proceed to the localization of the areas with different skew angle on the page. The segmentation into areas is performed by taking into account that the change of the angle of successive text lines causes wider valleys than the usual of the histogram which are translated into dips at the curve of maximum intensity of its Wigner Ville distribution. In order to localize these dips and use them as boundaries of areas of different skew angle, we follow this procedure: The WVDs and the corresponding curves of maximum intensity for an angle range of ± 6 around the dominant skew angle are calculated. A step of 0.2 is used. Step smaller than that proved unnecessarily, since it cannot be detected in handwritten pages. The average value of the curves of maximum intensity is calculated. A threshold depending on the average value is determined (the 3/4 of the average value of the average curve is a good threshold). a The dips of the average curve of maximum intensity below this threshold define the boundaries of the areas. The dominant curve of maximum intensity in each area provides the corresponding skew angle. The angle of 6 either way of the document angle was used here in order to cover the omitted area of the initial step. Moreover, the assumption, that the variation in skew angles in handwritten text does not usually exceed that area, has been made here. However, any angle greater than that can be used, if necessary. In case the text features multiple changes in the line width, the above threshold may divide the page into more than the appropriate areas. Nevertheless, this is not an important shortcoming since each zone will be corrected separately. In Fig. 6, the document image of Fig. 1 after the skew angle correction is shown. 4. Line and Word Segmentation 4.1. Line segmentation The horizontal projection profile is most commonly employed for the line segmentation. If the page is skew corrected and the lines well separated, the histogram a Detailed experiments over unconstrained handwritten text from 1000 documents of GRUHD database derived this threshold. Values greater than that failed to localize in some cases all the areas with variation in skew angle while values smaller than that tend to split the text in many areas without variations in skew angle.

9 Handwritten Document Image Processing 625 will present well-separated peaks and valleys. However, in handwritten documents this method can be problematic because of the hill and dale writing as well as the ascenders and descenders of the characters that very often begin in a line and end up in another. Thus, the valleys of the histogram are not very clear. And a separation algorithm based strictly on this technique would cause the loss of important information (part of lines) and the introducing of noise (portions of ascenders and descenders). Although problematic, this method can give a good estimation of the number of the lines included in a document page. And this is the information that we extract from the horizontal projection profile of the page. The valleys with minima less than a certain threshold (60 pixels found to be a good threshold, Fig. 7) are considered to be likely beginners of line segments. Subsequently the area is examined pixel by pixel until an entire white path is outlined. The area of the valley is examined starting from the left edge of the page. While white pixels are met we continue towards the right edge of the page. When black pixels are met we move up and down looking for alternative paths. The zone of search is limited between the peaks of the histogram of the previous and next possible areas of text lines. In cases that two lines are merged because of long ascenders or descenders the local histogram is calculated and the path goes on passing from the minimum of the histogram. When we reach the right edge a crooked line has been outlined consisting of as much white pixels as possible. In Fig. 8, the document image of Fig. 1 after the line segmentation is shown. 120 success 100 rate (%) threshold (pixels) Fig. 7. The success rate of localization of line segments according to the used threshold after experiments performed using GRUHD database. 13

10 626 E. Kavallieratou et al. Fig. 8. The document image of Fig. 1 after the line segmentation Word Segmentation The vertical projection profile of each text line is employed in order to perform word segmentation. In contrast to the line segmentation, in handwriting the words are almost always well separated. However, the valleys of the histogram cannot be certain word boundaries, since untouched characters can also cause dips. These two cases can be distinguished by examining the width of the valley. In normal handwriting a gap between words is not normally shorter than the mean character width, while a gap in the word is unlikely to be longer than that. By mean width of character, we consider the width of characters such as a, b, c, d, etc. excluding the characters i, l, j, m, w that are either too narrow (i, l), or too wide (m, w). Thus the width of a character is a good criterion for the estimation of the word boundaries. Although the character width differs between characters and writers, a rough estimation of the mean width could be made by accepting that excluding the ascenders and descenders the characters with mean width (as defined above), present width equal to their height. Then by locating the height of the main body of the words, we have an approximate but satisfactory approach of the required threshold. In order to achieve that, we calculated the horizontal projection profile of a part of a text line. Although a part with more than several characters is appropriate, a tenth of the text line was used in our case. The upper and lower parts of the line where the value of the histogram falls under the 1/3 of its peak value (threshold extracted experimentally) are excluded. The remaining height is the required threshold. The valleys of the vertical projection profile of a text line with width greater than the above threshold are considered to be the boundaries between words. If it is known that the whole document is written by one person this estimation is enough to be carried out just once in a page otherwise it is safer to be done several times.

11 Handwritten Document Image Processing 627 Table 1. Statistical values measured over 1000 documents of GRUHD 13 database scanned at 300 dpi. Mean character width Mean character height Mean character spacing Mean word spacing 21,7 pixels 23,1 pixels 8,7 pixels 46,3 pixels Error Error Fig. 9. The document image of Fig. 1 after the word segmentation. In Fig. 9, the document image of Fig. 1 after the word segmentation is shown. 5. Slope and Slant Correction 5.1. Slope correction The skew angle estimation and correction of the document improves the whole document image and facilitates the further processing stages, especially if it handles the page by zones in respect to the different skew angles. However, the problem introduced by the hill and dale writing, still remains since despite the fact that the text lines have been corrected by the dominant angle, skewed words may still exist. As already mentioned our method deals with this problem by using a similar method to the document skew estimation (Sec. 3), based on the horizontal projection profile of the word and the WVD. The idea here is that since the envelope of the histogram is getting smoother when the word tends to be oriented at the vertical position (±90 ), and presents most peaks at the horizontal position (0 and 180 ), the corresponding WVD presents maximum intensity in the latter case. Thus, our algorithm consists of the following steps: (1) The word image is rotated in the range of ±89 using a one-degree step. (2) For each angle, the horizontal projection profile as well as the WVD are calculated.

12 628 E. Kavallieratou et al. (3) The curve of maximum intensity is extracted from each WVD. (4) The acquired curves are compared and the one with the maximum peak is selected. The corresponding rotation angle is the estimated skew angle of the word. It has to be noted that this algorithm is not able to handle very short words (i.e. less than four characters) that contain ascenders and/or descenders. In such cases the application of the presented algorithm may cause undesirable slopes. Due to this fact, we apply the threshold used in the word segmentation module in order to filter out words that fall into that category. Thus, the slope correction algorithm is not applied to words that meet the following criteria: word length < 3 main body height AND word height > main body height. For example, words such as The and by are likely to be filtered out. However, short words are not characterized by large slope angles. In the majority of cases, therefore, the skew correction of the entire text line will probably correct the slope of short words. In Fig. 10, the document image of Fig. 1 after the slope correction is shown. Fig. 10. The document image of Fig. 1 after the slope correction. The words, though independent word images by now, have been placed according to their original position in order to facilitate comparison.

13 Handwritten Document Image Processing Slant removal The character segmentation, a prerequisite stage in some systems, is a very difficult task of handwriting recognition. However, the slanted characters complicate it even more. On the other hand, even in handwriting recognition systems that do not perform character segmentation, the slanted characters cause various problems. A handwriting recognition system that does not deal with the slanted characters during the preprocessing stage, requires much more data for its training, in order to cover every case of inclined characters. In our system, we cope with this problem by automatically detecting the slant for each word and removing it. The same method as before is also used here, though the vertical projection profile of the word image is now used. A nonslanted word presents clear gaps between its characters, even if the characters are connected. These gaps introduce deep valleys in the vertical histogram of the word, while the characters draw, more or less sharp peaks. Instead, in a slanted word, the inclined characters cover the gaps between the characters and this is interpreted by less deep dips and smoother peaks in the vertical projection profile. The WVD is also used here to detect the projection profile with the most intent alternations. The procedure that is followed is the same as before and the slant that corresponds to the curve of maximum intensity with the greatest peak is selected. In order to slant a word image to right or left we follow the procedure below. The word image is segmented in equally-wide horizontal zones. The lowest zone is considered to be the base. The zone above the base is shifted one pixel to the right Fig. 11. The document image of Fig. 1 after the slant removal. The words, though independent word images by now, have been placed according to their original position in order to facilitate comparison.

14 630 E. Kavallieratou et al. or to the left. The next zone (if exists) is shifted two pixels to the same direction etc. The more the zones, the greater the slant angle. The maximum slant angle corresponds to one-pixel-high zones (i.e. when the amount of zones equals to the height h of the word image in pixels). In this case, the higher zone is shifted by h 1 pixels and the corresponding slant angle θ is the maximum one and it can be calculated as: tan θ = (h 1)/h 1 θ = 45, since h 1. In Fig. 11, the document image of Fig. 1 after the slant removal is shown. 6. Experimental Results The performance of the presented system has been tested on a collection of document images covering different scanning resolutions from dpi. In more detail, the test collection comprised roughly 100 unconstrained handwriting documents (mainly student notes), in both English and Modern Greek, b taken from corresponding writers. No special criteria were taken into account as far as it concerns the complexity and difficulty during the selection of those documents. As a result, documents of any length and complexity were included. Moreover, particular combinations of the presented algorithms have been used in several applications for automatic document image processing. Specifically, the skew angle correction, word segmentation, slope correction and slant removal algorithms were used in the system ACCeSS (European project LE ) aiming at the automatic processing application forms of insurance companies. The line segmentation, word segmentation, slope correction and slant removal algorithms were applied to GRUHD 13 and IAM-DB 17 databases in order to automate the conversion of the data in a form appropriate for further processing. The skew angle correction achieves a success rate of 99.4% with a confidence range of ±0.2 dealing with documents of skew angle ranging from 89 to +89. The response time strongly depends on the document resolution as well as the different skew angles that may be met in the document. However, since the accuracy of this algorithm is resolution-independent as it is shown in Table 2, we can drastically improve the time cost by transforming each document image into 50 dpi. On the other hand, the proposed technique for estimating the areas with different skew angles in a single document page achieves a success rate of roughly 85%. In Table 2, some experimental data of skew angle correction are presented. In general, the line and word segmentation algorithms cope with quite hard cases of high-density text as the example of Fig. 12, taken from GRUHD database. However, occasionally parts of descenders and ascenders are likely to be attached into an adjacent line, especially when characters of successive text lines are connected. b Both, English and Greek Alphabets, belong to the Alphabets of Indo-European languages and they do not require different processing as far as it concerns the preprocessing stages of OCR systems.

15 Handwritten Document Image Processing 631 Table 2. Experimental results of skew angle correction. Image Resolution Rate of Text Skew Angle Estimated No in dpi in the Page Border Graphics in Degrees Skew Angle % No No % No No % Yes No % No Yes % No Yes % No No % Yes No % Yes No % Yes Yes % No Yes % No Yes % Yes No % Yes No % No Yes % No No % No No % Yes No % No Yes % No No % No Yes % No No % No No % Yes No % Yes Yes % No No % Yes Yes % Yes No % Yes No % No Yes % No No Fig. 12. A difficult case of line segmentation as it is faced by our system. Moreover, the word segmentation algorithm may fail to distinguish consecutive words in the case of extremely slanted characters. The performance of the slope correction and slant segmentation algorithms proved to be satisfactory as well. The former reaches the success rate of 97%,

16 632 E. Kavallieratou et al. Skew Angle Correction Line Segmentation Word Segmentation Slope correction Slant Removal Fig. 13. The relative computational cost. bearing in mind that the short words (as defined in Sec. 5.1) are filtered out, while the latter overcomes the 98.5%. It is worth noting that even in the case of variant slanted characters within the same word, the slant removal algorithm generally improves the word image producing natural resultant images and facilitating the task of either character segmentation or character recognition. The computational cost is very much dependable on the size of the page as well as of the length of the text on the page. In the above-described experiments the mean computational cost was around 25 secs per page using a Pentium III at 1 GHz. However we consider more interesting the relative computational cost as it is presented in Fig Conclusions In this paper we presented an integrated system for handwritten image processing appropriate to be used as a preprocessing stage in any character segmentation or character recognition system. The proposed system consists of four main modules, namely skew angle estimation and correction, line and word segmentation, slope correction and slant removing stemming from the implementation of already existing as well as novel algorithms. A well known time-frequency distribution, the WVD, in combination with the projection profile technique is used in order to detect the skew angle, either of page or word images, as well as the slant of the characters. The idea behind these algorithms is that the alterations between peaks and dips of both the horizontal histogram of a properly oriented page/word image and the vertical histogram of a nonslanted word will be the most intent. According to the requirements of a certain application, either the entire system or specific combinations of its modules can be used. Moreover, since there are no special restrictions regarding the form of exchanging data, any module can be substituted by an already existing module that performs the same or a similar preprocessing task. In past, a similar work has been performed 30 for form-filling applications.

17 Handwritten Document Image Processing 633 References 1. A. Amiri, A. C. Downton, S. J. Hanlon, C. G. Leedham, S. M. Lucas and D. Monger, OSCAR: a visual programming toolkit for offline handwritten forms recognition, Proc. IWFHR4, 4th Int. Workshop on Frontiers in Handwriting Recognition, Taiwan, December 1994, pp A. Bagdanov and S. Kanai, Evaluation of document image skew techniques, Proc. SPIE, 1996, pp B. Boashash, B. Lovell and L. White, Time frequency analysis and pattern recognition using singular value decomposition of the Wigner Ville distribution, Advanced Algorithms and Architecture for Signal Processing, Proc. SPIE 828 (1987) P. Boles and B. Boashash, The cross Wigner Ville distribution a two dimensional analysis method for the processing of vibrosis seismic signals, Proc. IEEE ICASP 87, 1988, pp R. M. Bozinovic and S. N. Srihari, Off-line cursive script word recognition, IEEE Trans. PAMI, 11, 1 (1989) M. Y. Chen, A. Kundu, J. Zhou and S. N. Srihari, Off-line handwritten word recognition using HMM, U.S. Postal Service, 5th Adv. Technol. Conf., Washington, DC, 1992, pp W. Chin, A. Harvey and A. Jennings, Skew detection in handwritten scripts, Proc. IEEE Speech and Image Technologies for Computing and Telecommunications, 1997, pp T. A. Claasen and W. F. Mecklenbrauker, The Wigner distribution: tool for timefrequency signal analysis, Phillips J. Res. 35, 1 3 (1980) , and G. Cristobal, J. Bescos and J. Santamaria, Application of Wigner distribution for image representation and analysis, Proc IEEE 8th Int. Conf. Patt. Recogn., 1986, pp B. Gatos, N. Papamarkos and C. Chamzas, Skew detection and text line position determination in digitized documents, Patt. Recogn. 30, 9 (1997) H. Jiang, C. Han and K. Fan, A fast approach to the detection and correction of skew documents, Patt. Recogn. Lett. 18 (1997) E. Kavallieratos, N. Fakotakis and G. Kokkinakis, Skew angle estimation using Cohen s class distributions, Patt. Recogn. Lett. 20, (1999) E. Kavallieratou, N. Liolios, E. Koutsogiorgos, N. Fakotakis and G. Kokkinakis, The GRUHD database of modern Greek unconstrained handwriting, LREC (1999) O. Kenny and B. Boashash, An optical signal processing for time-frequency signal analysis using the Wigner Ville distribution, J. Elec. Electron. Eng. (1998) D. S. Le, G. R. Thoma and H. Wechsler, Automated page orientation and skew angle detection for binary document image, Patt. Recogn. 27, 10 (1994) J. Liu, C. Lee and R. Shu, An efficient method for the skew normalization of a document image, Proc. 12th Int. Conf. Pattern Recognition 1992, pp U. Marti and H. Bunke, A full English sentence database for off-line handwriting recognition, Proc. 5th Int. Conf. Document Analysis and Recognition, ICDAR 99, Bangalore, 1999, pp M. Mohamed and P. Gader, Handwritten word recognition using segmentation-free hidden Markov modeling and segmentation-based dynamic programming techniques, IEEE Trans. PAMI 18, 5 (1996) L. O Gorman, The document spectrum for page layout analysis, IEEE Trans. Patt. Anal. Mach. Intell. 15, 11 (1993)

18 634 E. Kavallieratou et al. 20. T. Pavlidis and J. Zhou, Page segmentation by white streams, Proc. 1st Int. Conf. Document Analysis and Recognition (ICDAR), International Association of Pattern Recognition, 1991, pp G. S. Peake and T. N. Tan, A general algorithm for document skew angle estimation, IEEE Int. Conf. Image Process. 2 (1997) H. Penz, I. Bajla, A. Vrabl, W. Krattenthaler and K. Mayer, Fast real-time recognition and quality inspection of printed characters via point-correlation, Proc. SPIE 4303 (2001) A. W. Senior and A. J. Robinson, An off-line cursive handwriting recognition system, IEEE Trans. Patt. Anal. Mach. Intell. 20, 3 (1998) M. Shridar and F. Kimura, Handwritten address interpretation using word recognition with and without lexicon, Proc. IEEE Int. Conf. Systems, Man and Cybernetics, Piscataway, NJ, USA, Vol. 3, 1995, pp Spitz, Analysis of compressed document images for document skews, multiple skew and logotype detection, CVIU 70, 3 (1998) Ville, Theorie et applicacions de la notion de signal analytique, Cable et Trasmission A2 (1948) J. Wang, M. K. H. Leung and S. C. Hui, Cursive word reference line detection, Patt. Recogn. 30, 3 (1997) K. B. Yu and S. Cheng, Signal synthesis from Wigner distribution, Proc. IEEE ICASSP 85, 1985, pp B. Yu and A. K. Jain, A robust and fast skew detection algorithm for generic documents, Patt. Recogn. 29, 10 (1996) B. Yu and A. K. Jain, A generic system for form dropout, IEEE Trans. Patt. Anal. Mach. Intell. 18, 11 (1996)

19 Handwritten Document Image Processing 635 E. Kavallieratou received her Diploma in electrical and computer engineering in 1996 from the Polytechnic School of the University of Patras and her Ph.D. in handwritten optical character recognition and document image processing from the same department in During the academic year she was a member of the Signals, Systems and Radiocomunications Laboratory of the Department of Telecommunications Engineering of the Polytechnic School of Madrid, working on Image Processing. During 2000 and 2001, she joined as guest researcher at the Institute of Communication Acoustics of Ruhr- Universitaet Bochum, Germany. Since 2002, she has been Assistant Professor of Audio Processing in Department of Audio and Musical Instruments Technology in Technological Educational Institute of Epirus, Greece. Her research interests include optical character recognition, document image analysis, signal processing, audio and image processing. N. Dromazou received her Diploma in electrical and computer engineering in 2000 from the Polytechnic School of the University of Patras. During the academic year she completed her thesis project in the field of optical character recognition. During this period she worked as a research assistant in the Wire Communications Laboratory of the Electrical and Computer Engineering Department for the creation of an Integrated System for Handwritten Document Image Processing. She also worked as a research assistant in the creation of a Greek handwritten character database. Since 2000, she has been working in the design center of ATMEL HELLAS as a software engineer specified in the telecommunications department.

20 636 E. Kavallieratou et al. N. Fakotakis received the B.Sc. degree from the University of London (UK) in electronics in 1978, the M.Sc. degree in electronics from the University of Wales (UK), and the Ph.D. in speech processing from the University of Patras (Greece) in From 1986 to 1992, he was a Lecturer in the Electrical and Computer Engineering Department of the University of Patras, from 1992 to 1999, he was an Assistant Professor and since 1999, he is an Associate Professor in the area of speech and natural language processing and Head of the Speech and Language Processing Group at the Wire Communications Laboratory. He is the author of over 100 publications in the area of speech and natural language engineering. Dr. Fakotakis is a member of the Executive Board of ELSNET (European Language and Speech Network of Excellence), Editorin-Chief of the European Student Journal on Language and Speech, WEB-SLS. He is also a member of IEEE, TEE, EURASIP, ESCA. His current research interests include speech modeling, speech recognition/understanding, speaker recognition, spoken dialogue processing, natural language processing and optical character recognition. G. K. Kokkinakis received the Diploma in electrical engineering (Dipl.-Ing.) in 1961, the Doctor s Degree in engineering (Dr.-Ing) in 1966 and the Diploma in engineering economics (Dipl.-Wirt.-Ing) in 1967, all from the Technical University of Munich (Technische Hochschule Munchen). During he served at the Ministry of Coordination in Athens. Since 1969 he is with the Department of Electrical Engineering at the University of Patras, where he has organized and is directing the Wire Communications Laboratory (WCL). The WCL is a partner in several EU projects and in other R&D projects financed by the Greek Telecom Industry, the Greek General Secretariat for Research and Technology, etc. He has published several books on telecommunications and electrotechnology and over 250 technical papers, articles and reports on telecommunications and speech technology. Dr. Kokkinakis is a senior member of IEEE and a member of the Technical Chamber of Greece (TEE), the VDE (Verein Deutscher Elektrotechniker), ESCA (European Speech Communication Association), the EURASIP (European Association for Signal Processing), the EEEE (Greek Operations Research Society), and the Linguistics Society of America (LSA). Since 1997, he is a member of the board of ISCA. His current activity in research and development, which coincides with the activity of WCL, includes the analysis, synthesis recognition and linguistic processing of the Greek language and the design and optimization of telecom networks.

21

Locally baseline detection for online Arabic script based languages character recognition

Locally baseline detection for online Arabic script based languages character recognition International Journal of the Physical Sciences Vol. 5(7), pp. 955-959, July 2010 Available online at http://www.academicjournals.org/ijps ISSN 1992-1950 2010 Academic Journals Full Length Research Paper

More information

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron Proc. National Conference on Recent Trends in Intelligent Computing (2006) 86-92 A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

More information

Contrast adaptive binarization of low quality document images

Contrast adaptive binarization of low quality document images Contrast adaptive binarization of low quality document images Meng-Ling Feng a) and Yap-Peng Tan b) School of Electrical and Electronic Engineering, Nanyang Technological University, Nanyang Avenue, Singapore

More information

Method for Real Time Text Extraction of Digital Manga Comic

Method for Real Time Text Extraction of Digital Manga Comic Method for Real Time Text Extraction of Digital Manga Comic Kohei Arai Information Science Department Saga University Saga, 840-0027, Japan Herman Tolle Software Engineering Department Brawijaya University

More information

Automatic Licenses Plate Recognition System

Automatic Licenses Plate Recognition System Automatic Licenses Plate Recognition System Garima R. Yadav Dept. of Electronics & Comm. Engineering Marathwada Institute of Technology, Aurangabad (Maharashtra), India yadavgarima08@gmail.com Prof. H.K.

More information

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad Road, Rajkot Gujarat, India C. K. Kumbharana,

More information

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 88 95 Information Technology and Quantitative Management (ITQM2013) A New Character Segmentation Approach for Off-Line Cursive

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter Extraction and Recognition of Text From Digital English Comic Image Using Median Filter S.Ranjini 1 Research Scholar,Department of Information technology Bharathiar University Coimbatore,India ranjinisengottaiyan@gmail.com

More information

License Plate Localisation based on Morphological Operations

License Plate Localisation based on Morphological Operations License Plate Localisation based on Morphological Operations Xiaojun Zhai, Faycal Benssali and Soodamani Ramalingam School of Engineering & Technology University of Hertfordshire, UH Hatfield, UK Abstract

More information

A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2

A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2 A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2 Dave A. D. Tompkins and Faouzi Kossentini Signal Processing and Multimedia Group Department of Electrical and Computer Engineering

More information

Detection and Verification of Missing Components in SMD using AOI Techniques

Detection and Verification of Missing Components in SMD using AOI Techniques , pp.13-22 http://dx.doi.org/10.14257/ijcg.2016.7.2.02 Detection and Verification of Missing Components in SMD using AOI Techniques Sharat Chandra Bhardwaj Graphic Era University, India bhardwaj.sharat@gmail.com

More information

An Improved Bernsen Algorithm Approaches For License Plate Recognition

An Improved Bernsen Algorithm Approaches For License Plate Recognition IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) ISSN: 78-834, ISBN: 78-8735. Volume 3, Issue 4 (Sep-Oct. 01), PP 01-05 An Improved Bernsen Algorithm Approaches For License Plate Recognition

More information

Combined Approach for Face Detection, Eye Region Detection and Eye State Analysis- Extended Paper

Combined Approach for Face Detection, Eye Region Detection and Eye State Analysis- Extended Paper International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 9 (September 2014), PP.57-68 Combined Approach for Face Detection, Eye

More information

Multicomponent Multidimensional Signals

Multicomponent Multidimensional Signals Multidimensional Systems and Signal Processing, 9, 391 398 (1998) c 1998 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands. Multicomponent Multidimensional Signals JOSEPH P. HAVLICEK*

More information

Algorithm for Detection and Elimination of False Minutiae in Fingerprint Images

Algorithm for Detection and Elimination of False Minutiae in Fingerprint Images Algorithm for Detection and Elimination of False Minutiae in Fingerprint Images Seonjoo Kim, Dongjae Lee, and Jaihie Kim Department of Electrical and Electronics Engineering,Yonsei University, Seoul, Korea

More information

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction International Journal of Scientific and Research Publications, Volume 4, Issue 7, July 2014 1 Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for

More information

Robust Voice Activity Detection Based on Discrete Wavelet. Transform

Robust Voice Activity Detection Based on Discrete Wavelet. Transform Robust Voice Activity Detection Based on Discrete Wavelet Transform Kun-Ching Wang Department of Information Technology & Communication Shin Chien University kunching@mail.kh.usc.edu.tw Abstract This paper

More information

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis Mohini Avatade & S.L. Sahare Electronics & Telecommunication Department, Cummins

More information

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP LIU Ying 1,HAN Yan-bin 2 and ZHANG Yu-lin 3 1 School of Information Science and Engineering, University of Jinan, Jinan 250022, PR China

More information

Segmentation of Fingerprint Images Using Linear Classifier

Segmentation of Fingerprint Images Using Linear Classifier EURASIP Journal on Applied Signal Processing 24:4, 48 494 c 24 Hindawi Publishing Corporation Segmentation of Fingerprint Images Using Linear Classifier Xinjian Chen Intelligent Bioinformatics Systems

More information

Content Based Image Retrieval Using Color Histogram

Content Based Image Retrieval Using Color Histogram Content Based Image Retrieval Using Color Histogram Nitin Jain Assistant Professor, Lokmanya Tilak College of Engineering, Navi Mumbai, India. Dr. S. S. Salankar Professor, G.H. Raisoni College of Engineering,

More information

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India Segmentation of Touching Characters in Upper Zone in Printed Gurmukhi Script M. K. Jindal Department of Computer Science and Applications Panjab University Regional Centre Muktsar, Punjab, India +919814637188,

More information

Image binarization techniques for degraded document images: A review

Image binarization techniques for degraded document images: A review Image binarization techniques for degraded document images: A review Binarization techniques 1 Amoli Panchal, 2 Chintan Panchal, 3 Bhargav Shah 1 Student, 2 Assistant Professor, 3 Assistant Professor 1

More information

Recovery of badly degraded Document images using Binarization Technique

Recovery of badly degraded Document images using Binarization Technique International Journal of Scientific and Research Publications, Volume 4, Issue 5, May 2014 1 Recovery of badly degraded Document images using Binarization Technique Prof. S. P. Godse, Samadhan Nimbhore,

More information

Real Time Word to Picture Translation for Chinese Restaurant Menus

Real Time Word to Picture Translation for Chinese Restaurant Menus Real Time Word to Picture Translation for Chinese Restaurant Menus Michelle Jin, Ling Xiao Wang, Boyang Zhang Email: mzjin12, lx2wang, boyangz @stanford.edu EE268 Project Report, Spring 2014 Abstract--We

More information

AN APPROACH ON RECOGNITION OF HAND-WRITTEN LETTERS

AN APPROACH ON RECOGNITION OF HAND-WRITTEN LETTERS AN APPROACH ON RECOGNITION OF HAND-WRITTEN LETTERS Ahmet ÇINAR, Erdal ÖZBAY Fırat University Faculty Of Engineering Computer Engineering 23119 Elazig TURKEY acinar@firat.edu.tr Abstract In this study,

More information

Multi-Script Line identification from Indian Documents

Multi-Script Line identification from Indian Documents Multi-Script Line identification from Indian Documents U. Pal, S. Sinha and B. B. Chaudhuri Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B. T. Road, Kolkata-700108, INDIA

More information

A Review of Optical Character Recognition System for Recognition of Printed Text

A Review of Optical Character Recognition System for Recognition of Printed Text IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May Jun. 2015), PP 28-33 www.iosrjournals.org A Review of Optical Character Recognition

More information

Adaptive Fingerprint Binarization by Frequency Domain Analysis

Adaptive Fingerprint Binarization by Frequency Domain Analysis Adaptive Fingerprint Binarization by Frequency Domain Analysis Josef Ström Bartůněk, Mikael Nilsson, Jörgen Nordberg, Ingvar Claesson Department of Signal Processing, School of Engineering, Blekinge Institute

More information

Colored Rubber Stamp Removal from Document Images

Colored Rubber Stamp Removal from Document Images Colored Rubber Stamp Removal from Document Images Soumyadeep Dey, Jayanta Mukherjee, Shamik Sural, and Partha Bhowmick Indian Institute of Technology, Kharagpur {soumyadeepdey@sit,jay@cse,shamik@sit,pb@cse}.iitkgp.ernet.in

More information

Robust Document Image Binarization Techniques

Robust Document Image Binarization Techniques Robust Document Image Binarization Techniques T. Srikanth M-Tech Student, Malla Reddy Institute of Technology and Science, Maisammaguda, Dulapally, Secunderabad. Abstract: Segmentation of text from badly

More information

An Hybrid MLP-SVM Handwritten Digit Recognizer

An Hybrid MLP-SVM Handwritten Digit Recognizer An Hybrid MLP-SVM Handwritten Digit Recognizer A. Bellili ½ ¾ M. Gilloux ¾ P. Gallinari ½ ½ LIP6, Université Pierre et Marie Curie ¾ La Poste 4, Place Jussieu 10, rue de l Ile Mabon, BP 86334 75252 Paris

More information

Defense Technical Information Center Compilation Part Notice

Defense Technical Information Center Compilation Part Notice UNCLASSIFIED Defense Technical Information Center Compilation Part Notice ADPO 11345 TITLE: Measurement of the Spatial Frequency Response [SFR] of Digital Still-Picture Cameras Using a Modified Slanted

More information

Locating the Query Block in a Source Document Image

Locating the Query Block in a Source Document Image Locating the Query Block in a Source Document Image Naveena M and G Hemanth Kumar Department of Studies in Computer Science, University of Mysore, Manasagangotri-570006, Mysore, INDIA. Abstract: - In automatic

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Kiwon Yun, Junyeong Yang, and Hyeran Byun Dept. of Computer Science, Yonsei University, Seoul, Korea, 120-749

More information

Fast pseudo-semantic segmentation for joint region-based hierarchical and multiresolution representation

Fast pseudo-semantic segmentation for joint region-based hierarchical and multiresolution representation Author manuscript, published in "SPIE Electronic Imaging - Visual Communications and Image Processing, San Francisco : United States (2012)" Fast pseudo-semantic segmentation for joint region-based hierarchical

More information

Optical Character Recognition for Hindi

Optical Character Recognition for Hindi Optical Character Recognition for Hindi Prasanta Pratim Bairagi Assistant Professor, Department of CSE, Assam down town University, Assam, India ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates American Journal of Applied Sciences 6 (12): 2066-2070, 2009 ISSN 1546-9239 2009 Science Publications A Novel Morphological Method for Detection and Recognition of Vehicle License Plates 1 S.H. Mohades

More information

Mr. Chairpersons, Ladies, and Gentlemen, it is indeed a great honor for me to be given this opportunity to address you on the occasion of this

Mr. Chairpersons, Ladies, and Gentlemen, it is indeed a great honor for me to be given this opportunity to address you on the occasion of this Mr. Chairpersons, Ladies, and Gentlemen, it is indeed a great honor for me to be given this opportunity to address you on the occasion of this special meeting. My English name is Sean Zhu, I come from

More information

Fuzzy Statistics Based Multi-HE for Image Enhancement with Brightness Preserving Behaviour

Fuzzy Statistics Based Multi-HE for Image Enhancement with Brightness Preserving Behaviour International Journal of Engineering and Management Research, Volume-3, Issue-3, June 2013 ISSN No.: 2250-0758 Pages: 47-51 www.ijemr.net Fuzzy Statistics Based Multi-HE for Image Enhancement with Brightness

More information

AN EFFICIENT THINNING ALGORITHM FOR ARABIC OCR SYSTEMS

AN EFFICIENT THINNING ALGORITHM FOR ARABIC OCR SYSTEMS AN EFFICIENT THINNING ALGORITHM FOR ARABIC OCR SYSTEMS Mohamed A. Ali Department of Computer Science, Sabha University, Sabha, Libya fadeel1@sebhau.edu.ly ABSTRACT This paper address an efficient iterative

More information

Sound Source Localization using HRTF database

Sound Source Localization using HRTF database ICCAS June -, KINTEX, Gyeonggi-Do, Korea Sound Source Localization using HRTF database Sungmok Hwang*, Youngjin Park and Younsik Park * Center for Noise and Vibration Control, Dept. of Mech. Eng., KAIST,

More information

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images IEEE SIGNAL PROCESSING LETTERS, VOL. X, NO. Y, Z 2003 1 IEEE Signal Processing Letters: SPL-00466-2002 1) Paper Title Distance-Reciprocal Distortion Measure for Binary Document Images 2) Authors Haiping

More information

An Algorithm and Implementation for Image Segmentation

An Algorithm and Implementation for Image Segmentation , pp.125-132 http://dx.doi.org/10.14257/ijsip.2016.9.3.11 An Algorithm and Implementation for Image Segmentation Li Haitao 1 and Li Shengpu 2 1 College of Computer and Information Technology, Shangqiu

More information

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta and Rapeeporn Chamchong Department of Management Information Systems and Computer Science Faculty of Informatics,

More information

An Efficient Method for Contrast Enhancement in Still Images using Histogram Modification Framework

An Efficient Method for Contrast Enhancement in Still Images using Histogram Modification Framework Journal of Computer Science 8 (5): 775-779, 2012 ISSN 1549-3636 2012 Science Publications An Efficient Method for Contrast Enhancement in Still Images using Histogram Modification Framework 1 Ravichandran,

More information

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email:

More information

Skeletonization Algorithm for an Arabic Handwriting

Skeletonization Algorithm for an Arabic Handwriting Skeletonization Algorithm for an Arabic Handwriting MOHAMED A. ALI, KASMIRAN BIN JUMARI Dept. of Elc., Elc. and sys, Fuculty of Eng., Pusat Komputer Universiti Kebangsaan Malaysia Bangi, Selangor 43600

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Stamp detection in scanned documents

Stamp detection in scanned documents Annales UMCS Informatica AI X, 1 (2010) 61-68 DOI: 10.2478/v10065-010-0036-6 Stamp detection in scanned documents Paweł Forczmański Chair of Multimedia Systems, West Pomeranian University of Technology,

More information

Image Enhancement using Histogram Equalization and Spatial Filtering

Image Enhancement using Histogram Equalization and Spatial Filtering Image Enhancement using Histogram Equalization and Spatial Filtering Fari Muhammad Abubakar 1 1 Department of Electronics Engineering Tianjin University of Technology and Education (TUTE) Tianjin, P.R.

More information

NEURALNETWORK BASED CLASSIFICATION OF LASER-DOPPLER FLOWMETRY SIGNALS

NEURALNETWORK BASED CLASSIFICATION OF LASER-DOPPLER FLOWMETRY SIGNALS NEURALNETWORK BASED CLASSIFICATION OF LASER-DOPPLER FLOWMETRY SIGNALS N. G. Panagiotidis, A. Delopoulos and S. D. Kollias National Technical University of Athens Department of Electrical and Computer Engineering

More information

IMPROVEMENT USING WEIGHTED METHOD FOR HISTOGRAM EQUALIZATION IN PRESERVING THE COLOR QUALITIES OF RGB IMAGE

IMPROVEMENT USING WEIGHTED METHOD FOR HISTOGRAM EQUALIZATION IN PRESERVING THE COLOR QUALITIES OF RGB IMAGE Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 5, May 2014, pg.913

More information

Coding and Analysis of Cracked Road Image Using Radon Transform and Turbo codes

Coding and Analysis of Cracked Road Image Using Radon Transform and Turbo codes Coding and Analysis of Cracked Road Image Using Radon Transform and Turbo codes G.Bhaskar 1, G.V.Sridhar 2 1 Post Graduate student, Al Ameer College Of Engineering, Visakhapatnam, A.P, India 2 Associate

More information

Iris Recognition using Histogram Analysis

Iris Recognition using Histogram Analysis Iris Recognition using Histogram Analysis Robert W. Ives, Anthony J. Guidry and Delores M. Etter Electrical Engineering Department, U.S. Naval Academy Annapolis, MD 21402-5025 Abstract- Iris recognition

More information

A DEVELOPED UNSHARP MASKING METHOD FOR IMAGES CONTRAST ENHANCEMENT

A DEVELOPED UNSHARP MASKING METHOD FOR IMAGES CONTRAST ENHANCEMENT 2011 8th International Multi-Conference on Systems, Signals & Devices A DEVELOPED UNSHARP MASKING METHOD FOR IMAGES CONTRAST ENHANCEMENT Ahmed Zaafouri, Mounir Sayadi and Farhat Fnaiech SICISI Unit, ESSTT,

More information

Implementation of Text to Speech Conversion

Implementation of Text to Speech Conversion Implementation of Text to Speech Conversion Chaw Su Thu Thu 1, Theingi Zin 2 1 Department of Electronic Engineering, Mandalay Technological University, Mandalay 2 Department of Electronic Engineering,

More information

Subband Analysis of Time Delay Estimation in STFT Domain

Subband Analysis of Time Delay Estimation in STFT Domain PAGE 211 Subband Analysis of Time Delay Estimation in STFT Domain S. Wang, D. Sen and W. Lu School of Electrical Engineering & Telecommunications University of ew South Wales, Sydney, Australia sh.wang@student.unsw.edu.au,

More information

http://www.diva-portal.org This is the published version of a paper presented at SAI Annual Conference on Areas of Intelligent Systems and Artificial Intelligence and their Applications to the Real World

More information

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2 An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2 1, Student, SPCOE, Department of E&TC Engineering, Dumbarwadi, Otur 2, Professor, SPCOE, Department of E&TC Engineering,

More information

Chapter 17. Shape-Based Operations

Chapter 17. Shape-Based Operations Chapter 17 Shape-Based Operations An shape-based operation identifies or acts on groups of pixels that belong to the same object or image component. We have already seen how components may be identified

More information

A Comprehensive Survey on Kannada Handwritten Character Recognition and Dataset Preparation

A Comprehensive Survey on Kannada Handwritten Character Recognition and Dataset Preparation A Comprehensive Survey on Kannada Handwritten Character Recognition and Dataset Preparation Kiran Y. C Research Scholar, Jain University Associate Professor, Dept. of ISE Dayananda Sagar College of Engineering

More information

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,

More information

MAV-ID card processing using camera images

MAV-ID card processing using camera images EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON

More information

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing Extraction of Newspaper Headlines from Microfilm for Automatic Indexing Chew Lim Tan 1, Qing Hong Liu 2 1 School of Computing, National University of Singapore, 3 Science Drive 2, Singapore 117543 Email:

More information

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi Department of E&TC Engineering,PVPIT,Bavdhan,Pune ABSTRACT: In the last decades vehicle license plate recognition systems

More information

THE Touchless SDK released by Microsoft provides the

THE Touchless SDK released by Microsoft provides the 1 Touchless Writer: Object Tracking & Neural Network Recognition Yang Wu & Lu Yu The Milton W. Holcombe Department of Electrical and Computer Engineering Clemson University, Clemson, SC 29631 E-mail {wuyang,

More information

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Keyword: Morphological operation, template matching, license plate localization, character recognition. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Automatic

More information

Image Enhancement in Spatial Domain

Image Enhancement in Spatial Domain Image Enhancement in Spatial Domain 2 Image enhancement is a process, rather a preprocessing step, through which an original image is made suitable for a specific application. The application scenarios

More information

Document Processing for Automatic Color form Dropout

Document Processing for Automatic Color form Dropout Rochester Institute of Technology RIT Scholar Works Articles 12-7-2001 Document Processing for Automatic Color form Dropout Andreas E. Savakis Rochester Institute of Technology Christopher R. Brown Microwave

More information

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods 19 An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods T.Arunachalam* Post Graduate Student, P.G. Dept. of Computer Science, Govt Arts College, Melur - 625 106 Email-Arunac682@gmail.com

More information

Compression Method for Handwritten Document Images in Devnagri Script

Compression Method for Handwritten Document Images in Devnagri Script Compression Method for Handwritten Document Images in Devnagri Script Smita V. Khangar, Dr. Latesh G. Malik Department of Computer Science and Engineering, Nagpur University G.H. Raisoni College of Engineering,

More information

An Algorithm for Fingerprint Image Postprocessing

An Algorithm for Fingerprint Image Postprocessing An Algorithm for Fingerprint Image Postprocessing Marius Tico, Pauli Kuosmanen Tampere University of Technology Digital Media Institute EO.BOX 553, FIN-33101, Tampere, FINLAND tico@cs.tut.fi Abstract Most

More information

Modelling, Simulation and Computing Laboratory (msclab) School of Engineering and Information Technology, Universiti Malaysia Sabah, Malaysia

Modelling, Simulation and Computing Laboratory (msclab) School of Engineering and Information Technology, Universiti Malaysia Sabah, Malaysia 1.0 Introduction During the recent years, image processing based vehicle license plate localisation and recognition has been widely used in numerous areas:- a) Entrance admission b) Speed control Modelling,

More information

Refined Slanted-Edge Measurement for Practical Camera and Scanner Testing

Refined Slanted-Edge Measurement for Practical Camera and Scanner Testing Refined Slanted-Edge Measurement for Practical Camera and Scanner Testing Peter D. Burns and Don Williams Eastman Kodak Company Rochester, NY USA Abstract It has been almost five years since the ISO adopted

More information

NOISE ESTIMATION IN A SINGLE CHANNEL

NOISE ESTIMATION IN A SINGLE CHANNEL SPEECH ENHANCEMENT FOR CROSS-TALK INTERFERENCE by Levent M. Arslan and John H.L. Hansen Robust Speech Processing Laboratory Department of Electrical Engineering Box 99 Duke University Durham, North Carolina

More information

Automatic License Plate Recognition System using Histogram Graph Algorithm

Automatic License Plate Recognition System using Histogram Graph Algorithm Automatic License Plate Recognition System using Histogram Graph Algorithm Divyang Goswami 1, M.Tech Electronics & Communication Engineering Department Marudhar Engineering College, Raisar Bikaner, Rajasthan,

More information

Abstract. Most OCR systems decompose the process into several stages:

Abstract. Most OCR systems decompose the process into several stages: Artificial Neural Network Based On Optical Character Recognition Sameeksha Barve Computer Science Department Jawaharlal Institute of Technology, Khargone (M.P) Abstract The recognition of optical characters

More information

ANALYSIS OF PARTIAL IRIS RECOGNITION

ANALYSIS OF PARTIAL IRIS RECOGNITION ANALYSIS OF PARTIAL IRIS RECOGNITION Yingzi Du, Robert Ives, Bradford Bonney, Delores Etter Electrical Engineering Department, U.S. Naval Academy, Annapolis, MD, USA 21402 ABSTRACT In this paper, we investigate

More information

Recursive Plateau Histogram Equalization for the Contrast Enhancement of the Infrared Images

Recursive Plateau Histogram Equalization for the Contrast Enhancement of the Infrared Images 2 3rd International Conference on Computer and Electrical Engineering ICCEE 2) IPCSIT vol. 53 22) 22) IACSIT Press, Singapore DOI:.7763/IPCSIT.22.V53.No..7 Recursive Plateau Histogram Equalization for

More information

Machine-printed and hand-written text lines identi cation

Machine-printed and hand-written text lines identi cation Pattern Recognition Letters 22 2001) 431±441 www.elsevier.nl/locate/patrec Machine-printed and hand-written text lines identi cation U. Pal, B.B. Chaudhuri * Computer Vision and Pattern Recognition Unit,

More information

Author(s) Corr, Philip J.; Silvestre, Guenole C.; Bleakley, Christopher J. The Irish Pattern Recognition & Classification Society

Author(s) Corr, Philip J.; Silvestre, Guenole C.; Bleakley, Christopher J. The Irish Pattern Recognition & Classification Society Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Open Source Dataset and Deep Learning Models

More information

A DUAL TREE COMPLEX WAVELET TRANSFORM CONSTRUCTION AND ITS APPLICATION TO IMAGE DENOISING

A DUAL TREE COMPLEX WAVELET TRANSFORM CONSTRUCTION AND ITS APPLICATION TO IMAGE DENOISING A DUAL TREE COMPLEX WAVELET TRANSFORM CONSTRUCTION AND ITS APPLICATION TO IMAGE DENOISING Sathesh Assistant professor / ECE / School of Electrical Science Karunya University, Coimbatore, 641114, India

More information

MLP for Adaptive Postprocessing Block-Coded Images

MLP for Adaptive Postprocessing Block-Coded Images 1450 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 8, DECEMBER 2000 MLP for Adaptive Postprocessing Block-Coded Images Guoping Qiu, Member, IEEE Abstract A new technique

More information

Time-Frequency Distributions for Automatic Speech Recognition

Time-Frequency Distributions for Automatic Speech Recognition 196 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 9, NO. 3, MARCH 2001 Time-Frequency Distributions for Automatic Speech Recognition Alexandros Potamianos, Member, IEEE, and Petros Maragos, Fellow,

More information

Multimodal Face Recognition using Hybrid Correlation Filters

Multimodal Face Recognition using Hybrid Correlation Filters Multimodal Face Recognition using Hybrid Correlation Filters Anamika Dubey, Abhishek Sharma Electrical Engineering Department, Indian Institute of Technology Roorkee, India {ana.iitr, abhisharayiya}@gmail.com

More information

A new seal verification for Chinese color seal

A new seal verification for Chinese color seal Edith Cowan University Research Online ECU Publications 2011 2011 A new seal verification for Chinese color seal Zhihu Huang Jinsong Leng Edith Cowan University 10.4028/www.scientific.net/AMM.58-60.2558

More information

Subminiature Multi-stage Band-Pass Filter Based on LTCC Technology Research

Subminiature Multi-stage Band-Pass Filter Based on LTCC Technology Research International Journal of Information and Electronics Engineering, Vol. 6, No. 2, March 2016 Subminiature Multi-stage Band-Pass Filter Based on LTCC Technology Research Bowen Li and Yongsheng Dai Abstract

More information

Libyan Licenses Plate Recognition Using Template Matching Method

Libyan Licenses Plate Recognition Using Template Matching Method Journal of Computer and Communications, 2016, 4, 62-71 Published Online May 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.47009 Libyan Licenses Plate Recognition Using

More information

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and 8.1 INTRODUCTION In this chapter, we will study and discuss some fundamental techniques for image processing and image analysis, with a few examples of routines developed for certain purposes. 8.2 IMAGE

More information

Proposed Method for Off-line Signature Recognition and Verification using Neural Network

Proposed Method for Off-line Signature Recognition and Verification using Neural Network e-issn: 2349-9745 p-issn: 2393-8161 Scientific Journal Impact Factor (SJIF): 1.711 International Journal of Modern Trends in Engineering and Research www.ijmter.com Proposed Method for Off-line Signature

More information

Effective and Efficient Fingerprint Image Postprocessing

Effective and Efficient Fingerprint Image Postprocessing Effective and Efficient Fingerprint Image Postprocessing Haiping Lu, Xudong Jiang and Wei-Yun Yau Laboratories for Information Technology 21 Heng Mui Keng Terrace, Singapore 119613 Email: hplu@lit.org.sg

More information

Drum Transcription Based on Independent Subspace Analysis

Drum Transcription Based on Independent Subspace Analysis Report for EE 391 Special Studies and Reports for Electrical Engineering Drum Transcription Based on Independent Subspace Analysis Yinyi Guo Center for Computer Research in Music and Acoustics, Stanford,

More information

Open Access An Improved Character Recognition Algorithm for License Plate Based on BP Neural Network

Open Access An Improved Character Recognition Algorithm for License Plate Based on BP Neural Network Send Orders for Reprints to reprints@benthamscience.ae 202 The Open Electrical & Electronic Engineering Journal, 2014, 8, 202-207 Open Access An Improved Character Recognition Algorithm for License Plate

More information

A Chinese License Plate Recognition System

A Chinese License Plate Recognition System A Chinese License Plate Recognition System Bai Yanping, Hu Hongping, Li Fei Key Laboratory of Instrument Science and Dynamic Measurement North University of China, No xueyuan road, TaiYuan, ShanXi 00051,

More information

Iris Segmentation & Recognition in Unconstrained Environment

Iris Segmentation & Recognition in Unconstrained Environment www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume - 3 Issue -8 August, 2014 Page No. 7514-7518 Iris Segmentation & Recognition in Unconstrained Environment ABSTRACT

More information

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Implementation of License Plate Recognition System in ARM Cortex A8 Board www..org 9 Implementation of License Plate Recognition System in ARM Cortex A8 Board S. Uma 1, M.Sharmila 2 1 Assistant Professor, 2 Research Scholar, Department of Electrical and Electronics Engg, College

More information

A Custom-made MATLAB Based Software to Manage Leakage Current Waveforms

A Custom-made MATLAB Based Software to Manage Leakage Current Waveforms ETASR - Engineering, Technology & Applied Science Research Vol. 1, No.2, 2011, 36-42 36 A Custom-made MATLAB Based Software to Manage Leakage Current Waveforms Dionisios Pylarinos High Voltage Lab University

More information

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position

Applying the Filtered Back-Projection Method to Extract Signal at Specific Position Applying the Filtered Back-Projection Method to Extract Signal at Specific Position 1 Chia-Ming Chang and Chun-Hao Peng Department of Computer Science and Engineering, Tatung University, Taipei, Taiwan

More information