Improving Optical Character Recognition Process for Low Resolution

Size: px
Start display at page:

Download "Improving Optical Character Recognition Process for Low Resolution"

Transcription

1 Improving Optical Character Recognition Process for Low Resolution Images 1 Imad Qasim Habeeb, 2 Shahrul Azmi Mohd Yusof, 3 Faudziah B. Ahmad 1, First Author Iraqi Commission for Computers and Informatics, Iraq, emadkassam@yahoo.com *2,Corresponding Author Universiti Utara Malaysia, shahrulazmi@uum.edu.my 3 Universiti Utara Malaysia, fudz@uum.edu.my Abstract Optical Character Recognition (OCR) systems often generate errors for images with noise or with low scanning resolution. In this paper, a novel approach that can be used to improve and restore the quality of any clean lower resolution images for easy recognition by OCR process. The method relies on the production of four copies of the original image so that each picture undergoes different restoration processes. These four copies of the images are then passed to a single OCR engine in parallel. In addition to that, the method does not need any traditional alignment between the four resulting texts, which is time consuming and needs complex calculation. It implements a new procedure to choose the best among them and can be applied without prior training on errors. The experimental results show improvement in word error rate for low resolution images by more than 67%. 1. Introduction Keywords: OCR, Low resolution image, Alignment resulting text, Multi inputs The process of optical character recognition (OCR) extracts the text in images so that it can be modified and searched [1]. Output of OCR systems often produces errors when the images contain noise or the scanning resolution is low [2, 3]. The optimal resolution to scan images for most OCR systems is 300 dots per inch (dpi) [4]. Fig. 1 shows output of two OCR engines with scanning resolution of 300 dpi and 72 dpi respectively. It shows the effect of low resolution image to OCR systems. The two OCR engines used are Tesseract [5] which is supported by Google Inc, and Asprise [6] which is a commercial software. Figure 1. The difference in OCR output for two images having different resolution Low resolution images can be extracted from sequence of low quality video [3, 7] or when dealing with available images having low resolution like what is available in thousands of documents' images in the Internet. Increasing image resolution after the scanning process will not add more details to the image unless the original scanning resolution is high [8]. This research designed effective method that can be used to deal with any available image that has a low resolution to improve and restore its quality for easy recognition by OCR process. To determine word error rate (WER) for images having low resolution, several documents containing words were scanned twice at 72 dpi, the first with a gray level, and the second with one bit black/white. The texts were extracted from Wikipedia's website. The documents International Journal of Advancements in Computing Technology(IJACT) Volume 6, Number 3, May

2 contain normal text without any layout or images. All documents' images were passed to two OCR engines, Tesseract version 3.02 and Asprise version 4.0 respectively. The output of both engines gave an average WER of greater than 64% which is high. The resulting WER for both engines is shown in Table 1. Due to the high rate of errors, a new method aimed at improving the image quality so as to reduce WER is proposed. Table 1. Word error rate for two types of images by two engines Engine Tesseract version 3.02 Asprise version 4.0 WER for Gray images WER for Binary images Average WER 65% 79% 72% 57% 71% 64% Resolution of the image represents its quality, i.e. the higher the resolution the higher the quality. The image becomes clearer, sharper, and more detailed when the resolution is high. On the other hand, its file size becomes larger and the number of pixels increase [9]. For example, in case of scanning a single page of text as image with resolution 75 dpi and store it in a bmp format, the size of file is nearly 353KB. And when the same page is scanned at 300 dpi, the size of file is almost 5240 KB; the rate of the increase in file size and number of pixels is practically 15 times; this means more processing time for OCR systems. For this reason, the proposed method clearly will decrease file size in the hard disk, and will increase speed of the OCR systems, though the scope of this research is limited to reducing the OCR errors for low resolution images. Furthermore, these images are so blurry because the pixels are not enough to represent all the details of the characters, this causes several letters seems to be touching each causing difficult to differentiate the outlines of these characters. Such poor-quality images can cause OCR segmentation and features extraction processes to become more complex. For examples, many characters' blocks in the segmentation process may contain noise, sometimes part of character in one block and the rest in another block; in other cases, a single block contains two or more touching characters [3]. Using several values for the threshold when converting grayscale image to a binary image can lead to different results from a single OCR engine [10, 11]. The proposed method will improve this situation by not classifying the values of images between 0 and 255, but to group them into three classes. After that, various operations are performed such as characters cleaning and restoration based on these classes to produce four images. These images are sent to same OCR engine to produce four outputs, where the best is chosen among them. The details of these operations are given in section 3. Using more than one input will lead to the problem of the alignment of resulting texts of OCR engines. This problem needs a long time and complex calculation, especially for number of the characters greater than 2500 [12, 13]. The proposed method used multiple inputs with alignment between the words only, while the previous works on related methods require alignment line by line or alignment between completes resulting texts before words or characters were aligned. Furthermore, the alignment of the words is easier and need simple alignment methods because the number of the characters in any words at most does not exceed 20 [13] as described in section 3. The resulting errors from OCR process can be classified into two types: non-word errors and real word errors. The first means: the words that do not exist in the lexicon, such as the word "foed". The second means: the words that exist in the lexicon, but unsuitable for the sentence, such as the word "too" in sentence "I want too eat" [14, 15]. This method in addition to improving the accuracy of the image before entering the OCR engine can perform multiple-pass decoding in detecting and correcting these errors without prior training. The contributions of this research are: (1) a new method that improves WER of OCR systems when the inputs are free of noisy images having low resolution, and (2) A new alignment method that uses multi-inputs with alignment between the words only. The paper is organized in 14

3 five sections: section 1 presented the introduction; section 2 discusses related work on OCR error correction; section 3 explains the proposed method and its implementation. In section 4, interface and data collected are defined, also experimental results and evaluation. The last section includes conclusions and future work of our research. 2. Related work of OCR Error Correction The proposed method involves three themes: multi-inputs, low-resolution images, and OCR post-processing error correction. A lot of researches relies on multi-inputs for OCR engines to improve accuracy of OCR. For examples, Lopresti and Zhou [16] stated that scanning a page multiple times, then entering images into OCR engine, and running a voting procedure to select the best among them will eliminate 20 to 50% from text errors resulting from a single OCR engine. The method does not require training; the alignment between the output texts was done line by line, with scanning resolution of 300dpi. Lund and Ringger [12] take advantage of the differences between the outputs of three OCR engines to improve accuracy. Their method enhanced the A* algorithm used in the alignments of the three OCR outputs, which resulted in reduction of WER from 22.9% to 10.3%. The alignment process used complete output texts of three OCR engines, while test images were scanned at 400 dpi. As second attempt, Lund and Ringger [17] created a decision list from in-domain training data that was used to select the best output of the three OCR engines. A* algorithm with Reverse Dijkstra admissible was used in the alignment of the complete output texts from the three engines. The method leads to a 19.5% improvement in WER compared to the best single OCR. Progressive alignment of five different OCR Engines was presented in the third attempt of Lund, Walker, and Ringger [18]. The method used maximum entropy model to select the best output from five different OCR Engines. Total output texts are used in alignment of five OCR engines; the scanning resolution was 1500 dpi for the documents' images. A 24.6% improvement in WER relative to the optimal one from the five OCR engines was attained. At fourth attempt, Lund, Kennar and Ringger [11] improved OCR accuracy by using seven values to the binarization threshold for single image, so that multiple images were passed to the same OCR engine. Progressive alignment of the outputs for total texts was used. The test images were scanned at 400 dots per inch. 2.68% of all tokens used in the test corpus were corrected by the method. The method proposed in this research compared to the prior methods, in addition to multi-inputs, will employ: (1) algorithm to clean, and restore the characters, (2) no previous training on errors, and (3) testing stage uses low scanning resolution (72 dpi) images. For the researchers that considered low-resolution images: Jacobs, Simard, Viola, and Rinker [8] presented a camera-based OCR system that can improve and recognize poor-quality documents' images. The system used machine learning approach and consists of two parts: character recognizer and word recognizer. Character recognizer is implemented using neural network which is used to predict the character at specific location in the image, while the word recognizer is used to find the word inside a given box in the image. The system trained on a large amount of data with recognition accuracy of between 80-95% on images captured with size of and font size of 10-point. This accuracy is achieved when the system uses a language model. On the other hand, the processing time was slow taking about 2 minutes and 40 seconds to produce the output for medium-sized paragraph. Ma and Agam [3] presented a super resolution framework that was based on machine learning for low resolution images using K- means algorithm. The goal of the framework was: reconstructing an excellent resolution image from a low-quality image to enhance accuracy of OCR. The results of method showed 50% error rate reduction. The testing images were re-sampled from high-resolution images of to lower resolution ratios of (1:2, 1:3, and 1:4 sampling rates) so that it can be used in the experiments. However, the proposed method in this study will improve WER compared to the previous methods. The last theme in the related work is OCR post-processing error correction, which means correcting the errors after OCR engine generates the text. It can be divided into several categories: (1) proofreading-based correction, (2) lexicon-based correction, and (3) contextbased correction [19]. Proofreading-based correction requires humans to read and rewrite text produced from the OCR process. This is inefficient as it is time-consuming, especially when the 15

4 number of words is in the thousands. While the lexicon-based correction is used to identify the non-words errors; the error happens when the word resulted from the OCR does not exist in the lexicon [14]. Lastly, the context-based takes into account the words surrounding the wrong word. It is more complex than the previous techniques and can detect real word errors [19]. Examples of OCR post-processing error correction methods include: Naseem and Hussain [20] used the similarity in shape among characters in words. Guyon and Pereira [21] proposed grammar rules. Lapata and Keller [22] suggested word count, i.e., the frequency of the word in the web or corpus is used in selecting the right word. Mays, Damerau, and Mercer [23] used two techniques in their method: first was a dictionary to identify and process non-word errors, and the second was a language model to identify and correct the grammar errors. Liu, Babad, Sun, and Chan [24] presented a matrix of sequence and count of characters for all words resulted from OCR, where the incorrect word was replaced with any word that has the highest count in the matrix. Tong, Zhai, Milic, and Evans [25] proposed confusion sets that used common errors in words. Bassil and Alwani [19] used Google's spelling and suggestions. Choudhury, et al. [26] suggested probability based language model. The proposed method in this research will not use complex method, but will use an only lexicon as an integral part in the correction process. 3. The Proposed method in this research The research involves six major steps: (S1) extract words' images from document image and store in an array, (S2) pass each word image in the array sequentially to four processes, (S3) perform cleaning, restoration, and resample on each word image in any of the four processes based on different conditions, (S4) each OCR engine will receive words images from one process in sequence, (S5) apply a procedure to select the best word resulting from the four OCR engines, and (S6) compile all words in one output text. Figure 3 shows the proposed method. Figure 2. The proposed method framework In S1, the document image is checked for gray scale. If the image is colored, then the document image is converted to gray-scale before further processing. This method does not accept binary images. This method does not accept binary images. The document image is searched through to locate words and denote the words as blocks. Thus, each block will contain a word image. At the end, words of the document image are extracted and stored as an array of word images. The first reason for extracting only words' images is if the document image is passed to multiple OCR directly, problem of the alignment of output texts will occur. More so, it is computationally complex and take long time for lengthy sequences [12, 13]. The proposed method does not need this type of alignment as described more in the decision and collection stage. The second reason is most OCR systems begin with identifying words in the image and then split words into letters' blocks before features' extraction [27], thus it can reduce 16

5 processing time by selecting words in the image once, rather than repeat it in each OCR engine. It is to be noted that the words are easy to be identified because of the presence of spaces between them. The spaces between the letters are very small and sometimes attached to each other, especially when the scanning resolution of image is low [3]. An example of spaces between two words or two characters is shown in Fig. 1. Next, in S1, each pixel value in the word image is compared with a threshold value '220'. If the pixel value is greater than 220, then the value is changed to 255. The aim is to remove some weak pixels. Furthermore, search each word image from left to right to find any vertical line having a height start from top of word image to bottom of it, and having a width as one pixel; this line must not contain any pixel value less than 160, if found it, then all values of the pixels in this line become equal to 255. This is to facilitate the splitting of word image into letters (Fig. 4). The threshold values '220' and '160' were chosen based on experiments that were conducted to specifically identify threshold values so that the process does not lead to neglect or distort other important information. The output of S1 is an array of words images. Figure 3. Word image before and after preprocessing & extraction stage Next in S2, each word image is passed in sequence to four processes (S3). Here, in S3, the proposed method classifies and converts the pixel values of a word image into three classes ranging between 0 and 255. This is followed by performing restoration to the characters. The classification and conversion of the values change four times by using a variable threshold. This means value of threshold named x changes during each process. It takes the value (130, 150, 170, and 190) for processes (1, 2, 3, and 4) respectively (Fig. 5). Steps S3 and S4 are performed in a multithreads manner, (in parallel) to reduce processing time [28]. Furthermore, OCR engines in this method are not different but are multiple copies of the Tesseract engine version In Figure 4. Processes conditions 17

6 process 1, several operations are performed for each word image. Firstly, each pixel value smaller than x becomes zero; the reason is: to confirm the strong pixels. On the other hand, any value of pixel greater than x remains the same, the reason is: the restoration process will be performed on them. The second operation in process 1 identifying all pixels having values between (x+1) and (x+20), located beside pixels having values equal to zero as shown in fig. 5. These will become primary starting pixels for the process of restoration. Figure 5. Example of position of starting pixels In the restoration stage, each primary starting pixel has cycle of operations: (1) value of starting pixel is changed to zero, and (2) all the neighboring pixels from all sides with values not equal to zero are arranged in ascending order so that the pixel having the smallest value becomes a secondary starting pixel on condition that its value is between (x+1) and (x+20). If these conditions are met, then all previous operations (1 and 2) are performed for the new starting pixel and so on, otherwise the cycle is ended for current starting pixel, and another cycle for next primary starting pixel is initiated. The last operation in process1 is increasing the resolution of each word image to 300 dpi because most of OCR engines are optimized at this resolution. All the operations are the same in processes 2, 3 and 4, except that the value of the variable x becomes 150, 170 and 190 respectively. Previous values of x were selected based on the results from a series of experiments that were conducted to choose the best values between 0 and 255. As previously mentioned, the processes 1, 2, 3 and 4 in addition to the OCR engines are implemented in parallel to reduce the processing time. For this, any word image will be sent to the four processes to produce four words images (S5), each one passing through one OCR engine to turn into a word, so that the final results are four words sent to S6. Fig. 6 show the steps in decision & collection stage. Figure 6. The steps in decision & collection stage The first step in decision & collection stage selects only unique words from the four words resulting from OCR engines. Next, checking each word if it belongs to the lexicon. If there is just one, it is 18

7 marked as correct, otherwise, if more than one belongs to the lexicon, the valid word that contains most frequently among non shared letters will be selected. The third is creating an array named "a_shared" containing the letters that exist in the four words. Array "a_shared" is created using characters-based bigram model. For example, if the four words resulted from OCR engines are "imoge", "imagc", "imdgc", and "imogc", then array "a_shared" would contain ("im", "m?", "?g", "g?"). The fourth step suggests a list of words not exceed five words from the lexicon for each unique word. The last step will choose any word from suggestions' list having the largest number of letters in the array "a_shared". If more than one satisfies the previous conditions, only word that contains most frequently among non shared letters will be selected. The resulting words will be used to build the output text by putting space between them. 4. Results and evaluation To evaluate the proposed method, a prototype is developed using VB.NET. It uses Tesseract version 3.02 as OCR engine to convert images into text. Tesseract engine is a software library supported by Google Inc [5]. The prototype will pass documents images to OCR engine before and after using the proposed method and display the results. Experiments will test several English documents containing words; the texts in these documents are from Wikipedia's website and acts as reference or standard text. The texts do not contain any layout or images having five font sizes and five font types. Sclite toolkit supported by the National Institute of Standards and Technology was used to compute WER, by comparing a reference text with OCR output text [1, 29]. To generate two types of test images from the reference, the text is first printed on papers. Then the hardcopy are applied to two types of experiments: (1) the documents are scanned at 72 dpi with a gray level in a normal scanner to produce 487x662 images, and (2) the same documents are snapped by a 1024x768 cameras. All images result from this camera are resample to produce several low resolution 500x680 images, the default resolution of the camera is 75 dpi. The two groups of images resulted from the scanner, and the camera are tested. Results showed that the output of Tesseract engine without using the proposed method was very poor having an average WER of more than 69% and 78% respectively for both scanner images and camera images. Table 2 shows the comparison results. Source of Images Table 2. Results of the proposal method testing Average WER (Using Tesseract engine only) Average WER (Using Tesseract engine and the proposed method) Scanner 69.35% 2.21% Camera 78.82% 4.63% From Table 2, it can be observed that the average WER for scanner images is 69.35% when using only Tesseract engine and 2.21 % when using Tesseract engine and the proposed method. Average WER for camera images is 78.82% when using only Tesseract engine and is 4.63% when using both Tesseract engine and the proposed method. The difference in accuracy between scanner images and camera images is due to variations in lighting between camera and scanner, and properties of each device. The results in Table 2 also show that the proposed method improves the average WER. The alignment of multiple outputs of the proposed method was compared with two methods named "1" [18] and "2" [4]. Factors of comparison are: type of alignment and probability of the error in the alignment as shown in Table 3. First row shows that the alignment used in the proposed method requires less complex calculations because it was dealing with only four words each time, in contrast to the other two methods that deal with long sequences of strings. The second row reveals there is a possibility of error in the alignment for the methods 1 and 2 19

8 [4, 18], while it is zero for proposed method. All the experiments in this research confirmed that proposed method is better than the other two methods. Table 3. Comparison of the proposed method with two related methods Type of alignment and probability of the error in the alignment Method 1 Method 2 The Proposed method Alignment type sequence by sequence page by page word by word Probability of the error in the alignment Cannot guarantee 100% error free in alignment [18] Cannot guarantee 100% error free in alignment [4] Guaranteed 100% error free in alignment 5. Conclusion and Future Work OCR systems accuracy is very high in ideal conditions, but the error rate increases when the images contain noise, the scanning resolution is low, and a cursive written typed languages. This paper presents a new method which can restore characters' quality in weak resolution images before passing them to OCR engines. The method excludes traditional alignment among resulting texts used by related methods; and also no training is needed on errors before executing it. Furthermore, it performs a procedure to select the best among the output texts and correct wrong words if they occurred in the output texts. In addition to that, this method can be used for any language with simple modification. The experiment results show that this method will reduce WER of output text considerably. Further research can be done for more improvement in WER for systems of OCR for the cases of: the noisy images, low resolution images, or cursive writing languages. In addition to that, there is a need to minimize data set size on hard disk of N-grams based-language models, which perform well in correcting the errors that resulted after OCR process. This is done without affecting the speed of data access for its effective inclusion in desktop OCR applications. 6. References [1] S. Impedovo, L. Ottaviano, and S. Occhinegro, "Optical character recognition a survey," International Journal of Pattern Recognition and Artificial Intelligence, vol. 5, pp. 1-24, [2] B. Alex, C. Grover, E. Klein, and R. Tobin, "Digitised Historical Text: Does it have to be mediocre?," [3] D. Ma and G. Agam, "A super resolution framework for low resolution document image OCR," in IS&T/SPIE Electronic Imaging, 2013, pp P-86580P-9. [4] M. Volk, L. Furrer, and R. Sennrich, "Strategies for Reducing and Correcting OCR Errors," in Language Technology for Cultural Heritage, ed: Springer, 2011, pp [5] Google Inc. (2014, January 02). Tesseract-ocr v3.02. Available: [6] LAB Asprise. (2014, January 05). Asprise OCR SDK library v4.0. Available: [7] D. Ma and G. Agam, "Lecture video segmentation and indexing," in IS&T/SPIE Electronic Imaging, 2012, pp V-82970V-8. [8] C. Jacobs, P. Y. Simard, P. Viola, and J. Rinker, "Text recognition of low-resolution document images," in Document Analysis and Recognition, Proceedings. Eighth International Conference on, 2005, pp [9] B. J. Dawson, "Method and apparatus for dynamically selecting an image compression process based on image size and color resolution," ed: Google Patents, [10] M. R. Gupta, N. P. Jacobson, and E. K. Garcia, "OCR binarization and image pre-processing for searching historical documents," Pattern Recognition, vol. 40, pp ,

9 [11] W. B. Lund, D. J. Kennard, and E. K. Ringger, "Why multiple document image binarizations improve OCR," presented at the Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, Washington, District of Columbia, [12] W. B. Lund and E. K. Ringger, "Improving optical character recognition through efficient multiple system alignment," in Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, 2009, pp [13] I. Elias, "Settling the intractability of multiple alignment," Journal of Computational Biology, vol. 13, pp , [14] J. F. Daðason, "Post-Correction of Icelandic OCR Text," (Master's thesis, University of Iceland, Reykjavik, Iceland), [15] X. Sun, J. Gao, D. Micol, and C. Quirk, "Learning phrase-based spelling error models from clickthrough data," in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010, pp [16] D. Lopresti and J. Zhou, "Using consensus sequence voting to correct OCR errors," Computer Vision and Image Understanding, vol. 67, pp , [17] W. B. Lund and E. K. Ringger, "Error Correction with In-Domain Training Across Multiple OCR System Outputs," in Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, pp [18] W. B. Lund, D. D. Walker, and E. K. Ringger, "Progressive alignment and discriminative error correction for multiple OCR engines," in Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, pp [19] Y. Bassil and M. Alwani, "Ocr post-processing error correction algorithm using google online spelling suggestion," arxiv preprint arxiv: , [20] T. Naseem and S. Hussain, "A novel approach for ranking spelling error corrections for Urdu," Language Resources and Evaluation, vol. 41, pp , [21] I. Guyon and F. Pereira, "Design of a linguistic postprocessor using variable memory length Markov models," in Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on, 1995, pp [22] M. Lapata and F. Keller, "The Web as a baseline: Evaluating the performance of unsupervised Web-based models for a range of NLP tasks," in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004, pp [23] E. Mays, F. J. Damerau, and R. L. Mercer, "Context based spelling correction," Information Processing & Management, vol. 27, pp , [24] L.-M. Liu, Y. M. Babad, W. Sun, and K.-K. Chan, "Adaptive post-processing of OCR text via knowledge acquisition," in Proceedings of the 19th annual conference on Computer Science, 1991, pp [25] X. Tong, C. Zhai, N. Milic-Frayling, and D. A. Evans, "OCR Correction and Query Expansion for Retrieval on OCR Data -- CLARIT TREC-5 Confusion Track Report," in TREC, [26] M. Choudhury, R. Saraf, V. Jain, A. Mukherjee, S. Sarkar, and A. Basu, "Investigation and modeling of the structure of texting language," International Journal of Document Analysis and Recognition (IJDAR), vol. 10, pp , [27] M. Labidi, M. Khemakhem, and M. Jemni, "Grid 5000 Based Large Scale OCR Using the DTW Algorithm: Case of the Arabic Cursive Writing," Recent Advances in Document Recognition and Understanding, p. 73, [28] Microsoft Corporation. (2014, January 08). Multithreading in Visual Basic. Available: 21

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad Road, Rajkot Gujarat, India C. K. Kumbharana,

More information

Digitization Errors In Hungarian Documents

Digitization Errors In Hungarian Documents Digitization Errors In Hungarian Documents Máté Pataki 1 Tamás Füzessy 2 1 Department of Distributed Systems Computer and Automation Research Institute of the Hungarian Academy of Sciences 2 FreeSoft Nyrt.

More information

A Review of Optical Character Recognition System for Recognition of Printed Text

A Review of Optical Character Recognition System for Recognition of Printed Text IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May Jun. 2015), PP 28-33 www.iosrjournals.org A Review of Optical Character Recognition

More information

A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2

A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2 A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2 Dave A. D. Tompkins and Faouzi Kossentini Signal Processing and Multimedia Group Department of Electrical and Computer Engineering

More information

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction International Journal of Scientific and Research Publications, Volume 4, Issue 7, July 2014 1 Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for

More information

Real Time Word to Picture Translation for Chinese Restaurant Menus

Real Time Word to Picture Translation for Chinese Restaurant Menus Real Time Word to Picture Translation for Chinese Restaurant Menus Michelle Jin, Ling Xiao Wang, Boyang Zhang Email: mzjin12, lx2wang, boyangz @stanford.edu EE268 Project Report, Spring 2014 Abstract--We

More information

Compression Method for Handwritten Document Images in Devnagri Script

Compression Method for Handwritten Document Images in Devnagri Script Compression Method for Handwritten Document Images in Devnagri Script Smita V. Khangar, Dr. Latesh G. Malik Department of Computer Science and Engineering, Nagpur University G.H. Raisoni College of Engineering,

More information

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India Segmentation of Touching Characters in Upper Zone in Printed Gurmukhi Script M. K. Jindal Department of Computer Science and Applications Panjab University Regional Centre Muktsar, Punjab, India +919814637188,

More information

Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information

Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Mohd Firdaus Zakaria, Shahrel A. Suandi Intelligent Biometric Group, School of Electrical and Electronics Engineering,

More information

Lane Detection in Automotive

Lane Detection in Automotive Lane Detection in Automotive Contents Introduction... 2 Image Processing... 2 Reading an image... 3 RGB to Gray... 3 Mean and Gaussian filtering... 5 Defining our Region of Interest... 6 BirdsEyeView Transformation...

More information

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Implementation of License Plate Recognition System in ARM Cortex A8 Board www..org 9 Implementation of License Plate Recognition System in ARM Cortex A8 Board S. Uma 1, M.Sharmila 2 1 Assistant Professor, 2 Research Scholar, Department of Electrical and Electronics Engg, College

More information

Automatic Electricity Meter Reading Based on Image Processing

Automatic Electricity Meter Reading Based on Image Processing Automatic Electricity Meter Reading Based on Image Processing Lamiaa A. Elrefaei *,+,1, Asrar Bajaber *,2, Sumayyah Natheir *,3, Nada AbuSanab *,4, Marwa Bazi *,5 * Computer Science Department Faculty

More information

Locally baseline detection for online Arabic script based languages character recognition

Locally baseline detection for online Arabic script based languages character recognition International Journal of the Physical Sciences Vol. 5(7), pp. 955-959, July 2010 Available online at http://www.academicjournals.org/ijps ISSN 1992-1950 2010 Academic Journals Full Length Research Paper

More information

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi Department of E&TC Engineering,PVPIT,Bavdhan,Pune ABSTRACT: In the last decades vehicle license plate recognition systems

More information

Iraqi Car License Plate Recognition Using OCR

Iraqi Car License Plate Recognition Using OCR Iraqi Car License Plate Recognition Using OCR Safaa S. Omran Computer Engineering Techniques College of Electrical and Electronic Techniques Baghdad, Iraq omran_safaa@ymail.com Jumana A. Jarallah Computer

More information

A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION

A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION Nora Naik Assistant Professor, Dept. of Computer Engineering, Agnel Institute of Technology & Design, Goa, India

More information

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter Extraction and Recognition of Text From Digital English Comic Image Using Median Filter S.Ranjini 1 Research Scholar,Department of Information technology Bharathiar University Coimbatore,India ranjinisengottaiyan@gmail.com

More information

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Keyword: Morphological operation, template matching, license plate localization, character recognition. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Automatic

More information

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,

More information

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 88 95 Information Technology and Quantitative Management (ITQM2013) A New Character Segmentation Approach for Off-Line Cursive

More information

Libyan Licenses Plate Recognition Using Template Matching Method

Libyan Licenses Plate Recognition Using Template Matching Method Journal of Computer and Communications, 2016, 4, 62-71 Published Online May 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.47009 Libyan Licenses Plate Recognition Using

More information

Text Extraction from Images

Text Extraction from Images Text Extraction from Images Paraag Agrawal #1, Rohit Varma *2 # Information Technology, University of Pune, India 1 paraagagrawal@hotmail.com * Information Technology, University of Pune, India 2 catchrohitvarma@gmail.com

More information

AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH

AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH Report submitted in partial fulfillment of the requirements for the award of the degree of Bachelor of Computer Systems & Software Engineering

More information

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image

Background. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image Background Computer Vision & Digital Image Processing Introduction to Digital Image Processing Interest comes from two primary backgrounds Improvement of pictorial information for human perception How

More information

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Sheraz Ahmed, Koichi Kise, Masakazu Iwamura, Marcus Liwicki, and Andreas Dengel German Research Center for

More information

Method for Real Time Text Extraction of Digital Manga Comic

Method for Real Time Text Extraction of Digital Manga Comic Method for Real Time Text Extraction of Digital Manga Comic Kohei Arai Information Science Department Saga University Saga, 840-0027, Japan Herman Tolle Software Engineering Department Brawijaya University

More information

VLSI Implementation of Impulse Noise Suppression in Images

VLSI Implementation of Impulse Noise Suppression in Images VLSI Implementation of Impulse Noise Suppression in Images T. Satyanarayana 1, A. Ravi Chandra 2 1 PG Student, VRS & YRN College of Engg. & Tech.(affiliated to JNTUK), Chirala 2 Assistant Professor, Department

More information

Proposed Method for Off-line Signature Recognition and Verification using Neural Network

Proposed Method for Off-line Signature Recognition and Verification using Neural Network e-issn: 2349-9745 p-issn: 2393-8161 Scientific Journal Impact Factor (SJIF): 1.711 International Journal of Modern Trends in Engineering and Research www.ijmter.com Proposed Method for Off-line Signature

More information

Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals

Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals Aarti 1, Dr. Neetu Sharma 2 1 DEPArtment Of Computer Science

More information

Recognition System for Pakistani Paper Currency

Recognition System for Pakistani Paper Currency World Applied Sciences Journal 28 (12): 2069-2075, 2013 ISSN 1818-4952 IDOSI Publications, 2013 DOI: 10.5829/idosi.wasj.2013.28.12.300 Recognition System for Pakistani Paper Currency 1 2 Ahmed Ali and

More information

Image Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab

Image Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab Image Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab 2009-2010 Vincent DeVito June 16, 2010 Abstract In the world of photography and machine vision, blurry

More information

Book Scanning Technologies and Techniques. Mike Mansfield Director of Content Engineering Ancestry.com / Genealogy.com

Book Scanning Technologies and Techniques. Mike Mansfield Director of Content Engineering Ancestry.com / Genealogy.com Book Scanning Technologies and Techniques Mike Mansfield Director of Content Engineering Ancestry.com / Genealogy.com Outline Project Analysis Scanning Parameters Book Scanners Project Analysis Overview

More information

Image Rendering for Digital Fax

Image Rendering for Digital Fax Rendering for Digital Fax Guotong Feng a, Michael G. Fuchs b and Charles A. Bouman a a Purdue University, West Lafayette, IN b Hewlett-Packard Company, Boise, ID ABSTRACT Conventional halftoning methods

More information

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems Contents 1 Introduction.... 1 1.1 Organization of the Monograph.... 1 1.2 Notation.... 3 1.3 State of Art.... 4 1.4 Research Issues and Challenges.... 5 1.5 Figures.... 5 1.6 MATLAB OCR Toolbox.... 5 References....

More information

Colored Rubber Stamp Removal from Document Images

Colored Rubber Stamp Removal from Document Images Colored Rubber Stamp Removal from Document Images Soumyadeep Dey, Jayanta Mukherjee, Shamik Sural, and Partha Bhowmick Indian Institute of Technology, Kharagpur {soumyadeepdey@sit,jay@cse,shamik@sit,pb@cse}.iitkgp.ernet.in

More information

Using Barcodes in Documents Best Practices

Using Barcodes in Documents Best Practices Using Barcodes in Documents Best Practices Barcode Basics A basic understanding of barcode recognition algorithms can help you to make implementation choices that optimize your success. Basic one-dimensional

More information

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron Proc. National Conference on Recent Trends in Intelligent Computing (2006) 86-92 A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

More information

MAV-ID card processing using camera images

MAV-ID card processing using camera images EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON

More information

An Improved Bernsen Algorithm Approaches For License Plate Recognition

An Improved Bernsen Algorithm Approaches For License Plate Recognition IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) ISSN: 78-834, ISBN: 78-8735. Volume 3, Issue 4 (Sep-Oct. 01), PP 01-05 An Improved Bernsen Algorithm Approaches For License Plate Recognition

More information

AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION

AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION Safaa S. Omran 1 Jumana A. Jarallah 2 1 Electrical Engineering Technical College / Middle Technical University 2 Electrical Engineering Technical College /

More information

10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System

10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System TP 12.1 10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System Peter Masa, Pascal Heim, Edo Franzi, Xavier Arreguit, Friedrich Heitger, Pierre Francois Ruedi, Pascal

More information

Sri Shakthi Institute of Engg and Technology, Coimbatore, TN, India.

Sri Shakthi Institute of Engg and Technology, Coimbatore, TN, India. Intelligent Forms Processing System Tharani B 1, Ramalakshmi. R 2, Pavithra. S 3, Reka. V. S 4, Sivaranjani. J 5 1 Assistant Professor, 2,3,4,5 UG Students, Dept. of ECE Sri Shakthi Institute of Engg and

More information

Images and Graphics. 4. Images and Graphics - Copyright Denis Hamelin - Ryerson University

Images and Graphics. 4. Images and Graphics - Copyright Denis Hamelin - Ryerson University Images and Graphics Images and Graphics Graphics and images are non-textual information that can be displayed and printed. Graphics (vector graphics) are an assemblage of lines, curves or circles with

More information

A Method of Multi-License Plate Location in Road Bayonet Image

A Method of Multi-License Plate Location in Road Bayonet Image A Method of Multi-License Plate Location in Road Bayonet Image Ying Qian The lab of Graphics and Multimedia Chongqing University of Posts and Telecommunications Chongqing, China Zhi Li The lab of Graphics

More information

Image Finder Mobile Application Based on Neural Networks

Image Finder Mobile Application Based on Neural Networks Image Finder Mobile Application Based on Neural Networks Nabil M. Hewahi Department of Computer Science, College of Information Technology, University of Bahrain, Sakheer P.O. Box 32038, Kingdom of Bahrain

More information

Memory-Efficient Algorithms for Raster Document Image Compression*

Memory-Efficient Algorithms for Raster Document Image Compression* Memory-Efficient Algorithms for Raster Document Image Compression* Maribel Figuera School of Electrical & Computer Engineering Ph.D. Final Examination June 13, 2008 Committee Members: Prof. Charles A.

More information

Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader

Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader Teresa Vania Tjahja 1, Anto Satriyo Nugroho #2, Nur Aziza Azis #, Rose Maulidiyatul Hikmah #, James Purnama Faculty

More information

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Vehicular Visual Tracking System Incorporating Global Positioning System A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang Abstract Surveillance system is widely used in the traffic monitoring. The deployment of cameras

More information

Image Extraction using Image Mining Technique

Image Extraction using Image Mining Technique IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,

More information

Removal of Gaussian noise on the image edges using the Prewitt operator and threshold function technical

Removal of Gaussian noise on the image edges using the Prewitt operator and threshold function technical IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 2 (Nov. - Dec. 2013), PP 81-85 Removal of Gaussian noise on the image edges using the Prewitt operator

More information

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images IEEE SIGNAL PROCESSING LETTERS, VOL. X, NO. Y, Z 2003 1 IEEE Signal Processing Letters: SPL-00466-2002 1) Paper Title Distance-Reciprocal Distortion Measure for Binary Document Images 2) Authors Haiping

More information

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES International Journal of Advanced Research in Engineering and Technology (IJARET) Volume 9, Issue 3, May - June 2018, pp. 177 185, Article ID: IJARET_09_03_023 Available online at http://www.iaeme.com/ijaret/issues.asp?jtype=ijaret&vtype=9&itype=3

More information

Automatic Licenses Plate Recognition System

Automatic Licenses Plate Recognition System Automatic Licenses Plate Recognition System Garima R. Yadav Dept. of Electronics & Comm. Engineering Marathwada Institute of Technology, Aurangabad (Maharashtra), India yadavgarima08@gmail.com Prof. H.K.

More information

Bangla Optical Digits Recognition using Edge Detection Method

Bangla Optical Digits Recognition using Edge Detection Method IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 3 (Sep. - Oct. 2013), PP 19-24 Bangla Optical Digits Recognition using Edge Detection

More information

A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol. Qinghua Wang

A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol. Qinghua Wang International Conference on Artificial Intelligence and Engineering Applications (AIEA 2016) A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol Qinghua Wang Fuzhou Power

More information

Effective and Efficient Fingerprint Image Postprocessing

Effective and Efficient Fingerprint Image Postprocessing Effective and Efficient Fingerprint Image Postprocessing Haiping Lu, Xudong Jiang and Wei-Yun Yau Laboratories for Information Technology 21 Heng Mui Keng Terrace, Singapore 119613 Email: hplu@lit.org.sg

More information

Image optimization guide

Image optimization guide Image Optimization guide for Image Submittal Images can play a crucial role in the successful execution of a book project by enhancing the text and giving the reader insight into your story. Although your

More information

Image to Sound Conversion

Image to Sound Conversion Volume 1, Issue 6, November 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Image to Sound Conversion Jaiprakash

More information

An Enhanced Approach in Run Length Encoding Scheme (EARLE)

An Enhanced Approach in Run Length Encoding Scheme (EARLE) An Enhanced Approach in Run Length Encoding Scheme (EARLE) A. Nagarajan, Assistant Professor, Dept of Master of Computer Applications PSNA College of Engineering &Technology Dindigul. Abstract: Image compression

More information

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing Extraction of Newspaper Headlines from Microfilm for Automatic Indexing Chew Lim Tan 1, Qing Hong Liu 2 1 School of Computing, National University of Singapore, 3 Science Drive 2, Singapore 117543 Email:

More information

RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS

RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.137-141 DOI: http://dx.doi.org/10.21172/1.74.018 e-issn:2278-621x RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT

More information

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta and Rapeeporn Chamchong Department of Management Information Systems and Computer Science Faculty of Informatics,

More information

International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May ISSN

International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May ISSN International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 601 Automatic license plate recognition using Image Enhancement technique With Hidden Markov Model G. Angel, J. Rethna

More information

Abstract. Most OCR systems decompose the process into several stages:

Abstract. Most OCR systems decompose the process into several stages: Artificial Neural Network Based On Optical Character Recognition Sameeksha Barve Computer Science Department Jawaharlal Institute of Technology, Khargone (M.P) Abstract The recognition of optical characters

More information

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION International Journal of Computer Science and Communication Vol. 2, No. 2, July-December 2011, pp. 593-599 INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION Chetan Sharma 1 and Amandeep Kaur 2 1

More information

Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images

Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images Keshav Thakur 1, Er Pooja Gupta 2,Dr.Kuldip Pahwa 3, 1,M.Tech Final Year Student, Deptt. of ECE, MMU Ambala,

More information

Implementation of Text to Speech Conversion

Implementation of Text to Speech Conversion Implementation of Text to Speech Conversion Chaw Su Thu Thu 1, Theingi Zin 2 1 Department of Electronic Engineering, Mandalay Technological University, Mandalay 2 Department of Electronic Engineering,

More information

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 7, July 2015, pg.16

More information

Recognition of very low-resolution characters from motion images captured by a portable digital camera

Recognition of very low-resolution characters from motion images captured by a portable digital camera Recognition of very low-resolution characters from motion images captured by a portable digital camera Shinsuke Yanadume 1, Yoshito Mekada 2, Ichiro Ide 1, Hiroshi Murase 1 1 Graduate School of Information

More information

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email:

More information

APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE

APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE Najirah Umar 1 1 Jurusan Teknik Informatika, STMIK Handayani Makassar Email : najirah_stmikh@yahoo.com

More information

CS 548: Computer Vision REVIEW: Digital Image Basics. Spring 2016 Dr. Michael J. Reale

CS 548: Computer Vision REVIEW: Digital Image Basics. Spring 2016 Dr. Michael J. Reale CS 548: Computer Vision REVIEW: Digital Image Basics Spring 2016 Dr. Michael J. Reale Human Vision System: Cones and Rods Two types of receptors in eye: Cones Brightness and color Photopic vision = bright-light

More information

THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION

THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION Aufa Zin, Kamarul Hawari and Norliana Khamisan Faculty of Electrical and Electronics Engineering, Universiti Malaysia Pahang, Pekan,

More information

in the list below are available in the Pro version of Scan2CAD

in the list below are available in the Pro version of Scan2CAD Scan2CAD features Features marked only. in the list below are available in the Pro version of Scan2CAD Scan Scan from inside Scan2CAD using TWAIN (Acquire). Use any TWAIN-compliant scanner of any size.

More information

Displacement Measurement of Burr Arch-Truss Under Dynamic Loading Based on Image Processing Technology

Displacement Measurement of Burr Arch-Truss Under Dynamic Loading Based on Image Processing Technology 6 th International Conference on Advances in Experimental Structural Engineering 11 th International Workshop on Advanced Smart Materials and Smart Structures Technology August 1-2, 2015, University of

More information

Finger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy

Finger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy Finger print Recognization By M R Rahul Raj K Muralidhar A Papi Reddy Introduction Finger print recognization system is under biometric application used to increase the user security. Generally the biometric

More information

FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka

FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka RESEARCH ARTICLE OPEN ACCESS FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka Swapna Premasiri 1, Lahiru Wijesinghe 1, Randika Perera 1 1. Department

More information

Fundamentals of Multimedia

Fundamentals of Multimedia Fundamentals of Multimedia Lecture 2 Graphics & Image Data Representation Mahmoud El-Gayyar elgayyar@ci.suez.edu.eg Outline Black & white imags 1 bit images 8-bit gray-level images Image histogram Dithering

More information

SCIENCE & TECHNOLOGY

SCIENCE & TECHNOLOGY Pertanika J. Sci. & Technol. 25 (S): 163-172 (2017) SCIENCE & TECHNOLOGY Journal homepage: http://www.pertanika.upm.edu.my/ Performance Comparison of Min-Max Normalisation on Frontal Face Detection Using

More information

PosterArtist Quick Guide

PosterArtist Quick Guide PosterArtist Quick Guide Create posters in four easy steps STEP STEP STEP STEP Use Auto Design to Create Posters Simply select a poster type and a design image to automatically create high-quality posters.

More information

FACE RECOGNITION USING NEURAL NETWORKS

FACE RECOGNITION USING NEURAL NETWORKS Int. J. Elec&Electr.Eng&Telecoms. 2014 Vinoda Yaragatti and Bhaskar B, 2014 Research Paper ISSN 2319 2518 www.ijeetc.com Vol. 3, No. 3, July 2014 2014 IJEETC. All Rights Reserved FACE RECOGNITION USING

More information

An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.

An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S. An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.Nithyanandam Abstract An Optimal Text Recognition and Translation System

More information

Number Plate Recognition Using Segmentation

Number Plate Recognition Using Segmentation Number Plate Recognition Using Segmentation Rupali Kate M.Tech. Electronics(VLSI) BVCOE. Pune 411043, Maharashtra, India. Dr. Chitode. J. S BVCOE. Pune 411043 Abstract Automatic Number Plate Recognition

More information

Image segmentation method and device.

Image segmentation method and device. 1 van 11 25-7-2008 0:31 SEARCH: GO TO ADVANCED SEARCH LOGIN: Login Create Free Account HOME SEARCH PATENTS CHEMICAL SEARCH DATA SERVICES HELP Title: Document Type and Number: Abstract: Image segmentation

More information

Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method

Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method M. Veerraju *1, S. Saidarao *2 1 Student, (M.Tech), Department of ECE, NIE, Macherla, Andrapradesh, India. E-Mail:

More information

Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter

Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter Sanjaa Bold Department of Computer Hardware and Networking. University of the humanities Ulaanbaatar, Mongolia

More information

COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER

COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER Department of Computer Science, Institute of Management Sciences, 1-A, Sector

More information

Resizing Images By Laurence Fenn

Resizing Images By Laurence Fenn Resizing Images By Laurence Fenn This article is an expansion of the talk I recently gave at the computer club about resizing images on your PC and getting the best results. I ve taken the basic notes

More information

Visual Interpretation of Hand Gestures as a Practical Interface Modality

Visual Interpretation of Hand Gestures as a Practical Interface Modality Visual Interpretation of Hand Gestures as a Practical Interface Modality Frederik C. M. Kjeldsen Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate

More information

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Vehicular Visual Tracking System Incorporating Global Positioning System A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang Abstract Surveillance system is widely used in the traffic monitoring. The deployment of cameras

More information

XM: The AOI camera technology of the future

XM: The AOI camera technology of the future No. 29 05/2013 Viscom Extremely fast and with the highest inspection depth XM: The AOI camera technology of the future The demands on systems for the automatic optical inspection (AOI) of soldered electronic

More information

Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence

Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence Sheng Yan LI, Jie FENG, Bin Gang XU, and Xiao Ming TAO Institute of Textiles and Clothing,

More information

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Vehicular Visual Tracking System Incorporating Global Positioning System Vol:5, :6, 20 A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang International Science Index, Computer and Information Engineering Vol:5, :6,

More information

Digitizing Color. Place Value in a Decimal Number. Place Value in a Binary Number. Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally

Digitizing Color. Place Value in a Decimal Number. Place Value in a Binary Number. Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally Fluency with Information Technology Third Edition by Lawrence Snyder Digitizing Color RGB Colors: Binary Representation Giving the intensities

More information

ROBOT VISION. Dr.M.Madhavi, MED, MVSREC

ROBOT VISION. Dr.M.Madhavi, MED, MVSREC ROBOT VISION Dr.M.Madhavi, MED, MVSREC Robotic vision may be defined as the process of acquiring and extracting information from images of 3-D world. Robotic vision is primarily targeted at manipulation

More information

Practical Image and Video Processing Using MATLAB

Practical Image and Video Processing Using MATLAB Practical Image and Video Processing Using MATLAB Chapter 1 Introduction and overview What will we learn? What is image processing? What are the main applications of image processing? What is an image?

More information

Digital Photogrammetry. Presented by: Dr. Hamid Ebadi

Digital Photogrammetry. Presented by: Dr. Hamid Ebadi Digital Photogrammetry Presented by: Dr. Hamid Ebadi Background First Generation Analog Photogrammetry Analytical Photogrammetry Digital Photogrammetry Photogrammetric Generations 2000 digital photogrammetry

More information

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP LIU Ying 1,HAN Yan-bin 2 and ZHANG Yu-lin 3 1 School of Information Science and Engineering, University of Jinan, Jinan 250022, PR China

More information

Bitmap Image Formats

Bitmap Image Formats LECTURE 5 Bitmap Image Formats CS 5513 Multimedia Systems Spring 2009 Imran Ihsan Principal Design Consultant OPUSVII www.opuseven.com Faculty of Engineering & Applied Sciences 1. Image Formats To store

More information

Touchless Fingerprint Recognization System

Touchless Fingerprint Recognization System e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 501-505 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com Touchless Fingerprint Recognization System Biju V. G 1., Anu S Nair 2, Albin Joseph

More information