Improving Optical Character Recognition Process for Low Resolution
|
|
- Hector Felix Dalton
- 5 years ago
- Views:
Transcription
1 Improving Optical Character Recognition Process for Low Resolution Images 1 Imad Qasim Habeeb, 2 Shahrul Azmi Mohd Yusof, 3 Faudziah B. Ahmad 1, First Author Iraqi Commission for Computers and Informatics, Iraq, emadkassam@yahoo.com *2,Corresponding Author Universiti Utara Malaysia, shahrulazmi@uum.edu.my 3 Universiti Utara Malaysia, fudz@uum.edu.my Abstract Optical Character Recognition (OCR) systems often generate errors for images with noise or with low scanning resolution. In this paper, a novel approach that can be used to improve and restore the quality of any clean lower resolution images for easy recognition by OCR process. The method relies on the production of four copies of the original image so that each picture undergoes different restoration processes. These four copies of the images are then passed to a single OCR engine in parallel. In addition to that, the method does not need any traditional alignment between the four resulting texts, which is time consuming and needs complex calculation. It implements a new procedure to choose the best among them and can be applied without prior training on errors. The experimental results show improvement in word error rate for low resolution images by more than 67%. 1. Introduction Keywords: OCR, Low resolution image, Alignment resulting text, Multi inputs The process of optical character recognition (OCR) extracts the text in images so that it can be modified and searched [1]. Output of OCR systems often produces errors when the images contain noise or the scanning resolution is low [2, 3]. The optimal resolution to scan images for most OCR systems is 300 dots per inch (dpi) [4]. Fig. 1 shows output of two OCR engines with scanning resolution of 300 dpi and 72 dpi respectively. It shows the effect of low resolution image to OCR systems. The two OCR engines used are Tesseract [5] which is supported by Google Inc, and Asprise [6] which is a commercial software. Figure 1. The difference in OCR output for two images having different resolution Low resolution images can be extracted from sequence of low quality video [3, 7] or when dealing with available images having low resolution like what is available in thousands of documents' images in the Internet. Increasing image resolution after the scanning process will not add more details to the image unless the original scanning resolution is high [8]. This research designed effective method that can be used to deal with any available image that has a low resolution to improve and restore its quality for easy recognition by OCR process. To determine word error rate (WER) for images having low resolution, several documents containing words were scanned twice at 72 dpi, the first with a gray level, and the second with one bit black/white. The texts were extracted from Wikipedia's website. The documents International Journal of Advancements in Computing Technology(IJACT) Volume 6, Number 3, May
2 contain normal text without any layout or images. All documents' images were passed to two OCR engines, Tesseract version 3.02 and Asprise version 4.0 respectively. The output of both engines gave an average WER of greater than 64% which is high. The resulting WER for both engines is shown in Table 1. Due to the high rate of errors, a new method aimed at improving the image quality so as to reduce WER is proposed. Table 1. Word error rate for two types of images by two engines Engine Tesseract version 3.02 Asprise version 4.0 WER for Gray images WER for Binary images Average WER 65% 79% 72% 57% 71% 64% Resolution of the image represents its quality, i.e. the higher the resolution the higher the quality. The image becomes clearer, sharper, and more detailed when the resolution is high. On the other hand, its file size becomes larger and the number of pixels increase [9]. For example, in case of scanning a single page of text as image with resolution 75 dpi and store it in a bmp format, the size of file is nearly 353KB. And when the same page is scanned at 300 dpi, the size of file is almost 5240 KB; the rate of the increase in file size and number of pixels is practically 15 times; this means more processing time for OCR systems. For this reason, the proposed method clearly will decrease file size in the hard disk, and will increase speed of the OCR systems, though the scope of this research is limited to reducing the OCR errors for low resolution images. Furthermore, these images are so blurry because the pixels are not enough to represent all the details of the characters, this causes several letters seems to be touching each causing difficult to differentiate the outlines of these characters. Such poor-quality images can cause OCR segmentation and features extraction processes to become more complex. For examples, many characters' blocks in the segmentation process may contain noise, sometimes part of character in one block and the rest in another block; in other cases, a single block contains two or more touching characters [3]. Using several values for the threshold when converting grayscale image to a binary image can lead to different results from a single OCR engine [10, 11]. The proposed method will improve this situation by not classifying the values of images between 0 and 255, but to group them into three classes. After that, various operations are performed such as characters cleaning and restoration based on these classes to produce four images. These images are sent to same OCR engine to produce four outputs, where the best is chosen among them. The details of these operations are given in section 3. Using more than one input will lead to the problem of the alignment of resulting texts of OCR engines. This problem needs a long time and complex calculation, especially for number of the characters greater than 2500 [12, 13]. The proposed method used multiple inputs with alignment between the words only, while the previous works on related methods require alignment line by line or alignment between completes resulting texts before words or characters were aligned. Furthermore, the alignment of the words is easier and need simple alignment methods because the number of the characters in any words at most does not exceed 20 [13] as described in section 3. The resulting errors from OCR process can be classified into two types: non-word errors and real word errors. The first means: the words that do not exist in the lexicon, such as the word "foed". The second means: the words that exist in the lexicon, but unsuitable for the sentence, such as the word "too" in sentence "I want too eat" [14, 15]. This method in addition to improving the accuracy of the image before entering the OCR engine can perform multiple-pass decoding in detecting and correcting these errors without prior training. The contributions of this research are: (1) a new method that improves WER of OCR systems when the inputs are free of noisy images having low resolution, and (2) A new alignment method that uses multi-inputs with alignment between the words only. The paper is organized in 14
3 five sections: section 1 presented the introduction; section 2 discusses related work on OCR error correction; section 3 explains the proposed method and its implementation. In section 4, interface and data collected are defined, also experimental results and evaluation. The last section includes conclusions and future work of our research. 2. Related work of OCR Error Correction The proposed method involves three themes: multi-inputs, low-resolution images, and OCR post-processing error correction. A lot of researches relies on multi-inputs for OCR engines to improve accuracy of OCR. For examples, Lopresti and Zhou [16] stated that scanning a page multiple times, then entering images into OCR engine, and running a voting procedure to select the best among them will eliminate 20 to 50% from text errors resulting from a single OCR engine. The method does not require training; the alignment between the output texts was done line by line, with scanning resolution of 300dpi. Lund and Ringger [12] take advantage of the differences between the outputs of three OCR engines to improve accuracy. Their method enhanced the A* algorithm used in the alignments of the three OCR outputs, which resulted in reduction of WER from 22.9% to 10.3%. The alignment process used complete output texts of three OCR engines, while test images were scanned at 400 dpi. As second attempt, Lund and Ringger [17] created a decision list from in-domain training data that was used to select the best output of the three OCR engines. A* algorithm with Reverse Dijkstra admissible was used in the alignment of the complete output texts from the three engines. The method leads to a 19.5% improvement in WER compared to the best single OCR. Progressive alignment of five different OCR Engines was presented in the third attempt of Lund, Walker, and Ringger [18]. The method used maximum entropy model to select the best output from five different OCR Engines. Total output texts are used in alignment of five OCR engines; the scanning resolution was 1500 dpi for the documents' images. A 24.6% improvement in WER relative to the optimal one from the five OCR engines was attained. At fourth attempt, Lund, Kennar and Ringger [11] improved OCR accuracy by using seven values to the binarization threshold for single image, so that multiple images were passed to the same OCR engine. Progressive alignment of the outputs for total texts was used. The test images were scanned at 400 dots per inch. 2.68% of all tokens used in the test corpus were corrected by the method. The method proposed in this research compared to the prior methods, in addition to multi-inputs, will employ: (1) algorithm to clean, and restore the characters, (2) no previous training on errors, and (3) testing stage uses low scanning resolution (72 dpi) images. For the researchers that considered low-resolution images: Jacobs, Simard, Viola, and Rinker [8] presented a camera-based OCR system that can improve and recognize poor-quality documents' images. The system used machine learning approach and consists of two parts: character recognizer and word recognizer. Character recognizer is implemented using neural network which is used to predict the character at specific location in the image, while the word recognizer is used to find the word inside a given box in the image. The system trained on a large amount of data with recognition accuracy of between 80-95% on images captured with size of and font size of 10-point. This accuracy is achieved when the system uses a language model. On the other hand, the processing time was slow taking about 2 minutes and 40 seconds to produce the output for medium-sized paragraph. Ma and Agam [3] presented a super resolution framework that was based on machine learning for low resolution images using K- means algorithm. The goal of the framework was: reconstructing an excellent resolution image from a low-quality image to enhance accuracy of OCR. The results of method showed 50% error rate reduction. The testing images were re-sampled from high-resolution images of to lower resolution ratios of (1:2, 1:3, and 1:4 sampling rates) so that it can be used in the experiments. However, the proposed method in this study will improve WER compared to the previous methods. The last theme in the related work is OCR post-processing error correction, which means correcting the errors after OCR engine generates the text. It can be divided into several categories: (1) proofreading-based correction, (2) lexicon-based correction, and (3) contextbased correction [19]. Proofreading-based correction requires humans to read and rewrite text produced from the OCR process. This is inefficient as it is time-consuming, especially when the 15
4 number of words is in the thousands. While the lexicon-based correction is used to identify the non-words errors; the error happens when the word resulted from the OCR does not exist in the lexicon [14]. Lastly, the context-based takes into account the words surrounding the wrong word. It is more complex than the previous techniques and can detect real word errors [19]. Examples of OCR post-processing error correction methods include: Naseem and Hussain [20] used the similarity in shape among characters in words. Guyon and Pereira [21] proposed grammar rules. Lapata and Keller [22] suggested word count, i.e., the frequency of the word in the web or corpus is used in selecting the right word. Mays, Damerau, and Mercer [23] used two techniques in their method: first was a dictionary to identify and process non-word errors, and the second was a language model to identify and correct the grammar errors. Liu, Babad, Sun, and Chan [24] presented a matrix of sequence and count of characters for all words resulted from OCR, where the incorrect word was replaced with any word that has the highest count in the matrix. Tong, Zhai, Milic, and Evans [25] proposed confusion sets that used common errors in words. Bassil and Alwani [19] used Google's spelling and suggestions. Choudhury, et al. [26] suggested probability based language model. The proposed method in this research will not use complex method, but will use an only lexicon as an integral part in the correction process. 3. The Proposed method in this research The research involves six major steps: (S1) extract words' images from document image and store in an array, (S2) pass each word image in the array sequentially to four processes, (S3) perform cleaning, restoration, and resample on each word image in any of the four processes based on different conditions, (S4) each OCR engine will receive words images from one process in sequence, (S5) apply a procedure to select the best word resulting from the four OCR engines, and (S6) compile all words in one output text. Figure 3 shows the proposed method. Figure 2. The proposed method framework In S1, the document image is checked for gray scale. If the image is colored, then the document image is converted to gray-scale before further processing. This method does not accept binary images. This method does not accept binary images. The document image is searched through to locate words and denote the words as blocks. Thus, each block will contain a word image. At the end, words of the document image are extracted and stored as an array of word images. The first reason for extracting only words' images is if the document image is passed to multiple OCR directly, problem of the alignment of output texts will occur. More so, it is computationally complex and take long time for lengthy sequences [12, 13]. The proposed method does not need this type of alignment as described more in the decision and collection stage. The second reason is most OCR systems begin with identifying words in the image and then split words into letters' blocks before features' extraction [27], thus it can reduce 16
5 processing time by selecting words in the image once, rather than repeat it in each OCR engine. It is to be noted that the words are easy to be identified because of the presence of spaces between them. The spaces between the letters are very small and sometimes attached to each other, especially when the scanning resolution of image is low [3]. An example of spaces between two words or two characters is shown in Fig. 1. Next, in S1, each pixel value in the word image is compared with a threshold value '220'. If the pixel value is greater than 220, then the value is changed to 255. The aim is to remove some weak pixels. Furthermore, search each word image from left to right to find any vertical line having a height start from top of word image to bottom of it, and having a width as one pixel; this line must not contain any pixel value less than 160, if found it, then all values of the pixels in this line become equal to 255. This is to facilitate the splitting of word image into letters (Fig. 4). The threshold values '220' and '160' were chosen based on experiments that were conducted to specifically identify threshold values so that the process does not lead to neglect or distort other important information. The output of S1 is an array of words images. Figure 3. Word image before and after preprocessing & extraction stage Next in S2, each word image is passed in sequence to four processes (S3). Here, in S3, the proposed method classifies and converts the pixel values of a word image into three classes ranging between 0 and 255. This is followed by performing restoration to the characters. The classification and conversion of the values change four times by using a variable threshold. This means value of threshold named x changes during each process. It takes the value (130, 150, 170, and 190) for processes (1, 2, 3, and 4) respectively (Fig. 5). Steps S3 and S4 are performed in a multithreads manner, (in parallel) to reduce processing time [28]. Furthermore, OCR engines in this method are not different but are multiple copies of the Tesseract engine version In Figure 4. Processes conditions 17
6 process 1, several operations are performed for each word image. Firstly, each pixel value smaller than x becomes zero; the reason is: to confirm the strong pixels. On the other hand, any value of pixel greater than x remains the same, the reason is: the restoration process will be performed on them. The second operation in process 1 identifying all pixels having values between (x+1) and (x+20), located beside pixels having values equal to zero as shown in fig. 5. These will become primary starting pixels for the process of restoration. Figure 5. Example of position of starting pixels In the restoration stage, each primary starting pixel has cycle of operations: (1) value of starting pixel is changed to zero, and (2) all the neighboring pixels from all sides with values not equal to zero are arranged in ascending order so that the pixel having the smallest value becomes a secondary starting pixel on condition that its value is between (x+1) and (x+20). If these conditions are met, then all previous operations (1 and 2) are performed for the new starting pixel and so on, otherwise the cycle is ended for current starting pixel, and another cycle for next primary starting pixel is initiated. The last operation in process1 is increasing the resolution of each word image to 300 dpi because most of OCR engines are optimized at this resolution. All the operations are the same in processes 2, 3 and 4, except that the value of the variable x becomes 150, 170 and 190 respectively. Previous values of x were selected based on the results from a series of experiments that were conducted to choose the best values between 0 and 255. As previously mentioned, the processes 1, 2, 3 and 4 in addition to the OCR engines are implemented in parallel to reduce the processing time. For this, any word image will be sent to the four processes to produce four words images (S5), each one passing through one OCR engine to turn into a word, so that the final results are four words sent to S6. Fig. 6 show the steps in decision & collection stage. Figure 6. The steps in decision & collection stage The first step in decision & collection stage selects only unique words from the four words resulting from OCR engines. Next, checking each word if it belongs to the lexicon. If there is just one, it is 18
7 marked as correct, otherwise, if more than one belongs to the lexicon, the valid word that contains most frequently among non shared letters will be selected. The third is creating an array named "a_shared" containing the letters that exist in the four words. Array "a_shared" is created using characters-based bigram model. For example, if the four words resulted from OCR engines are "imoge", "imagc", "imdgc", and "imogc", then array "a_shared" would contain ("im", "m?", "?g", "g?"). The fourth step suggests a list of words not exceed five words from the lexicon for each unique word. The last step will choose any word from suggestions' list having the largest number of letters in the array "a_shared". If more than one satisfies the previous conditions, only word that contains most frequently among non shared letters will be selected. The resulting words will be used to build the output text by putting space between them. 4. Results and evaluation To evaluate the proposed method, a prototype is developed using VB.NET. It uses Tesseract version 3.02 as OCR engine to convert images into text. Tesseract engine is a software library supported by Google Inc [5]. The prototype will pass documents images to OCR engine before and after using the proposed method and display the results. Experiments will test several English documents containing words; the texts in these documents are from Wikipedia's website and acts as reference or standard text. The texts do not contain any layout or images having five font sizes and five font types. Sclite toolkit supported by the National Institute of Standards and Technology was used to compute WER, by comparing a reference text with OCR output text [1, 29]. To generate two types of test images from the reference, the text is first printed on papers. Then the hardcopy are applied to two types of experiments: (1) the documents are scanned at 72 dpi with a gray level in a normal scanner to produce 487x662 images, and (2) the same documents are snapped by a 1024x768 cameras. All images result from this camera are resample to produce several low resolution 500x680 images, the default resolution of the camera is 75 dpi. The two groups of images resulted from the scanner, and the camera are tested. Results showed that the output of Tesseract engine without using the proposed method was very poor having an average WER of more than 69% and 78% respectively for both scanner images and camera images. Table 2 shows the comparison results. Source of Images Table 2. Results of the proposal method testing Average WER (Using Tesseract engine only) Average WER (Using Tesseract engine and the proposed method) Scanner 69.35% 2.21% Camera 78.82% 4.63% From Table 2, it can be observed that the average WER for scanner images is 69.35% when using only Tesseract engine and 2.21 % when using Tesseract engine and the proposed method. Average WER for camera images is 78.82% when using only Tesseract engine and is 4.63% when using both Tesseract engine and the proposed method. The difference in accuracy between scanner images and camera images is due to variations in lighting between camera and scanner, and properties of each device. The results in Table 2 also show that the proposed method improves the average WER. The alignment of multiple outputs of the proposed method was compared with two methods named "1" [18] and "2" [4]. Factors of comparison are: type of alignment and probability of the error in the alignment as shown in Table 3. First row shows that the alignment used in the proposed method requires less complex calculations because it was dealing with only four words each time, in contrast to the other two methods that deal with long sequences of strings. The second row reveals there is a possibility of error in the alignment for the methods 1 and 2 19
8 [4, 18], while it is zero for proposed method. All the experiments in this research confirmed that proposed method is better than the other two methods. Table 3. Comparison of the proposed method with two related methods Type of alignment and probability of the error in the alignment Method 1 Method 2 The Proposed method Alignment type sequence by sequence page by page word by word Probability of the error in the alignment Cannot guarantee 100% error free in alignment [18] Cannot guarantee 100% error free in alignment [4] Guaranteed 100% error free in alignment 5. Conclusion and Future Work OCR systems accuracy is very high in ideal conditions, but the error rate increases when the images contain noise, the scanning resolution is low, and a cursive written typed languages. This paper presents a new method which can restore characters' quality in weak resolution images before passing them to OCR engines. The method excludes traditional alignment among resulting texts used by related methods; and also no training is needed on errors before executing it. Furthermore, it performs a procedure to select the best among the output texts and correct wrong words if they occurred in the output texts. In addition to that, this method can be used for any language with simple modification. The experiment results show that this method will reduce WER of output text considerably. Further research can be done for more improvement in WER for systems of OCR for the cases of: the noisy images, low resolution images, or cursive writing languages. In addition to that, there is a need to minimize data set size on hard disk of N-grams based-language models, which perform well in correcting the errors that resulted after OCR process. This is done without affecting the speed of data access for its effective inclusion in desktop OCR applications. 6. References [1] S. Impedovo, L. Ottaviano, and S. Occhinegro, "Optical character recognition a survey," International Journal of Pattern Recognition and Artificial Intelligence, vol. 5, pp. 1-24, [2] B. Alex, C. Grover, E. Klein, and R. Tobin, "Digitised Historical Text: Does it have to be mediocre?," [3] D. Ma and G. Agam, "A super resolution framework for low resolution document image OCR," in IS&T/SPIE Electronic Imaging, 2013, pp P-86580P-9. [4] M. Volk, L. Furrer, and R. Sennrich, "Strategies for Reducing and Correcting OCR Errors," in Language Technology for Cultural Heritage, ed: Springer, 2011, pp [5] Google Inc. (2014, January 02). Tesseract-ocr v3.02. Available: [6] LAB Asprise. (2014, January 05). Asprise OCR SDK library v4.0. Available: [7] D. Ma and G. Agam, "Lecture video segmentation and indexing," in IS&T/SPIE Electronic Imaging, 2012, pp V-82970V-8. [8] C. Jacobs, P. Y. Simard, P. Viola, and J. Rinker, "Text recognition of low-resolution document images," in Document Analysis and Recognition, Proceedings. Eighth International Conference on, 2005, pp [9] B. J. Dawson, "Method and apparatus for dynamically selecting an image compression process based on image size and color resolution," ed: Google Patents, [10] M. R. Gupta, N. P. Jacobson, and E. K. Garcia, "OCR binarization and image pre-processing for searching historical documents," Pattern Recognition, vol. 40, pp ,
9 [11] W. B. Lund, D. J. Kennard, and E. K. Ringger, "Why multiple document image binarizations improve OCR," presented at the Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, Washington, District of Columbia, [12] W. B. Lund and E. K. Ringger, "Improving optical character recognition through efficient multiple system alignment," in Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, 2009, pp [13] I. Elias, "Settling the intractability of multiple alignment," Journal of Computational Biology, vol. 13, pp , [14] J. F. Daðason, "Post-Correction of Icelandic OCR Text," (Master's thesis, University of Iceland, Reykjavik, Iceland), [15] X. Sun, J. Gao, D. Micol, and C. Quirk, "Learning phrase-based spelling error models from clickthrough data," in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010, pp [16] D. Lopresti and J. Zhou, "Using consensus sequence voting to correct OCR errors," Computer Vision and Image Understanding, vol. 67, pp , [17] W. B. Lund and E. K. Ringger, "Error Correction with In-Domain Training Across Multiple OCR System Outputs," in Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, pp [18] W. B. Lund, D. D. Walker, and E. K. Ringger, "Progressive alignment and discriminative error correction for multiple OCR engines," in Document Analysis and Recognition (ICDAR), 2011 International Conference on, 2011, pp [19] Y. Bassil and M. Alwani, "Ocr post-processing error correction algorithm using google online spelling suggestion," arxiv preprint arxiv: , [20] T. Naseem and S. Hussain, "A novel approach for ranking spelling error corrections for Urdu," Language Resources and Evaluation, vol. 41, pp , [21] I. Guyon and F. Pereira, "Design of a linguistic postprocessor using variable memory length Markov models," in Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on, 1995, pp [22] M. Lapata and F. Keller, "The Web as a baseline: Evaluating the performance of unsupervised Web-based models for a range of NLP tasks," in Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004, pp [23] E. Mays, F. J. Damerau, and R. L. Mercer, "Context based spelling correction," Information Processing & Management, vol. 27, pp , [24] L.-M. Liu, Y. M. Babad, W. Sun, and K.-K. Chan, "Adaptive post-processing of OCR text via knowledge acquisition," in Proceedings of the 19th annual conference on Computer Science, 1991, pp [25] X. Tong, C. Zhai, N. Milic-Frayling, and D. A. Evans, "OCR Correction and Query Expansion for Retrieval on OCR Data -- CLARIT TREC-5 Confusion Track Report," in TREC, [26] M. Choudhury, R. Saraf, V. Jain, A. Mukherjee, S. Sarkar, and A. Basu, "Investigation and modeling of the structure of texting language," International Journal of Document Analysis and Recognition (IJDAR), vol. 10, pp , [27] M. Labidi, M. Khemakhem, and M. Jemni, "Grid 5000 Based Large Scale OCR Using the DTW Algorithm: Case of the Arabic Cursive Writing," Recent Advances in Document Recognition and Understanding, p. 73, [28] Microsoft Corporation. (2014, January 08). Multithreading in Visual Basic. Available: 21
Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition
Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad Road, Rajkot Gujarat, India C. K. Kumbharana,
More informationDigitization Errors In Hungarian Documents
Digitization Errors In Hungarian Documents Máté Pataki 1 Tamás Füzessy 2 1 Department of Distributed Systems Computer and Automation Research Institute of the Hungarian Academy of Sciences 2 FreeSoft Nyrt.
More informationA Review of Optical Character Recognition System for Recognition of Printed Text
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May Jun. 2015), PP 28-33 www.iosrjournals.org A Review of Optical Character Recognition
More informationA Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2
A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2 Dave A. D. Tompkins and Faouzi Kossentini Signal Processing and Multimedia Group Department of Electrical and Computer Engineering
More informationStudy and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction
International Journal of Scientific and Research Publications, Volume 4, Issue 7, July 2014 1 Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for
More informationReal Time Word to Picture Translation for Chinese Restaurant Menus
Real Time Word to Picture Translation for Chinese Restaurant Menus Michelle Jin, Ling Xiao Wang, Boyang Zhang Email: mzjin12, lx2wang, boyangz @stanford.edu EE268 Project Report, Spring 2014 Abstract--We
More informationCompression Method for Handwritten Document Images in Devnagri Script
Compression Method for Handwritten Document Images in Devnagri Script Smita V. Khangar, Dr. Latesh G. Malik Department of Computer Science and Engineering, Nagpur University G.H. Raisoni College of Engineering,
More informationR. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India
Segmentation of Touching Characters in Upper Zone in Printed Gurmukhi Script M. K. Jindal Department of Computer Science and Applications Panjab University Regional Centre Muktsar, Punjab, India +919814637188,
More informationMalaysian Car Number Plate Detection System Based on Template Matching and Colour Information
Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Mohd Firdaus Zakaria, Shahrel A. Suandi Intelligent Biometric Group, School of Electrical and Electronics Engineering,
More informationLane Detection in Automotive
Lane Detection in Automotive Contents Introduction... 2 Image Processing... 2 Reading an image... 3 RGB to Gray... 3 Mean and Gaussian filtering... 5 Defining our Region of Interest... 6 BirdsEyeView Transformation...
More informationImplementation of License Plate Recognition System in ARM Cortex A8 Board
www..org 9 Implementation of License Plate Recognition System in ARM Cortex A8 Board S. Uma 1, M.Sharmila 2 1 Assistant Professor, 2 Research Scholar, Department of Electrical and Electronics Engg, College
More informationAutomatic Electricity Meter Reading Based on Image Processing
Automatic Electricity Meter Reading Based on Image Processing Lamiaa A. Elrefaei *,+,1, Asrar Bajaber *,2, Sumayyah Natheir *,3, Nada AbuSanab *,4, Marwa Bazi *,5 * Computer Science Department Faculty
More informationLocally baseline detection for online Arabic script based languages character recognition
International Journal of the Physical Sciences Vol. 5(7), pp. 955-959, July 2010 Available online at http://www.academicjournals.org/ijps ISSN 1992-1950 2010 Academic Journals Full Length Research Paper
More informationAn Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi
An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi Department of E&TC Engineering,PVPIT,Bavdhan,Pune ABSTRACT: In the last decades vehicle license plate recognition systems
More informationIraqi Car License Plate Recognition Using OCR
Iraqi Car License Plate Recognition Using OCR Safaa S. Omran Computer Engineering Techniques College of Electrical and Electronic Techniques Baghdad, Iraq omran_safaa@ymail.com Jumana A. Jarallah Computer
More informationA NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION
A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION Nora Naik Assistant Professor, Dept. of Computer Engineering, Agnel Institute of Technology & Design, Goa, India
More informationExtraction and Recognition of Text From Digital English Comic Image Using Median Filter
Extraction and Recognition of Text From Digital English Comic Image Using Median Filter S.Ranjini 1 Research Scholar,Department of Information technology Bharathiar University Coimbatore,India ranjinisengottaiyan@gmail.com
More informationKeyword: Morphological operation, template matching, license plate localization, character recognition.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Automatic
More informationAn Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,
More informationA New Character Segmentation Approach for Off-Line Cursive Handwritten Words
Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 88 95 Information Technology and Quantitative Management (ITQM2013) A New Character Segmentation Approach for Off-Line Cursive
More informationLibyan Licenses Plate Recognition Using Template Matching Method
Journal of Computer and Communications, 2016, 4, 62-71 Published Online May 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.47009 Libyan Licenses Plate Recognition Using
More informationText Extraction from Images
Text Extraction from Images Paraag Agrawal #1, Rohit Varma *2 # Information Technology, University of Pune, India 1 paraagagrawal@hotmail.com * Information Technology, University of Pune, India 2 catchrohitvarma@gmail.com
More informationAN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH
AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH Report submitted in partial fulfillment of the requirements for the award of the degree of Bachelor of Computer Systems & Software Engineering
More informationBackground. Computer Vision & Digital Image Processing. Improved Bartlane transmitted image. Example Bartlane transmitted image
Background Computer Vision & Digital Image Processing Introduction to Digital Image Processing Interest comes from two primary backgrounds Improvement of pictorial information for human perception How
More informationAutomatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval
Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Sheraz Ahmed, Koichi Kise, Masakazu Iwamura, Marcus Liwicki, and Andreas Dengel German Research Center for
More informationMethod for Real Time Text Extraction of Digital Manga Comic
Method for Real Time Text Extraction of Digital Manga Comic Kohei Arai Information Science Department Saga University Saga, 840-0027, Japan Herman Tolle Software Engineering Department Brawijaya University
More informationVLSI Implementation of Impulse Noise Suppression in Images
VLSI Implementation of Impulse Noise Suppression in Images T. Satyanarayana 1, A. Ravi Chandra 2 1 PG Student, VRS & YRN College of Engg. & Tech.(affiliated to JNTUK), Chirala 2 Assistant Professor, Department
More informationProposed Method for Off-line Signature Recognition and Verification using Neural Network
e-issn: 2349-9745 p-issn: 2393-8161 Scientific Journal Impact Factor (SJIF): 1.711 International Journal of Modern Trends in Engineering and Research www.ijmter.com Proposed Method for Off-line Signature
More informationVehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals
Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals Aarti 1, Dr. Neetu Sharma 2 1 DEPArtment Of Computer Science
More informationRecognition System for Pakistani Paper Currency
World Applied Sciences Journal 28 (12): 2069-2075, 2013 ISSN 1818-4952 IDOSI Publications, 2013 DOI: 10.5829/idosi.wasj.2013.28.12.300 Recognition System for Pakistani Paper Currency 1 2 Ahmed Ali and
More informationImage Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab
Image Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab 2009-2010 Vincent DeVito June 16, 2010 Abstract In the world of photography and machine vision, blurry
More informationBook Scanning Technologies and Techniques. Mike Mansfield Director of Content Engineering Ancestry.com / Genealogy.com
Book Scanning Technologies and Techniques Mike Mansfield Director of Content Engineering Ancestry.com / Genealogy.com Outline Project Analysis Scanning Parameters Book Scanners Project Analysis Overview
More informationImage Rendering for Digital Fax
Rendering for Digital Fax Guotong Feng a, Michael G. Fuchs b and Charles A. Bouman a a Purdue University, West Lafayette, IN b Hewlett-Packard Company, Boise, ID ABSTRACT Conventional halftoning methods
More informationContents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems
Contents 1 Introduction.... 1 1.1 Organization of the Monograph.... 1 1.2 Notation.... 3 1.3 State of Art.... 4 1.4 Research Issues and Challenges.... 5 1.5 Figures.... 5 1.6 MATLAB OCR Toolbox.... 5 References....
More informationColored Rubber Stamp Removal from Document Images
Colored Rubber Stamp Removal from Document Images Soumyadeep Dey, Jayanta Mukherjee, Shamik Sural, and Partha Bhowmick Indian Institute of Technology, Kharagpur {soumyadeepdey@sit,jay@cse,shamik@sit,pb@cse}.iitkgp.ernet.in
More informationUsing Barcodes in Documents Best Practices
Using Barcodes in Documents Best Practices Barcode Basics A basic understanding of barcode recognition algorithms can help you to make implementation choices that optimize your success. Basic one-dimensional
More informationA comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron
Proc. National Conference on Recent Trends in Intelligent Computing (2006) 86-92 A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron
More informationMAV-ID card processing using camera images
EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON
More informationAn Improved Bernsen Algorithm Approaches For License Plate Recognition
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) ISSN: 78-834, ISBN: 78-8735. Volume 3, Issue 4 (Sep-Oct. 01), PP 01-05 An Improved Bernsen Algorithm Approaches For License Plate Recognition
More informationAUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION
AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION Safaa S. Omran 1 Jumana A. Jarallah 2 1 Electrical Engineering Technical College / Middle Technical University 2 Electrical Engineering Technical College /
More information10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System
TP 12.1 10mW CMOS Retina and Classifier for Handheld, 1000Images/s Optical Character Recognition System Peter Masa, Pascal Heim, Edo Franzi, Xavier Arreguit, Friedrich Heitger, Pierre Francois Ruedi, Pascal
More informationSri Shakthi Institute of Engg and Technology, Coimbatore, TN, India.
Intelligent Forms Processing System Tharani B 1, Ramalakshmi. R 2, Pavithra. S 3, Reka. V. S 4, Sivaranjani. J 5 1 Assistant Professor, 2,3,4,5 UG Students, Dept. of ECE Sri Shakthi Institute of Engg and
More informationImages and Graphics. 4. Images and Graphics - Copyright Denis Hamelin - Ryerson University
Images and Graphics Images and Graphics Graphics and images are non-textual information that can be displayed and printed. Graphics (vector graphics) are an assemblage of lines, curves or circles with
More informationA Method of Multi-License Plate Location in Road Bayonet Image
A Method of Multi-License Plate Location in Road Bayonet Image Ying Qian The lab of Graphics and Multimedia Chongqing University of Posts and Telecommunications Chongqing, China Zhi Li The lab of Graphics
More informationImage Finder Mobile Application Based on Neural Networks
Image Finder Mobile Application Based on Neural Networks Nabil M. Hewahi Department of Computer Science, College of Information Technology, University of Bahrain, Sakheer P.O. Box 32038, Kingdom of Bahrain
More informationMemory-Efficient Algorithms for Raster Document Image Compression*
Memory-Efficient Algorithms for Raster Document Image Compression* Maribel Figuera School of Electrical & Computer Engineering Ph.D. Final Examination June 13, 2008 Committee Members: Prof. Charles A.
More informationRecursive Text Segmentation for Color Images for Indonesian Automated Document Reader
Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader Teresa Vania Tjahja 1, Anto Satriyo Nugroho #2, Nur Aziza Azis #, Rose Maulidiyatul Hikmah #, James Purnama Faculty
More informationA Vehicular Visual Tracking System Incorporating Global Positioning System
A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang Abstract Surveillance system is widely used in the traffic monitoring. The deployment of cameras
More informationImage Extraction using Image Mining Technique
IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,
More informationRemoval of Gaussian noise on the image edges using the Prewitt operator and threshold function technical
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 2 (Nov. - Dec. 2013), PP 81-85 Removal of Gaussian noise on the image edges using the Prewitt operator
More informationIEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images
IEEE SIGNAL PROCESSING LETTERS, VOL. X, NO. Y, Z 2003 1 IEEE Signal Processing Letters: SPL-00466-2002 1) Paper Title Distance-Reciprocal Distortion Measure for Binary Document Images 2) Authors Haiping
More informationCOMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES
International Journal of Advanced Research in Engineering and Technology (IJARET) Volume 9, Issue 3, May - June 2018, pp. 177 185, Article ID: IJARET_09_03_023 Available online at http://www.iaeme.com/ijaret/issues.asp?jtype=ijaret&vtype=9&itype=3
More informationAutomatic Licenses Plate Recognition System
Automatic Licenses Plate Recognition System Garima R. Yadav Dept. of Electronics & Comm. Engineering Marathwada Institute of Technology, Aurangabad (Maharashtra), India yadavgarima08@gmail.com Prof. H.K.
More informationBangla Optical Digits Recognition using Edge Detection Method
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 3 (Sep. - Oct. 2013), PP 19-24 Bangla Optical Digits Recognition using Edge Detection
More informationA Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol. Qinghua Wang
International Conference on Artificial Intelligence and Engineering Applications (AIEA 2016) A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol Qinghua Wang Fuzhou Power
More informationEffective and Efficient Fingerprint Image Postprocessing
Effective and Efficient Fingerprint Image Postprocessing Haiping Lu, Xudong Jiang and Wei-Yun Yau Laboratories for Information Technology 21 Heng Mui Keng Terrace, Singapore 119613 Email: hplu@lit.org.sg
More informationImage optimization guide
Image Optimization guide for Image Submittal Images can play a crucial role in the successful execution of a book project by enhancing the text and giving the reader insight into your story. Although your
More informationImage to Sound Conversion
Volume 1, Issue 6, November 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Image to Sound Conversion Jaiprakash
More informationAn Enhanced Approach in Run Length Encoding Scheme (EARLE)
An Enhanced Approach in Run Length Encoding Scheme (EARLE) A. Nagarajan, Assistant Professor, Dept of Master of Computer Applications PSNA College of Engineering &Technology Dindigul. Abstract: Image compression
More informationExtraction of Newspaper Headlines from Microfilm for Automatic Indexing
Extraction of Newspaper Headlines from Microfilm for Automatic Indexing Chew Lim Tan 1, Qing Hong Liu 2 1 School of Computing, National University of Singapore, 3 Science Drive 2, Singapore 117543 Email:
More informationRESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS
International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.137-141 DOI: http://dx.doi.org/10.21172/1.74.018 e-issn:2278-621x RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT
More informationImage Segmentation of Historical Handwriting from Palm Leaf Manuscripts
Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts Olarik Surinta and Rapeeporn Chamchong Department of Management Information Systems and Computer Science Faculty of Informatics,
More informationInternational Journal of Scientific & Engineering Research, Volume 5, Issue 5, May ISSN
International Journal of Scientific & Engineering Research, Volume 5, Issue 5, May-2014 601 Automatic license plate recognition using Image Enhancement technique With Hidden Markov Model G. Angel, J. Rethna
More informationAbstract. Most OCR systems decompose the process into several stages:
Artificial Neural Network Based On Optical Character Recognition Sameeksha Barve Computer Science Department Jawaharlal Institute of Technology, Khargone (M.P) Abstract The recognition of optical characters
More informationINDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION
International Journal of Computer Science and Communication Vol. 2, No. 2, July-December 2011, pp. 593-599 INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION Chetan Sharma 1 and Amandeep Kaur 2 1
More informationPerformance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images
Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images Keshav Thakur 1, Er Pooja Gupta 2,Dr.Kuldip Pahwa 3, 1,M.Tech Final Year Student, Deptt. of ECE, MMU Ambala,
More informationImplementation of Text to Speech Conversion
Implementation of Text to Speech Conversion Chaw Su Thu Thu 1, Theingi Zin 2 1 Department of Electronic Engineering, Mandalay Technological University, Mandalay 2 Department of Electronic Engineering,
More informationPHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 7, July 2015, pg.16
More informationRecognition of very low-resolution characters from motion images captured by a portable digital camera
Recognition of very low-resolution characters from motion images captured by a portable digital camera Shinsuke Yanadume 1, Yoshito Mekada 2, Ichiro Ide 1, Hiroshi Murase 1 1 Graduate School of Information
More informationA new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction
A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email:
More informationAPPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE
APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE Najirah Umar 1 1 Jurusan Teknik Informatika, STMIK Handayani Makassar Email : najirah_stmikh@yahoo.com
More informationCS 548: Computer Vision REVIEW: Digital Image Basics. Spring 2016 Dr. Michael J. Reale
CS 548: Computer Vision REVIEW: Digital Image Basics Spring 2016 Dr. Michael J. Reale Human Vision System: Cones and Rods Two types of receptors in eye: Cones Brightness and color Photopic vision = bright-light
More informationTHERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION
THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION Aufa Zin, Kamarul Hawari and Norliana Khamisan Faculty of Electrical and Electronics Engineering, Universiti Malaysia Pahang, Pekan,
More informationin the list below are available in the Pro version of Scan2CAD
Scan2CAD features Features marked only. in the list below are available in the Pro version of Scan2CAD Scan Scan from inside Scan2CAD using TWAIN (Acquire). Use any TWAIN-compliant scanner of any size.
More informationDisplacement Measurement of Burr Arch-Truss Under Dynamic Loading Based on Image Processing Technology
6 th International Conference on Advances in Experimental Structural Engineering 11 th International Workshop on Advanced Smart Materials and Smart Structures Technology August 1-2, 2015, University of
More informationFinger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy
Finger print Recognization By M R Rahul Raj K Muralidhar A Papi Reddy Introduction Finger print recognization system is under biometric application used to increase the user security. Generally the biometric
More informationFPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka
RESEARCH ARTICLE OPEN ACCESS FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka Swapna Premasiri 1, Lahiru Wijesinghe 1, Randika Perera 1 1. Department
More informationFundamentals of Multimedia
Fundamentals of Multimedia Lecture 2 Graphics & Image Data Representation Mahmoud El-Gayyar elgayyar@ci.suez.edu.eg Outline Black & white imags 1 bit images 8-bit gray-level images Image histogram Dithering
More informationSCIENCE & TECHNOLOGY
Pertanika J. Sci. & Technol. 25 (S): 163-172 (2017) SCIENCE & TECHNOLOGY Journal homepage: http://www.pertanika.upm.edu.my/ Performance Comparison of Min-Max Normalisation on Frontal Face Detection Using
More informationPosterArtist Quick Guide
PosterArtist Quick Guide Create posters in four easy steps STEP STEP STEP STEP Use Auto Design to Create Posters Simply select a poster type and a design image to automatically create high-quality posters.
More informationFACE RECOGNITION USING NEURAL NETWORKS
Int. J. Elec&Electr.Eng&Telecoms. 2014 Vinoda Yaragatti and Bhaskar B, 2014 Research Paper ISSN 2319 2518 www.ijeetc.com Vol. 3, No. 3, July 2014 2014 IJEETC. All Rights Reserved FACE RECOGNITION USING
More informationAn Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.
An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.Nithyanandam Abstract An Optimal Text Recognition and Translation System
More informationNumber Plate Recognition Using Segmentation
Number Plate Recognition Using Segmentation Rupali Kate M.Tech. Electronics(VLSI) BVCOE. Pune 411043, Maharashtra, India. Dr. Chitode. J. S BVCOE. Pune 411043 Abstract Automatic Number Plate Recognition
More informationImage segmentation method and device.
1 van 11 25-7-2008 0:31 SEARCH: GO TO ADVANCED SEARCH LOGIN: Login Create Free Account HOME SEARCH PATENTS CHEMICAL SEARCH DATA SERVICES HELP Title: Document Type and Number: Abstract: Image segmentation
More informationEfficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method
Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method M. Veerraju *1, S. Saidarao *2 1 Student, (M.Tech), Department of ECE, NIE, Macherla, Andrapradesh, India. E-Mail:
More informationSmart License Plate Recognition Using Optical Character Recognition Based on the Multicopter
Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter Sanjaa Bold Department of Computer Hardware and Networking. University of the humanities Ulaanbaatar, Mongolia
More informationCOLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER
COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER Department of Computer Science, Institute of Management Sciences, 1-A, Sector
More informationResizing Images By Laurence Fenn
Resizing Images By Laurence Fenn This article is an expansion of the talk I recently gave at the computer club about resizing images on your PC and getting the best results. I ve taken the basic notes
More informationVisual Interpretation of Hand Gestures as a Practical Interface Modality
Visual Interpretation of Hand Gestures as a Practical Interface Modality Frederik C. M. Kjeldsen Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate
More informationA Vehicular Visual Tracking System Incorporating Global Positioning System
A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang Abstract Surveillance system is widely used in the traffic monitoring. The deployment of cameras
More informationXM: The AOI camera technology of the future
No. 29 05/2013 Viscom Extremely fast and with the highest inspection depth XM: The AOI camera technology of the future The demands on systems for the automatic optical inspection (AOI) of soldered electronic
More informationIntegrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence
Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence Sheng Yan LI, Jie FENG, Bin Gang XU, and Xiao Ming TAO Institute of Textiles and Clothing,
More informationA Vehicular Visual Tracking System Incorporating Global Positioning System
Vol:5, :6, 20 A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang International Science Index, Computer and Information Engineering Vol:5, :6,
More informationDigitizing Color. Place Value in a Decimal Number. Place Value in a Binary Number. Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally
Chapter 11: Light, Sound, Magic: Representing Multimedia Digitally Fluency with Information Technology Third Edition by Lawrence Snyder Digitizing Color RGB Colors: Binary Representation Giving the intensities
More informationROBOT VISION. Dr.M.Madhavi, MED, MVSREC
ROBOT VISION Dr.M.Madhavi, MED, MVSREC Robotic vision may be defined as the process of acquiring and extracting information from images of 3-D world. Robotic vision is primarily targeted at manipulation
More informationPractical Image and Video Processing Using MATLAB
Practical Image and Video Processing Using MATLAB Chapter 1 Introduction and overview What will we learn? What is image processing? What are the main applications of image processing? What is an image?
More informationDigital Photogrammetry. Presented by: Dr. Hamid Ebadi
Digital Photogrammetry Presented by: Dr. Hamid Ebadi Background First Generation Analog Photogrammetry Analytical Photogrammetry Digital Photogrammetry Photogrammetric Generations 2000 digital photogrammetry
More informationIMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP
IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP LIU Ying 1,HAN Yan-bin 2 and ZHANG Yu-lin 3 1 School of Information Science and Engineering, University of Jinan, Jinan 250022, PR China
More informationBitmap Image Formats
LECTURE 5 Bitmap Image Formats CS 5513 Multimedia Systems Spring 2009 Imran Ihsan Principal Design Consultant OPUSVII www.opuseven.com Faculty of Engineering & Applied Sciences 1. Image Formats To store
More informationTouchless Fingerprint Recognization System
e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 501-505 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com Touchless Fingerprint Recognization System Biju V. G 1., Anu S Nair 2, Albin Joseph
More information