Aks: A Database for Detection and Extraction of Devanagari Text in Camera Based Images

Size: px
Start display at page:

Download "Aks: A Database for Detection and Extraction of Devanagari Text in Camera Based Images"

Transcription

1 Aks: A Database for Detection and Extraction of Devanagari Text in Camera Based Images Ganesh K Sethi #1, Rajesh K Bawa *2 # Assistant Professor, Department of Computer Science, Multani Mal Modi College, Patiala, India * Professor, Department of Computer Science, Punjabi University, Patiala, India Abstract With the advent of digital cameras and other hand held imaging devices a new type of text containing images have emerged that are unable to handle with traditional optical character recognition (OCR) technology. These camera based images imposes a number of challenges that are absent in scanned or born-digital images. To detect and extract text from camera based images, Text Information Extraction (TIE) process is carried out that detects presence of text in an image and separates it from the background. In this paper a detailed comparison between camera based, born digital and scanned images is presented. A database of camera based images containing text particularly in Devanagari script is created. A survey of various available bench mark databases is done and keeping in view the challenges of camera based images an exhaustive dataset of images is prepared. The paper also discusses the evaluation metrics used to compute the accuracy of text detection and extraction from camera based images. Keywords Born Digital Images, Camera Images, Scanned Images, Text Information Extraction (TIE), optical character recognition (OCR). I. INTRODUCTION OCR is a field related to recognition of characters in scanned documents, PDF files or images captured by digital camera. The commercial OCRs were originated about five decades back that resulted in automatic interpretation of hard copy documents. Since then a lot of improvement has been done and the accuracy of commercial OCRs in certain languages with clear text and clean background is more than 99%. Traditional OCRs use a flatbed scanner to input image with a clean background, even illumination and minimal or no skew. With the advent of digital cameras, PDAs and mobile-cameras a new type of documents has originated. Documents are now not confined to the scanned images, but also contain images that contain text on real world objects taken from real scenes. Images captured with a digital camera like sign boards on roads, buildings, vehicle license plates, clothes contain text that need to be understood. This has emerged a sub-field of OCR for camera based digital images. A wide variety of images is taken with hand held devices like digital camera, PDAs, mobile cameras and hand held scanners. These documents differ from traditional scanned images due to complex background, uneven illumination, variable font size and style, and arbitrary layouts of text in image [1]. The traditional OCR technology that works for scanned images is not applicable here. This failure of traditional OCR system gave birth to new area of research- Camera Based Document Analysis and Recognition. Since 2003, a Robust Reading Competition for extracting and recognizing text in digital images is held with International Conference of Document analysis and Recognition (ICDAR) for finding the best method. Due to lot of challenges, the accuracy of localizing, extracting and recognizing text in camera based images is not up to the mark till date. For recognizing text in camera based images, text in these images has to be completely extracted before it is passed to an OCR. The process carried out before passing image to OCR is Text Information Extraction (TIE). Text information extraction from digital images is a task performed on digital images to extract and recognize text content in them. Natural scenes images contain large amounts of information, which are often required to be automatically recognized and processed. TIE is an active and important area of research in digital image processing because of potential applications in mobile robot navigation, vehicle number plates detection and recognition, object identification, reading foreign language text, content based indexing and searching and helping visually impaired persons to know their environment in a better way. The input to TIE system is an image that is captured with a digital camera or any other hand held device. To check accuracy of any TIE system, an exhaustive set of images is required that must include all the challenging images. In present study, we have captured the image under different conditions to create a dataset of Devanagari text images. Multi-script images, frequently occurring in real world environment, are also taken into account. Section II of paper discusses the difference between camera, scanned and born digital images. In section III various bench mark databases have been discussed. Evaluation metrics are given in section IV and proposed work is presented in section V. ISSN: Page 32

2 II. WHAT IS AN IMAGE? The data input to any OCR or TIE system is an image. The evolution of digital image may be by conversion of a paper into digital form using scanner-scanned image, capturing image using a digital camera-camera based or it may be synthetically created using software-born digital. A. Scanned Image Scanned image is formed by scanning a document using a scanner against a plain background. Scanners are used to digitize the old manuscripts, books and other documents. The quality of scanned image depends on the quality of scanner and the quality of document to be digitized. Usually, scanners are used for document images and there is no problem of uneven illumination, background noise or texture. Further scanned images mostly have black text on a white background or vice-versa. The commercial available OCRs are able to recognize text in the scanned images having text in a particular orientation by applying a series of steps like preprocessing, segmentation, feature extraction and classification. The accuracy of recognition in certain languages is more than 99%. So extracting and recognizing text from scanned images is relatively solved problem for some scripts, while the work is going on for other languages. B. Born Digital Image Born digital images are synthetically formed images using software tools. These images are mostly used in web pages, messages and on social-media (whattsapp, facebook etc.) to insert textual information. The concept of using images to entrench text originates from a variety of needs, for example in advertisements in order to attract attention, in titles and headings to beautify, in spam s to avoid text-based filtering, to hide information, for generating CAPTCHA tests. Borndigital text images resemble to other complex-text containing images but they present certain distinct characteristics. In such images, the resolution is kept inherently low (to be transmitted online and displayed on mobile-screen/monitor), and the text is normally super-imposed on the image. The automatic detection and extraction of text from borndigital images is an interesting area as it would provide a technology for a number of applications such as improved indexing, retrieval of web images and content filtering (e.g. advertisements or spam s) etc. The presence of text in born-digital images has been quantified in the past [2]-[3]. Research has shown that a considerable amount of text on web pages is presented in images (17%), while an important fraction of this text (76%) is not to be found anywhere else on the Web [2]. The work has been published in specifically focused on borndigital images [4]-[6]. Figure 2: Born-digital Images taken from internet Figure 2: Scanned Images C. Camera Based Image With the evolution of low cost consumer-end digital cameras, a new challenge has opened up for the document analysis community. Camera based images are the real scene images captured using a digital camera or any hand-held device like mobile, PDA or a tablet having digital camera. The images are captured from real scenic environments. As the resolution of devices varies, so the resolution of camera images varies greatly. Further, the camera images have lot of distortions like uneven illumination, complex backgrounds, motion blur, ISSN: Page 33

3 Criteria Background Sharpness Lens position Colours Contrast Words/ Characters arrange ment Surface Rotation Fontsize Fonttype Number of lines Camera Based Image Heterogeneous, any color possibly motion blur Variable, geometric and perspective distortions present high variation of colors depends on colors, shadows, lighting, illumination, texture May have horizontal and vertical lines, curved, wavy text freestanding (detached) or attached to objects with arbitrary non- planar surfaces arbitrary rotations Large variation in font sizes in single image Text superimpo sed on a plain or textured background machineprinted, handwritten, special effects (textured or 3D) often only single line or few words in Scanned Image Homogene ous, usually white Usually sharp Fixed, document lies on scanner's glass black text on white background good (black/dark text on white/light background) clear horizontal lines text attached to plain paper horizontally aligned text lines or rotated by 90 degrees limited number of font sizes, usually same throughout image machineprinted, handwritten usually several lines of text Born Digital Usually single color Usually sharp Artificially created Variable colours Good (dark/light text superimposed on light/dark background) Depends on target application Arbitrary rotations Variable fonts size mingled together Superimposed typed text Depends on target application image Table 1: A comparison between Camera based, Scanned and Digital image and perspective deformation. Camera based images, besides document images, contain natural scene images containing sign-boards, vehicle number plates, street names, electricity meters, and so on. The extraction of text from these camera based images have enormous applications viz. license plate recognition, helping visually impaired, understanding foreign languages, industrial automation, web search etc. For roman scripts, research on camera based scene image text detection and recognition started in mid-90s [7]-[8] and since then has seen a substantial amount of growth with a large number of approaches published in the last two decades [8]-[12]. Figure 3: Camera-Based Images from ICDAR and our dataset III. BENCH MARKS DATABASES AND ROBUST READING COMPETITIONS A number of methods for text localization, segmentation and recognition of text from camera images are available. To check the accuracy and to compare different methods, a standard or benchmark dataset is required. A number of datasets are available for the Roman, Korean, Chinese and mutiscripts that are being used for ranking different algorithms in various competitions. A. ICDAR 2003 Data Set The ICDAR 2003 data set was created for the Robust Reading competition [14]. The data set includes images of text in the real environment captured with hand held devices. This data set is used for the tasks of character recognition, word recognition, text localization and end-to-end recognition. The dataset is organized into Sample, Trial and Competition datasets that contain 20, 258 and 251 color images respectively. The number of words in corresponding datasets is 171, 1157 and 1111 with 854, 6185 and 5430 characters of Roman script. Ground truth information is specified by word bounding box locations and labels as well as character locations and labels for both the training and test sets. Figure 4 shows sample images from this data set. ISSN: Page 34

4 B. ICDAR 2011 Data Set A benchmark data set was created for the ICDAR 2011 Robust Reading competition [15] which was an extension of the ICDAR 2003 data set. The repeated words were removed and with few additions. The dataset contains 255 color images with 1189 words. There is also a training set of a similar size. Ground truth information is provided for word bounding box locations and labels, which makes this suitable for the tasks of end-to-end recognition, text localization and word recognition. Unlike the ICDAR 2003 data set, this does not contain ground truth character location and label information. Sample images from the ICDAR 2011 data set are shown in figure 4. ground truth labels are provided. In addition, there is a lexicon given for every word, which contains the ground truth label plus business names in the locality extracted using the `Search Nearby' feature in Google Maps. The lexicons consist of around 50 unique words each. The task is to locate all the words in an image that appear in its lexicon. While there may be other text in the image, the target is to detect only the lexicon words. This contrasts from the more general OCR problem Figure 5: Sample images from the SVT data set Figure 4: Sample Images taken from ICDAR 2003 and 2011 Data Set C. ICDAR 2013 Data Set The latest version of the ICDAR data set is ICDAR 2013 [16]. This is also based on the previous data sets, but ground truth errors are fixed and image duplicates are removed. It contains 233 color images with 1095 words. The training set is of a similar size. Ground truth information is provided for word bounding box locations and labels, which makes this suitable for the tasks of end-to-end recognition, text localization and word recognition. In addition, pixellevel ground truth labels are provided for the first time for text segments, enabling the task of text segmentation. D. The Street View Text Dataset As a part of Google Street View project, Wang and Belongie [17] introduced street view text (SVT) data set. For harvesting and labeling the images from Google Street View Amazon's Mechanical Turk tool was used. Several Human Intelligence Tasks (HITs) were created that were completed on Mechanical Turk for building the data set. The Street View Text (SVT) data set was specifically created for the word spotting problems in street view images. The dataset consists of 250 images containing 647 words. In each image, word bounding box locations and E. MSRA Text Detection 500 Database (MSRA- TD500) C. Yao, X. Bai, W. Liu, Y. Ma and Z. Tu [18] collected and released publicly MSRA Text Detection 500 Database (MSRA-TD500) as a benchmark to evaluate different text detection algorithms in natural images. The database consists of 500 natural images, taken from outdoor (streets) and indoor (office and mall) scenes using a pocket camera. The resolutions of the images are in range of 1296x864 to 1920x1280. The text in the image may be in different languages (English, Chinese, or mixture of both), size, fonts, colors and orientations. The dataset is further divided into two parts: testing set and training set. The training set is formed by taking 300 images randomly selected from the original dataset and the remaining 200 images are used for testing. The fundamental unit for detection in the dataset is text line instead of word. The reason is due to the fact that it is hard to partition Chinese script text lines into individual words based on their spacing. Even for English language text lines, it is difficult partition words in natural scene images. Figure 6: Sample images from MSRA-TD500data set ISSN: Page 35

5 F. NEOCR: Natural Environment OCR Dataset R. Nagy, A. Dicker and K. Meyer [19]-[20] developed Natural Environment OCR Dataset. The dataset comprise of a broad range of natural scene images with rich annotation for OCR that covers a variety of features. These features distinguish natural scene text images from scanned images. The NEOCR consists of 659 natural scene images having a total of 5238 bounding boxes containing text. Images were captured with different digital cameras set at various camera settings to include text in real world with arbitrary colors, font types and font sizes, lighting effects, variety of textures or occlusion. Images containing text were then manually selected and for image annotation, the web-based tool of [21] for the LabelMe dataset [22] was used. Annotations are provided in XML for each image separately describing global image features, bounding boxes of text and its special characteristics. Figure 7: Example images from the NEOCR dataset. G. KAIST Scene Text Database The KAIST scene text dataset consists of 3000 images captured from variable environments, including indoor and outdoor scenes under different illumination conditions (strong artificial lights, day light, night, etc). Figure 8: Sample Images taken from KAIST Scene Text Database Images were captured either by using a highresolution digital camera or a low-resolution mobile phone camera. The images were captured at different resolutions but later resized to 640x480. The KAIST scene text database is arranged into different categories according to the script of the scene text captured: Korean, English (Number), and Mixed (English + Korean+ Number). For each category images are placed into two classes-digital camera and mobile camera. Further images are classifies as outdoor, indoor, shadow effect, night etc. The scene text in the images is mainly representative of frequent text used in Korean streets or shops. IV. EVALUATION METRICS A. Text Localization task The localized text in an image is specified by a bounding box around each word of the text. In text localization task of ICDAR 2005 Lucas et. al. [14], the organizers provided bounding boxes of words for each of the images. The ground truth text localized images are provided as separate text files (one per image) in which each line specifies the coordinates of one word's bounding rectangle and its transcription in a comma separated format. For a given image, with all the word-region rectangles, let T be ground-truth set of targets and the E be set returned by the system under test. The number of boxes which are correct is denoted by c. Then precision, p is defined as the number of correct estimated rectangles divided by the total number of rectangles: c p (1) E Recall, r is defined as the number of correct estimates divided by the total number of targets. c p (2) T The above measures of accuracy are quote unrealistic as a little deviation from the exact area results in a lot of deviation in the final quantitative aspect. So Lucas et. al. [14] adopted a flexible measure using notion of area-match. An area match m a between two rectangles r 1 and r 2 is defined as twice the common area divided by the total of the areas of two rectangles. m a 2. a( r a( r ) 1 1 r 2 ) a( r ) 2 (3) where a(r) is the area of rectangle r. The value of m is one for same rectangles (100% matching) and a zero for rectangles that have no intersection (entirely different). For each rectangle in the set of targets we find the closest match in the set of estimates, and vice versa. So, the best matching for a rectangle r, m(r, R) in set of Rectangles R is defined as: m( r, R) maxm a( r, r') r' R (4) So finally precision and recall are re-defined as ISSN: Page 36

6 p r r E e r T t m( r, T) T E m( r e, T) or ( r, E) m t e m( r, E) t (5) (6) is the best match for a rectangle re or rt in a set of rectangles T or E, respectively. The two measures are clubbed into single measure, called f score, computed as: f 1 / p (1 )/ r (7) Wolf and Jolion [23] pointed several drawbacks in above methods of evaluation as they don t give exact information about the proportion of the correctly detected words and the number of false alarms. Further, without creating ambiguity in their interpretation, these cannot be accumulated across multiple images. So another evaluation procedure based on the object count was proposed. Apart from one to one match, one-to-many and many-to-one matches are also counted with little penalty. B. Text Segmentation task In robust reading competition of ICDAR 2011, the text segmentation was introduced as a separate task. For comparing the results of different algorithms, the evaluation procedure has to calculate the precision, recall and f-score values. A ground truth data is provided for the task. In ground truth data, text is represented as black against a white background. Segmentation task is to segment the text from the input image and the text pixels are represented in black against a white background. The precision and recall values are calculated between the result of the method and the ground-truth for each image at the pixel-level. F-score, calculated as the harmonic mean of precision and recall values for the image set are used to rank the different methods. The metrics, however, existed earlier for scanned documents images as Document Image Binarization Contest (DIBCO) [17]. These metrics are calculated from the pixel level comparison of segmented image and the ground-truth image by using True Positive pixels (TP), False Positive pixels and False Negative pixels: True Positives: True positives are those pixels in the image which are text pixels and have been detected by the algorithm as also as text. False Positives: False positives are those pixels in the image which are actually not text pixels, but have been detected by the algorithm as text pixels. False Negatives: False negatives are those pixels in the image which are actually text pixels, but have not been detected by the algorithm as text pixels. Precision Rate: is defined as the ratio of correctly detected pixels to the sum of correctly detected pixels and false positives as represented in equation below: TP Precision Rate *100 (8) TP FP Recall Rate: is the ratio of correctly detected pixels to the sum of correctly detected pixels and false negatives as represented in equation below: TP RecallRate *100 (9) TP FN F-Measure: F-score is the harmonic mean of the recall and precision rates. 2* RecallRate *PrecisionRate F Score (10) RecallRate PrecisionRate Figure 9: Ground Truth data for pixel level segmentation task (KAIST Dataset) V. PROPOSED WORK A. Aks: Database for proposed work For Devanagari script there is no benchmark database of camera captured images. Aks, the proposed database of images is created by taking images from different environments. Camera captured images has a useful application in understanding the natural scene texts. So most of the database is dedicated towards natural scene text images. The proposed dataset is divided into two categories (figure 10): Devanagari Images and Multi-script images. In each category we further collected images in Natural light (Outdoor and indoor), Artificial light (Outdoor and indoor) and no light at all. Images of book titles are also taken. The images are captured from road-sides (sign boards), name plates (indoor and outdoor), hospitals, schools, historic monuments, book titles etc to cover different type of backgrounds. The captured images have different type of fonts, variable font size. The image may contain single word, multiple words arranged in Different patterns of lines. The text appearing in the image may be Machine-printed or handwritten. The ISSN: Page 37

7 B. Ground Truth Data TIE system extract text from an image and the extracted text image can be further fed to an OCR for its recognition. The recognition of text depends on the quality of OCR and also on the text image that is given as input to the OCR. The accuracy of TIE is measured as discussed in section IV. To calculate the required metrics, the output image of TIE has to be compared with some ground truth image that is created manually. To create ground truth data, we used the toolkit developed by [24]. The ground truth data for some images is shown in figure 11. Figure 10: Aks: Dataset of Camera Based Images images are captured either with a high resolution digital camera or with a mobile camera. Further we used capturing device with different resolutions. Sample images from our dataset are shown in figure 11: (a) Natural Light (Outdoor) (b) Natural Light (Indoor) \ (c) Artificial Light ` Figure 11: Ground Truth Images from Aks VI. CONCLUSION (d) No Light Figure 10: Sample Images from Aks In this paper a collection of camera based images for Devanagari script along with ground truth data is discussed. The proposed database will be used to detect and extract Devanagari text from these camera based images. Since the ground truth data for images is also created, the database can be used to compare the efficiency of different algorithms. ISSN: Page 38

8 REFERENCES [1] J. Liang, D. Doermann, H. Li, Camera-based analysis of text and documents: a survey, International Journal on Document Analysis and Recognition (IJDAR), Volume (7), Issue 2-3, pp , [2] A. Antonacopoulos, D. Karatzas, J. Ortiz Lopez, Accessing Textual Information Embedded in Internet Images, Proc. of SPIE, Internet Imaging II, San Jose, USA, Vol. 4311, pp , January [3] T. Kanungo and C. Lee What fraction of images on the web contain text? International Workshop on Web Document Analysis (WDA), pp 43-46, [4] D. Karatzas, A. Antonacopoulos, Colour Text Segmentation in Web Images Based on Human Perception, Image and Vision Computing, Vol. 25, Issue 5, Elsevier, pp , May [5] D. Lopresti, J. Zhou, Locating and recognizing text in WWW images, Information Retrieval, 2, pp , [6] S.J. Perantonis, B. Gatos and V. Maragos, A novel Web image processing algorithm for text area identification that helps commercial OCR engines to improve their Web image recognition efficiency, Second Int. Workshop on Web Document Analysis (WDA2003), pp , Edinburgh, Scotland, August 2003 [7] J. Ohya, A. Shio, and S. Akamatsu, Recognizing characters in scene images, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 16, no. 2, pp , Feb [8] Y. Zhong, K. Karu, and A. Jain, Locating text in complex color images, Pattern Recognition, vol. 28, no. 10, pp , Oct [9] P. Clark and M. Mirmehdi, Recognising text in real scenes, Int. Jour. on Document Analysis and Recognition, vol. 4, no. 4, pp , [10] C. Mancas Thillou and B. Gosselin, Color text extraction with selective metric-based clustering, Computer Vision and Image Understanding, vol. 107, no. 1-2, pp , Jul [11] J. Weinman, E. Learned Miller, and A. Hanson, Scene text recognition using similarity and a lexicon with sparse belief propagation, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 10, pp , Oct [12] J. Park, G. Lee, E. Kim, J. Lim, S. Kim, H. Yang, M. Lee, and S. Hwang, Automatic detection and recognition of korean text in outdoor signboard images, Pattern Recognition Letters, vol. 31, no. 12, pp , Sep [13] Y. Pan, X. Hou, and C. Liu, A hybrid approach to detect and localize texts in natural scene images, IEEE Trans. on Image Processing, vol. 20, no. 3, pp , Mar [14] S.M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and, R. Young, ICDAR 2003 robust reading competitions In Proceedings of International Conference on Document Analysis and Recognition, pages IEEE Computer Society, [15] A. Shahab, F. Shafait, and A. Dengel, ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images, In Proceedings of the International Conference on Document Analysis and Recognition, pages IEEE Computer Society, [16] D. Karatzas, F. Shafait, S. Uchida, M. Iwamura, L. G. i. Bigorda, S. R. Mestre, J. Mas, D. F. Mota, J. A. Almazan, and D. L. Heras, ICDAR 2013 Robust Reading Competition, In Proceedings of the 12th nternational Conference on Document Analysis and Recognition, pages , [17] K. Wang and S. Belongie, Word spotting in the wild, In Proceedings of 11th ECCV, pages , 2010, [18] C. Yao, X. Bai, W. Liu, Y. Ma and Z. Tu, Detecting Texts of Arbitrary Orientations in Natural Images, CVPR 2012 [19] R. Nagy, A. Dicker and K. Meyer-Wegener, "NEOCR: A Configurable Dataset for Natural Image Text Recognition". In Proceedings of CBDAR Workshop 2011 at ICDAR pp , September [20] R. Nagy, A. Dicker, and K. Meyer-Wegener, "Definition and Evaluation of the NEOCR Dataset for Natural-Image Text Recognition". University of Erlangen, Dept. of Computer Science, Technical Reports, CS-2011, 07, September [21] B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman, Labelme: A database and web-based tool for image annotation, IJCV, vol. 77, pp , May [22] LabelMe Dataset. [Online]. Available: csail.mit.edu/ [23] C. Wolf and J.M. Jolion, "Object Count / Area Graphs for the Evaluation of Object Detection and Segmentation Algorithms", International Journal of Document Analysis, vol. 8, no. 4, pp , [24] T. Kasar, D. Kumar, M.N. Anil Prasad, D. Girish and A.G. Ramakrishnan, MAST: Multi-Script Annotation toolkit for Scenic Text, Proc. Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data (J-MOCR- AND), Sept. 17, Beijing, China. ISSN: Page 39

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval Sheraz Ahmed, Koichi Kise, Masakazu Iwamura, Marcus Liwicki, and Andreas Dengel German Research Center for

More information

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images

An Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,

More information

Locating the Query Block in a Source Document Image

Locating the Query Block in a Source Document Image Locating the Query Block in a Source Document Image Naveena M and G Hemanth Kumar Department of Studies in Computer Science, University of Mysore, Manasagangotri-570006, Mysore, INDIA. Abstract: - In automatic

More information

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter Extraction and Recognition of Text From Digital English Comic Image Using Median Filter S.Ranjini 1 Research Scholar,Department of Information technology Bharathiar University Coimbatore,India ranjinisengottaiyan@gmail.com

More information

An Improved Bernsen Algorithm Approaches For License Plate Recognition

An Improved Bernsen Algorithm Approaches For License Plate Recognition IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) ISSN: 78-834, ISBN: 78-8735. Volume 3, Issue 4 (Sep-Oct. 01), PP 01-05 An Improved Bernsen Algorithm Approaches For License Plate Recognition

More information

International Conference on Computer, Communication, Control and Information Technology (C 3 IT 2009) Paper Code: DSIP-024

International Conference on Computer, Communication, Control and Information Technology (C 3 IT 2009) Paper Code: DSIP-024 Paper Code: DSIP-024 Oral 270 A NOVEL SCHEME FOR BINARIZATION OF VEHICLE IMAGES USING HIERARCHICAL HISTOGRAM EQUALIZATION TECHNIQUE Satadal Saha 1, Subhadip Basu 2 *, Mita Nasipuri 2, Dipak Kumar Basu

More information

Automatic Licenses Plate Recognition System

Automatic Licenses Plate Recognition System Automatic Licenses Plate Recognition System Garima R. Yadav Dept. of Electronics & Comm. Engineering Marathwada Institute of Technology, Aurangabad (Maharashtra), India yadavgarima08@gmail.com Prof. H.K.

More information

Effect of Ground Truth on Image Binarization

Effect of Ground Truth on Image Binarization 2012 10th IAPR International Workshop on Document Analysis Systems Effect of Ground Truth on Image Binarization Elisa H. Barney Smith Boise State University Boise, Idaho, USA EBarneySmith@BoiseState.edu

More information

Locally baseline detection for online Arabic script based languages character recognition

Locally baseline detection for online Arabic script based languages character recognition International Journal of the Physical Sciences Vol. 5(7), pp. 955-959, July 2010 Available online at http://www.academicjournals.org/ijps ISSN 1992-1950 2010 Academic Journals Full Length Research Paper

More information

A Review of Optical Character Recognition System for Recognition of Printed Text

A Review of Optical Character Recognition System for Recognition of Printed Text IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 3, Ver. II (May Jun. 2015), PP 28-33 www.iosrjournals.org A Review of Optical Character Recognition

More information

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 7, July 2015, pg.16

More information

International Journal of Advanced Research in Computer Science and Software Engineering

International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 4, April 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Novel Approach

More information

Automatic Enhancement and Binarization of Degraded Document Images

Automatic Enhancement and Binarization of Degraded Document Images Automatic Enhancement and Binarization of Degraded Document Images Jon Parker 1,2, Ophir Frieder 1, and Gideon Frieder 1 1 Department of Computer Science Georgetown University Washington DC, USA {jon,

More information

Scene Text Recognition with Bilateral Regression

Scene Text Recognition with Bilateral Regression Scene Text Recognition with Bilateral Regression Jacqueline Feild and Erik Learned-Miller Technical Report UM-CS-2012-021 University of Massachusetts Amherst Abstract This paper focuses on improving the

More information

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad Road, Rajkot Gujarat, India C. K. Kumbharana,

More information

Recovery of badly degraded Document images using Binarization Technique

Recovery of badly degraded Document images using Binarization Technique International Journal of Scientific and Research Publications, Volume 4, Issue 5, May 2014 1 Recovery of badly degraded Document images using Binarization Technique Prof. S. P. Godse, Samadhan Nimbhore,

More information

Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence

Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence Sheng Yan LI, Jie FENG, Bin Gang XU, and Xiao Ming TAO Institute of Textiles and Clothing,

More information

A Comparative Analysis of Different Edge Based Algorithms for Mobile/Camera Captured Images

A Comparative Analysis of Different Edge Based Algorithms for Mobile/Camera Captured Images A Comparative Analysis of Different Edge Based Algorithms for Mobile/Camera Captured Images H.K.Chethan Research Scholar, Department of Studies in Computer Science, University of Mysore, Mysore-570006,

More information

http://www.diva-portal.org This is the published version of a paper presented at SAI Annual Conference on Areas of Intelligent Systems and Artificial Intelligence and their Applications to the Real World

More information

Keywords Unidirectional scanning, Bidirectional scanning, Overlapping region, Mosaic image, Split image

Keywords Unidirectional scanning, Bidirectional scanning, Overlapping region, Mosaic image, Split image Volume 6, Issue 2, February 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Improved

More information

SmartATID: A mobile captured Arabic Text Images Dataset for multi-purpose recognition tasks

SmartATID: A mobile captured Arabic Text Images Dataset for multi-purpose recognition tasks SmartATID: A mobile captured Arabic Text Images Dataset for multi-purpose recognition tasks Fatma Chabchoub, Yousri Kessentini, Slim Kanoun, Véronique Eglin To cite this version: Fatma Chabchoub, Yousri

More information

Laser Printer Source Forensics for Arbitrary Chinese Characters

Laser Printer Source Forensics for Arbitrary Chinese Characters Laser Printer Source Forensics for Arbitrary Chinese Characters Xiangwei Kong, Xin gang You,, Bo Wang, Shize Shang and Linjie Shen Information Security Research Center, Dalian University of Technology,

More information

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL Instructor : Dr. K. R. Rao Presented by: Prasanna Venkatesh Palani (1000660520) prasannaven.palani@mavs.uta.edu

More information

RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS

RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT DETECTION IN VIDEO IMAGES USING CONNECTED COMPONENT ANALYSIS International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.137-141 DOI: http://dx.doi.org/10.21172/1.74.018 e-issn:2278-621x RESEARCH PAPER FOR ARBITRARY ORIENTED TEAM TEXT

More information

Lecture 7: Scene Text Detection and Recognition. Dr. Cong Yao Megvii (Face++) Researcher

Lecture 7: Scene Text Detection and Recognition. Dr. Cong Yao Megvii (Face++) Researcher Lecture 7: Scene Text Detection and Recognition Dr. Cong Yao Megvii (Face++) Researcher yaocong@megvii.com Outline Background and Introduction Conventional Methods Deep Learning Methods Datasets and Competitions

More information

Sri Shakthi Institute of Engg and Technology, Coimbatore, TN, India.

Sri Shakthi Institute of Engg and Technology, Coimbatore, TN, India. Intelligent Forms Processing System Tharani B 1, Ramalakshmi. R 2, Pavithra. S 3, Reka. V. S 4, Sivaranjani. J 5 1 Assistant Professor, 2,3,4,5 UG Students, Dept. of ECE Sri Shakthi Institute of Engg and

More information

Text Extraction from Images

Text Extraction from Images Text Extraction from Images Paraag Agrawal #1, Rohit Varma *2 # Information Technology, University of Pune, India 1 paraagagrawal@hotmail.com * Information Technology, University of Pune, India 2 catchrohitvarma@gmail.com

More information

A Novel Method for Enhancing Satellite & Land Survey Images Using Color Filter Array Interpolation Technique (CFA)

A Novel Method for Enhancing Satellite & Land Survey Images Using Color Filter Array Interpolation Technique (CFA) A Novel Method for Enhancing Satellite & Land Survey Images Using Color Filter Array Interpolation Technique (CFA) Suma Chappidi 1, Sandeep Kumar Mekapothula 2 1 PG Scholar, Department of ECE, RISE Krishna

More information

Number Plate Recognition Using Segmentation

Number Plate Recognition Using Segmentation Number Plate Recognition Using Segmentation Rupali Kate M.Tech. Electronics(VLSI) BVCOE. Pune 411043, Maharashtra, India. Dr. Chitode. J. S BVCOE. Pune 411043 Abstract Automatic Number Plate Recognition

More information

Image binarization techniques for degraded document images: A review

Image binarization techniques for degraded document images: A review Image binarization techniques for degraded document images: A review Binarization techniques 1 Amoli Panchal, 2 Chintan Panchal, 3 Bhargav Shah 1 Student, 2 Assistant Professor, 3 Assistant Professor 1

More information

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2 An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2 1, Student, SPCOE, Department of E&TC Engineering, Dumbarwadi, Otur 2, Professor, SPCOE, Department of E&TC Engineering,

More information

Noise Removal and Binarization of Scanned Document Images Using Clustering of Features

Noise Removal and Binarization of Scanned Document Images Using Clustering of Features , March 15-17, 2017, Hong Kong Noise Removal and Binarization of Scanned Document Images Using Clustering of Features Atena Farahmand, Abdolhossein Sarrafzadeh and Jamshid Shanbehzadeh, Abstract- Old documents

More information

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron Proc. National Conference on Recent Trends in Intelligent Computing (2006) 86-92 A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

More information

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION International Journal of Computer Science and Communication Vol. 2, No. 2, July-December 2011, pp. 593-599 INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION Chetan Sharma 1 and Amandeep Kaur 2 1

More information

True Color Distributions of Scene Text and Background

True Color Distributions of Scene Text and Background True Color Distributions of Scene Text and Background Renwu Gao, Shoma Eguchi, Seiichi Uchida Kyushu University Fukuoka, Japan Email: {kou, eguchi}@human.ait.kyushu-u.ac.jp, uchida@ait.kyushu-u.ac.jp Abstract

More information

Optical Character Recognition for Hindi

Optical Character Recognition for Hindi Optical Character Recognition for Hindi Prasanta Pratim Bairagi Assistant Professor, Department of CSE, Assam down town University, Assam, India ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

MAV-ID card processing using camera images

MAV-ID card processing using camera images EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON

More information

Recognizing Words in Scenes with a Head-Mounted Eye-Tracker

Recognizing Words in Scenes with a Head-Mounted Eye-Tracker Recognizing Words in Scenes with a Head-Mounted Eye-Tracker Takuya Kobayashi, Takumi Toyama, Faisal Shafait, Masakazu Iwamura, Koichi Kise and Andreas Dengel Graduate School of Engineering Osaka Prefecture

More information

Nigerian Vehicle License Plate Recognition System using Artificial Neural Network

Nigerian Vehicle License Plate Recognition System using Artificial Neural Network Nigerian Vehicle License Plate Recognition System using Artificial Neural Network Amusan D.G 1, Arulogun O.T 2 and Falohun A.S 3 Open and Distance Learning Centre, Ladoke Akintola University of Technology,

More information

An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.

An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S. An Optimal Text Recognition and Translation System for Smart phones Using Genetic Programming and Cloud Ashish Emmanuel S, Dr. S.Nithyanandam Abstract An Optimal Text Recognition and Translation System

More information

Wavelet-based Image Splicing Forgery Detection

Wavelet-based Image Splicing Forgery Detection Wavelet-based Image Splicing Forgery Detection 1 Tulsi Thakur M.Tech (CSE) Student, Department of Computer Technology, basiltulsi@gmail.com 2 Dr. Kavita Singh Head & Associate Professor, Department of

More information

An Analysis of Binarization Ground Truthing

An Analysis of Binarization Ground Truthing Boise State University ScholarWorks Electrical and Computer Engineering Faculty Publications and Presentations Department of Electrical and Computer Engineering 6-1-2010 An Analysis of Binarization Ground

More information

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India Segmentation of Touching Characters in Upper Zone in Printed Gurmukhi Script M. K. Jindal Department of Computer Science and Applications Panjab University Regional Centre Muktsar, Punjab, India +919814637188,

More information

International Journal of Innovative Research in Engineering Science and Technology APRIL 2018 ISSN X

International Journal of Innovative Research in Engineering Science and Technology APRIL 2018 ISSN X HIGH DYNAMIC RANGE OF MULTISPECTRAL ACQUISITION USING SPATIAL IMAGES 1 M.Kavitha, M.Tech., 2 N.Kannan, M.E., and 3 S.Dharanya, M.E., 1 Assistant Professor/ CSE, Dhirajlal Gandhi College of Technology,

More information

License Plate Localisation based on Morphological Operations

License Plate Localisation based on Morphological Operations License Plate Localisation based on Morphological Operations Xiaojun Zhai, Faycal Benssali and Soodamani Ramalingam School of Engineering & Technology University of Hertfordshire, UH Hatfield, UK Abstract

More information

A Recognition of License Plate Images from Fast Moving Vehicles Using Blur Kernel Estimation

A Recognition of License Plate Images from Fast Moving Vehicles Using Blur Kernel Estimation A Recognition of License Plate Images from Fast Moving Vehicles Using Blur Kernel Estimation Kalaivani.R 1, Poovendran.R 2 P.G. Student, Dept. of ECE, Adhiyamaan College of Engineering, Hosur, Tamil Nadu,

More information

Improved SIFT Matching for Image Pairs with a Scale Difference

Improved SIFT Matching for Image Pairs with a Scale Difference Improved SIFT Matching for Image Pairs with a Scale Difference Y. Bastanlar, A. Temizel and Y. Yardımcı Informatics Institute, Middle East Technical University, Ankara, 06531, Turkey Published in IET Electronics,

More information

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction

A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email:

More information

Book Cover Recognition Project

Book Cover Recognition Project Book Cover Recognition Project Carolina Galleguillos Department of Computer Science University of California San Diego La Jolla, CA 92093-0404 cgallegu@cs.ucsd.edu Abstract The purpose of this project

More information

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi Department of E&TC Engineering,PVPIT,Bavdhan,Pune ABSTRACT: In the last decades vehicle license plate recognition systems

More information

Libyan Licenses Plate Recognition Using Template Matching Method

Libyan Licenses Plate Recognition Using Template Matching Method Journal of Computer and Communications, 2016, 4, 62-71 Published Online May 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.47009 Libyan Licenses Plate Recognition Using

More information

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images

IEEE Signal Processing Letters: SPL Distance-Reciprocal Distortion Measure for Binary Document Images IEEE SIGNAL PROCESSING LETTERS, VOL. X, NO. Y, Z 2003 1 IEEE Signal Processing Letters: SPL-00466-2002 1) Paper Title Distance-Reciprocal Distortion Measure for Binary Document Images 2) Authors Haiping

More information

Content Based Image Retrieval Using Color Histogram

Content Based Image Retrieval Using Color Histogram Content Based Image Retrieval Using Color Histogram Nitin Jain Assistant Professor, Lokmanya Tilak College of Engineering, Navi Mumbai, India. Dr. S. S. Salankar Professor, G.H. Raisoni College of Engineering,

More information

Linear Gaussian Method to Detect Blurry Digital Images using SIFT

Linear Gaussian Method to Detect Blurry Digital Images using SIFT IJCAES ISSN: 2231-4946 Volume III, Special Issue, November 2013 International Journal of Computer Applications in Engineering Sciences Special Issue on Emerging Research Areas in Computing(ERAC) www.caesjournals.org

More information

Implementation of Barcode Localization Technique using Morphological Operations

Implementation of Barcode Localization Technique using Morphological Operations Implementation of Barcode Localization Technique using Morphological Operations Savreet Kaur Student, Master of Technology, Department of Computer Engineering, ABSTRACT Barcode Localization is an extremely

More information

Vehicle License Plate Recognition System Using LoG Operator for Edge Detection and Radon Transform for Slant Correction

Vehicle License Plate Recognition System Using LoG Operator for Edge Detection and Radon Transform for Slant Correction Vehicle License Plate Recognition System Using LoG Operator for Edge Detection and Radon Transform for Slant Correction Jaya Gupta, Prof. Supriya Agrawal Computer Engineering Department, SVKM s NMIMS University

More information

World Journal of Engineering Research and Technology WJERT

World Journal of Engineering Research and Technology WJERT wjert, 2017, Vol. 3, Issue 3, 357-366 Original Article ISSN 2454-695X Shagun et al. WJERT www.wjert.org SJIF Impact Factor: 4.326 NUMBER PLATE RECOGNITION USING MATLAB 1 *Ms. Shagun Chaudhary and 2 Miss

More information

Multi-Script Line identification from Indian Documents

Multi-Script Line identification from Indian Documents Multi-Script Line identification from Indian Documents U. Pal, S. Sinha and B. B. Chaudhuri Computer Vision and Pattern Recognition Unit Indian Statistical Institute 203 B. T. Road, Kolkata-700108, INDIA

More information

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Implementation of License Plate Recognition System in ARM Cortex A8 Board www..org 9 Implementation of License Plate Recognition System in ARM Cortex A8 Board S. Uma 1, M.Sharmila 2 1 Assistant Professor, 2 Research Scholar, Department of Electrical and Electronics Engg, College

More information

Face Detection System on Ada boost Algorithm Using Haar Classifiers

Face Detection System on Ada boost Algorithm Using Haar Classifiers Vol.2, Issue.6, Nov-Dec. 2012 pp-3996-4000 ISSN: 2249-6645 Face Detection System on Ada boost Algorithm Using Haar Classifiers M. Gopi Krishna, A. Srinivasulu, Prof (Dr.) T.K.Basak 1, 2 Department of Electronics

More information

Effects of the Unscented Kalman Filter Process for High Performance Face Detector

Effects of the Unscented Kalman Filter Process for High Performance Face Detector Effects of the Unscented Kalman Filter Process for High Performance Face Detector Bikash Lamsal and Naofumi Matsumoto Abstract This paper concerns with a high performance algorithm for human face detection

More information

Introduction to Video Forgery Detection: Part I

Introduction to Video Forgery Detection: Part I Introduction to Video Forgery Detection: Part I Detecting Forgery From Static-Scene Video Based on Inconsistency in Noise Level Functions IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 5,

More information

Recognition Of Vehicle Number Plate Using MATLAB

Recognition Of Vehicle Number Plate Using MATLAB Recognition Of Vehicle Number Plate Using MATLAB Mr. Ami Kumar Parida 1, SH Mayuri 2,Pallabi Nayk 3,Nidhi Bharti 4 1Asst. Professor, Gandhi Institute Of Engineering and Technology, Gunupur 234Under Graduate,

More information

Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method

Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method Efficient Car License Plate Detection and Recognition by Using Vertical Edge Based Method M. Veerraju *1, S. Saidarao *2 1 Student, (M.Tech), Department of ECE, NIE, Macherla, Andrapradesh, India. E-Mail:

More information

Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information

Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Mohd Firdaus Zakaria, Shahrel A. Suandi Intelligent Biometric Group, School of Electrical and Electronics Engineering,

More information

MAJORITY VOTING IMAGE BINARIZATION

MAJORITY VOTING IMAGE BINARIZATION MAJORITY VOTING IMAGE BINARIZATION Alexandru PRUNCU 1* Cezar GHIMBAS 2 Radu BOERU 3 Vlad NECULAE 4 Costin-Anton BOIANGIU 5 ABSTRACT This paper presents a new binarization technique for text based images.

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online): 2321-0613 Improved Document Image Binarization using Hybrid Thresholding Method Neha 1 Deepak 2

More information

Matlab Based Vehicle Number Plate Recognition

Matlab Based Vehicle Number Plate Recognition International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 9 (2017), pp. 2283-2288 Research India Publications http://www.ripublication.com Matlab Based Vehicle Number

More information

An Effective Method for Removing Scratches and Restoring Low -Quality QR Code Images

An Effective Method for Removing Scratches and Restoring Low -Quality QR Code Images An Effective Method for Removing Scratches and Restoring Low -Quality QR Code Images Ashna Thomas 1, Remya Paul 2 1 M.Tech Student (CSE), Mahatma Gandhi University Viswajyothi College of Engineering and

More information

Binarization of Historical Document Images Using the Local Maximum and Minimum

Binarization of Historical Document Images Using the Local Maximum and Minimum Binarization of Historical Document Images Using the Local Maximum and Minimum Bolan Su Department of Computer Science School of Computing National University of Singapore Computing 1, 13 Computing Drive

More information

` Jurnal Teknologi IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE.

` Jurnal Teknologi IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE. ` Jurnal Teknologi IDENTIFICATION OF MOST SUITABLE BINARISATION METHODS FOR ACEHNESE ANCIENT MANUSCRIPTS RESTORATION SOFTWARE USER GUIDE Fardian *, Fitri Arnia, Sayed Muchallil, Khairul Munadi Electrical

More information

Quality Measure of Multicamera Image for Geometric Distortion

Quality Measure of Multicamera Image for Geometric Distortion Quality Measure of Multicamera for Geometric Distortion Mahesh G. Chinchole 1, Prof. Sanjeev.N.Jain 2 M.E. II nd Year student 1, Professor 2, Department of Electronics Engineering, SSVPSBSD College of

More information

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods 19 An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods T.Arunachalam* Post Graduate Student, P.G. Dept. of Computer Science, Govt Arts College, Melur - 625 106 Email-Arunac682@gmail.com

More information

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates American Journal of Applied Sciences 6 (12): 2066-2070, 2009 ISSN 1546-9239 2009 Science Publications A Novel Morphological Method for Detection and Recognition of Vehicle License Plates 1 S.H. Mohades

More information

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Vehicular Visual Tracking System Incorporating Global Positioning System A Vehicular Visual Tracking System Incorporating Global Positioning System Hsien-Chou Liao and Yu-Shiang Wang Abstract Surveillance system is widely used in the traffic monitoring. The deployment of cameras

More information

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices J Inf Process Syst, Vol.12, No.1, pp.100~108, March 2016 http://dx.doi.org/10.3745/jips.04.0022 ISSN 1976-913X (Print) ISSN 2092-805X (Electronic) Number Plate Detection with a Multi-Convolutional Neural

More information

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 88 95 Information Technology and Quantitative Management (ITQM2013) A New Character Segmentation Approach for Off-Line Cursive

More information

Image Extraction using Image Mining Technique

Image Extraction using Image Mining Technique IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,

More information

A No Reference Image Blur Detection using CPBD Metric and Deblurring of Gaussian Blurred Images using Lucy-Richardson Algorithm

A No Reference Image Blur Detection using CPBD Metric and Deblurring of Gaussian Blurred Images using Lucy-Richardson Algorithm A No Reference Image Blur Detection using CPBD Metric and Deblurring of Gaussian Blurred Images using Lucy-Richardson Algorithm Suresh S. Zadage, G. U. Kharat Abstract This paper addresses sharpness of

More information

Face detection, face alignment, and face image parsing

Face detection, face alignment, and face image parsing Lecture overview Face detection, face alignment, and face image parsing Brandon M. Smith Guest Lecturer, CS 534 Monday, October 21, 2013 Brief introduction to local features Face detection Face alignment

More information

AN APPROACH TO EXTRACT LINE, WORD AND CHARACTER FROM SCENE TEXT IMAGE

AN APPROACH TO EXTRACT LINE, WORD AND CHARACTER FROM SCENE TEXT IMAGE AN APPROACH TO EXTRACT LINE, WORD AND CHARACTER FROM SCENE TEXT IMAGE DANESHWARI A NOOLA 1, M M KODABAGI 2 daneshwari.noola@gmail.com malik123_mk@rediffmail.com Abstract:- Text translation from Scene Image

More information

A Study on Single Camera Based ANPR System for Improvement of Vehicle Number Plate Recognition on Multi-lane Roads

A Study on Single Camera Based ANPR System for Improvement of Vehicle Number Plate Recognition on Multi-lane Roads Invention Journal of Research Technology in Engineering & Management (IJRTEM) ISSN: 2455-3689 www.ijrtem.com Volume 2 Issue 1 ǁ January. 2018 ǁ PP 11-16 A Study on Single Camera Based ANPR System for Improvement

More information

Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding

Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding Vijay Jumb, Mandar Sohani, Avinash Shrivas Abstract In this paper, an approach for color image segmentation is presented.

More information

Open Access An Improved Character Recognition Algorithm for License Plate Based on BP Neural Network

Open Access An Improved Character Recognition Algorithm for License Plate Based on BP Neural Network Send Orders for Reprints to reprints@benthamscience.ae 202 The Open Electrical & Electronic Engineering Journal, 2014, 8, 202-207 Open Access An Improved Character Recognition Algorithm for License Plate

More information

DENSE-CLUSTER BASED VOTING APPROACH FOR LICENSE PLATE IDENTIFICATION

DENSE-CLUSTER BASED VOTING APPROACH FOR LICENSE PLATE IDENTIFICATION Journal of Engineering Science and Technology Special Issue on ICCSIT 208, July (208) 34-47 School of Engineering, Taylor s University DENSE-CLUSTER BASED VOTING APPROACH FOR LICENSE PLATE IDENTIFICATION

More information

Combined Approach for Face Detection, Eye Region Detection and Eye State Analysis- Extended Paper

Combined Approach for Face Detection, Eye Region Detection and Eye State Analysis- Extended Paper International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 9 (September 2014), PP.57-68 Combined Approach for Face Detection, Eye

More information

Robust Document Image Binarization Techniques

Robust Document Image Binarization Techniques Robust Document Image Binarization Techniques T. Srikanth M-Tech Student, Malla Reddy Institute of Technology and Science, Maisammaguda, Dulapally, Secunderabad. Abstract: Segmentation of text from badly

More information

Localization of License Plates from Surveillance Camera Images: A Color Feature Based ANN Approach

Localization of License Plates from Surveillance Camera Images: A Color Feature Based ANN Approach Localization of License Plates from Surveillance Camera Images: A Color Feature Based ANN Approach Satadal Saha Sr. Lecturer MCKV Institute of Engg. Liluah Subhadip Basu Sr. Lecturer Jadavpur University

More information

Adaptive Feature Analysis Based SAR Image Classification

Adaptive Feature Analysis Based SAR Image Classification I J C T A, 10(9), 2017, pp. 973-977 International Science Press ISSN: 0974-5572 Adaptive Feature Analysis Based SAR Image Classification Debabrata Samanta*, Abul Hasnat** and Mousumi Paul*** ABSTRACT SAR

More information

Real Time ALPR for Vehicle Identification Using Neural Network

Real Time ALPR for Vehicle Identification Using Neural Network _ Real Time ALPR for Vehicle Identification Using Neural Network Anushree Deshmukh M.E Student Terna Engineering College,Navi Mumbai Email: anushree_deshmukh@yahoo.co.in Abstract With the rapid growth

More information

A Method for Estimating Meanings for Groups of Shapes in Presentation Slides

A Method for Estimating Meanings for Groups of Shapes in Presentation Slides A Method for Estimating Meanings for Groups of Shapes in Presentation Slides Yuki Sakuragi, Atsushi Aoyama, Fuminori Kimura, and Akira Maeda Abstract This paper proposes a method for estimating the meanings

More information

Toward an Augmented Reality System for Violin Learning Support

Toward an Augmented Reality System for Violin Learning Support Toward an Augmented Reality System for Violin Learning Support Hiroyuki Shiino, François de Sorbier, and Hideo Saito Graduate School of Science and Technology, Keio University, Yokohama, Japan {shiino,fdesorbi,saito}@hvrl.ics.keio.ac.jp

More information

Segmentation using Saturation Thresholding and its Application in Content-Based Retrieval of Images

Segmentation using Saturation Thresholding and its Application in Content-Based Retrieval of Images Segmentation using Saturation Thresholding and its Application in Content-Based Retrieval of Images A. Vadivel 1, M. Mohan 1, Shamik Sural 2 and A.K.Majumdar 1 1 Department of Computer Science and Engineering,

More information

Smart Number Plate Identification Using Back Propagation Neural Network

Smart Number Plate Identification Using Back Propagation Neural Network Smart Number Plate Identification Using Back Propagation Neural Network Prof. Pankaj Salunkhe 1, Mr. Akshay Dhawale 2 1 Head of Department (Electronics & Telecommunication Engineering), YTIET, Bhivpuri

More information

Method for Real Time Text Extraction of Digital Manga Comic

Method for Real Time Text Extraction of Digital Manga Comic Method for Real Time Text Extraction of Digital Manga Comic Kohei Arai Information Science Department Saga University Saga, 840-0027, Japan Herman Tolle Software Engineering Department Brawijaya University

More information

Truthing for Pixel-Accurate Segmentation

Truthing for Pixel-Accurate Segmentation Truthing for Pixel-Accurate Segmentation Michael A. Moll, Henry S. Baird & Chang An Computer Science & Engineering Dept, Lehigh University 19 Memorial Drive West, Bethlehem, Pennsylvania 18017 USA E-mail:

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Kiwon Yun, Junyeong Yang, and Hyeran Byun Dept. of Computer Science, Yonsei University, Seoul, Korea, 120-749

More information

Intelligent Nighttime Video Surveillance Using Multi-Intensity Infrared Illuminator

Intelligent Nighttime Video Surveillance Using Multi-Intensity Infrared Illuminator , October 19-21, 2011, San Francisco, USA Intelligent Nighttime Video Surveillance Using Multi-Intensity Infrared Illuminator Peggy Joy Lu, Jen-Hui Chuang, and Horng-Horng Lin Abstract In nighttime video

More information

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK

OBJECTIVE OF THE BOOK ORGANIZATION OF THE BOOK xv Preface Advancement in technology leads to wide spread use of mounting cameras to capture video imagery. Such surveillance cameras are predominant in commercial institutions through recording the cameras

More information

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP

IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP IMAGE TYPE WATER METER CHARACTER RECOGNITION BASED ON EMBEDDED DSP LIU Ying 1,HAN Yan-bin 2 and ZHANG Yu-lin 3 1 School of Information Science and Engineering, University of Jinan, Jinan 250022, PR China

More information