International Journal of Advanced Research in Computer Science and Software Engineering

Similar documents
A Review of Optical Character Recognition System for Recognition of Printed Text

Current Tendency of Listening FM Radio of Journalism Students in DIU

Bangla Optical Digits Recognition using Edge Detection Method

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

Rony Parvej s EEE. Lecture 5 & 6: MIST Special. Update: 30 April, fecabook.com/ronyiut

Handwritten Character Recognition using Different Kernel based SVM Classifier and MLP Neural Network (A COMPARISON)

ISO/IEC JTC 1/SC 2 N 4181/WG2 N4014 DATE:

MCQ ( Non-Departmental) 60 x 1/2 =30

Line Segmentation and Orientation Algorithm for Automatic Bengali License Plate Localization and Recognition

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Syllabus Class IV Notes:

Implementation of Text to Speech Conversion

Compression Method for Handwritten Document Images in Devnagri Script

Recognition System for Pakistani Paper Currency

MAV-ID card processing using camera images

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Abstract. Most OCR systems decompose the process into several stages:

A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION

Iraqi Car License Plate Recognition Using OCR

Matlab Based Vehicle Number Plate Recognition

A Novel Approach for Image Cropping and Automatic Contact Extraction from Images

Autumn 2015 Vol. 02 Issue 03 01st October, 2014 EDITORIAL: WHY IIEST NEEDS TO DEVELOP A CAMPUS MASTER PLAN. Editorial Team This Issue.

Localization of License Plates from Surveillance Camera Images: A Color Feature Based ANN Approach

Vehicle Number Plate Recognition with Bilinear Interpolation and Plotting Horizontal and Vertical Edge Processing Histogram with Sound Signals

Proposed Method for Off-line Signature Recognition and Verification using Neural Network

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

An Hybrid MLP-SVM Handwritten Digit Recognizer

Addis Ababa University School of Graduate Studies Addis Ababa Institute of Technology

Keywords OCR, Scripts, Hierarchical Classification, Contour, Projections.

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts

Image Finder Mobile Application Based on Neural Networks

CHARACTERS RECONGNIZATION OF AUTOMOBILE LICENSE PLATES ON THE DIGITAL IMAGE Rajasekhar Junjunuri* 1, Sandeep Kotta 1

Chapter 6. [6]Preprocessing

Implementation of License Plate Recognition System in ARM Cortex A8 Board

World Journal of Engineering Research and Technology WJERT

Student: Nizar Cherkaoui. Advisor: Dr. Chia-Ling Tsai (Computer Science Dept.) Advisor: Dr. Eric Muller (Biology Dept.)

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

Number Plate Recognition Using Segmentation

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

Bangladeshi Vehicle Digital License Plate Recognition for Metropolitan Cities Using Support Vector Machine

Optical Character Recognition for Hindi

Multi-Script Line identification from Indian Documents

Exercise questions for Machine vision

Text Extraction from Images

Figure 1. Artificial Neural Network structure. B. Spiking Neural Networks Spiking Neural networks (SNNs) fall into the third generation of neural netw

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

Volume 7, Issue 5, May 2017

PRODUCT RECOGNITION USING LABEL AND BARCODES

PHASE PRESERVING DENOISING AND BINARIZATION OF ANCIENT DOCUMENT IMAGE

FACE RECOGNITION USING NEURAL NETWORKS

Mobile Based Application to Scan the Number Plate and To Verify the Owner Details

The Real Time Vechicle License Plate Identification System

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

AUTOMATIC LICENSE PLATE RECOGNITION USING IMAGE PROCESSING AND NEURAL NETWORK

AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION

International Conference on Computer, Communication, Control and Information Technology (C 3 IT 2009) Paper Code: DSIP-024

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

AN EFFICIENT THINNING ALGORITHM FOR ARABIC OCR SYSTEMS

Artificial Neural Networks. Artificial Intelligence Santa Clara, 2016

Image Extraction using Image Mining Technique

AUTOMATIC NUMBER PLATE DETECTION USING IMAGE PROCESSING AND PAYMENT AT TOLL PLAZA

Libyan Licenses Plate Recognition Using Template Matching Method

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION

Motion Detection Keyvan Yaghmayi

######################################################################

Automatic Licenses Plate Recognition System

Brain Tumor Segmentation of MRI Images Using SVM Classifier Abstract: Keywords: INTRODUCTION RELATED WORK A UGC Recommended Journal

Identification of Fake Currency Based on HSV Feature Extraction of Currency Note

An Improved Bernsen Algorithm Approaches For License Plate Recognition

A New Character Segmentation Approach for Off-Line Cursive Handwritten Words

Digital Image Processing Lec.(3) 4 th class

Recursive Text Segmentation for Color Images for Indonesian Automated Document Reader

Multiple-Layer Networks. and. Backpropagation Algorithms

A Chinese License Plate Recognition System

A Comprehensive Survey on Kannada Handwritten Character Recognition and Dataset Preparation

MINE 432 Industrial Automation and Robotics

Indian Coin Matching and Counting Using Edge Detection Technique

Number Plate Recognition System using OCR for Automatic Toll Collection

Skeletonization Algorithm for an Arabic Handwriting

Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter

Automated License Plate Recognition for Toll Booth Application

A Training Based Approach for Vehicle Plate Recognition (VPR)

Automatic Car License Plate Detection System for Odd and Even Series

Segmentation of Microscopic Bone Images

Analysis and Identification of Rice Granules Using Image Processing and Neural Network

Image Processing and Artificial Neural Network techniques in Identifying Defects of Textile Products

A new seal verification for Chinese color seal

FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka

Hand & Upper Body Based Hybrid Gesture Recognition

Automatics Vehicle License Plate Recognition using MATLAB

Indexed Color. A browser may support only a certain number of specific colors, creating a palette from which to choose

Finger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy

Estimation of Moisture Content in Soil Using Image Processing

Real Time ALPR for Vehicle Identification Using Neural Network

AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH

IJRASET 2015: All Rights are Reserved

DISEASE DETECTION OF TOMATO PLANT LEAF USING ANDROID APPLICATION

Optical Character Recognition with Neural Network

Transcription:

Volume 2, Issue 11, November 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Handwritten Bangla Character Recognition Using Neural Network Md. Alamgir Badsha * Department of CSE, Chittagong University of Engineering and Technology, Chittagong- 4349, Bangladesh Md. Akkas Ali* Lecturer, Dept. of CSE & IT, University of Information Technology and Sciences Dhaka-1212, Bangladesh Dr. Kaushik Deb Department of CSE, Chittagong University of Engineering and Technology, Chittagong- 4349, Bangladesh Md. Nuruzzaman Bhuiyan Lecturer, Dept. of CSE & IT, University of Information Technology and Sciences Dhaka-1212, Bangladesh Abstract This paper presents an off-line recognition system for Bangla handwritten characters using Backpropagation Feed-forward neural network. Character is identified by analysing its shape and comparing its features that distinguishes each character. Also an analysis was carried out to determine the number of hidden layer nodes to achieve high performance of Back-propagation network in the recognition of handwritten Bangla characters. First, we try to create binary image. Then, extract the feature and form input vector. Then, apply the input vector in the neural network. The experimental result has analysed and result shows that the proposed recognition method gives 84% accuracy and less computational cost than other method. fed- Keywords Bangla Character Recognition, Feature Extraction, Pre-processing, RGB image, Pixel grabbing, forward neural network. I. INTRODUCTION For Optical character recognition (OCR) is the translation of optically scanned bitmaps of printed or written text characters into character codes, such as ASCII. This is an efficient way to turn hard-copy materials into data files that can be edited and otherwise manipulated on a computer [1]. This is the technology long used by libraries and government agencies to make lengthy documents quickly available electronically. Advances in OCR technology have spurred its increasing use by enterprises. For many document-input tasks, OCR is the most cost-effective and speedy method available [2]. And each year, the technology frees acres of storage space once given over to file cabinets and boxes full of paper documents. Before OCR can be used, the source material must be scanned using an optical scanner (and sometimes a specialized circuit board in the PC) to read in the page as a bitmap (a pattern of dots) [3]. Software to recognize the images is also required. The OCR software then processes these scans to differentiate between images and text and determine what letters are represented in the light and dark areas. Older OCR systems match these images against stored bitmaps based on specific fonts. The hit-or-miss results of such pattern-recognition systems helped establish OCR s reputation for inaccuracy. Today s OCR engines add the multiple algorithms of neural network technology to analyze the stroke edge, the line of discontinuity between the text characters, and the background [4]. Allowing for irregularities of printed ink on paper, each algorithm averages the light and dark along the side of a stroke, matches it to known characters and makes a best guess as to which character it is. The OCR software then averages or polls the results from all the algorithms to obtain a single reading. II. OBJECTIVE To presents a new technique for Handwritten Bangla Character Recognition system. Also reduce the recognition problem and the computational time. The aim of Bangla character recognition is to provide an efficient and accurate mechanism to recognize the handwritten bangle character. Our aim is to recognize 50 basic bangle characters using neural network. The block diagram of our proposed model is given below: III. PROPOSED METHOD A. Process of Character Recognition system Basically we can divide whole process of Handwritten character Recognition system is two major parts: 1. Pre-processing and Feature Extraction. 2. Recognize with Neural network. 2012, IJARCSSE All Rights Reserved Page 307

A.1. Pre-processing and Feature Extraction Here, we have prepared the input image for feature extraction and then extract the feature of it. Using these features we have prepared an input matrix for Feed-forward neural network. Start Input image Resized input image Convert resized RGB image into gray scale images Convert the Gray scale images to Binary images Feature extraction Generate input matrix Convert this matrix into 25*25 binary matrixes Feed-forward neural network Target Matrix Output formatter Output Fig. 1 Our proposed model End A.1.1. Resizing the captured image Here the captured image size is so high i.e. high resolution. So, the size of the input image must be reduced. The reduction is done so carefully that the aspect ratio remains same. For example, if the size of an input image is 640 480 then the aspect ratio of it is 4/3. If we preserve the aspect ratio and try to reduce the size of the image, the output of this process may be 400 300 or, 300 225 or, 100 75 image [5]. We are very careful about the preservation of aspect ratio because if the aspect ratio is not preserved the shape of the image will change. A.1.2. RGB to Gray converter It is the process converts the resized RGB image into Gray scale image. We need to do this conversion because we need Binary image for operation and the Binary image can be constructed from Gray scale image by applying thresholding operation [6]. During the thresholding process, individual pixels in an image are marked as "object" pixels if their value is greater than some threshold value (assuming an object to be brighter than the background) and as "background" pixels otherwise. This convention is known as threshold above. Variants include threshold below, which is opposite of threshold above; threshold inside, where a pixel is labelled "object" if its value is between two thresholds; and threshold outside, which is the opposite of threshold inside. Typically, an object pixel is given a value of 1 while a background pixel is given a value of 0. Finally, a binary image is created by coloring each pixel white or black, depending on a pixel's labels. A.1.3. Gray scale to Binary image converter A binary image is a digital image that has only two possible values for each pixel. Typically the two colors used for a binary image are black and white though any two colors can be used. The color used for the object(s) in the image is the foreground color while the rest of the image is the background color. Thresholding method is used to separate object and background, which is divided image into two modes. The way to resolve both categories is by assigning a thresholding value T. Each point (x, y) which have value f(x, y) T is called foreground object, and each point (x, y) which have value f(x, y) > T is called background object. A thresholded image g(x, y) is defined as, g(x, y) = { 0 ; If f(x, y) > T 1; If f(x, y) <= T 2012, IJARCSSE All Rights Reserved Page 308

Pixel which have value 1 correspond the object and pixel which have value 0 correspond background. T is a constants, this approach is called global thresholding. A.1.4. Feature extraction Next and the most important feature of Bangla character recognition is feature extraction. In this system I am considering a few steps for extracting a vector. Our main target is finding a vector from the image. So image is processed and then binary image is created [7]. So we have only 2 types of data on the image. Those are 1 for the white space and 0 for the black space. Now we have to pass the following steps for creating input vector for a particular character or image. Those are: 1. Pixel grabbing 2. Finding probability of making square A.1.4.1. Pixel grabbing from image As we are considering binary image and we also fixed the image size, so we can easily get 250 X 250 pixels from a particular image containing Bangla character or word. One thing is clear that we can grab and separate only character portion from the digital image. In specific, we took a Bangla character contained image. And obviously it s a binary image. A.1.4.2. Finding probability of making square Now we are going to sample the entire image into a specified portion so that we can get the vector easily. We specified an area of 25 X 25 pixels. For this we need to convert the 250 X 250 image into the 25 X 25 area. So for each sampled area we need to take 10 X 10 pixels from binary image. We can give a short example for that. Table 2 is the original binary image of 25 X 15 pixels. We need to sample it 5 X 3 pixels area. So, for each area we will consider 5 X 5 pixel from the binary image. Table 1 will show how pixels are classified for finding the probability of making square. TABLE I INITIAL PIXEL AND SEPARATING THE PIXEL Initial pixel data from image Separating the pixels 0000011000001000000000000 0000000000000000111000010 0100000111001000111000100 0010011111000100111000000 0000000111000010000000000 0111000000110001100100110 0100000000011111001100000 0011100000110001100100000 0111100000001111001101110 0001100000001111100100001 00000 11000 00100 00000 00000 00000 00000 00000 01110 00010 01000 00111 00100 01110 00100 00100 11111 00010 01110 00000 00000 00111 00001 00000 00000 01110 00000 11000 11001 00110 01000 00000 01111 10011 00000 00111 00000 11000 11001 00000 01111 00000 00111 10011 01110 00011 00000 00111 11001 00001 A.1.5. Target Matrix Generator We can easily generate the target matrix for feed-forward neural network. It is 50x 50 identity matrixes. The diagonal values are 1 and the value of remaining elements is 0. In each column there is only one element which the value is 1. It represents that the output neuron corresponding, this row number in on and other neurons are off. One output column is for input row. B. Recognized using neural network We have used feed-forward neural network. The neural network has three layers. The first layer takes input in of 25*25 matrixes. The second layer is hidden layer. It consists of 10 neurons. The third layer is output layer. It consists of 50 neurons. These neurons are called output neurons. The number of neuron in the hidden is chosen so carefully that the performance of the neural network is maximized. C. Output Formatter The output formatter takes the output of the simulation as input and conversion it into human understandable form. For this it uses a thresholding process. It takes 0.5 as threshold value and converts the simulated matrix into 1 0 binary matrix. If the value of an element of simulated matrix is less than 0.5 it is set to 0, otherwise the value is set to 1 [8]. We can easily identify the output of corresponding input from this matrix. We can also calculate the recognition rate from the matrix. The formula is given below: Recognition rate= (number of 1 s in the diagonal / total number of input) x100% IV. EXPERIMENTAL RESULTS 2012, IJARCSSE All Rights Reserved Page 309

We have used captured image for input image. We write down code on an M-file and execute it using MATLAB command prompt. Now we describe the experimental result in the following section of the input-output result. A. Implementation of Character recognition system The implementation of character recognition system, image acquisition and pre-treatment is the primary step of our implementation. The result of this step is given in the following description. A.1. Captured image A.2. Input Image Fig. 2 Captured image Fig.3 Input image A.3. Output of Gray to Binary conversion Fig. 4 Gray to Binary conversion A.4. Output of Feature Extraction Part Fig. 5 Output of Feature Extraction part B. Input-Output Analysis TABLE II INPUT-OUTPUT ANALYSIS Input Characters Recognized Characters অ অ আ ই ঈ উ ঊ ঋ এ 2012, IJARCSSE All Rights Reserved Page 310 আ ই ঈ উ ঋ এ

2012, IJARCSSE All Rights Reserved Page 311 ঐ ঐ ও ও ঔ ক ক খ খ গ গ ঘ ঘ ঙ চ চ ছ ছ জ জ ঝ ঝ ঞ ঞ ট ঠ ঠ ড ঢ ঢ ণ ণ দ দ ধ ধ ন ন প প ফ ফ ব ভ ভ ম য য র র ল ল ড় ড় ঢ় ৎ ৎ Input Character Recognized Character C. Recognized Rate Recognized rate: (42/50) x 100% = 84%

V. CONCLUSION In this paper we show faster method for recognizing bangle handwritten character using neural network. The recognition rate of our method is remarkable. The software is implemented by using MATLAB. The input images are taken by a camera and it must be in.jpg format. After, extract feature of binary image it apply into input layer of fed-forward neural network and take output from the output layer. REFERENCES [1] Velappa Ganapathy, and Kok Leong Liew, Handwritten Character Recognition Using Multiscale Neural Network Training Technique, World Academy of Science, Engineering and Technology 39 2008. [2] G.G Rajput, Fourier Descriptor based Isolated Marathi Handwritten Numeral Recognition, Gulbarga University, India, June 2010. [3] Adnan Mohammad Shoeb Shatil, Research Report on Bangla Optical Character Recognition Using Kohonen network, Center for Research on Bangla Language Processing, BRAC University Dhaka, Bangladesh. [4] Hasan Mohammad Kafi, Sign Alphabet and Digits Recognition Using Neural network, 2010. [5] Anik Saha, A Neural Approach in Bangla Handwritten Text Recognition, 2008. [6] Ety Dey, Recognition Bangla and English Text from the same Document, July 2009. [7] Mohammad Sirajul Islam and S.M. Wahidur Rahman, Bangla character Recognition, 2004. [8] Nibaran Das, Brindaban Das, Ram Sarkar, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri, Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier, February 2010. 2012, IJARCSSE All Rights Reserved Page 312