Camera Based Hand Gesture Recognition Daniel Snowden B.Sc. Computing with Artificial Intelligence 2005/2006
|
|
- Dwain Blankenship
- 5 years ago
- Views:
Transcription
1 Camera Based Hand Gesture Recognition Daniel Snowden B.Sc. Computing with Artificial Intelligence 2005/2006 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference has been made to the work of others. I understand that failure to attribute material which is obtained from another source may be considered as plagiarism. (Signature of student)
2 Summary The minimum requirement was to develop a system capable of recognising at least 2 gestures from at least 1 user. It has achieved in this regard. The system uses histograms of motion history images as templates for gestures. A similar MHI is built from the live input and a histogram of the image is compared against the templates. The system itself does produce recognitions and does not detect many gestures, although these are likely to be optimisation and template issues. In addition there were hardware and software issues which slowed the development of this project. The following report describes the development of this system. i
3 Acknowledgements I d like to thank my supervisor, Andy Bulpit, for his help throughout this project. I d also like to thank Chris Needham and Andrew Bennett (Leeds Vision Group) for their assistance in resolving technical issues surrounding the webcam and image processing library. And of course I d like to thank my family and friends who put up with me constantly talking about Computer Vision. ii
4 Contents 1 Introduction Background Literature Review Computer Vision Tracking Motion History Images Template Matching Markov Markov Models Recursive Induction Bayesian Networks Software Development Languages Development Methodologies System Design Vision Techniques Background Subtraction Image Segmentation Motion History Images Histograms Comparison iii
5 4 System Implementation Platforms Video Input Image Processing Library Vision Techniques Background Subtraction Image Segmentation Motion History Images Histograms Training Comparison Results and Testing Evaluation Performance of the System Potential Future Improvements Generation of Templates Optimisation of Recognition Threshold Bibliography 17 iv
6 Chapter 1 Introduction 1.1 Background to the Problem Computer Vision is an exciting field of research, providing the potential for creating new ways of interacting with computers. The aim of this project is to develop an application capable of recognising a range of gestures from a single user, through computer vision techniques. A requirement of this project is to produce a low cost, real-time solution. A potential use for the technology is to allow users to use gestures to interact with a virtual document, for example by pointing at an area to select it. The School of Computing has a number of Phillips web cams, one of which is to be used during this project. It is intended to develop a system which can use a video source (live video input or pre-recorded) to recognise at least 2 gestures from a single user. 1
7 Chapter 2 Literature Review 2.1 Computer Vision Tracking One of the tracking methods reviewed is the adaptive background mixture model[6] which can be used for real time tracking (Also known as the Grimson and Stauffer tracker). This is a method based on the common background subtraction methods, sometimes used in vision systems,citetowardsmodel. What is different about the adaptive background mixture model is that each pixel is modelled as a mixture of Gaussians, built up over a period of time and evaluated to determine what is and is not in the background. The computational power required for the algorithm could prove to be a limiting factor and as a result, only a small number of Gaussians may be able to be used. This may not be as problematic as it would first sound, due to the fact that the program will not be run on particularly noisy scenes (The program will be used in controlled situations, changes in lighting and scenary are not expected). There could be great benefit to using a tracking system which does not require human intervention when being initialised. Such a method is discussed by Yuan, Sclarlof and Athisos. They use a probabilistic approach, which optimises itself[9]. While this approach could certainly prove useful, it may take too long to implement. 2
8 2.1.2 Motion History Images The motion history image is a technique which allows the successive layering of image silhouettes are layered in to a template image. Each silhouette decreases in value with every successive frame[3] Template Matching A form of template matching is used by Bobick and Davis [2]. Their technique uses the previously mentioned motion history images, their approach involves comparing motion history and motion energy images against similar images which represent templates of known actions Markov Markov Models A number of different approaches to implementing hidden Markov models have been studied. These include pseudo 3-D models and human anatomy based coupled models,[7]. Both methods involve adapting the models to the specifics of the movement of human anatomy, i.e. the degrees of freedom of the human hand Recursive Induction One method studied is recursive induction learning, a method the program develops its own learning model from examples[10], these examples can be supplied in the initialisation. A feature of this approach is the development of decision trees, based on the degrees of freedom of the human hand. An approach which could prove useful considering that the degrees of freedom makes gesture recognition in the hand a difficult task Bayesian Networks A Bayesian framework based approach could be considered[11] as the approach described by Zhou and Huang has a relatively low hardware requirement (program was run on a 1GHz CPU), and it is readily able to track a hand in a relatively cluttered scene. 3
9 2.2 Software Development Languages The only programming language research has been in to understanding the usage of C++, rather than to decide on a language to use. The reason for this is that the project will be using the Leeds Real Time Image Library 1, which is written in the C++ language, which means that the program will need to be developed in the same language. A guide to using program libraries in C++ has been found, and will be used to integrate the image processing library in to the program to be built[8]. Static libraries are to be used rather then dynamic libraries, as this will enable to program to run on systems without the library (as the program can carry it s own statically linked copy), as portability may be required in the testing, rather than development stages Development Methodologies There are few development methodologies which are suitable for a project such as this, due to the experimental nature of computer vision. Traditional methodologies such as the waterfall model are not appropriate to a project such as this. A form of rapid prototyping is to be used (with an iterative approach), as this will allow the flexibility to make alterations to the program and the CV algorithms used. Prototyping can be used with the waterfall model or with other development approaches, this approach allows the development of functionality early on in the project and allows the project to be perfected through a number of iterations with good risk control[1]
10 Chapter 3 System Design 3.1 Vision Techniques Background Subtraction The background subtraction is provided by the Background Modelling Methods library from the Leeds Vision Group. (For SOC students, at the time of writing, this is available at ṽislib/libs/) Image Segmentation The resulting binary image from the background subtraction is to be run through an edge detector, this provides an outline of the hand. As the background subtraction should produce a white image of a hand on a black background, this technique should prove effective Motion History Images The resulting binary image from the image segmentation is to be added to a second image. As well as adding new images, a decay rate is factored in. This way, new motion appears the brightest while older motion gradually decays to black. This is relatively simple to implement in a greyscale image as each pixel is assigned a value from The binary image from the edge detector produces values of 0 5
11 Figure 3.1: The Overall Design Of The System 6
12 for(i=0; i<width; i++) { for(j=0; j<height; j++) { } } Figure 3.2: Iteration through all pixels in an image histogram a histogram c histogram e histogram g histogram b histogram d histogram f histogram h Figure 3.3: Division of MHI for no edges, and 255 for edges, therefore it is simple to build in a decay rate (based on subtracting a specific pixel value each iteration) Histograms An approach is used, which is based on work by Aaron Bobick and James Davis [2]. 1 dimensional histograms of pixel intensity are to be used to recognise the gestures. 1D histograms are used as greyscale images use a single value of pixel intensity (0-255). A histogram class is provided in the Leeds Real Time Image Processing Library. (If RGB or similar colourspaces were used, a 2 or 3d histogram would be used) 8 histograms are to be used, with the motion history images being split into 8 sections (4 rows, 2 columns), the contents of each one added to the appropriate histogram. A similar process is to be carried out through a separate training program. This program will build histograms for each gesture and the histograms saved for future comparison. This approach should not have problems associated with colour variations, because the histograms are built from the MHI, with this being a greyscale image (with intensity linked to time). In addition, histograms are invariant to rotation (due to the fact that the histograms only store the frequency of pixel 7
13 values, not the position). The pixel values are grouped in to a number of bins. It is hoped that clustering the pixels in this way will help limit the effects of minor variations in the pixel values and clean up the values Comparison The comparison will take place by calculating the mean difference between the histograms of templates and the current MHI. The templates can then be ranked according how close they are to the current image. The closest template will be chosen as the current gesture. A failsafe will be required to ensure that the system will know that the user is not making a gesture. 2 possibilities are to have a default template for no gestures or set a maximum deviation from the template whereby any template which exceeds it will be ignored. It is intended to test both techniques. 8
14 Chapter 4 System Implementation During the implementation of this project, it was decided to use the Eclipse IDE. The reason for this was my familiarity with Eclipse and the existence of a C/C++ development plugin which included build control (though GNU make). 4.1 Platforms Video Input The Phillips web cam caused a number of problems with building the system. For an as yet unknown reason, there was no ability to disable the automatic gain control. This had the result of the camera adjusting focus as the light levels differed. In turn, this meant that objects moving in a scene caused changes in focus. Because the images were changing constantly, this made it difficult to seperate foreground and background. The reason for this is that changes in lighting produce pixel values which are different from the background image and are therefore assumed to be foreground. This problem does not occur on SOC computers. As a result, in the later stages of the project, I was allowed access to the Vision Lab. This allowed data to be gathered without the automatic gain control issue. Another ongoing problem with the Phillips web cam is compatibility with operating systems. Although a driver exists, it was not compatible with Fedora Core 5. This was only a minor compatibility 9
15 Figure 4.1: Variations in lighting caused there to be many differences between the current image and the mean image problem as I was able to restore previous installations of version 4 on my home computers Image Processing Library A substantial amount of time was spent building the library to the detriment of other components of the project. 4.2 Vision Techniques Background Subtraction As mentioned in the previous section, the automatic gain control was an issue with the background subtraction, producing peculiar results. The binary image resulting from the edge detection should produce black pixels for the background, with white pixels showing the foreground. The library provided 2 trackers, a tracker based on single Gaussian hypothesis and a Grimson and Staufer tracker based on multiple Gaussian hypothesis[6]. It was decided to use the single Gaussian 10
16 (a) Background Subtraction (b) After Edge Detection Figure 4.2: Comparison of background subtraction with resulting sobel edge detection tracker as the Grimson and Stauffer tracker proved to be very computationally expensive Image Segmentation Although the Leeds Real Time Image Processing Library provides an edge detector, unfortunately it is a Sobel edge detector. A Canny edge detector would have been preferable due to the fact that it first applies a gradient edge detector, then finds edge pixels through non maximal suppression and hysteresis tracking. This is a more sophisticated technique, producing a more optimal image [5, 4]. Nonetheless, the sobel edge detector proved to be remarkably resilient, this could be due to the fact that the background subtraction produced a relatively clean image. A diagnostic mode was included in the application, which directed the output of the various stages of image processing to display windows. This enabled the performance of individual components of the system to be viewed. These were the background subtraction (image difference and mean image), edge detection and the motion history image Motion History Images The motion history images were not as easily implemented as the design in the previous chapter. The first problem was that the initial approach used a set of nested loops designed to iterate through the width and height. This approach proved to be computationally expensive. A faster approach was found to involve a single loop, iterating over the image using pointers rather than assignment. The use of pointers provided a substantial benefit in increasing the speed of the system. 11
17 Figure 4.3: Video output of the various stages of image processing The reason why this is important is that the library processes video in real time. As a result, delays in processing can result in obsolete data being written to the motion history image. Not all of the effects of the delay were obvious. One example being a ghosting effect, where an object in a scene (e.g. a hand) decays but reappears moments later. Another change from the initial design was the way decay functioned. As well as being slow, the subtraction method also tended to produce a ghosting effect, where pixels count down to a particular value but remain at that point as it is lower than the threshold value. It was decided to multiply the pixel values by a decay rate (with the decay rate being a value lower than 1, 0.8 was chosen) Histograms There were serious problems encountered when implementing the histograms. Due to delays at various stages, I was left with far less time than I had hoped for at this stage. The problem that emerged was that only 3 histograms were being populated. The histograms in this case were histograms a, b and c. Because the program iterates through each pixel, through width then height, it can be concluded that these are the first 3 histograms to be populated by the program. (The program treats the image as a 1d array of length w*d) At this stage, it was impossible to determine what exactly was causing this loss of functionality but 12
18 it is possible that it was the result of a computationally expensive loop and nested if statements. In the final version, a single histogram was used for each image because of the problems mentioned. I decided to make this modification as there was little time remaining to repair the program Training It was decided to use a separate training program to build the histograms for the templates. The histograms were cleared at each iteration and saved to file after being built. This allows the final template to be chosen by simply closing the program. Although this is not a particularly advanced technique, it has proven adequate for building the templates Comparison Development of the comparison mechanism was slower than expected. Rather than use a template to recognise no gesture, I decided to use the absolute difference calculated between the video histogram and templates. The way this worked was by creating a threshold, where any absolute difference which exceeds it is discounted. Choosing a reasonable value is a difficult process, however the testing mechanism helped considerably. In a file, the intended gesture, detected gesture and absolute difference were recorded. It was a simple matter to examine the differences present in false positives and to reduce the threshold accordingly. Of course on some occasions this prevented gestures from being picked up Results and Testing As mentioned, a file was used to record the data. The reason was to create a blind test, where seeing results does not affect the way I constructed gestures. A mechanism was put in place where I could simply click on the window with the LMB to state that point is intended and RMB for clench. 13
19 Chapter 5 Evaluation 5.1 Performance of the System The system was tested by using a selector which indicated the intended gesture. This is outputted to a file along with the recognised gesture and the absolute difference between the template histogram and the live video histogram. This data could then be used to create a cut-off value, where all differences that exceed it are ignored. In the final version of the program tested, 18 gestures were recognised. Out of those, 8 were false. 2 gestures were used in this test, a user pointing their hand and clenching their fist. Figure 5.1: An Example MHI Used For Construction of Histogram 14
20 no. Intended Gesture Detected Absolute Diff 1 clench point clench clench clench clench clench point clench point clench clench clench point clench point clench point point point point point point point point clench point clench clench clench clench clench clench clench clench clench Figure 5.2: Results of Testing 15
21 It should be noted that this test did not include the intended gestures that were not picked up (false negatives). It is highly likely that the poor quality results were partly the result of poor templates. Unfortunately there was not much time to develop more templates and a better approach to template generation. 5.2 Potential Future Improvements The system developed is very crude and there are many possible improvements that can be made Generation of Templates The generation of templates involved building a histogram of a single gesture, a more effective system that could be developed may involve mean histograms. By that I mean building histograms of several attempts at the same gesture and calculating the mean. This would provide a more robust template as the quality of the input gesture effects the performance of the entire system Optimisation of Recognition Threshold This is a reference to the cut-off value to prevent incorrect recognition. A future system could optimise this itself. The way this could be developed could involve using the intended gesture flags to recognise false positives. It could also be used to recognise false negatives. Using this approach, the system could be extended to decrease the threshold when it detects false positives (reduce the value of the minimum absolute difference) and raise it when it detects false negatives. It would have to use small increments of course. 16
22 Bibliography [1] Comparison of software development methodologies. crosstalk/1995/01/comparis.asp. Last Visited 07/12/2005. [2] Aaron Bobick and James Davis. Real-Time Recognition of Activity Using Temporal Templates [3] Gary R. Bradski and James W. Davis. Motion segmentation and pose recognition with history gradients. Machine Vision and Applications, pages , [4] Bob Fisher, Simon Perkins, Ashley Walker, and Eric Wolfart. Canny edge detector. http: // [5] Bob Fisher, Simon Perkins, Ashley Walker, and Eric Wolfart. Edge detectors. cee.hw.ac.uk/hipr/html/edgdetct.html. [6] Chris Stauffer and W.E.L Grimson. Adaptive Background Mixture Models for Real-Time Tracking. In IEEE Computer Vision and Pattern Recognition, pages , [7] Gerhard Rigoll Stefan Muller, Stefan Eickeler. Crane gesture recognition using pseudo 3-d hidden markov models. In Fourth IEEE Conference on Automatic Face and Gesture Recogntition, pages 1 4, [8] David A. Wheeler. Program library howto. program-library/. Last visited 03/12/2005. [9] Quan Yuan, Stan Sclaroff, and Vassilis Athisos. Automatic 2d hand tracking in video sequences. Technical report, Computer Science Department, Boston University, Available from http: // 17
23 [10] Meide Zhao, Francis K.H. Quek, and Xindong Wu. Recursive induction learning in hand gesture recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20: , Available from csdl.computer.org/comp/trans/tp/1998/ 11/i1174abs.htm. [11] Hanning Zhou and Thomas S. Huang. A bayesian framework for real-time 3d hand tracking in high clutter background. In The International Conference on Human Computer Interaction, volume 10, pages 1 7, Available from hzhou/pub/hcii03.pdf. 18
24 Appendix A - Personal Reflection The final year project is a piece work which I had dreaded for almost 2 years. After working on this project for almost a year, I can safely say that it has been an amazing learning experience as it is an individual piece of work with none of the rigid limitations seen in assignments and module criteria. This is the first piece of work, of this type, where I have had such a level of academic freedom. This has also been the single hardest piece of work I ve ever done. Even with that being the case, I have no regrets about choosing this particular topic. This project has helped me develop several important skills, including programming and analytical/problem solving skills as well as developing a better understanding of computer vision techniques. My greatest weakness has probably been time management and balancing this project with other assignments, which unfortunately, left little time for implementing the later stages of the project. I have found that it is easy to get caught up on a few specific components and to loose sight of the bigger picture. In this sense, many of the problems which occurred were my own fault. Of course there were problems which were outside my scope of control. For example, the problems I mentioned in the implementation chapter with regards to the camera. In this sense, it has been a valuable learning experience and in future, I will certainly make provisions for hardware/software problems with tasks. This certainly was a task I was working on until the 11th hour. In closing, I feel I have learned a lot from this project and having the opportunity to develop a Computer Vision project has added to what has already been a fascinating and enjoyable year. 19
Real-Time Face Detection and Tracking for High Resolution Smart Camera System
Digital Image Computing Techniques and Applications Real-Time Face Detection and Tracking for High Resolution Smart Camera System Y. M. Mustafah a,b, T. Shan a, A. W. Azman a,b, A. Bigdeli a, B. C. Lovell
More informationA Real Time Static & Dynamic Hand Gesture Recognition System
International Journal of Engineering Inventions e-issn: 2278-7461, p-issn: 2319-6491 Volume 4, Issue 12 [Aug. 2015] PP: 93-98 A Real Time Static & Dynamic Hand Gesture Recognition System N. Subhash Chandra
More informationWheeler-Classified Vehicle Detection System using CCTV Cameras
Wheeler-Classified Vehicle Detection System using CCTV Cameras Pratishtha Gupta Assistant Professor: Computer Science Banasthali University Jaipur, India G. N. Purohit Professor: Computer Science Banasthali
More informationMaking PHP See. Confoo Michael Maclean
Making PHP See Confoo 2011 Michael Maclean mgdm@php.net http://mgdm.net You want to do what? PHP has many ways to create graphics Cairo, ImageMagick, GraphicsMagick, GD... You want to do what? There aren't
More informationVICs: A Modular Vision-Based HCI Framework
VICs: A Modular Vision-Based HCI Framework The Visual Interaction Cues Project Guangqi Ye, Jason Corso Darius Burschka, & Greg Hager CIRL, 1 Today, I ll be presenting work that is part of an ongoing project
More informationImage Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab
Image Deblurring and Noise Reduction in Python TJHSST Senior Research Project Computer Systems Lab 2009-2010 Vincent DeVito June 16, 2010 Abstract In the world of photography and machine vision, blurry
More informationCOMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES
International Journal of Advanced Research in Engineering and Technology (IJARET) Volume 9, Issue 3, May - June 2018, pp. 177 185, Article ID: IJARET_09_03_023 Available online at http://www.iaeme.com/ijaret/issues.asp?jtype=ijaret&vtype=9&itype=3
More informationIntelligent Traffic Sign Detector: Adaptive Learning Based on Online Gathering of Training Samples
2011 IEEE Intelligent Vehicles Symposium (IV) Baden-Baden, Germany, June 5-9, 2011 Intelligent Traffic Sign Detector: Adaptive Learning Based on Online Gathering of Training Samples Daisuke Deguchi, Mitsunori
More informationAn Efficient Color Image Segmentation using Edge Detection and Thresholding Methods
19 An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods T.Arunachalam* Post Graduate Student, P.G. Dept. of Computer Science, Govt Arts College, Melur - 625 106 Email-Arunac682@gmail.com
More informationRecognition Of Vehicle Number Plate Using MATLAB
Recognition Of Vehicle Number Plate Using MATLAB Mr. Ami Kumar Parida 1, SH Mayuri 2,Pallabi Nayk 3,Nidhi Bharti 4 1Asst. Professor, Gandhi Institute Of Engineering and Technology, Gunupur 234Under Graduate,
More informationCROWD ANALYSIS WITH FISH EYE CAMERA
CROWD ANALYSIS WITH FISH EYE CAMERA Huseyin Oguzhan Tevetoglu 1 and Nihan Kahraman 2 1 Department of Electronic and Communication Engineering, Yıldız Technical University, Istanbul, Turkey 1 Netaş Telekomünikasyon
More informationCOLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER
COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER Department of Computer Science, Institute of Management Sciences, 1-A, Sector
More informationFigure 1. Mr Bean cartoon
Dan Diggins MSc Computer Animation 2005 Major Animation Assignment Live Footage Tooning using FilterMan 1 Introduction This report discusses the processes and techniques used to convert live action footage
More informationFOCAL LENGTH CHANGE COMPENSATION FOR MONOCULAR SLAM
FOCAL LENGTH CHANGE COMPENSATION FOR MONOCULAR SLAM Takafumi Taketomi Nara Institute of Science and Technology, Japan Janne Heikkilä University of Oulu, Finland ABSTRACT In this paper, we propose a method
More informationReal Time Video Analysis using Smart Phone Camera for Stroboscopic Image
Real Time Video Analysis using Smart Phone Camera for Stroboscopic Image Somnath Mukherjee, Kritikal Solutions Pvt. Ltd. (India); Soumyajit Ganguly, International Institute of Information Technology (India)
More informationLane Detection in Automotive
Lane Detection in Automotive Contents Introduction... 2 Image Processing... 2 Reading an image... 3 RGB to Gray... 3 Mean and Gaussian filtering... 5 Defining our Region of Interest... 6 BirdsEyeView Transformation...
More informationE90 Project Proposal. 6 December 2006 Paul Azunre Thomas Murray David Wright
E90 Project Proposal 6 December 2006 Paul Azunre Thomas Murray David Wright Table of Contents Abstract 3 Introduction..4 Technical Discussion...4 Tracking Input..4 Haptic Feedack.6 Project Implementation....7
More informationPerformance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images
Performance Evaluation of Edge Detection Techniques for Square Pixel and Hexagon Pixel images Keshav Thakur 1, Er Pooja Gupta 2,Dr.Kuldip Pahwa 3, 1,M.Tech Final Year Student, Deptt. of ECE, MMU Ambala,
More informationVision-based User-interfaces for Pervasive Computing. CHI 2003 Tutorial Notes. Trevor Darrell Vision Interface Group MIT AI Lab
Vision-based User-interfaces for Pervasive Computing Tutorial Notes Vision Interface Group MIT AI Lab Table of contents Biographical sketch..ii Agenda..iii Objectives.. iv Abstract..v Introduction....1
More informationDesign of Temporally Dithered Codes for Increased Depth of Field in Structured Light Systems
Design of Temporally Dithered Codes for Increased Depth of Field in Structured Light Systems Ricardo R. Garcia University of California, Berkeley Berkeley, CA rrgarcia@eecs.berkeley.edu Abstract In recent
More informationSMARTPHONE SENSOR BASED GESTURE RECOGNITION LIBRARY
SMARTPHONE SENSOR BASED GESTURE RECOGNITION LIBRARY Sidhesh Badrinarayan 1, Saurabh Abhale 2 1,2 Department of Information Technology, Pune Institute of Computer Technology, Pune, India ABSTRACT: Gestures
More informationAN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH
AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH Report submitted in partial fulfillment of the requirements for the award of the degree of Bachelor of Computer Systems & Software Engineering
More informationEnabling Cursor Control Using on Pinch Gesture Recognition
Enabling Cursor Control Using on Pinch Gesture Recognition Benjamin Baldus Debra Lauterbach Juan Lizarraga October 5, 2007 Abstract In this project we expect to develop a machine-user interface based on
More informationImage Extraction using Image Mining Technique
IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 9 (September. 2013), V2 PP 36-42 Image Extraction using Image Mining Technique Prof. Samir Kumar Bandyopadhyay,
More informationA Comparison of Histogram and Template Matching for Face Verification
A Comparison of and Template Matching for Face Verification Chidambaram Chidambaram Universidade do Estado de Santa Catarina chidambaram@udesc.br Marlon Subtil Marçal, Leyza Baldo Dorini, Hugo Vieira Neto
More informationFace Detection System on Ada boost Algorithm Using Haar Classifiers
Vol.2, Issue.6, Nov-Dec. 2012 pp-3996-4000 ISSN: 2249-6645 Face Detection System on Ada boost Algorithm Using Haar Classifiers M. Gopi Krishna, A. Srinivasulu, Prof (Dr.) T.K.Basak 1, 2 Department of Electronics
More informationAn Improved Method of Computing Scale-Orientation Signatures
An Improved Method of Computing Scale-Orientation Signatures Chris Rose * and Chris Taylor Division of Imaging Science and Biomedical Engineering, University of Manchester, M13 9PT, UK Abstract: Scale-Orientation
More informationSuper resolution with Epitomes
Super resolution with Epitomes Aaron Brown University of Wisconsin Madison, WI Abstract Techniques exist for aligning and stitching photos of a scene and for interpolating image data to generate higher
More informationRECOGNITION OF EMERGENCY AND NON-EMERGENCY LIGHT USING MATROX AND VB6 MOHD NAZERI BIN MUHAMMAD
RECOGNITION OF EMERGENCY AND NON-EMERGENCY LIGHT USING MATROX AND VB6 MOHD NAZERI BIN MUHAMMAD This thesis is submitted as partial fulfillment of the requirements for the award of the Bachelor of Electrical
More informationImproved SIFT Matching for Image Pairs with a Scale Difference
Improved SIFT Matching for Image Pairs with a Scale Difference Y. Bastanlar, A. Temizel and Y. Yardımcı Informatics Institute, Middle East Technical University, Ankara, 06531, Turkey Published in IET Electronics,
More informationResearch Seminar. Stefano CARRINO fr.ch
Research Seminar Stefano CARRINO stefano.carrino@hefr.ch http://aramis.project.eia- fr.ch 26.03.2010 - based interaction Characterization Recognition Typical approach Design challenges, advantages, drawbacks
More informationMorphological Image Processing Approach of Vehicle Detection for Real-Time Traffic Analysis
Morphological Image Processing Approach of Vehicle Detection for Real-Time Traffic Analysis Prutha Y M *1, Department Of Computer Science and Engineering Affiliated to VTU Belgaum, Karnataka Rao Bahadur
More informationLane Detection in Automotive
Lane Detection in Automotive Contents Introduction... 2 Image Processing... 2 Reading an image... 3 RGB to Gray... 3 Mean and Gaussian filtering... 6 Defining our Region of Interest... 10 BirdsEyeView
More informationAnalysis of Various Methodology of Hand Gesture Recognition System using MATLAB
Analysis of Various Methodology of Hand Gesture Recognition System using MATLAB Komal Hasija 1, Rajani Mehta 2 Abstract Recognition is a very effective area of research in regard of security with the involvement
More informationApplications of Flash and No-Flash Image Pairs in Mobile Phone Photography
Applications of Flash and No-Flash Image Pairs in Mobile Phone Photography Xi Luo Stanford University 450 Serra Mall, Stanford, CA 94305 xluo2@stanford.edu Abstract The project explores various application
More informationTelling What-Is-What in Video. Gerard Medioni
Telling What-Is-What in Video Gerard Medioni medioni@usc.edu 1 Tracking Essential problem Establishes correspondences between elements in successive frames Basic problem easy 2 Many issues One target (pursuit)
More informationAn Improved Bernsen Algorithm Approaches For License Plate Recognition
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) ISSN: 78-834, ISBN: 78-8735. Volume 3, Issue 4 (Sep-Oct. 01), PP 01-05 An Improved Bernsen Algorithm Approaches For License Plate Recognition
More informationDETECTION AND RECOGNITION OF HAND GESTURES TO CONTROL THE SYSTEM APPLICATIONS BY NEURAL NETWORKS. P.Suganya, R.Sathya, K.
Volume 118 No. 10 2018, 399-405 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu doi: 10.12732/ijpam.v118i10.40 ijpam.eu DETECTION AND RECOGNITION OF HAND GESTURES
More informationA software video stabilization system for automotive oriented applications
A software video stabilization system for automotive oriented applications A. Broggi, P. Grisleri Dipartimento di Ingegneria dellinformazione Universita degli studi di Parma 43100 Parma, Italy Email: {broggi,
More informationHaptic presentation of 3D objects in virtual reality for the visually disabled
Haptic presentation of 3D objects in virtual reality for the visually disabled M Moranski, A Materka Institute of Electronics, Technical University of Lodz, Wolczanska 211/215, Lodz, POLAND marcin.moranski@p.lodz.pl,
More informationIntelligent agents (TME285) Lecture 4,
Intelligent agents (TME285) Lecture 4, 20180124 Image processing for IPAs + Advanced C# programming Assignment, Stage 1 Note, again, that to complete Stage 1, you must have a discussion with us, based
More informationStatic Hand Gesture Recognition based on DWT Feature Extraction Technique
IJIRST International Journal for Innovative Research in Science & Technology Volume 2 Issue 05 October 2015 ISSN (online): 2349-6010 Static Hand Gesture Recognition based on DWT Feature Extraction Technique
More informationColour Profiling Using Multiple Colour Spaces
Colour Profiling Using Multiple Colour Spaces Nicola Duffy and Gerard Lacey Computer Vision and Robotics Group, Trinity College, Dublin.Ireland duffynn@cs.tcd.ie Abstract This paper presents an original
More informationDifferent Hand Gesture Recognition Techniques Using Perceptron Network
Different Hand Gesture Recognition Techniques Using Perceptron Network Nidhi Chauhan Department of Computer Science & Engg. Suresh Gyan Vihar University, Jaipur(Raj.) Email: nidhi99.chauhan@gmail.com Abstract
More informationOpen Access An Improved Kernel Density Estimation Approach for Moving Objects Detection
Send Orders for Reprints to reprints@benthamscience.ae 768 The Open Automation and Control Systems Journal, 2014, 6, 768-781 Open Access An Improved Kernel Density Estimation Approach for Moving Objects
More informationA new seal verification for Chinese color seal
Edith Cowan University Research Online ECU Publications 2011 2011 A new seal verification for Chinese color seal Zhihu Huang Jinsong Leng Edith Cowan University 10.4028/www.scientific.net/AMM.58-60.2558
More informationAutomatic Electricity Meter Reading Based on Image Processing
Automatic Electricity Meter Reading Based on Image Processing Lamiaa A. Elrefaei *,+,1, Asrar Bajaber *,2, Sumayyah Natheir *,3, Nada AbuSanab *,4, Marwa Bazi *,5 * Computer Science Department Faculty
More informationHISTOGRAM BASED AUTOMATIC IMAGE SEGMENTATION USING WAVELETS FOR IMAGE ANALYSIS
HISTOGRAM BASED AUTOMATIC IMAGE SEGMENTATION USING WAVELETS FOR IMAGE ANALYSIS Samireddy Prasanna 1, N Ganesh 2 1 PG Student, 2 HOD, Dept of E.C.E, TPIST, Komatipalli, Bobbili, Andhra Pradesh, (India)
More informationEyes n Ears: A System for Attentive Teleconferencing
Eyes n Ears: A System for Attentive Teleconferencing B. Kapralos 1,3, M. Jenkin 1,3, E. Milios 2,3 and J. Tsotsos 1,3 1 Department of Computer Science, York University, North York, Canada M3J 1P3 2 Department
More informationMAV-ID card processing using camera images
EE 5359 MULTIMEDIA PROCESSING SPRING 2013 PROJECT PROPOSAL MAV-ID card processing using camera images Under guidance of DR K R RAO DEPARTMENT OF ELECTRICAL ENGINEERING UNIVERSITY OF TEXAS AT ARLINGTON
More informationBlur Estimation for Barcode Recognition in Out-of-Focus Images
Blur Estimation for Barcode Recognition in Out-of-Focus Images Duy Khuong Nguyen, The Duy Bui, and Thanh Ha Le Human Machine Interaction Laboratory University Engineering and Technology Vietnam National
More informationBackground Pixel Classification for Motion Detection in Video Image Sequences
Background Pixel Classification for Motion Detection in Video Image Sequences P. Gil-Jiménez, S. Maldonado-Bascón, R. Gil-Pita, and H. Gómez-Moreno Dpto. de Teoría de la señal y Comunicaciones. Universidad
More informationLiangliang Cao *, Jiebo Luo +, Thomas S. Huang *
Annotating ti Photo Collections by Label Propagation Liangliang Cao *, Jiebo Luo +, Thomas S. Huang * + Kodak Research Laboratories *University of Illinois at Urbana-Champaign (UIUC) ACM Multimedia 2008
More informationTan-Hsu Tan Dept. of Electrical Engineering National Taipei University of Technology Taipei, Taiwan (ROC)
Munkhjargal Gochoo, Damdinsuren Bayanduuren, Uyangaa Khuchit, Galbadrakh Battur School of Information and Communications Technology, Mongolian University of Science and Technology Ulaanbaatar, Mongolia
More informationChapter 4 SPEECH ENHANCEMENT
44 Chapter 4 SPEECH ENHANCEMENT 4.1 INTRODUCTION: Enhancement is defined as improvement in the value or Quality of something. Speech enhancement is defined as the improvement in intelligibility and/or
More informationComputing for Engineers in Python
Computing for Engineers in Python Lecture 10: Signal (Image) Processing Autumn 2011-12 Some slides incorporated from Benny Chor s course 1 Lecture 9: Highlights Sorting, searching and time complexity Preprocessing
More informationKeytar Hero. Bobby Barnett, Katy Kahla, James Kress, and Josh Tate. Teams 9 and 10 1
Teams 9 and 10 1 Keytar Hero Bobby Barnett, Katy Kahla, James Kress, and Josh Tate Abstract This paper talks about the implementation of a Keytar game on a DE2 FPGA that was influenced by Guitar Hero.
More informationDetection of License Plates of Vehicles
13 W. K. I. L Wanniarachchi 1, D. U. J. Sonnadara 2 and M. K. Jayananda 2 1 Faculty of Science and Technology, Uva Wellassa University, Sri Lanka 2 Department of Physics, University of Colombo, Sri Lanka
More informationgo1984 Performance Optimization
go1984 Performance Optimization Date: October 2007 Based on go1984 version 3.7.0.1 go1984 Performance Optimization http://www.go1984.com Alfred-Mozer-Str. 42 D-48527 Nordhorn Germany Telephone: +49 (0)5921
More informationCombined Approach for Face Detection, Eye Region Detection and Eye State Analysis- Extended Paper
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 9 (September 2014), PP.57-68 Combined Approach for Face Detection, Eye
More informationCSC 320 H1S CSC320 Exam Study Guide (Last updated: April 2, 2015) Winter 2015
Question 1. Suppose you have an image I that contains an image of a left eye (the image is detailed enough that it makes a difference that it s the left eye). Write pseudocode to find other left eyes in
More informationStereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays. Habib Abi-Rached Thursday 17 February 2005.
Stereo-based Hand Gesture Tracking and Recognition in Immersive Stereoscopic Displays Habib Abi-Rached Thursday 17 February 2005. Objective Mission: Facilitate communication: Bandwidth. Intuitiveness.
More informationMalaysian Car Number Plate Detection System Based on Template Matching and Colour Information
Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information Mohd Firdaus Zakaria, Shahrel A. Suandi Intelligent Biometric Group, School of Electrical and Electronics Engineering,
More informationEFFICIENT ATTENDANCE MANAGEMENT SYSTEM USING FACE DETECTION AND RECOGNITION
EFFICIENT ATTENDANCE MANAGEMENT SYSTEM USING FACE DETECTION AND RECOGNITION 1 Arun.A.V, 2 Bhatath.S, 3 Chethan.N, 4 Manmohan.C.M, 5 Hamsaveni M 1,2,3,4,5 Department of Computer Science and Engineering,
More informationAutomatic optical measurement of high density fiber connector
Key Engineering Materials Online: 2014-08-11 ISSN: 1662-9795, Vol. 625, pp 305-309 doi:10.4028/www.scientific.net/kem.625.305 2015 Trans Tech Publications, Switzerland Automatic optical measurement of
More informationTesting, Tuning, and Applications of Fast Physics-based Fog Removal
Testing, Tuning, and Applications of Fast Physics-based Fog Removal William Seale & Monica Thompson CS 534 Final Project Fall 2012 1 Abstract Physics-based fog removal is the method by which a standard
More informationVisual Search using Principal Component Analysis
Visual Search using Principal Component Analysis Project Report Umesh Rajashekar EE381K - Multidimensional Digital Signal Processing FALL 2000 The University of Texas at Austin Abstract The development
More informationAudio Restoration Based on DSP Tools
Audio Restoration Based on DSP Tools EECS 451 Final Project Report Nan Wu School of Electrical Engineering and Computer Science University of Michigan Ann Arbor, MI, United States wunan@umich.edu Abstract
More informationTransport System. Telematics. Nonlinear background estimation methods for video vehicle tracking systems
Archives of Volume 4 Transport System Issue 4 Telematics November 2011 Nonlinear background estimation methods for video vehicle tracking systems K. OKARMA a, P. MAZUREK a a Faculty of Motor Transport,
More informationAn Analysis of Image Denoising and Restoration of Handwritten Degraded Document Images
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 12, December 2014,
More informationAn Efficient Method for Vehicle License Plate Detection in Complex Scenes
Circuits and Systems, 011,, 30-35 doi:10.436/cs.011.4044 Published Online October 011 (http://.scirp.org/journal/cs) An Efficient Method for Vehicle License Plate Detection in Complex Scenes Abstract Mahmood
More informationAutomatic High Dynamic Range Image Generation for Dynamic Scenes
Automatic High Dynamic Range Image Generation for Dynamic Scenes IEEE Computer Graphics and Applications Vol. 28, Issue. 2, April 2008 Katrien Jacobs, Celine Loscos, and Greg Ward Presented by Yuan Xi
More informationAuto-tagging The Facebook
Auto-tagging The Facebook Jonathan Michelson and Jorge Ortiz Stanford University 2006 E-mail: JonMich@Stanford.edu, jorge.ortiz@stanford.com Introduction For those not familiar, The Facebook is an extremely
More informationLive Hand Gesture Recognition using an Android Device
Live Hand Gesture Recognition using an Android Device Mr. Yogesh B. Dongare Department of Computer Engineering. G.H.Raisoni College of Engineering and Management, Ahmednagar. Email- yogesh.dongare05@gmail.com
More informationHand & Upper Body Based Hybrid Gesture Recognition
Hand & Upper Body Based Hybrid Gesture Prerna Sharma #1, Naman Sharma *2 # Research Scholor, G. B. P. U. A. & T. Pantnagar, India * Ideal Institue of Technology, Ghaziabad, India Abstract Communication
More informationDeblurring. Basics, Problem definition and variants
Deblurring Basics, Problem definition and variants Kinds of blur Hand-shake Defocus Credit: Kenneth Josephson Motion Credit: Kenneth Josephson Kinds of blur Spatially invariant vs. Spatially varying
More informationHuman Computer Interaction by Gesture Recognition
IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 9, Issue 3, Ver. V (May - Jun. 2014), PP 30-35 Human Computer Interaction by Gesture Recognition
More informationVarious Calibration Functions for Webcams and AIBO under Linux
SISY 2006 4 th Serbian-Hungarian Joint Symposium on Intelligent Systems Various Calibration Functions for Webcams and AIBO under Linux Csaba Kertész, Zoltán Vámossy Faculty of Science, University of Szeged,
More informationEffects of the Unscented Kalman Filter Process for High Performance Face Detector
Effects of the Unscented Kalman Filter Process for High Performance Face Detector Bikash Lamsal and Naofumi Matsumoto Abstract This paper concerns with a high performance algorithm for human face detection
More informationReal Time Word to Picture Translation for Chinese Restaurant Menus
Real Time Word to Picture Translation for Chinese Restaurant Menus Michelle Jin, Ling Xiao Wang, Boyang Zhang Email: mzjin12, lx2wang, boyangz @stanford.edu EE268 Project Report, Spring 2014 Abstract--We
More informationDistributed Vision System: A Perceptual Information Infrastructure for Robot Navigation
Distributed Vision System: A Perceptual Information Infrastructure for Robot Navigation Hiroshi Ishiguro Department of Information Science, Kyoto University Sakyo-ku, Kyoto 606-01, Japan E-mail: ishiguro@kuis.kyoto-u.ac.jp
More informationTokyo Institute of Technology School of Engineering Bachelor Thesis. Real-Time Tennis Ball Speed Analysis System Based on Image Processing
Tokyo Institute of Technology School of Engineering Bachelor Thesis Real-Time Tennis Ball Speed Analysis System Based on Image Processing Supervisor: Associate Prof. Kenji Kise February, 2016 Submitter
More informationA COMPARISON OF ARTIFICIAL NEURAL NETWORKS AND OTHER STATISTICAL METHODS FOR ROTATING MACHINE
A COMPARISON OF ARTIFICIAL NEURAL NETWORKS AND OTHER STATISTICAL METHODS FOR ROTATING MACHINE CONDITION CLASSIFICATION A. C. McCormick and A. K. Nandi Abstract Statistical estimates of vibration signals
More information3D-Position Estimation for Hand Gesture Interface Using a Single Camera
3D-Position Estimation for Hand Gesture Interface Using a Single Camera Seung-Hwan Choi, Ji-Hyeong Han, and Jong-Hwan Kim Department of Electrical Engineering, KAIST, Gusung-Dong, Yusung-Gu, Daejeon, Republic
More informationVirtual Touch Human Computer Interaction at a Distance
International Journal of Computer Science and Telecommunications [Volume 4, Issue 5, May 2013] 18 ISSN 2047-3338 Virtual Touch Human Computer Interaction at a Distance Prasanna Dhisale, Puja Firodiya,
More informationRestoration of Motion Blurred Document Images
Restoration of Motion Blurred Document Images Bolan Su 12, Shijian Lu 2 and Tan Chew Lim 1 1 Department of Computer Science,School of Computing,National University of Singapore Computing 1, 13 Computing
More informationAn Efficient Approach for Iris Recognition by Improving Iris Segmentation and Iris Image Compression
An Efficient Approach for Iris Recognition by Improving Iris Segmentation and Iris Image Compression K. N. Jariwala, SVNIT, Surat, India U. D. Dalal, SVNIT, Surat, India Abstract The biometric person authentication
More informationImage processing for gesture recognition: from theory to practice. Michela Goffredo University Roma TRE
Image processing for gesture recognition: from theory to practice 2 Michela Goffredo University Roma TRE goffredo@uniroma3.it Image processing At this point we have all of the basics at our disposal. We
More informationApplying mathematics to digital image processing using a spreadsheet
Jeff Waldock Applying mathematics to digital image processing using a spreadsheet Jeff Waldock Department of Engineering and Mathematics Sheffield Hallam University j.waldock@shu.ac.uk Introduction When
More informationOPPORTUNISTIC TRAFFIC SENSING USING EXISTING VIDEO SOURCES (PHASE II)
CIVIL ENGINEERING STUDIES Illinois Center for Transportation Series No. 17-003 UILU-ENG-2017-2003 ISSN: 0197-9191 OPPORTUNISTIC TRAFFIC SENSING USING EXISTING VIDEO SOURCES (PHASE II) Prepared By Jakob
More informationSTARCRAFT 2 is a highly dynamic and non-linear game.
JOURNAL OF COMPUTER SCIENCE AND AWESOMENESS 1 Early Prediction of Outcome of a Starcraft 2 Game Replay David Leblanc, Sushil Louis, Outline Paper Some interesting things to say here. Abstract The goal
More informationDetection of Defects in Glass Using Edge Detection with Adaptive Histogram Equalization
Detection of Defects in Glass Using Edge Detection with Adaptive Histogram Equalization Nitin kumar 1, Ranjit kaur 2 M.Tech (ECE), UCoE, Punjabi University, Patiala, India 1 Associate Professor, UCoE,
More information3DExplorer Quickstart. Introduction Requirements Getting Started... 4
Page 1 of 43 Table of Contents Introduction... 2 Requirements... 3 Getting Started... 4 The 3DExplorer User Interface... 6 Description of the GUI Panes... 6 Description of the 3D Explorer Headbar... 7
More informationVision Review: Image Processing. Course web page:
Vision Review: Image Processing Course web page: www.cis.udel.edu/~cer/arv September 7, Announcements Homework and paper presentation guidelines are up on web page Readings for next Tuesday: Chapters 6,.,
More information-f/d-b '') o, q&r{laniels, Advisor. 20rt. lmage Processing of Petrographic and SEM lmages. By James Gonsiewski. The Ohio State University
lmage Processing of Petrographic and SEM lmages Senior Thesis Submitted in partial fulfillment of the requirements for the Bachelor of Science Degree At The Ohio State Universitv By By James Gonsiewski
More informationHigh Dynamic Range image capturing by Spatial Varying Exposed Color Filter Array with specific Demosaicking Algorithm
High Dynamic ange image capturing by Spatial Varying Exposed Color Filter Array with specific Demosaicking Algorithm Cheuk-Hong CHEN, Oscar C. AU, Ngai-Man CHEUN, Chun-Hung LIU, Ka-Yue YIP Department of
More informationSelection of Temporally Dithered Codes for Increasing Virtual Depth of Field in Structured Light Systems
Selection of Temporally Dithered Codes for Increasing Virtual Depth of Field in Structured Light Systems Abstract Temporally dithered codes have recently been used for depth reconstruction of fast dynamic
More informationDigital Image Processing Lec.(3) 4 th class
Digital Image Processing Lec.(3) 4 th class Image Types The image types we will consider are: 1. Binary Images Binary images are the simplest type of images and can take on two values, typically black
More informationVEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL
VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL Instructor : Dr. K. R. Rao Presented by: Prasanna Venkatesh Palani (1000660520) prasannaven.palani@mavs.uta.edu
More informationAvailable online at ScienceDirect. Ehsan Golkar*, Anton Satria Prabuwono
Available online at www.sciencedirect.com ScienceDirect Procedia Technology 11 ( 2013 ) 771 777 The 4th International Conference on Electrical Engineering and Informatics (ICEEI 2013) Vision Based Length
More information