CROWD ANALYSIS WITH FISH EYE CAMERA

Similar documents
Real-Time Face Detection and Tracking for High Resolution Smart Camera System

Vehicle Detection using Images from Traffic Security Camera

Face Detection System on Ada boost Algorithm Using Haar Classifiers

Near Infrared Face Image Quality Assessment System of Video Sequences

OPEN CV BASED AUTONOMOUS RC-CAR

Exercise questions for Machine vision

SCIENCE & TECHNOLOGY

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

Face Detection: A Literature Review

Automatic Licenses Plate Recognition System

Face Detection using 3-D Time-of-Flight and Colour Cameras

Automated Driving Car Using Image Processing

Wadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology

MICROCHIP PATTERN RECOGNITION BASED ON OPTICAL CORRELATOR

Master thesis: Author: Examiner: Tutor: Duration: 1. Introduction 2. Ghost Categories Figure 1 Ghost categories

FLASH LiDAR KEY BENEFITS

WHITE PAPER. Methods for Measuring Flat Panel Display Defects and Mura as Correlated to Human Visual Perception

Applying Automated Optical Inspection Ben Dawson, DALSA Coreco Inc., ipd Group (987)

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

Automatic Vehicles Detection from High Resolution Satellite Imagery Using Morphological Neural Networks

Wheeler-Classified Vehicle Detection System using CCTV Cameras

EFFICIENT ATTENDANCE MANAGEMENT SYSTEM USING FACE DETECTION AND RECOGNITION

EC-433 Digital Image Processing

Image Processing Based Vehicle Detection And Tracking System

Implementation of Face Detection System Based on ZYNQ FPGA Jing Feng1, a, Busheng Zheng1, b* and Hao Xiao1, c

AGRICULTURE, LIVESTOCK and FISHERIES

FSI Machine Vision Training Programs

VLSI Implementation of Impulse Noise Suppression in Images

Follower Robot Using Android Programming

Robust Hand Gesture Recognition for Robotic Hand Control

A Survey on Different Face Detection Algorithms in Image Processing

Development of an Automatic Camera Control System for Videoing a Normal Classroom to Realize a Distant Lecture

An Efficient Method for Vehicle License Plate Detection in Complex Scenes

Vision-based User-interfaces for Pervasive Computing. CHI 2003 Tutorial Notes. Trevor Darrell Vision Interface Group MIT AI Lab

An Un-awarely Collected Real World Face Database: The ISL-Door Face Database

Intelligent Traffic Sign Detector: Adaptive Learning Based on Online Gathering of Training Samples

Paper Sobel Operated Edge Detection Scheme using Image Processing for Detection of Metal Cracks

The Classification of Gun s Type Using Image Recognition Theory

A Method of Measuring Distances between Cars. Using Vehicle Black Box Images

Student Attendance Monitoring System Via Face Detection and Recognition System

Real Time Face Recognition using Raspberry Pi II

IMPLEMENTATION METHOD VIOLA JONES FOR DETECTION MANY FACES

Background Pixel Classification for Motion Detection in Video Image Sequences

Paper or poster submitted for Europto-SPIE / AFPAEC May Zurich, CH. Version 9-Apr-98 Printed on 05/15/98 3:49 PM

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Implementation of License Plate Recognition System in ARM Cortex A8 Board

ORIFICE MEASUREMENT VERISENS APPLICATION DESCRIPTION: REQUIREMENTS APPLICATION CONSIDERATIONS RESOLUTION/ MEASUREMENT ACCURACY. Vision Technologies

Controlling Humanoid Robot Using Head Movements

IMAGE PROCESSING TECHNIQUES FOR CROWD DENSITY ESTIMATION USING A REFERENCE IMAGE

BASIC OPERATIONS IN IMAGE PROCESSING USING MATLAB

Video Synthesis System for Monitoring Closed Sections 1

Motion Detector Using High Level Feature Extraction

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

International Journal of Advanced Research in Computer Science and Software Engineering

FPGA Based Area Measurement of Irregular Objects

VC 14/15 TP2 Image Formation

Background Subtraction Fusing Colour, Intensity and Edge Cues

Image Extraction using Image Mining Technique

Morphological Image Processing Approach of Vehicle Detection for Real-Time Traffic Analysis

Real Time Video Analysis using Smart Phone Camera for Stroboscopic Image

Automatic Electricity Meter Reading Based on Image Processing

Automatic optical measurement of high density fiber connector

An Efficient Noise Removing Technique Using Mdbut Filter in Images

Image Capture and Problems

Limits of a Distributed Intelligent Networked Device in the Intelligence Space. 1 Brief History of the Intelligent Space

License Plate Localisation based on Morphological Operations

VC 11/12 T2 Image Formation

Authenticated Automated Teller Machine Using Raspberry Pi

USE OF HISTOGRAM EQUALIZATION IN IMAGE PROCESSING FOR IMAGE ENHANCEMENT

Blur Estimation for Barcode Recognition in Out-of-Focus Images

A Vehicle Speed Measurement System for Nighttime with Camera

A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol. Qinghua Wang

MULTI-LAYERED HYBRID ARCHITECTURE TO SOLVE COMPLEX TASKS OF AN AUTONOMOUS MOBILE ROBOT

Face Tracking using Camshift in Head Gesture Recognition System

Visual Search using Principal Component Analysis

Live Hand Gesture Recognition using an Android Device

KODAK NEXFINITY Digital Press. 256 Shades of Gray

Research of an Algorithm on Face Detection

ME 6406 MACHINE VISION. Georgia Institute of Technology

Facial Biometric For Performance. Best Practice Guide

Color Image Segmentation in RGB Color Space Based on Color Saliency

Failure is a crucial part of the creative process. Authentic success arrives only after we have mastered failing better. George Bernard Shaw

Column-Parallel Architecture for Line-of-Sight Detection Image Sensor Based on Centroid Calculation

ROBOT VISION. Dr.M.Madhavi, MED, MVSREC

Detection of License Plate using Sliding Window, Histogram of Oriented Gradient, and Support Vector Machines Method

A Review of Optical Character Recognition System for Recognition of Printed Text

A VIDEO CAMERA ROAD SIGN SYSTEM OF THE EARLY WARNING FROM COLLISION WITH THE WILD ANIMALS

Modern Robotics with OpenCV. Widodo Budiharto

Imaging serial interface ROM

Iraqi Car License Plate Recognition Using OCR

Image Processing by Bilateral Filtering Method

An Improved Bernsen Algorithm Approaches For License Plate Recognition

Number Plate Recognition Using Segmentation

VC 16/17 TP2 Image Formation

On spatial resolution

Analyzing Traffic Density in Images with Low Temporal and Spatial Resolution

VISUAL FINGER INPUT SENSING ROBOT MOTION

Application of Machine Vision Technology in the Diagnosis of Maize Disease

Research on Pupil Segmentation and Localization in Micro Operation Hu BinLiang1, a, Chen GuoLiang2, b, Ma Hui2, c

Transcription:

CROWD ANALYSIS WITH FISH EYE CAMERA Huseyin Oguzhan Tevetoglu 1 and Nihan Kahraman 2 1 Department of Electronic and Communication Engineering, Yıldız Technical University, Istanbul, Turkey 1 Netaş Telekomünikasyon A.Ş., Istanbul, Turkey 2 Deparment of Electronic and Communication Engineering, Yıldız Technical University, Istanbul, Turkey ABSTRACT Nowadays crowd analysis, essential factor about decision management of brand strategy, is not a controllable field by individuals. Therefore a technology, software products is needed. In this paper we focused on what we have done about crowd analysis and examination the problem of human detection with fish-eye lenses cameras. In order to identify human density, one of the machine learning algorithm, which is Haar Classification algorithm, is used to distinguish human body under different conditions. First, motion analysis is used to search for meaningful data, and then the desired object is detected by the trained classifier. Significant data has been sent to the end user via socket programming and human density analysis is presented. KEYWORDS Fish-Eye, Human Detection, Haar Classification, Crowd Analysis, Machine Learning, Motion Analysis, Density Analysis 1. INTRODUCTION Trends in video processing and human tracking systems have become popular today. Particularly for security purposes, analysis of data from IP cameras used in various locations has gained importance. However, the camera array, which is designed to completely cover the scene, is disadvantageous both in terms of the number of the images and the size of the data to be processed. The fish-eye cameras function as both a fixed camera and a moving camera. A fixed and restricted area can be displayed with stationary cameras. Thanks to moving cameras which are manually controllable cameras, areas outside the area where you want to view still images can also be displayed. In this case, it is very likely that the movements in the region to be displayed constantly or continuously are missed. Based on this information, we can say that fish-eye cameras are very advantageous. Fisheye lenses are created by the do not correction of normal lens defects. These lenses, which we can obtain data in wide angle, provide the creation of hemispherical images. It can be said that firstly it is used in the field of metrology and its usage also increases in security systems today. Depending on the barrel bending error in the lens production, it is possible to obtain two types of images, 360 and 180 degrees. DOI: 10.5121/ijaceee.2018.6301 1

Resolution performance from a fish-eye camera is not equivalent to a fixed camera. This is why a rectangular 360 degree image is obtained from a rectangular image sensor. It is necessary to make a mathematical explanation; the resolution of 5 megapixels is 2592 x 1944 pixels. The rectangular image that is tried to be obtained is cropped when the circle is fitted because of a diameter of 1944 pixels. The actual resolution from a 5 Megapixel camera is about 3 megapixels, which is the number of pixels left in the field. A more important disadvantage of fisheye cameras is that the edges of the images are distorted. These distortions are particularly prominent in moving objects and human images. 2. REFERENCE STUDIES In the literature, there are human detection studies with both fixed cameras and fish-eye cameras. Below is a brief description of the methods used in these studies. Pfinder [1] is a study on real-time human body tracking. As in almost all work, this work also has background subtraction to get rid of constant background noises. YUV space conversion and normalization operations were applied to avoid the distortions caused by light and similar reasons. Pixels were updated using adaptive filtering at every image and neighbours relationships with nearby pixels were evaluated. As a result, the most suitable pixel center is selected and the similarity of the distribution of the human figure is reached. Pfinder is a work done using a fixed camera. Figure 1. Pfinder morphological growing process In another study with a fish-eye camera [2], background extraction has been done using Gradient based edge detection technique. In the remaining foreground regions, head and shoulder shapes were searched and tried to match the templates that created earlier. Below is a diagram of working. Figure 2. Working diagram[2] 2

In another work done, it was aimed to prevent occupational accidents by performing human detection with a fish-eye camera placed in front of heavy duty machines [3]. Object distortions from fish-eye cameras used to obtain a wide angle were the biggest challenge they faced. They created databases for object recognition and trained classifiers with high number of positive and negative samples. Figure 3. Picture of working[3] In this study, the extraction of Haar attributes proposed by Viola and Jones and the object classification with enhanced adaboost classifier has been used [4]. 3. STEPS OF DESIGN Object detection methods are used to determine moving objects in video sequences using various methods and to extract motion direction. This process is regarded as the first part of image monitoring systems and has a great impact on system performance. Finding the moving object by taking the background difference is a method used when the camera that receives the image is fixed and the target object is moving. The background image is determined by the image areas that remain in motion after the system starts running or before the system starts. And then the difference between two background frames is detected. The absolute value of the difference obtained is compared with a predetermined threshold level value. If the difference value is above this threshold value, it is decided that it is a change in the related coordinates. When this method is applied to sequential image frames in video images, the following formula applies. As in other studies, Gaussian operation must be performed on image frames before background subtraction. In this way, the change of light and various distortions of the obstacles are reduced. (1) Figure 4. Background subtraction [2] 3

After background subtraction is done, it should be determined whether moving objects are searched data. Haar classifier will be used for this. Trained classifiers are created by giving the marked samples to the algorithm for the determination of the objects, and by the learning process in the result of this. When training data set is being prepared; the data in which the specified object is found are marked as positive, and the data not found are marked as negative. In Haar Classifier training, the following frames and the like are used. Frames scan over positive data to detect dark / bright differences and create target values where the differences between black and white regions are obvious. Figure 5. Haar weak classifier During the detection of the object being searched, the small classifiers specified in the training are scanned on the frame and fixed in the regions that match the target values. In an input frame, many small frames may provide target values for different regions, but since the object size to be searched is approximately known, an object is detected in the area where a range of frames are densely mapped. In educational classrooms, the data used in the training set must be consistent with each other. Otherwise the faulty rate will be high. In the study performed, the human silhouette was seen in different shapes in the different coordinate sections because of the fish eye camera. To avoid this, the camera is positioned at a height of 5 meters. This time, as the pixel size of the human silhouette decreased, the number of detected features decreased and the educational success was adversely affected. Figure 6. Dimension and shape differences in image 4

As a solution, training was generating with different data sets for different coordinate zones of the same frame. Training was carried out using a total of 5000 positive, 3000 negative data sets. Figure 7. Training regions The coordinates of the detected people are sent to the client with the generated socket programming code. The UDP protocol is used because high data transfer is a matter of concern. The client presents the data it receives in a visual form that end user can understand. The following illustration shows the example. Figure 8. Crowded heat map The pseudo code and flow diagram of the generated algorithm is presented below. Table 1. Pseudo Code. 1 Start 2 Get Frame 3 Apply Gaussian Filter 4 Background Subtraction 5 Divide Frame to 3 Different Regions 6 Clear Remaining Remnants 7 Search For Moving Objects 8 Give Object to Classifier 9 Send Meaningful Data Via Socket 10 View Heat Map With Web Browser 5

4. RESULT Figure 9. Algorithm Flow Diagram Within the scope of the study, a solution is proposed for an affordable human detection and intensity map of the area that does not contain blind spots and covers the entire area. In the present case, classifying moving objects reduces the processing power. However, when the detected object stops moving, the previous position continues to be marked as positive. If there is no movement within 3 seconds, the region is given a reclassifier and it is confirmed that the detected object is human. Algorithm and classifier optimization studies are continuing to reduce the specified time threshold. It was mentioned above that the training was done with different data sets according to the separated regions in the image frames taken from the camera. However, it was observed that the detection success rate decreased in the transition locations between the two different regions. The work is ongoing to resolve this problem. The height of the position of the camera is affected by the algorithm performance ratio. With the created front ace, the user is able to enter the integrated height of the camera parametrically. It is planned to increase the performance ratio by changing the algorithm and classifier values according to the entered value. 6

REFERENCES [1] Christopher Richard Wren, Ali Azarbayejani, Trevor Darrell and Alex Paul Pentland., Pfinder: Real- Time Tracking of the Human Body '', IEEE Trans.on Pattern Analysis and Machine Intelligence. Vol 19, No 7,July 1999. [2] Mamoru Satio, Katsuhisa Kitaguchi and Gun Kimura, Masafumi Hashimoto, Human Detection from Fish-eye Image by Bayesian Combination of Probabilistic Apperance Models, IEEE Systems Mand and Cybernetics, Oct. 2010. [3] Vincent Fremont, Manh Tuan Bui, Djamal Boukerroui and Pierrick Letort., Vision-Based People Detection System for Heavy Machine Applications Jan. 2016 [4] P. Viola and M. Jones, Rapid object detection using a boosted cascade of simple features, presented at IEEE Conference on Computer Vision and Pattern Recognition, 2001 AUTHORS Nihan Kahraman is Assist. Prof. Dr. at Yıldız Technical University Electronic and Communication Engineering Department. She has completed many projects throughout her academic career. Her interests are listed below. - Analog and digital IC design - Microelectronics - Neural networks - Electronic design automation - Image processing Oguzhan Tevetoglu is a software architecture designer at Netas and also graduate student at YTU. He has taken many roles in embedded system projects. His interests are listed below. - Image processing - Embedded Systems - Neural networks - Computer network 7