Artwork Recognition for Panorama Images Based on Optimized ASIFT and Cubic Projection

Size: px
Start display at page:

Download "Artwork Recognition for Panorama Images Based on Optimized ASIFT and Cubic Projection"

Transcription

1 Artwork Recognition for Panorama Images Based on Optimized ASIFT and Cubic Projection Dayou Jiang and Jongweon Kim Abstract Few studies have been published on the object recognition for panorama images. To prevent the infringement of artworks in 360-degree images, we put forward an efficient method for artworks recognition inside 360-degree images in this paper. To start with, we employed the improved cubic projection to transform the distorted panorama image. Then, we used the optimized Affine Invariant Feature Transform (ASIFT) algorithm for extracting local features of transformed image. Finally, the feature point matching is based on one-to-one mapping constrain. The overall performance of the method is investigated on panorama dataset and the experimental results are compared with other well-known local feature extraction methods and original panorama image. The experimental results show that using the proposed method can improve around 30% of the accuracy for relatively higher distorted panorama images and reduce the computing time. Index Terms Panorama image, ASIFT, cubic projection. I. INTRODUCTION artwork, recognition, Recently, with the continuous improvement of panorama photography and the advent of easy-to-use 360-degree cameras such as SAMSUNG GEAR, and LG G5, 360-degree videos and images have immediately become very popular. A 360-degree image can easily capture all the viewing directions simultaneously and give users the sense of immersion when viewed [1]. In view of the above-mentioned facts, the potential risks of the copyrighted artworks have been photographed without permissions will be greater than before. Hence copyright infringement of the artworks in 360-degree images will be burning issues. The detection and recognition of unauthorized artworks for the panorama images is required. Although many researches on image recognition have been widely published, quite few studies have been conducted on 360-degree images, not to mention artworks inside panorama images. However, they stay much the same. Those useful methods have been explored for image recognition will sometimes worth applying for object recognition inside panorama images. Since the 1990s, image content based methods have been well accepted to solve image recognition problems. The image content is described by extracting some low visual features. Moreover, these methods can achieve a better performance both on the accuracy and the speed [2]. A variety of local feature extraction algorithms have been found Manuscript received October 10, 2017; revised January 19, Dayou Jiang is with Dept. of Copyright Protection, Sangmyung University, Seoul, Korea ( dyjiang@cclabs.kr).. Jongweon Kim is with Dept. of Electronics Engineering, Sangmyung University, Seoul, Korea (corresponding author; jwkim@smu.ac.kr). in recent years. Among them, the most representative and widely used one is the Scale Invariant Feature Transform (SIFT) [3]. Meanwhile, algorithms such as Speeded Up Robust Features (SURF) [4], Affine SIFT [5], Oriented FAST and Rotated BRIEF (ORB) [6], Binary Robust Invariant Scalable Keypoints (BRISK) [7], and Fast Retina Keypoint (FREAK) [8] have also been employed due to the fact that they can achieve quite good performance. Currently, the deep learning methods such as AlexNet [9], ZFNet [10], GoogLeNet [11], and ResNet [12] are usually exploited to train these local feature vectors to obtain the classification model for large dataset task -- Large Scale Visual Recognition Challenge (LSVRC). With regarding to panorama images recognition, Xiao [13] introduced the problem of scene viewpoint recognition and also studied the canonical view biases exhibited by people taking photos of places. Yang [14] addressed the problem of room structure recognition from a 360 cylindrical panorama. The original panorama was transformed into four perspective projected sub-images. An algorithm to detect and recognize road lane markings from panorama images was presented in [15]. Zhang [16] advocated the use of 360 full-view panoramas in scene understanding and proposed a whole-room context model in 3D. A region based convolutional neutral network (R-CNN) was implemented to train and test on an indoor panorama image dataset in the work [17]. The work [18] developed a novel panorama-to-panorama matching process which either by aggregating features of individual images in a group or by explicitly constructing a larger panorama. Then, an improved ASIFT algorithm for indoor panorama images matching was analyzed and compared with algorithms such as SIFT, SURF, and ASIFT in work [19]. The literatures concerning artwork recognition for panorama images are rare. An artwork identification methodology for 360-degree images which by transforming the 360-degree image into a three-dimensional sphere and wrapping it with a polyhedron was studied in work [20]. The results showed that the method can increase much of identification precision for artwork that displayed on a monitor in a seriously distorted position. What is more, different local features are analyzed for feature matching. However, the more polyhedron employed in the method can improve the performance much more, and that again means large time will be spent in feature detection and matching. Furthermore, visual experiences of panorama image will also become worse. The aim of this paper is to develop an efficient method for artworks recognition for 360-degree images. Therefore, two issues are considered in solving the problem: (1) Use less time doi: /ijmlc

2 to achieve better performance; (2) use feasible and simple projection to ensure good visual experiences of transformed images from panorama images. The remaining work of the paper is organized as follows: panorama image transform, feature extraction, and feature matching methods are discussed in Section II. Section III demonstrates the experimental results and performance of the proposed artworks recognition method for panorama artworks datasets. Finally, conclusions are given in Section IV. II. ALGORITHM IMPROVEMENT A. Panorama Image Transformation The captured panorama images by using 360 cameras are mainly in equi-rectangular format with an aspect ratio of 2:1. The equi-rectangular projection has been widely used to map the 3D scene onto a 2 dimensional plane. However, this projection has much serious distortions, especially in the two poles. Hence, using original equi-rectangular panorama image for artworks recognition is a worse method. Recently, few of attention on projection of 360-degree images and videos have been reported. Kim [21] proposed an automatic framework for generating content-aware 2D normal view perspective videos from 360 videos, which method is based on Pannini projection [22] model. A new Study of Oculus 360 Degree Video Streaming by using offset cubic map is developed in work [23]. Before long, the Google engineer Brown [24] presented Equi-Angular Cube maps method offer better results and more efficient use of resources, which looks to solve the VR video quality issues. has fewer distortions. Also, it is available and easily obtained. Therefore, this cubic map is selected and improved. The original cubic map converts the panorama into six faces of a cube with each face is rectilinear image. Fig. 1 shows an example of panoramic image with low distortion and small screen. Fig. 2 shows the standard cubic map image from the above mentioned panorama. The improved method meets the demand that the object should be displayed obviously in the transformed images. This can be achieved by adjusting the angle of pitch, yaw and the field of view (FOV). Fig. 3 shows a schematic diagram of yaw, pitch, roll camera (The image is redrawn from work [23]). In Fig. 4, here front face and back faces are highlighted. The x, y and z direction respectively control the pitch, yaw and roll of the view. For different distorted images in different screens, the angle of pitch and roll are changed to suit the change of those situations. The fixed value of FOV is 90. The transform matrix is given by: M roty( yaw) rotx( pitch) rotz( roll) (1) T The value of pitch for large monitor is changed from 20 to 50 degree of Southern Latitude and for small monitor its value is changed from 10 to 30 degree of Southern Latitude. z yaw y roll x pitch Fig. 3. Schematic diagram of yaw, pitch, roll camera. front Fig. 1. Example of a panorama image. left right back Fig. 4. Schematic diagram of standard cubic map. Fig. 2. Corresponding cubic map image of Fig. 1. As for panorama projection, map projections such as Mercator, Goode homolosine, Natural earth and Cubic map are on first consideration. However, most of them cannot be used for artworks recognition. On the one hand, the panorama should be transformed into 3D sphere and then map them to those projections, the whole of this processes spend too much time; on the other hand, those projections still have unavoidable distortions. The difference is cubic map which B. Optimized ASIFT The ASIFT algorithm is preceded by the following steps: 1) Images are transformed by changing the positions of latitude and the longitude. The images are rotated followed by tilts t. In the algorithm, t is the T-subsampling in height direction. t 1 cos (2) height ' height / t (3) 2) Perform the rotations and tilts by changing values of and to ensure the simulated images with views as many as possible. 55

3 3) Use SIFT algorithm to detect features for all simulated images. The irregular sampling of the parameters of and of ASIFT is shown in Fig. 5 as below (The image is redrawn from work [5]). y θ=45 φ Fig. 5. Sampling of the parameters θ and φ. The sampling steps were validated by SIFT with simulated tilt and t is experimentally fixed to 2.While is changed with the change of t. For example, when tilt = 2, t = 2, is set as 45 ; when tilt = 3, t = 2, is set as 30. Thus, with tilt =2, four simulation images with angle steps of 45 are generated; with tilt =3, six simulation images with angle steps of 30 are obtained. Fig. 2 shows the simulated images of ASIFT. The images from left to right on the top respectively represents: transform 1 (t =1, =0 ), transform 2 (t = 2, =0 ), transform 3 (t = 2, =45 ); the bottom images from left to right: transform 4 (t = 2, =90 ), transform 5 (t = 2, =135 ) and transform 6 (t =2, =0 ). As shown in Fig. 6, we can conclude that, the simulated images of transform 3 and transform 5 are larger than other affine transformed images which means need much time for feature detection and extraction, so to simplify the process of them may be a good idea. What is more, as for panorama images, the effect of rotation transformation in height direction is less, so use less rotation transforms can save the running time and achieve the relative performance as using the whole transforms. Also, to speed up the algorithm, CPU local pool is employed. Fig. 6. Examples of simulated ASIFT images. C. One-To-One Mapping The basic feature matching algorithm used in the x experiment is UBCMATCH [25], which finds the closest descriptor D_B in image B for each descriptor D_A in image A. Although the matches can be filtered for uniqueness by using a threshold which is the ratio of the distance between the best matching keypoint and the distance to the second best one, some error matches still appear. After several test evaluation, the error matched points usually located in a central place. Sometimes, the point in the original artwork can be matched with many pointes in the training image after using UBCMATCH with threshold of 3.0. Therefore, the one-to-one mapping is adopted to solve the problems of some one-to-many error matches. Fig. 7 shows that one keypoint of panorama has been matched with two different keypoints from original artwork image. An example of using one-to-one mapping for feature matching is shown in Fig. 8. Compared with only using UBCMATCH, using one-to-one mapping has reduced the number of mismatched keypoints. The result clearly illustrates the positive effect of one-to-one mapping. Fig. 7. Feature matching result of using ubcmatch. Fig. 8. Feature matching result of using one-to-one mapping. III. SIMULATION RESULTS Experiments are carried out on the artworks panorama images dataset which is created by the 50 famous artworks from the Google website. These images are shown in Fig. 9. The images are downloaded in JPG format with different sizes. Among them, the biggest file size is 7.22MB and the smallest file size is 89.1KB, while at least half of them are less than 1MB. The panorama images are captured and automatically stitched with LG 360 that has dual wide-angle cameras. The default size of panorama image is with 72 DPI. To make the simulation more reality, three different distorted panorama images are captured in three different positions and two different screens which sizes are 32 inch and 79 inch. Thus, the panorama artwork dataset in which 300 panorama images are evenly distributed in 6 kinds of categories is created. The larger distortion means the artworks inside the panorama images away from the equator much more. The experiments are performed in the computer that has Xeon (R) 3.50GHz CPU, 8.00GB RAM, and Windows 7 Professional K 64bit system. 56

4 that generated from cubic projection and original panorama. As for Low distortion image displayed in 32 inch screen monitor, the results of feature matching are shown in Fig Fig. 9. Well-known artworks used in the experiment. Considering that the big size can detect large feature points and most of them cannot be "true" matched. Therefore, reducing the experiment images can not only reduce the computing time but also improve the percentage of the "true" matches. The panorama images are resized to , the resized artworks' sizes are , and the generated transformed images' sizes are In the feature extraction process, the SIFT method uses the default parameters, by comparison, the SIFT based ASIFT method uses Parallel function with 4 local pools to accelerate processing. The original uses 5 times SIFT (the whole tilt 1 and tilt 2 of ASIFT), while the improved ASIFT uses 4 times SIFT (the whole 1, the first and third of tilt 2 and the first one of tilt 3 of ASIFT). Meanwhile, the threshold value of UBCMATCH for feature matching is 3.0 due to the fact that the value of 2.0 leads to too many error matches for higher distorted panorama images. As shown in Fig. 10 and Fig. 11, on the contrary, the number of matching lines between the panorama image and corresponding original images is less than with different original image, which will lead to false recognition. To ensure the reliability of the experiments, the preset parameters mentioned above are all selected after several test evaluation. Fig. 12. Feature matching results of using original panorama with Fig. 13. Feature matching results of using cubic projected image with Fig. 14. Feature matching results of using original panorama with different original artwork. Fig. 10. Feature matching results of using SIFT with corresponding original artwork under ubcmatch threshold 2.0. Fig. 11. Feature matching results of using SIFT with different original artwork under ubcmatch threshold 2.0. Three kinds of comparison experiments are conducted to measure the performance of the proposed method. First of all, the following example is constructed using SIFT for the purpose of evaluating the recognition performance of image Fig. 15. Feature matching results of using cubic projected image with The Fig. 12 and Fig. 14 show that original panorama has some true matches for the corresponding artwork, and some error matches with different artworks. Whereas, the Fig.12 and Fig. 13 show that cubic projected image has more true matches than panoramic image with corresponding original artwork. What is more, Fig. 13 and Fig. 15 show that cubic projected image has less error matches than panoramic image with Therefore, the results indicate that the proposed panorama transform method - cubic map- is more suitable for artwork recognition inside panorama. Secondly, the comparison experiments are conducted to estimate the performance of SIFT and ASIFT. Fig showed the feature matching results of using ASIFT algorithm. Since experiments had been tested by using SIFT, here, only the matching results of ASIFT with cubic projected 57

5 images are displayed. By comparison with Fig. 13 and Fig. 15, although the Fig.16 and Fig. 17 showed less matched points, in fact, ASIFT has outperformed SIFT in feature extraction and matching for more distorted images. Fig. 18 showed more true-matched points than Fig. 16. But Fig. 19 has more error-matched points than Fig. 17. Therefore, ASIFT that has more affine transforms may appear less efficient for artwork recognition in panorama. Further experiments on different distorted images should be conducted to verify the performance of ASIFT with different affine transforms. The following example is taken from the experiment which is conducted between ASIFT and optimized ASIFT. High distortion images displayed in 79 inch screen monitor are used. The experiments results are shown in Fig Fig. 20. Feature matching results of using ASIFT with 5 transforms (1-5) for Fig. 16. Feature matching results of using ASIFT with 2 transforms (1-2) for Fig. 21. Feature matching results of using ASIFT with 5 transforms (1-5) for Fig. 17. Feature matching results of using ASIFT with 2 transforms (1-2) for Fig. 22. Feature matching results of using ASIFT with 4 transforms (1,2,4,6) for Fig. 18. Feature matching results of using ASIFT with 5 transforms (1-5) for Fig. 19. Feature matching results of using ASIFT with 5 transforms (1-5) for Although the ASIFT has applied Parallel function with 4 local pools to accelerate processing, the large computing time of it is still a problem in real-time recognition. Therefore, find a way to use less time to achieve good performance is required. Fig. 23. Feature matching results of using ASIFT with 4 transforms (1,2,4,6) for Finally, the whole panorama artwork dataset is used for measuring the performance of proposed method. In all the experiments, the parameters are preset and mentioned above. Numbers of the matched points are computed and sorted in descend order. Since those parameters are specially selected after several test evaluation, the error matches can be ignored. So, to simplify the compute process, all the matches of points are regarded as the true matches. The recognition leads to true or false is determined by the numbers of each image matching. If the artwork image in panorama has the most numbers of the matched pointers when it is matched with its corresponding 58

6 original artwork, the recognition result will lead to true, otherwise, it will be adjudged false. The accuracy is the percentage of true matched images in each panorama category. The experimental results of the proposed method compared with other methods shown in Fig. 24. Fig. 24. Results of using proposed method on artwork dataset. Here, in the Fig. 24, 'C' represents cubic map projection; 'O' stands for original panorama; 'S' is SIFT feature method; 'AS' is Affine SIFT method; '1-5' refers to affine transform from 1 to 5; '1246' is the 1st, 2nd, 4th and 6th affine transforms. As shown in Fig. 24, for small screen with high distortion, although the proposed method-cas(1246)-has relatively lower accuracy for artwork recognition, it is the best one among these methods. What is more, for large screen panorama images, low distorted and middle distorted panorama images with small screen, the proposed method showed better performance than others methods and the recognition accuracy are all larger than 85%. The conclusions can be reached that the using the proposed panorama transform method can improve large with compared to original panorama image; the ASIFT has great advantage over SIFT for image recognition; and the optimized ASIFT can use less time to achieve the passable even better performance than ASIFT. In sum, the proposed artwork recognition method for 360-degree image which based on cubic projection and optimized ASIFT is an efficient method. IV. CONCLUSION A Few researches on artworks recognition for panorama images have been performed. The advantage of the proposed method is that it develops an efficient artworks recognition method for panorama images with well visual experience and better performance. In the paper, we used cubic project to transform distorted panorama images and employed optimized ASIFT to reduce the computing time and improve the recognition accuracy. Besides, we adopted one-to-one mapping constrain to remove large error feature matches. The results demonstrate that both the performance accuracy and computing time have been improved obviously. Even if the panorama images are seriously distorted, the satisfied results also can be obtained. However, the algorithm may have shortcomings in the reality of situations where artworks can be displayed anywhere. Also, the method will be more efficient by using GPU to speed up feature extraction and matching tasks. Therefore, powerful GPU will be considered to accelerate the algorithm. In addition, large artworks datasets that included situations as many as possible will be done in the future work of the paper. ACKNOWLEDGMENT This work was supported by Ministry of Culture, Sports and Tourism (MCST) and Korea Creative Content Agency (KOCCA) in the Culture Technology (CT) Research & Development Program REFERENCES [1] T. Ho and M. Budagavi, Dual-fisheye lens stitching for 360-degree imaging, IEEE International Conference on Acoustics, Speech and Signal Processing, [2] G. Zhen, L. Zhuo, J. Zhang, and X. G. Li, A comparative study of local feature extraction algorithms for web pornographic image recognition, Informatics and Computing, pp , [3] D. G. Lowe, Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, vo1. 60, no. 2, pp , [4] H. Bay, T. Tuytelaars, and L. V. Gool, SURF: Speeded up robust features, in Proc. European Conference on Computer Vision (ECCV), 2006, vol. 3951, no. 3, pp [5] J. M. Morel and G. Yu, ASIFT: A new framework for fully affine invariant image comparison, SIAM Journal on Imaging Sciences, vol. 2, no. 2, pp , [6] E. Rublee, V. Rabaud, K. Konolige, and G. Bradski, ORB: An efficient alterative to SIFT or SURF, in Proc. International Conference on Computer Vision (ICCV), 2011, pp [7] L. Stefan, M. Chli, and R. Y. Siegwart, BRISK: Binary robust invariant scalable keypoints, in Proc. International Conference on Computer Vision (ICCV), 2011, pp [8] A. Alahi, R. Ortiz, and P. Vandergheynst, FREAK: Fast retina keypoint, in Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp [9] A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems, pp , [10] M. D. Zeiler and R. Fergus, Visualizing and understanding convolutional networks, in Proc. European Conference on Computer Vision, 2014, pp

7 [11] C. Szegedy et al., Going deeper with convolutions, in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp [12] K. M. He, X. Y. Zhang, S. Q. Ren, and J. Sun, Deep residual learning for image recognition, in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp [13] J. X. Xiao, K. A. Ehinger, A. Oliva, and A. Torralba, Recognizing scene viewpoint using panorama place representation, in Proc. IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp [14] H. Yang and H. Zhang, Indoor structure understanding from single 360 cylindrical panorama image, Computer-Aided Design and Computer Graphics, pp , [15] C. Li, L. Creusen, L. Hazelhoff, and P. H. N. D. With, Detection and recognition of road markings in panorama images, in Proc. Asian Conference on Computer Vision, 2016, pp [16] Y. D. Zhang, S. R. Song, P. Tan, and J. X. Xiao, Pano-context: A whole-room 3d context model for panorama scene understanding, in Proc. European Conference on Computer Vision, 2014, pp [17] F. Deng, X. Zhu, and J. Ren, Object detection on panorama images based on deep learning, in Proc. Control, Automation and Robotics (ICCAR), 2017, pp [18] A. Iscen, G. Tolias, Y. Avrithis, T. Furon, and O. Churn, Panorama to panorama matching for location recognition, in Proc. ACM on International Conference on Multimedia Retrieval, 2017, pp [19] H. Fu, D. H. Xie, R. F. Zhong, Y. Wu, and Q. Wu, An improved ASIFT algorithm for indoor panorama image matching, in Proc. Ninth International Conference on Digital Image Processing, 2017, vol [20] X. Jin and J. W. Kim, Artwork identification for 360-degree panorama images using polyhedron-based rectilinear projection and keypoint shapes, Applied Sciences, vol. 7, no. 5, p. 528, [21] Y. W. Kim et al., Automatic content-aware projection for 360 videos, Computing Research Repository (CoRR), [22] T. K. Sharpless, B. Postle, and D. M. German, Pannini: A new projection for rendering wide angle perspective images, in Proc. the Sixth International Conference on Computational Aesthetics in Graphics, Visualization and Imaging, 2010, pp [23] C. Zhou, Z. Li, and Y. Liu, A measurement study of oculus 360, Degree Video Streaming, [24] C. Brown. Bring pixels front and center in VR video. [Online]. Available: d-center-vr-video, [25] VLFeat.org. [Online]. Available: Dayou Jiang He received his M.S. degree in Computer application Technology from YanBian University, China, in He is currently pursuing the Ph.D. degree in Copyright Protection, Sangmyung University, Korea. His research interests are Image retrieval, Image identification, music identification, digital forensics. Jongweon Kim received the Ph.D. degree from University of Seoul, major in signal processing in He is currently a professor of Dept. of Electronics Engineering and director of Creative Content Labs at Sangmyung University in Korea. He has a lot of practical experiences in the digital signal processing and copyright protection technology in the institutional, the industrial, and academic environments. His research interests are in the areas of copyright protection technology, digital rights management, digital watermarking, and digital forensic marking. 60

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850

More information

MREAK : Morphological Retina Keypoint Descriptor

MREAK : Morphological Retina Keypoint Descriptor MREAK : Morphological Retina Keypoint Descriptor Himanshu Vaghela Department of Computer Engineering D. J. Sanghvi College of Engineering Mumbai, India himanshuvaghela1998@gmail.com Manan Oza Department

More information

Face detection, face alignment, and face image parsing

Face detection, face alignment, and face image parsing Lecture overview Face detection, face alignment, and face image parsing Brandon M. Smith Guest Lecturer, CS 534 Monday, October 21, 2013 Brief introduction to local features Face detection Face alignment

More information

Research on Hand Gesture Recognition Using Convolutional Neural Network

Research on Hand Gesture Recognition Using Convolutional Neural Network Research on Hand Gesture Recognition Using Convolutional Neural Network Tian Zhaoyang a, Cheng Lee Lung b a Department of Electronic Engineering, City University of Hong Kong, Hong Kong, China E-mail address:

More information

Improved SIFT Matching for Image Pairs with a Scale Difference

Improved SIFT Matching for Image Pairs with a Scale Difference Improved SIFT Matching for Image Pairs with a Scale Difference Y. Bastanlar, A. Temizel and Y. Yardımcı Informatics Institute, Middle East Technical University, Ankara, 06531, Turkey Published in IET Electronics,

More information

Content Based Image Retrieval Using Color Histogram

Content Based Image Retrieval Using Color Histogram Content Based Image Retrieval Using Color Histogram Nitin Jain Assistant Professor, Lokmanya Tilak College of Engineering, Navi Mumbai, India. Dr. S. S. Salankar Professor, G.H. Raisoni College of Engineering,

More information

Colour correction for panoramic imaging

Colour correction for panoramic imaging Colour correction for panoramic imaging Gui Yun Tian Duke Gledhill Dave Taylor The University of Huddersfield David Clarke Rotography Ltd Abstract: This paper reports the problem of colour distortion in

More information

Image Manipulation Detection using Convolutional Neural Network

Image Manipulation Detection using Convolutional Neural Network Image Manipulation Detection using Convolutional Neural Network Dong-Hyun Kim 1 and Hae-Yeoun Lee 2,* 1 Graduate Student, 2 PhD, Professor 1,2 Department of Computer Software Engineering, Kumoh National

More information

Video Synthesis System for Monitoring Closed Sections 1

Video Synthesis System for Monitoring Closed Sections 1 Video Synthesis System for Monitoring Closed Sections 1 Taehyeong Kim *, 2 Bum-Jin Park 1 Senior Researcher, Korea Institute of Construction Technology, Korea 2 Senior Researcher, Korea Institute of Construction

More information

DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION

DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION Journal of Advanced College of Engineering and Management, Vol. 3, 2017 DYNAMIC CONVOLUTIONAL NEURAL NETWORK FOR IMAGE SUPER- RESOLUTION Anil Bhujel 1, Dibakar Raj Pant 2 1 Ministry of Information and

More information

A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang, Dong-jun Seo, and Dong-seok Jung,

A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang, Dong-jun Seo, and Dong-seok Jung, IJCSNS International Journal of Computer Science and Network Security, VOL.11 No.9, September 2011 55 A Study on the control Method of 3-Dimensional Space Application using KINECT System Jong-wook Kang,

More information

Panoramic imaging. Ixyzϕθλt. 45 degrees FOV (normal view)

Panoramic imaging. Ixyzϕθλt. 45 degrees FOV (normal view) Camera projections Recall the plenoptic function: Panoramic imaging Ixyzϕθλt (,,,,,, ) At any point xyz,, in space, there is a full sphere of possible incidence directions ϕ, θ, covered by 0 ϕ 2π, 0 θ

More information

HMD based VR Service Framework. July Web3D Consortium Kwan-Hee Yoo Chungbuk National University

HMD based VR Service Framework. July Web3D Consortium Kwan-Hee Yoo Chungbuk National University HMD based VR Service Framework July 31 2017 Web3D Consortium Kwan-Hee Yoo Chungbuk National University khyoo@chungbuk.ac.kr What is Virtual Reality? Making an electronic world seem real and interactive

More information

Objective Quality Assessment Method for Stitched Images

Objective Quality Assessment Method for Stitched Images 1 : (Meer Sadeq Billah et al.: Objective Quality Assessment Method for Stitched Images) (Special Paper) 232, 2018 3 (JBE Vol. 23, No. 2, March 2018) https://doi.org/10.5909/jbe.2018.23.2.227 ISSN 2287-9137

More information

Recognizing Words in Scenes with a Head-Mounted Eye-Tracker

Recognizing Words in Scenes with a Head-Mounted Eye-Tracker Recognizing Words in Scenes with a Head-Mounted Eye-Tracker Takuya Kobayashi, Takumi Toyama, Faisal Shafait, Masakazu Iwamura, Koichi Kise and Andreas Dengel Graduate School of Engineering Osaka Prefecture

More information

Recognizing Panoramas

Recognizing Panoramas Recognizing Panoramas Kevin Luo Stanford University 450 Serra Mall, Stanford, CA 94305 kluo8128@stanford.edu Abstract This project concerns the topic of panorama stitching. Given a set of overlapping photos,

More information

Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising

Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising Learning Pixel-Distribution Prior with Wider Convolution for Image Denoising Peng Liu University of Florida pliu1@ufl.edu Ruogu Fang University of Florida ruogu.fang@bme.ufl.edu arxiv:177.9135v1 [cs.cv]

More information

Camera Model Identification With The Use of Deep Convolutional Neural Networks

Camera Model Identification With The Use of Deep Convolutional Neural Networks Camera Model Identification With The Use of Deep Convolutional Neural Networks Amel TUAMA 2,3, Frédéric COMBY 2,3, and Marc CHAUMONT 1,2,3 (1) University of Nîmes, France (2) University Montpellier, France

More information

Tan-Hsu Tan Dept. of Electrical Engineering National Taipei University of Technology Taipei, Taiwan (ROC)

Tan-Hsu Tan Dept. of Electrical Engineering National Taipei University of Technology Taipei, Taiwan (ROC) Munkhjargal Gochoo, Damdinsuren Bayanduuren, Uyangaa Khuchit, Galbadrakh Battur School of Information and Communications Technology, Mongolian University of Science and Technology Ulaanbaatar, Mongolia

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Deep Learning Barnabás Póczos Credits Many of the pictures, results, and other materials are taken from: Ruslan Salakhutdinov Joshua Bengio Geoffrey Hinton Yann LeCun 2

More information

Study Impact of Architectural Style and Partial View on Landmark Recognition

Study Impact of Architectural Style and Partial View on Landmark Recognition Study Impact of Architectural Style and Partial View on Landmark Recognition Ying Chen smileyc@stanford.edu 1. Introduction Landmark recognition in image processing is one of the important object recognition

More information

arxiv: v1 [cs.lg] 2 Jan 2018

arxiv: v1 [cs.lg] 2 Jan 2018 Deep Learning for Identifying Potential Conceptual Shifts for Co-creative Drawing arxiv:1801.00723v1 [cs.lg] 2 Jan 2018 Pegah Karimi pkarimi@uncc.edu Kazjon Grace The University of Sydney Sydney, NSW 2006

More information

Multimedia Forensics

Multimedia Forensics Multimedia Forensics Using Mathematics and Machine Learning to Determine an Image's Source and Authenticity Matthew C. Stamm Multimedia & Information Security Lab (MISL) Department of Electrical and Computer

More information

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. ECE 289G: Paper Presentation #3 Philipp Gysel DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition ECE 289G: Paper Presentation #3 Philipp Gysel Autonomous Car ECE 289G Paper Presentation, Philipp Gysel Slide 2 Source: maps.google.com

More information

Demosaicing Algorithm for Color Filter Arrays Based on SVMs

Demosaicing Algorithm for Color Filter Arrays Based on SVMs www.ijcsi.org 212 Demosaicing Algorithm for Color Filter Arrays Based on SVMs Xiao-fen JIA, Bai-ting Zhao School of Electrical and Information Engineering, Anhui University of Science & Technology Huainan

More information

LANDMARK recognition is an important feature for

LANDMARK recognition is an important feature for 1 NU-LiteNet: Mobile Landmark Recognition using Convolutional Neural Networks Chakkrit Termritthikun, Surachet Kanprachar, Paisarn Muneesawang arxiv:1810.01074v1 [cs.cv] 2 Oct 2018 Abstract The growth

More information

Deep Learning. Dr. Johan Hagelbäck.

Deep Learning. Dr. Johan Hagelbäck. Deep Learning Dr. Johan Hagelbäck johan.hagelback@lnu.se http://aiguy.org Image Classification Image classification can be a difficult task Some of the challenges we have to face are: Viewpoint variation:

More information

Near Infrared Face Image Quality Assessment System of Video Sequences

Near Infrared Face Image Quality Assessment System of Video Sequences 2011 Sixth International Conference on Image and Graphics Near Infrared Face Image Quality Assessment System of Video Sequences Jianfeng Long College of Electrical and Information Engineering Hunan University

More information

METHODS AND ALGORITHMS FOR STITCHING 360-DEGREE VIDEO

METHODS AND ALGORITHMS FOR STITCHING 360-DEGREE VIDEO International Journal of Civil Engineering and Technology (IJCIET) Volume 9, Issue 12, December 2018, pp. 77 85, Article ID: IJCIET_09_12_011 Available online at http://www.iaeme.com/ijciet/issues.asp?jtype=ijciet&vtype=9&itype=12

More information

Rectifying the Planet USING SPACE TO HELP LIFE ON EARTH

Rectifying the Planet USING SPACE TO HELP LIFE ON EARTH Rectifying the Planet USING SPACE TO HELP LIFE ON EARTH About Me Computer Science (BS) Ecology (PhD, almost ) I write programs that process satellite data Scientific Computing! Land Cover Classification

More information

An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet

An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet Journal of Information & Computational Science 8: 14 (2011) 3027 3034 Available at http://www.joics.com An Audio Fingerprint Algorithm Based on Statistical Characteristics of db4 Wavelet Jianguo JIANG

More information

Tiny ImageNet Challenge Investigating the Scaling of Inception Layers for Reduced Scale Classification Problems

Tiny ImageNet Challenge Investigating the Scaling of Inception Layers for Reduced Scale Classification Problems Tiny ImageNet Challenge Investigating the Scaling of Inception Layers for Reduced Scale Classification Problems Emeric Stéphane Boigné eboigne@stanford.edu Jan Felix Heyse heyse@stanford.edu Abstract Scaling

More information

360 Panorama Super-resolution using Deep Convolutional Networks

360 Panorama Super-resolution using Deep Convolutional Networks 360 Panorama Super-resolution using Deep Convolutional Networks Vida Fakour-Sevom 1,2, Esin Guldogan 1 and Joni-Kristian Kämäräinen 2 1 Nokia Technologies, Finland 2 Laboratory of Signal Processing, Tampere

More information

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices

Number Plate Detection with a Multi-Convolutional Neural Network Approach with Optical Character Recognition for Mobile Devices J Inf Process Syst, Vol.12, No.1, pp.100~108, March 2016 http://dx.doi.org/10.3745/jips.04.0022 ISSN 1976-913X (Print) ISSN 2092-805X (Electronic) Number Plate Detection with a Multi-Convolutional Neural

More information

An Efficient Approach to Face Recognition Using a Modified Center-Symmetric Local Binary Pattern (MCS-LBP)

An Efficient Approach to Face Recognition Using a Modified Center-Symmetric Local Binary Pattern (MCS-LBP) , pp.13-22 http://dx.doi.org/10.14257/ijmue.2015.10.8.02 An Efficient Approach to Face Recognition Using a Modified Center-Symmetric Local Binary Pattern (MCS-LBP) Anusha Alapati 1 and Dae-Seong Kang 1

More information

Wavelet-based Image Splicing Forgery Detection

Wavelet-based Image Splicing Forgery Detection Wavelet-based Image Splicing Forgery Detection 1 Tulsi Thakur M.Tech (CSE) Student, Department of Computer Technology, basiltulsi@gmail.com 2 Dr. Kavita Singh Head & Associate Professor, Department of

More information

Simulated Programmable Apertures with Lytro

Simulated Programmable Apertures with Lytro Simulated Programmable Apertures with Lytro Yangyang Yu Stanford University yyu10@stanford.edu Abstract This paper presents a simulation method using the commercial light field camera Lytro, which allows

More information

Effects of the Unscented Kalman Filter Process for High Performance Face Detector

Effects of the Unscented Kalman Filter Process for High Performance Face Detector Effects of the Unscented Kalman Filter Process for High Performance Face Detector Bikash Lamsal and Naofumi Matsumoto Abstract This paper concerns with a high performance algorithm for human face detection

More information

Biologically Inspired Computation

Biologically Inspired Computation Biologically Inspired Computation Deep Learning & Convolutional Neural Networks Joe Marino biologically inspired computation biological intelligence flexible capable of detecting/ executing/reasoning about

More information

Efficient Construction of SIFT Multi-Scale Image Pyramids for Embedded Robot Vision

Efficient Construction of SIFT Multi-Scale Image Pyramids for Embedded Robot Vision Efficient Construction of SIFT Multi-Scale Image Pyramids for Embedded Robot Vision Peter Andreas Entschev and Hugo Vieira Neto Graduate School of Electrical Engineering and Applied Computer Science Federal

More information

TRANSFORMING PHOTOS TO COMICS USING CONVOLUTIONAL NEURAL NETWORKS. Tsinghua University, China Cardiff University, UK

TRANSFORMING PHOTOS TO COMICS USING CONVOLUTIONAL NEURAL NETWORKS. Tsinghua University, China Cardiff University, UK TRANSFORMING PHOTOS TO COMICS USING CONVOUTIONA NEURA NETWORKS Yang Chen Yu-Kun ai Yong-Jin iu Tsinghua University, China Cardiff University, UK ABSTRACT In this paper, inspired by Gatys s recent work,

More information

Beacon Island Report / Notes

Beacon Island Report / Notes Beacon Island Report / Notes Paul Bourke, ivec@uwa, 17 February 2014 During my 2013 and 2014 visits to Beacon Island four general digital asset categories were acquired, they were: high resolution panoramic

More information

LENSLESS IMAGING BY COMPRESSIVE SENSING

LENSLESS IMAGING BY COMPRESSIVE SENSING LENSLESS IMAGING BY COMPRESSIVE SENSING Gang Huang, Hong Jiang, Kim Matthews and Paul Wilford Bell Labs, Alcatel-Lucent, Murray Hill, NJ 07974 ABSTRACT In this paper, we propose a lensless compressive

More information

Book Cover Recognition Project

Book Cover Recognition Project Book Cover Recognition Project Carolina Galleguillos Department of Computer Science University of California San Diego La Jolla, CA 92093-0404 cgallegu@cs.ucsd.edu Abstract The purpose of this project

More information

ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS

ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS Bulletin of the Transilvania University of Braşov Vol. 10 (59) No. 2-2017 Series I: Engineering Sciences ROAD RECOGNITION USING FULLY CONVOLUTIONAL NEURAL NETWORKS E. HORVÁTH 1 C. POZNA 2 Á. BALLAGI 3

More information

Deep Neural Network Architectures for Modulation Classification

Deep Neural Network Architectures for Modulation Classification Deep Neural Network Architectures for Modulation Classification Xiaoyu Liu, Diyu Yang, and Aly El Gamal School of Electrical and Computer Engineering Purdue University Email: {liu1962, yang1467, elgamala}@purdue.edu

More information

HDR IMAGING FOR FEATURE DETECTION ON DETAILED ARCHITECTURAL SCENES

HDR IMAGING FOR FEATURE DETECTION ON DETAILED ARCHITECTURAL SCENES HDR IMAGING FOR FEATURE DETECTION ON DETAILED ARCHITECTURAL SCENES G. Kontogianni, E. K. Stathopoulou*, A. Georgopoulos, A. Doulamis Laboratory of Photogrammetry, School of Rural and Surveying Engineering,

More information

Panoramic Image Stitching based on Feature Extraction and Correlation

Panoramic Image Stitching based on Feature Extraction and Correlation Panoramic Image Stitching based on Feature Extraction and Correlation Arya Mary K J 1, Dr. Priya S 2 PG Student, Department of Computer Engineering, Model Engineering College, Ernakulam, Kerala, India

More information

A Comparison of Histogram and Template Matching for Face Verification

A Comparison of Histogram and Template Matching for Face Verification A Comparison of and Template Matching for Face Verification Chidambaram Chidambaram Universidade do Estado de Santa Catarina chidambaram@udesc.br Marlon Subtil Marçal, Leyza Baldo Dorini, Hugo Vieira Neto

More information

Sketch-a-Net that Beats Humans

Sketch-a-Net that Beats Humans Sketch-a-Net that Beats Humans Qian Yu SketchLab@QMUL Queen Mary University of London 1 Authors Qian Yu Yongxin Yang Yi-Zhe Song Tao Xiang Timothy Hospedales 2 Let s play a game! Round 1 Easy fish face

More information

Convolutional Neural Network-Based Infrared Image Super Resolution Under Low Light Environment

Convolutional Neural Network-Based Infrared Image Super Resolution Under Low Light Environment Convolutional Neural Network-Based Infrared Super Resolution Under Low Light Environment Tae Young Han, Yong Jun Kim, Byung Cheol Song Department of Electronic Engineering Inha University Incheon, Republic

More information

Research on Pupil Segmentation and Localization in Micro Operation Hu BinLiang1, a, Chen GuoLiang2, b, Ma Hui2, c

Research on Pupil Segmentation and Localization in Micro Operation Hu BinLiang1, a, Chen GuoLiang2, b, Ma Hui2, c 3rd International Conference on Machinery, Materials and Information Technology Applications (ICMMITA 2015) Research on Pupil Segmentation and Localization in Micro Operation Hu BinLiang1, a, Chen GuoLiang2,

More information

An Efficient Framework for Image Analysis using Mapreduce

An Efficient Framework for Image Analysis using Mapreduce An Efficient Framework for Image Analysis using Mapreduce S Vidya Sagar Appaji 1, P.V.Lakshmi 2 and P.Srinivasa Rao 3 1 CSE Department, MVGR College of Engineering, Vizianagaram 2 IT Department, GITAM,

More information

An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet

An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet LETTER IEICE Electronics Express, Vol.14, No.15, 1 12 An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet Boya Zhao a), Mingjiang Wang b), and Ming Liu Harbin

More information

Robot Visual Mapper. Hung Dang, Jasdeep Hundal and Ramu Nachiappan. Fig. 1: A typical image of Rovio s environment

Robot Visual Mapper. Hung Dang, Jasdeep Hundal and Ramu Nachiappan. Fig. 1: A typical image of Rovio s environment Robot Visual Mapper Hung Dang, Jasdeep Hundal and Ramu Nachiappan Abstract Mapping is an essential component of autonomous robot path planning and navigation. The standard approach often employs laser

More information

Wadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology

Wadehra Kartik, Kathpalia Mukul, Bahl Vasudha, International Journal of Advance Research, Ideas and Innovations in Technology ISSN: 2454-132X Impact factor: 4.295 (Volume 4, Issue 1) Available online at www.ijariit.com Hand Detection and Gesture Recognition in Real-Time Using Haar-Classification and Convolutional Neural Networks

More information

Webcam Image Alignment

Webcam Image Alignment Washington University in St. Louis Washington University Open Scholarship All Computer Science and Engineering Research Computer Science and Engineering Report Number: WUCSE-2011-46 2011 Webcam Image Alignment

More information

RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS

RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS RESEARCH AND DEVELOPMENT OF DSP-BASED FACE RECOGNITION SYSTEM FOR ROBOTIC REHABILITATION NURSING BEDS Ming XING and Wushan CHENG College of Mechanical Engineering, Shanghai University of Engineering Science,

More information

Brief summary report of novel digital capture techniques

Brief summary report of novel digital capture techniques Brief summary report of novel digital capture techniques Paul Bourke, ivec@uwa, February 2014 The following briefly summarizes and gives examples of the various forms of novel digital photography and video

More information

Face Detection System on Ada boost Algorithm Using Haar Classifiers

Face Detection System on Ada boost Algorithm Using Haar Classifiers Vol.2, Issue.6, Nov-Dec. 2012 pp-3996-4000 ISSN: 2249-6645 Face Detection System on Ada boost Algorithm Using Haar Classifiers M. Gopi Krishna, A. Srinivasulu, Prof (Dr.) T.K.Basak 1, 2 Department of Electronics

More information

Dual-fisheye Lens Stitching for 360-degree Imaging & Video. Tuan Ho, PhD. Student Electrical Engineering Dept., UT Arlington

Dual-fisheye Lens Stitching for 360-degree Imaging & Video. Tuan Ho, PhD. Student Electrical Engineering Dept., UT Arlington Dual-fisheye Lens Stitching for 360-degree Imaging & Video Tuan Ho, PhD. Student Electrical Engineering Dept., UT Arlington Introduction 360-degree imaging: the process of taking multiple photographs and

More information

Landmark Recognition with Deep Learning

Landmark Recognition with Deep Learning Landmark Recognition with Deep Learning PROJECT LABORATORY submitted by Filippo Galli NEUROSCIENTIFIC SYSTEM THEORY Technische Universität München Prof. Dr Jörg Conradt Supervisor: Marcello Mulas, PhD

More information

Pose Invariant Face Recognition

Pose Invariant Face Recognition Pose Invariant Face Recognition Fu Jie Huang Zhihua Zhou Hong-Jiang Zhang Tsuhan Chen Electrical and Computer Engineering Department Carnegie Mellon University jhuangfu@cmu.edu State Key Lab for Novel

More information

Synthetic View Generation for Absolute Pose Regression and Image Synthesis: Supplementary material

Synthetic View Generation for Absolute Pose Regression and Image Synthesis: Supplementary material Synthetic View Generation for Absolute Pose Regression and Image Synthesis: Supplementary material Pulak Purkait 1 pulak.cv@gmail.com Cheng Zhao 2 irobotcheng@gmail.com Christopher Zach 1 christopher.m.zach@gmail.com

More information

Abstract. 1. Introduction and Motivation. 3. Methods. 2. Related Work Omni Directional Stereo Imaging

Abstract. 1. Introduction and Motivation. 3. Methods. 2. Related Work Omni Directional Stereo Imaging Abstract This project aims to create a camera system that captures stereoscopic 360 degree panoramas of the real world, and a viewer to render this content in a headset, with accurate spatial sound. 1.

More information

THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION

THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION Aufa Zin, Kamarul Hawari and Norliana Khamisan Faculty of Electrical and Electronics Engineering, Universiti Malaysia Pahang, Pekan,

More information

Image stitching. Image stitching. Video summarization. Applications of image stitching. Stitching = alignment + blending. geometrical registration

Image stitching. Image stitching. Video summarization. Applications of image stitching. Stitching = alignment + blending. geometrical registration Image stitching Stitching = alignment + blending Image stitching geometrical registration photometric registration Digital Visual Effects, Spring 2006 Yung-Yu Chuang 2005/3/22 with slides by Richard Szeliski,

More information

Multimodal Face Recognition using Hybrid Correlation Filters

Multimodal Face Recognition using Hybrid Correlation Filters Multimodal Face Recognition using Hybrid Correlation Filters Anamika Dubey, Abhishek Sharma Electrical Engineering Department, Indian Institute of Technology Roorkee, India {ana.iitr, abhisharayiya}@gmail.com

More information

Counterfeit Bill Detection Algorithm using Deep Learning

Counterfeit Bill Detection Algorithm using Deep Learning Counterfeit Bill Detection Algorithm using Deep Learning Soo-Hyeon Lee 1 and Hae-Yeoun Lee 2,* 1 Undergraduate Student, 2 Professor 1,2 Department of Computer Software Engineering, Kumoh National Institute

More information

Real Time Word to Picture Translation for Chinese Restaurant Menus

Real Time Word to Picture Translation for Chinese Restaurant Menus Real Time Word to Picture Translation for Chinese Restaurant Menus Michelle Jin, Ling Xiao Wang, Boyang Zhang Email: mzjin12, lx2wang, boyangz @stanford.edu EE268 Project Report, Spring 2014 Abstract--We

More information

COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER

COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER Department of Computer Science, Institute of Management Sciences, 1-A, Sector

More information

Hand Gesture Recognition by Means of Region- Based Convolutional Neural Networks

Hand Gesture Recognition by Means of Region- Based Convolutional Neural Networks Contemporary Engineering Sciences, Vol. 10, 2017, no. 27, 1329-1342 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ces.2017.710154 Hand Gesture Recognition by Means of Region- Based Convolutional

More information

Semantic Segmentation in Red Relief Image Map by UX-Net

Semantic Segmentation in Red Relief Image Map by UX-Net Semantic Segmentation in Red Relief Image Map by UX-Net Tomoya Komiyama 1, Kazuhiro Hotta 1, Kazuo Oda 2, Satomi Kakuta 2 and Mikako Sano 2 1 Meijo University, Shiogamaguchi, 468-0073, Nagoya, Japan 2

More information

A Geometric Correction Method of Plane Image Based on OpenCV

A Geometric Correction Method of Plane Image Based on OpenCV Sensors & Transducers 204 by IFSA Publishing, S. L. http://www.sensorsportal.com A Geometric orrection Method of Plane Image ased on OpenV Li Xiaopeng, Sun Leilei, 2 Lou aiying, Liu Yonghong ollege of

More information

Subregion Mosaicking Applied to Nonideal Iris Recognition

Subregion Mosaicking Applied to Nonideal Iris Recognition Subregion Mosaicking Applied to Nonideal Iris Recognition Tao Yang, Joachim Stahl, Stephanie Schuckers, Fang Hua Department of Computer Science Department of Electrical Engineering Clarkson University

More information

Passive Image Forensic Method to detect Copy Move Forgery in Digital Images

Passive Image Forensic Method to detect Copy Move Forgery in Digital Images IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 2, Ver. XII (Mar-Apr. 2014), PP 96-104 Passive Image Forensic Method to detect Copy Move Forgery in

More information

GPU ACCELERATED DEEP LEARNING WITH CUDNN

GPU ACCELERATED DEEP LEARNING WITH CUDNN GPU ACCELERATED DEEP LEARNING WITH CUDNN Larry Brown Ph.D. March 2015 AGENDA 1 Introducing cudnn and GPUs 2 Deep Learning Context 3 cudnn V2 4 Using cudnn 2 Introducing cudnn and GPUs 3 HOW GPU ACCELERATION

More information

Convolutional Neural Network-based Steganalysis on Spatial Domain

Convolutional Neural Network-based Steganalysis on Spatial Domain Convolutional Neural Network-based Steganalysis on Spatial Domain Dong-Hyun Kim, and Hae-Yeoun Lee Abstract Steganalysis has been studied to detect the existence of hidden messages by steganography. However,

More information

Multi-Resolution Estimation of Optical Flow on Vehicle Tracking under Unpredictable Environments

Multi-Resolution Estimation of Optical Flow on Vehicle Tracking under Unpredictable Environments , pp.32-36 http://dx.doi.org/10.14257/astl.2016.129.07 Multi-Resolution Estimation of Optical Flow on Vehicle Tracking under Unpredictable Environments Viet Dung Do 1 and Dong-Min Woo 1 1 Department of

More information

Reversible data hiding based on histogram modification using S-type and Hilbert curve scanning

Reversible data hiding based on histogram modification using S-type and Hilbert curve scanning Advances in Engineering Research (AER), volume 116 International Conference on Communication and Electronic Information Engineering (CEIE 016) Reversible data hiding based on histogram modification using

More information

Scalable systems for early fault detection in wind turbines: A data driven approach

Scalable systems for early fault detection in wind turbines: A data driven approach Scalable systems for early fault detection in wind turbines: A data driven approach Martin Bach-Andersen 1,2, Bo Rømer-Odgaard 1, and Ole Winther 2 1 Siemens Diagnostic Center, Denmark 2 Cognitive Systems,

More information

Elemental Image Generation Method with the Correction of Mismatch Error by Sub-pixel Sampling between Lens and Pixel in Integral Imaging

Elemental Image Generation Method with the Correction of Mismatch Error by Sub-pixel Sampling between Lens and Pixel in Integral Imaging Journal of the Optical Society of Korea Vol. 16, No. 1, March 2012, pp. 29-35 DOI: http://dx.doi.org/10.3807/josk.2012.16.1.029 Elemental Image Generation Method with the Correction of Mismatch Error by

More information

Deep filter banks for texture recognition and segmentation

Deep filter banks for texture recognition and segmentation Deep filter banks for texture recognition and segmentation Mircea Cimpoi, University of Oxford Subhransu Maji, UMASS Amherst Andrea Vedaldi, University of Oxford Texture understanding 2 Indicator of materials

More information

Proposed Kumototo Site 10 Wellington

Proposed Kumototo Site 10 Wellington Proposed Kumototo Site 10 Wellington Visualisation Simulation Methodology - Buildmedia Limited Contents 1.0 Introduction 2.0 Process Methodology Kumototo Site 10 Visual Simulation 3.0 Conclusion 1.0 Introduction

More information

Colorful Image Colorizations Supplementary Material

Colorful Image Colorizations Supplementary Material Colorful Image Colorizations Supplementary Material Richard Zhang, Phillip Isola, Alexei A. Efros {rich.zhang, isola, efros}@eecs.berkeley.edu University of California, Berkeley 1 Overview This document

More information

Convolutional Networks Overview

Convolutional Networks Overview Convolutional Networks Overview Sargur Srihari 1 Topics Limitations of Conventional Neural Networks The convolution operation Convolutional Networks Pooling Convolutional Network Architecture Advantages

More information

SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB

SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB S. Kajan, J. Goga Institute of Robotics and Cybernetics, Faculty of Electrical Engineering and Information Technology, Slovak University

More information

Simple Impulse Noise Cancellation Based on Fuzzy Logic

Simple Impulse Noise Cancellation Based on Fuzzy Logic Simple Impulse Noise Cancellation Based on Fuzzy Logic Chung-Bin Wu, Bin-Da Liu, and Jar-Ferr Yang wcb@spic.ee.ncku.edu.tw, bdliu@cad.ee.ncku.edu.tw, fyang@ee.ncku.edu.tw Department of Electrical Engineering

More information

A Compact Miniaturized Frequency Selective Surface with Stable Resonant Frequency

A Compact Miniaturized Frequency Selective Surface with Stable Resonant Frequency Progress In Electromagnetics Research Letters, Vol. 62, 17 22, 2016 A Compact Miniaturized Frequency Selective Surface with Stable Resonant Frequency Ning Liu 1, *, Xian-Jun Sheng 2, and Jing-Jing Fan

More information

Recognition: Overview. Sanja Fidler CSC420: Intro to Image Understanding 1/ 83

Recognition: Overview. Sanja Fidler CSC420: Intro to Image Understanding 1/ 83 Recognition: Overview Sanja Fidler CSC420: Intro to Image Understanding 1/ 83 Textbook This book has a lot of material: K. Grauman and B. Leibe Visual Object Recognition Synthesis Lectures On Computer

More information

ON THE CREATION OF PANORAMIC IMAGES FROM IMAGE SEQUENCES

ON THE CREATION OF PANORAMIC IMAGES FROM IMAGE SEQUENCES ON THE CREATION OF PANORAMIC IMAGES FROM IMAGE SEQUENCES Petteri PÖNTINEN Helsinki University of Technology, Institute of Photogrammetry and Remote Sensing, Finland petteri.pontinen@hut.fi KEY WORDS: Cocentricity,

More information

Toward an Augmented Reality System for Violin Learning Support

Toward an Augmented Reality System for Violin Learning Support Toward an Augmented Reality System for Violin Learning Support Hiroyuki Shiino, François de Sorbier, and Hideo Saito Graduate School of Science and Technology, Keio University, Yokohama, Japan {shiino,fdesorbi,saito}@hvrl.ics.keio.ac.jp

More information

multiframe visual-inertial blur estimation and removal for unmodified smartphones

multiframe visual-inertial blur estimation and removal for unmodified smartphones multiframe visual-inertial blur estimation and removal for unmodified smartphones, Severin Münger, Carlo Beltrame, Luc Humair WSCG 2015, Plzen, Czech Republic images taken by non-professional photographers

More information

3D Face Recognition System in Time Critical Security Applications

3D Face Recognition System in Time Critical Security Applications Middle-East Journal of Scientific Research 25 (7): 1619-1623, 2017 ISSN 1990-9233 IDOSI Publications, 2017 DOI: 10.5829/idosi.mejsr.2017.1619.1623 3D Face Recognition System in Time Critical Security Applications

More information

TECHNICAL DOCUMENTATION

TECHNICAL DOCUMENTATION TECHNICAL DOCUMENTATION NEED HELP? Call us on +44 (0) 121 231 3215 TABLE OF CONTENTS Document Control and Authority...3 Introduction...4 Camera Image Creation Pipeline...5 Photo Metadata...6 Sensor Identification

More information

Stamp detection in scanned documents

Stamp detection in scanned documents Annales UMCS Informatica AI X, 1 (2010) 61-68 DOI: 10.2478/v10065-010-0036-6 Stamp detection in scanned documents Paweł Forczmański Chair of Multimedia Systems, West Pomeranian University of Technology,

More information

Panoramic human structure maintenance based on invariant features of video frames

Panoramic human structure maintenance based on invariant features of video frames Chang et al. Human-centric Computing and Information Sciences 2013, 3:14 RESEARCH Open Access Panoramic human structure maintenance based on invariant features of video frames Shih-Ming Chang 1*, Hon-Hang

More information

A Method of Multi-License Plate Location in Road Bayonet Image

A Method of Multi-License Plate Location in Road Bayonet Image A Method of Multi-License Plate Location in Road Bayonet Image Ying Qian The lab of Graphics and Multimedia Chongqing University of Posts and Telecommunications Chongqing, China Zhi Li The lab of Graphics

More information

Improvement of Accuracy in Remote Gaze Detection for User Wearing Eyeglasses Using Relative Position Between Centers of Pupil and Corneal Sphere

Improvement of Accuracy in Remote Gaze Detection for User Wearing Eyeglasses Using Relative Position Between Centers of Pupil and Corneal Sphere Improvement of Accuracy in Remote Gaze Detection for User Wearing Eyeglasses Using Relative Position Between Centers of Pupil and Corneal Sphere Kiyotaka Fukumoto (&), Takumi Tsuzuki, and Yoshinobu Ebisawa

More information

Digital Photographic Imaging Using MOEMS

Digital Photographic Imaging Using MOEMS Digital Photographic Imaging Using MOEMS Vasileios T. Nasis a, R. Andrew Hicks b and Timothy P. Kurzweg a a Department of Electrical and Computer Engineering, Drexel University, Philadelphia, USA b Department

More information