Computing Touristic Walking Routes using Geotagged Photographs from Flickr

Similar documents
Image Extraction using Image Mining Technique

TOURISM for several country is a primordial matter to

Vistradas: Visual Analytics for Urban Trajectory Data

An Approach to Semantic Processing of GPS Traces

Urban Traffic Bottleneck Identification Based on Congestion Propagation

International Journal of Advance Engineering and Research Development. Generating The Summary Of Geographic Area

A Study on the Accuracy of Flickr s Geotag Data

LOCATION PRIVACY & TRAJECTORY PRIVACY. Elham Naghizade COMP20008 Elements of Data Processing 20 rd May 2016

Where Do Tourists Go? Visualizing and Analyzing the Spatial Distribution of Geotagged Photography

City Research Online. Permanent City Research Online URL:

Understanding the city to make it smart

On-site Traffic Accident Detection with Both Social Media and Traffic Data

ORBIS via: A Situated Perspective of a Transportation Network Based on Computer Gaming Principles

The Pennsylvania State University The Graduate School A STATISTICS-BASED FRAMEWORK FOR BUS TRAVEL TIME PREDICTION

An Embedding Model for Mining Human Trajectory Data with Image Sharing

Exploring the New Trends of Chinese Tourists in Switzerland

Mining Social Data to Extract Intellectual Knowledge

A STUDY FOR CAUSE ESTIMATION OF FAULTS USING STATISTICAL ANALYSIS

Spatial Color Indexing using ACC Algorithm

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN:

Autocomplete Sketch Tool

COLOR IMAGE SEGMENTATION USING K-MEANS CLASSIFICATION ON RGB HISTOGRAM SADIA BASAR, AWAIS ADNAN, NAILA HABIB KHAN, SHAHAB HAIDER

AUTOMATED METHOD FOR STATISTIC PROCESSING OF AE TESTING DATA

Xuegang (Jeff) Ban, Xia Yang, Jeff Wojtowicz, Jose Holguin-Veras Rensselaer Polytechnic Institute

The UN Population Division Urban Database

Study Impact of Architectural Style and Partial View on Landmark Recognition

Wi-Fi Fingerprinting through Active Learning using Smartphones

Twitter Event Photo Detection Using both Geotagged Tweets and Non-geotagged Photo Tweets

Location and User Activity Preference Based Recommendation System

Why Google Result Positioning Matters

DISTINGUISHING URBAN BUILT-UP AND BARE SOIL FEATURES FROM LANDSAT 8 OLI IMAGERY USING DIFFERENT DEVELOPED BAND INDICES

BEST PRACTICES IN INNOVATIONS IN MICROPLANNING FOR POLIO ERADICATION

Enhanced wireless indoor tracking system in multi-floor buildings with location prediction

CONTEXT-BASED MEDIA GEOTAGGING OF PERSONAL PHOTOS. Ivan Tankoyeu, Julian Stöttinger, Fausto Giunchiglia

Locating the Query Block in a Source Document Image

Rhythmic Similarity -- a quick paper review. Presented by: Shi Yong March 15, 2007 Music Technology, McGill University

Segmentation using Saturation Thresholding and its Application in Content-Based Retrieval of Images

GPSView: A Scenic Driving Route Planner

INTELLIGENT APRIORI ALGORITHM FOR COMPLEX ACTIVITY MINING IN SUPERMARKET APPLICATIONS

Chitika Insights The Value of Google Result Positioning

Is Food Scenery? Generative Situations in Urban Networked Photography

A MOBILE SOLUTION TO HELP VISUALLY IMPAIRED PEOPLE IN PUBLIC TRANSPORTS AND IN PEDESTRIAN WALKS

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

Strategic and Tactical Reasoning with Waypoints Lars Lidén Valve Software

Local and Low-Cost White Space Detection

Tourism network analysis 1

Neural Labyrinth Robot Finding the Best Way in a Connectionist Fashion

The study of Fuzzy theory applied to cool guys looking for beautiful girl

Magnusson, Charlotte; Rassmus-Gröhn, Kirsten; Szymczak, Delphine

Retrieval of Large Scale Images and Camera Identification via Random Projections

Technology Roadmap using Patent Keyword

Advanced Analytics for Intelligent Society

Algorithm for wavelength assignment in optical networks

The purpose of this study is to show that this difference is crucial.

Analysis of Temporal Logarithmic Perspective Phenomenon Based on Changing Density of Information

A Solution for Identification of Bird s Nests on Transmission Lines with UAV Patrol. Qinghua Wang

Automatic Morphological Segmentation and Region Growing Method of Diagnosing Medical Images

AN IMPROVED NO-REFERENCE SHARPNESS METRIC BASED ON THE PROBABILITY OF BLUR DETECTION. Niranjan D. Narvekar and Lina J. Karam

Automatic Aesthetic Photo-Rating System

idocent: Indoor Digital Orientation Communication and Enabling Navigational Technology

Extracting user habits from Google maps history logs

Your Neighbors Affect Your Ratings: On Geographical Neighborhood Influence to Rating Prediction

A Vehicular Visual Tracking System Incorporating Global Positioning System

Innovative mobility data collection tools for sustainable planning

Learning and Using Models of Kicking Motions for Legged Robots

History and Perspective of Simulation in Manufacturing.

Voice Activity Detection

Uncertainty in CT Metrology: Visualizations for Exploration and Analysis of Geometric Tolerances

Travel Photo Album Summarization based on Aesthetic quality, Interestingness, and Memorableness

Integrated Digital System for Yarn Surface Quality Evaluation using Computer Vision and Artificial Intelligence

An Improved Event Detection Algorithm for Non- Intrusive Load Monitoring System for Low Frequency Smart Meters

SPTF: Smart Photo-Tagging Framework on Smart Phones

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Vehicular Visual Tracking System Incorporating Global Positioning System

A Spatiotemporal Approach for Social Situation Recognition

A Gentle Introduction to Dynamic Programming and the Viterbi Algorithm

Thesis: Bio-Inspired Vision Model Implementation In Compressed Surveillance Videos by. Saman Poursoltan. Thesis submitted for the degree of

Natalia Vassilieva HP Labs Russia

II. MEASUREMENT OF THE CITY PERFORMANCE EFFICIENCY

FUTURE-PROOF INTERFACES: SYSTEMATIC IDENTIFICATION AND ANALYSIS

Colorful Image Colorizations Supplementary Material

An Integrated Expert User with End User in Technology Acceptance Model for Actual Evaluation


Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

A Study of Optimal Spatial Partition Size and Field of View in Massively Multiplayer Online Game Server

Lots of Pervasive Devices and Web services producing data about us!

The multi-facets of building dependable applications over connected physical objects

Study of Location Management for Next Generation Personal Communication Networks

Maps for People Who Walk and Bike

Tour the World: building a web-scale landmark recognition engine

QS Spiral: Visualizing Periodic Quantified Self Data

Automatic Image Timestamp Correction

Epitome A Social Game for Photo Album Summarization

Neural Networks for Real-time Pathfinding in Computer Games

AUTOMATED MUSIC TRACK GENERATION

NJDEP GPS Data Collection Standards for GIS Data Development

Daniel R. Cahoy Smeal College of Business Penn State University VALGEN Workshop January 20-21, 2011

Detection of Compound Structures in Very High Spatial Resolution Images

CHAPTER 1 INTRODUCTION

Detection, Recognition, and Localization of Multiple Cyber/Physical Attacks through Event Unmixing

Transcription:

Research Collection Conference Paper Computing Touristic Walking Routes using Geotagged Photographs from Flickr Author(s): Mor, Matan; Dalyot, Sagi Publication Date: 2018-01-15 Permanent Link: https://doi.org/10.3929/ethz-b-000225591 Rights / License: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library

Computing Touristic Walking Routes using Geotagged Photographs from Flickr Matan Mor, Sagi Dalyot Mapping and Geoinformation Engineering, The Technion, Technion City 3200003, Haifa matan.mor@campus.technion.ac.il, dalyot@technion.ac.il Abstract. Tourism information is getting extensive, comprehensive and complex, thus tourists have to manage and mine large volumes of data and information to better plan their trip. Geotagged photographs uploaded by users to social photo-sharing online websites are today frequently used by tourists to describe their tourism experience, sometimes even replacing textual description. We focus on Flickr geotagged photograph database to automatically compute touristic walking routes. Instead of simply clustering photographs to extract places that might not be associated with tourism, we suggest using a set of spatio-temporal descriptors that are associated with the activities of touristic photographers, to mine and interpret photographs that have a tourism-context. Cell-based clustering is used on the retrieved photographs to find popular regions and places of interest traversed by trajectories made by the photographers that show tourism characteristics. A bi-directional constrained pathfinder Nearest Neighbor route calculation algorithm is developed to compute routes that visit the most popular touristic locations among photographers. Preliminary results for Manhattan are presented, proving reliable interpretation, mining and retrieving of valuable tourism information from social media Flickr geotagged photographs, computing interesting and realistic touristic walking routes. Keywords. Geotagged Big Data, Activity Interpretation, Route Calculation 1. Introduction Smart tourism solutions are developing fast, aspiring to keep paste with the growing number of dynamic volumes of big data and information sources that did not exist until recently, continuously updating and becoming more accessible and straightforward to use. As a result, paper maps and tour guides are becoming obsolete, being replaced with internet and mobile 63

guides, internet search engines, apps, and blogs. Dynamic crowdsourced user-generated geotagged data and information sources are increasing dramatically, and social media websites, such as Flickr, Twitter, and Facebook, are more commonly used to share travel and tourism experience. Since these dynamic social media websites serve as collective up-to-date knowledge, mining and extracting relevant and updated photograph trails of photographers can serve as a smart solution to touristic walking route planning, mainly in unfamiliar areas (e.g., De Choudhury et al. 2010, Chareyron et al. 2014). This requires two main challenges to deal with: 1) identify and understand the underlying tourism context by implementing data mining processes to interpret and discover popular places that should be included in the route; 2) compute an optimized walking route that should serve as the most touristic one. Current research aims to discover tourism routes extracted from user-generated geotagged photograph and Places of Interest (POI) databases. Li et al. (2015), for example, calculate POI geographic information to guide to locations resembling tourism features, while Sun et al. (2015) use the Dijkstra algorithm with weighted popularity road matrix to compute touristic routes. Becker et al. (2015) retrieve photograph trails for a single user regardless of time span by comparing the extracted trail to a weighted known POI geographic layer to explore tourists patterns. Wang et al. (2016) analyze photographers pattern by analyzing their photograph sequence to evaluate touristic routes. In this study, instead of relying and analyzing numerous discrete geotagged photographs that exist on Flickr that might not have a tourism context, we aspire to identify users (photographers) that can be considered as tourists based on their activity and analyze their trajectory patterns; thus, we filter less relevant (and noisy ) photograph data. Relying on these photographers activities we can interpret and discover popular touristic places visited by them according to the accumulated photographs that have a tourism-context. To this end, we have devised an array of spatio-temporal key indicators to classify touristic photographers and analyze their accumulated travel trajectories (touristic trail). We interpret popular cells traversed by their trajectories. These cells are used as input in a constrained bi-directional Nearest Neighbor (NN) route calculation algorithm to automatically seek and calculate the most touristic and comprehensive walking route, without having to rely on an external POI database. 2. Methodology Our methodology, depicted in Figure 1, relies on three consecutive stages with the aim of mine and interpret touristic photographs (stage 1), cluster and rank popular touristic places and POIs (stage 2), and construct the touristic route (stage 3), as follows: 64

Photos Extraction Geotagged Photos Data Structure Developed Algorithms Classifying Touristic Photographers Flickr API Developed Algorithms Cell-Grid Popularity Analysis Geographic Photos Query Constrained NN route calculation Automatic Computation Torusitic Route Google Maps API Figure 1. Methodology workflow. 2.1 Classifying Touristic Photographers To retrieve photographs that have a tourism context, we identify photographers (users) that show touristic activity and descriptors, aspiring to filter photographs uploaded by local residents or random photographers. This is achieved by implementing a set of spatio-temporal parameters and identifiers that characterizes a touristic activity, namely: the number of photographs taken on a trip; the time interval between consecutive photographs; the duration of a trip travel; the distance traveled; and, the traveling speed. By linking all geotagged photographs taken by these photographers, we can analyze the touristic setting of the area. 2.2 Cell-Grid Popularity Analysis The cell-based approach is implemented to geographically cluster the accumulated photographers activity patterns by analyzing their traversed trajectories to identify popular touristic places: cells with a higher number of visiting touristic photographers are considered to be more touristically popular and frequently traveled. The most attracting location for a cell relies on a centroid calculation of all geotagged photographs that fall in its extent, which presumably coincides with the position of the touristic and popular attraction (e.g., landmark, site), i.e., POI. 2.3 Route Calculation Searching popular cells is carried out via a constrained bi-directional NN pathfinder method. Using NN will not ascertain crossing all popular cells POIs, but will assure crossing the ones that are the most popular along the shortest route constraint. In our implementation, two routes are calculated, forward (origin-destination) and backward (destination-origin). Both routes 65

will not always pass through the same popular cells, such that a popularity test is implemented to quantify and compare both, recommending the one that is the most popular. To construct routes that are logical in relation to the existing road network, our preliminary route calculation algorithm relies solely on the road network arrangement without considering specific environmental constraints, such as the orienteering problem in networks. We use Google Maps Direction API by applying walking travel mode via the extracted waypoints (POIs) as input to compute a logical walking route. 3. Preliminary Results Manhattan, New-York, USA, with an approximated area of 210 sq. km. is chosen as a case study. Flickr database was downloaded in July 2016, having 22665 users, and a total of 358691 geotagged photographs. Preliminary parameters were defined for excluding erroneous data and outliers, while together with the touristic photographer descriptors, only 1846 users (photographers) were identified as having touristic characteristics - less than 10%. Statistics revealed that photographers visit time duration is below four days, traveling 13 Kilometers (values for CEP90), validating the tourism travel behavior indicators used for filtering. Origin point was defined at Grand Central Terminal, and destination point at Manhattan Cruise Terminal. Computing walking route via Google Maps API generated two options, where the optimal shortest route is approximately 7.2 km in length, depicted in grey in Figure 2 (left). Figure 2 (right) depicts the bi-directional routes computed by our algorithm: red (forward) and green (backward), where the forward one was considered as more touristic due to its higher popularity rate: 568 accumulated photographers, as opposed to 282. The resulting yellow route was computed by Google Maps Direction API using the retrieved waypoints (POIs) as input. The final touristic route, depicted in Figure 3, which is slightly longer than the shortest route, passes through main touristic landmarks and attractions, depicted as A to E, which were automatically retrieved without relying on an external POI database. When TripAdvisor s landmark recommendations and rankings of the area are compared to the retrieved POI locations, all are represented. Hell s kitchen, depicted as POI D in Figure 3, which is popular among tourists and photographers, is not listed as POI in TripAdvisor s database. This proves that our algorithms are capable of retrieving local popular attractions, which are not always listed in external POI databases, commonly visited by tourists and are attractive among photographers, and thus most obviously can be included in the planned touristic walking route. Moreover, the number of POIs retrieved is realistic: 5 landmarks for a walking distance of approximately 3.5 km, ascertaining not to communicate excessive information to the tourist. 66

Figure 2. Google Maps routes (left), and the automatically computed touristic walking routes in Manhattan (right), superimposed on the grid-cell heat map with popularity colors: forward route (red), backward route (green), and Google Maps Direction API route (yellow) that uses the retrieved waypoints of the forward route as input (background: Google Maps). D E C B A Figure 3. Automatically computed touristic route in Manhattan passing through main landmarks and attractions (with TripAdvisor ranking, where available): Bryant Park (A), New Amsterdam Theater (B), Time Square (C), Hell s kitchen (view of Hearst Tower from 9 th Avenue) (D), Intrepid Sea, Air & Space Museum (E) (background: Google Maps). 67

4. Conclusions and Future Work This research presents a methodology that relies on spatio-temporal descriptors, which are associated with the activities of touristic photographers, to mine and interpret photographs that have a tourismcontext to retrieve touristic popular attractions. By doing so, we filter with high certainty photographs that have no touristic significance, which is commonly the case when simply clustering all existing photographs. Preliminary experiments show very promising results, where the automatically computed touristic routes validate the premise of passing through touristic sights and landmarks. Moreover, routes do not significantly deviate from the general direction and distance between the origin and destination points, serving as interesting and realistic touristic walking routes, validating the optimization process suggested here. Small deviation segments exist, which are the result of relying on the centroid locations as waypoints in the routing; this is planned to be handled in future work, tuning and developing a more complex route calculation algorithm. Future work will also include the use of an adaptable clustering method to replace the cellbased one and the replacement of the hard-coded thresholds in the touristic photographers classification stage. Results show the potential that local, alternative and less known attractions - yet ones that attract photographers and tourists - are successfully identified. This proves that our algorithms retrieve important and updated (perhaps even temporal) attractions that do not exist in other external databases. These ensure the user has a touristic experience while walking in an unfamiliar area, proving the capacity to retrieve dynamic and up-to-date data and information from geotagged big data sources to computing comprehensive touristic routes. References Becker, M., Singer, P., Lemmerich, F., Hotho, A., Helic, D., & Strohmaier, M. (2015). Photowalking the city: Comparing hypotheses about urban photo trails on Flickr. In SocInfo (pp. 227-244). Chareyron, G., Da-Rugna, J., & Raimbault, T. (2014). Big data: A new challenge for tourism. In Big data (Big data), 2014 IEEE international conference on (pp. 5-7). IEEE. De Choudhury, M., Feldman, M., Amer-Yahia, S., Golbandi, N., Lempel, R., & Yu, C. (2010). Automatic construction of travel itineraries using social breadcrumbs. In Proceedings of the 21st ACM conference on Hypertext and hypermedia (pp. 35-44). ACM. Li, J., Yang, Y., and Liu, W. (2015). Exploring personalized travel route using POIs. International Journal of Computer Theory and Engineering, 7(2), 126. Sun, Y., Fan, H., Bakillah, M., & Zipf, A. (2015). Road-based travel recommendation using geo-tagged images. Computers, Environment and Urban Systems, 53, 110-122. Wang, S., Bao, Z., Culpepper, J. S., Sellis, T., Sanderson, M., & Yadamjav, M. E. (2016). Interactive trip planning using activity trajectories. In ADCS (pp. 77-80). 68