Understanding the city to make it smart

Similar documents
Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety

Social Media Intelligence in Practice: The NEREUS Experimental Platform. Dimitris Gritzalis & Vasilis Stavrou June 2015

Latest trends in sentiment analysis - A survey

BIG DATA EUROPE TRANSPORT PILOT: INTRODUCING THESSALONIKI. Josep Maria Salanova Grau CERTH-HIT

Emotion analysis using text mining on social networks

Computing Touristic Walking Routes using Geotagged Photographs from Flickr

WHITE PAPER. NLP TOOL (Natural Language Processing) User Case: isocialcube (Social Networks Campaign Management)

THE CHALLENGES OF SENTIMENT ANALYSIS ON SOCIAL WEB COMMUNITIES

Vistradas: Visual Analytics for Urban Trajectory Data

MODULE 4 CREATING SOCIAL MEDIA CONTENT

Predicting Content Virality in Social Cascade

Hence analysing the sentiments of the people are more important. Sentiment analysis is particular to a topic. I.e.,

A Brief Overview of Facebook and NLP. Presented by Brian Groenke and Nabil Wadih

Opinion Mining and Emotional Intelligence: Techniques and Methodology

ISSN: (Online) Volume 2, Issue 4, April 2014 International Journal of Advance Research in Computer Science and Management Studies

ArkPSA Arkansas Political Science Association

Regionaal Platform. 19 oktober 2016

Social Events in a Time-Varying Mobile Phone Graph

WORKSHOP. Sara Bauer Ma, MSc. Computational Linguistics or "How your last tweet will be used against you" September 28, 2018

An Embedding Model for Mining Human Trajectory Data with Image Sharing

Analysis of Data Mining Methods for Social Media

Wi-Fi Fingerprinting through Active Learning using Smartphones

Moodify. A music search engine by. Rock, Saru, Vincent, Walter

Social media corpora, datasets and tools: An overview

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN:

Travel Photo Album Summarization based on Aesthetic quality, Interestingness, and Memorableness

Exploring Wearable Cameras for Educational Purposes

On the road to smarter German Cities Research and Dialogue

Retrieval of Large Scale Images and Camera Identification via Random Projections

Developing a Semantic Content Analyzer for L Aquila Social Urban Network

How to Blog: Best Practices 18 January 2017

Liangliang Cao *, Jiebo Luo +, Thomas S. Huang *

Xuegang (Jeff) Ban, Xia Yang, Jeff Wojtowicz, Jose Holguin-Veras Rensselaer Polytechnic Institute

Public Disclosure Authorized. Public Disclosure Authorized. Public Disclosure Authorized

SKIFT BRAND IDENTITY BRAND GUIDELINES Design Rocket

QS Spiral: Visualizing Periodic Quantified Self Data

DIGITALMEETSCULTURE.NET Interactive e-zine where digital technology and culture collide

Technologies Worth Watching. Case Study: Investigating Innovation Leader s

Set your business hours: Print off the Weekly Action Plan and begin to fill it out Update your contact list

Local and Low-Cost White Space Detection

Using Deep Learning for Sentiment Analysis and Opinion Mining

THE NO AGE SOCIETY Comfort Living and Meaningful Consumption

Polarization Analysis of Twitter Users Using Sentiment Analysis

Identifying Personality Trait using Social Media: A Data Mining Approach

MMORPGs And Women: An Investigative Study of the Appeal of Massively Multiplayer Online Roleplaying Games. and Female Gamers.

LOCATION PRIVACY & TRAJECTORY PRIVACY. Elham Naghizade COMP20008 Elements of Data Processing 20 rd May 2016

Exploring the New Trends of Chinese Tourists in Switzerland

A MOBILE SOLUTION TO HELP VISUALLY IMPAIRED PEOPLE IN PUBLIC TRANSPORTS AND IN PEDESTRIAN WALKS

Walking with Geo-Zombie: a Pervasive Game to Engage People in Urban Crowdsourcing

the role of mobile computing in daily life

Digitisation A Quantitative and Qualitative Market Research Elicitation

Social Network Analysis in HCI

Predicting Video Game Popularity With Tweets

Secrets of Social Media on Purpose

Context Aware Computing

Understanding Generation Z TREND REPORT

Image Finder Mobile Application Based on Neural Networks

Torino Living Lab & Open Incet

Playware Research Methodological Considerations

Copyright: Conference website: Date deposited:

POLICY SIMULATION AND E-GOVERNANCE

2016 Smart Cities Survey Summary Report of Survey Results

A Comparison Study of Image Descriptors on Low- Resolution Face Image Verification

QLectives: evolving software to support quality

TRENDS IN TECHNOLOGY: LOOKING AHEAD FOR OUTDOOR RECREATION

New Challenges of immersive Gaming Services

ENHANCING PHOTOWARE IN THE SOCIAL NETWORKS ENVIRONMENT

Participation, awareness and learning

Copyright 2015 Silicon Valley Digital Marketing Institute, All Rights Reserved

Emotion Extraction and Entrainment in Social Media: The Case of U.S. Immigration and Border Security

What is Big Data? Jaakko Hollmén. Aalto University School of Science Helsinki Institute for Information Technology (HIIT) Espoo, Finland

2nd ACM International Workshop on Mobile Systems for Computational Social Science

D4.1.2 Experiment progress report including intermediate results

Social Interaction Design (SIxD) and Social Media

30 Day Affirmation Challenge

«Digital transformation of Pharma and API Plants: a way to create value for long term sustainability» G. Burba

Findings of a User Study of Automatically Generated Personas

Lecture 3 - Regression

What We ll Cover In This Presentation:

encompass - an Integrative Approach to Behavioural Change for Energy Saving

Potential areas of industrial interest relevant for cross-cutting KETs in the Electronics and Communication Systems domain

Automatic Counterfeit Protection System Code Classification

Measuring and Analyzing the Scholarly Impact of Experimental Evaluation Initiatives

Analyzing the User Inactiveness in a Mobile Social Game

A Practical Approach to Understanding Robot Consciousness

Inside the black-box. children rights in the digital age. Conceição Costa José Rogado Carla Sousa Sara Henriques

MISB RP 1107 RECOMMENDED PRACTICE. 24 October Metric Geopositioning Metadata Set. 1 Scope. 2 References. 2.1 Normative Reference

MODULE 5 FACEBOOK PROMOTION AND MARKETING STRATEGIES

Jigsaw Puzzle Image Retrieval via Pairwise Compatibility Measurement

A Review of Related Work on Machine Learning in Semiconductor Manufacturing and Assembly Lines

Privacy, Technology and Economics in the 5G Environment

Analysing Emotional Sentiment in People s YouTube Channel Comments

Digging Deeper, Reaching Further. Module 5: Visualizing Textual Data An Introduction

MedLab. Mediterranean Living Lab for Territorial Innovation. Nicos Komninos, URENIO Research CENTRAL LAB Kick-off, Ptuj, Slovenia 30 May 2011

Understanding Social Computing: Challenges and Opportunities for Europe

A whitepaper by. Consumption of Communication

Understanding Real-World Mobile Network Experience

Understanding User Privacy in Internet of Things Environments IEEE WORLD FORUM ON INTERNET OF THINGS / 30

The effects of uncertainty in forest inventory plot locations. Ronald E. McRoberts, Geoffrey R. Holden, and Greg C. Liknes

Foreword_. Smart Santander Foreword

Transcription:

Understanding the city to make it smart Roberta De Michele and Marco Furini Communication and Economics Department Universty of Modena and Reggio Emilia, Reggio Emilia, 42121, Italy, marco.furini@unimore.it Abstract. Understanding the sentiment of people is a process that may be useful when transforming a city into a smart city. A recent trend is to exploit social media data to infer people sentiments. While many studies focused on textual data, few considered the visual contents. In this paper we investigate whether the images available in the Instagram platform can be useful to understand people sentiments. Through an experimental assessment and two different validation methods, we observed that although the use of images in sentiment analysis can be useful to have insights about people sentiments, the use of Instagram images may be slightly misleading. Key words: Smart city, Visual Sentiment Analysis, Plutchink model 1 Introduction A Smart city uses digital technology to better manage infrastructure and resources, to improve the services offered to citizens and to address challenges of development and sustainability more efficiently. To truly develop a smart city, citizen should be involved in the process [1], but most of the approaches to build a smart city does not consider the citizens while designing and implementing strategies to transform a city into its smart version [2, 3]. A recent approach towards citizens involvement is to exploit the large amount of data available in social media platforms to understand their feelings. Indeed, the idea is to use these information to make more informed decisions that will end in better use of resources, better organization, better citizen lifestyle, better human relations and, eventually, better society. In literature, many researches on sentiment analysis focused on textual contents (e.g., blogs, forum, tweets) and only recently, researches are trying to exploit the large amount of multimedia contents available in social media platforms to measure people feelings (e.g., [4] and [5]). The reasons behind this approach is that multimedia contents are more and more produced and shared in social media platforms (e.g., more than one billion of images are shared daily on Facebook, Instagram, Flickr and Whatsapp) and, moreover, these contents are usually produced with a mobile application that couples the pure multimedia content with additional information (e.g., GPS data, users information, etc.) that may be very useful in understanding people feelings.

2 Roberta De Michele and Marco Furini In this paper we focus on the Instagram application and we want to investigate whether images shared by people in the Instagram social platform can be useful in the decision making processes that aim at transforming a city into a smart city. Instagram is an online mobile photo-sharing application that enables users to take and share pictures in social platforms. The application is becoming more and more popular among mobile customers (as of December 2014 the platform had over 300 millions of users) and therefore it is possible to publicly access to a huge amount of data. We perform this investigation in two steps: first we define an area of interest and we access to the Instagram images taken in that area; then we use a psychological model to associate colors to emotions and we compute a low-level color analysis to determine whether the image represents a positive or negative emotion. By repeating the process for all the images located in the interested area, we can understand what is the people sentiment and how these emotions are distributed in the area. To validate our approach we perform two different validations: ground-truth and users tag. Results obtained showed that images can be useful in understanding the sentiment of people living in an area of interest, but also showed that Instagram images are slightly affected by the massive usage of filters that modify the original picture. The paper is organized as follows: in Section 2 we present related works in the area of image sentiment analysis; in Section 3 we present our study, whereas its evaluation is presented in Section 4. Conclusions are drawn in Section 5. 2 Background and Related Work The social web is characterized by an abundance of multimedia resources coupled with many additional information (i.e., metadata) [6]. For instance, an Instagram image is coupled with information about the user, about his/her friends, about its description, tags and location [7, 8]. Researchers are trying to exploit these information for many different reasons and one of these is to identify people sentiment. While there are several studies that focused on textual contents (e.g., the ones available in the Twitter platform), the visual sentiment analysis is in its early stage. Examples are the video/image classification and smile detection (e.g., [5, 9, 10]). When dealing with emotions, it is necessary to have a psychological model that classifies them. In this paper we focus on the widely used Plutchik model [11]. This model identifies eight basic bipolar emotions (joy, trust, fear, surprise, sadness, disgust, anger, anticipation) that may be extended to 24 emotions by varying the intensity of every basic emotion (e.g., ecstasy joy serenity) and associates a color to each of the 24 emotions as shown in Figure 1. Therefore, the model connects a color to its emotion (e.g., yellow joy). Looking at the model, we observe that light colors represent positive emotions, whereas dark colors represent negative emotions.

Smart city 3 Fig. 1. The Plutchik model: emotions are associated to specific colors. 3 Our Study Understanding the sentiments of people living in a city is becoming an important aspect in decision making process and, recently, attention is given to the large amount of multimedia contents available in social media platforms. The motivation behind our study is to investigate whether the huge amount of Instagram images can be useful to understand people sentiments. In particular, our study is composed of three steps: i) access to Instagram images located in the area one wants to investigate, ii) mapping the retrieved images into emotions, and iii) validate the obtained results. By repeating the process for all the images located in the interested area, one can understand the people sentiments and how these emotions are distributed in the area. This should provide useful insights to city administrator, politics, urban designer, etc. Accessing Instagram images. To access to Instagram images we develop a Python application that interacts with Instagram API. The application requires to specify a geographical location (latitude and longitude) and gets all the Instagram images located nearby that location (i.e., the distance from the point may vary from the default value of one km to five km). In addition to the image, the application retrieves many other information like comments and tags. Mapping images into emotions. According to the Plutchik model, dark colors correspond to negative emotions, whereas light colors correspond to positive emotions. To identify whether an image is dark or light, we consider images with 24-bits RGB colors where the lightest color (i.e., white) is defined by the triplet (255, 255,255) and the darkest color (i.e., black) is defined by the triplet (0,0,0). To determine whether an image is light or dark, we measure the color distance that separates each pixel from the white and black points. The distance is measured through the Euclidean distance (see Equation 1). Therefore, for each pixel (x,y,z) we know whether it is closer to the white or to the black point. When all the image pixels are analyzed we can call the image as dark or light.

4 Roberta De Michele and Marco Furini (x bin r ) 2 + (y bin g ) 2 + (z bin b ) 2 (1) Validation. One of the most difficult tasks in understanding the insights obtained from the analysis of the data available in social media platforms is the validation process. In this study, we consider two different validation methods: ground-truth (a methodology that requires a group of human beings to say whether an image represents positive or negative emotion) and users tag (a methodology that analyzes the textual description attached to the image to understand what people think about the picture). 4 Experimental Assessment To investigate whether Instagram images can be used to determine people sentiments, we set up an experimental scenario where we focus on three towns in the area of our department: Reggio Emilia (where our department is), Modena (30 km east of Reggio Emilia) and Parma (30 km west of Reggio Emilia). And we use the developed python application to build an image dataset for each town. In total, we dataset is composed of around 2,000 images. Looking at the collected images, we observed a first possible problem in using Instagram images: many images contain textual contents (e.g., logos, flyers or quotes) and/or colored borders (Instagram only handles square images and hence images with different formats are converted using colored borders). Examples of these images are reported in Figure 2. These images may negatively affect the sentiment results: the color of the borders and the presence of textual contents (which outweigh the colors) may affect the validity of the model. For these reasons, we manually filtered the dataset and we obtained a new dataset composed of 260 images (with no borders or textual contents) for each town. Fig. 2. Example of images excluded from the dataset: color borders (left) and textual content (right). We applied the color analysis (defined in the previous section) to get the sentiment of the analyzed images. Results show a predominance of images with a negative connotation for the town of Modena (63% vs. 37%), a slight predominance of images with a negative connotation for the town of Parma (56% vs. 44%) and an equal distribution between positive and negative images for the town of Reggio Emilia. By plotting the results on the city map, we can identify areas of the city where people post positive or negative images.

Smart city 5 Groud-thruth evaluation. We asked a group of five people to call either positive or negative every image of the dataset (260 images for each town). In particular, we considered as positive (negative) an image that receives at least three positive (negative) calls. Figure 3 shows that the ground-truth evaluation labeled as positive more images than the one labeled by the color analysis method. Needless to say, the color analysis method labeled as negative more images than the one labeled by the ground truth evaluation. The results show that the color-based analysis produce more negative calls. 0 50 100 150 200 Color Ground-truth 0 50 100 150 200 Color Ground-truth Fig. 3. Ground-truth evaluation: positive (left) negative (right) images. Users tag. We defined a list of positive tags (e.g., happy, beautiful ) and a list of negative tags (e.g., sad, ugly ) and then we filtered the dataset to get only images tagged as positive or negative. The resulting dataset are composed of 65 images for Reggio Emilia (60 positive and 5 negative), 58 for Modena (53 positive and 5 negative) and 55 for Parma (53 positive and 2 negative). Finally, we applied the color-based algorithm. Results presented in Figure 4 show that the tags evaluation labeled as positive more images than the one labeled by the color analysis method. Also in this case, the color-based analysis produces more negative calls. 0 10 20 30 40 50 60 70 Color Tag 0 10 20 30 40 50 60 70 Color Tag Fig. 4. Users tags evaluation: images called as positive (left) and negative (right).

6 Roberta De Michele and Marco Furini In general, results showed that Instagram images should be used with caution when investigating people sentiments. We found three different main reasons: i) the square format may introduce colored borders, thus affecting the appearance and the colors of the picture; ii) textual contents represent the emotions better than colors and therefore their presence nullifies color analysis results; iii) Instagram users make a large use of filters that change the original image. 5 Conclusions and Future Work In this paper we investigated whether the Instagram images may be useful to identify people sentiments. Through an experimental assessment and two different validation methods (ground-truth and users tags) we found that Instagram images may have characteristics that may affect the emotions analysis (e.g., colored borders, textual contents, dark filters). As a future work, we plan to investigate whether it is possible to correlate the used filter with emotions. References 1. Ben A. Amaba. Industrial and business systems for smart cities. In Proc. of EMASC, EMASC 14, pages 21 22, New York, NY, USA, 2014. ACM. 2. Nicos Komninos, Panagiotis Tsarchopoulos, and Christina Kakderi. New services design for smart cities: A planning roadmap for user-driven innovation. In Proc. of WiMobCity, 2014. 3. Marco Roccetti, Gustavo Marfia, and Claudio Enrico Palazzi. Entertainment beyond divertissment: Using computer games for city road accessibility. Comput. Entertain., 9(2):10:1 10:9, July 2011. 4. Stefan Siersdorfer, Enrico Minack and Fan Deng. Analyzing and predicting sentiment of images on the social web. In Int. Conf. on Multimedia, 2010. 5. Saeed Abdullah, Elizabeth L. Murnane, Jean M.R. Costa, and Tanzeem Choudhury. Collective smile: Measuring societal happiness from geolocated images. In Proc. of CSCW, 2015. 6. Manuela Montangero and Marco Furini. Trank: ranking twitter users according to specific topics. In Proceedings of the 12th International IEEE Consumer Communications and Networking Conference (CCNC2015), Jan 2015. 7. Marco Furini and Valentina Tamanini. Location privacy and public metadata in social media platforms: attitudes, behaviors and opinions. Multimedia Tools and Applications, pages 1 31, June 2014. 8. Marco Furini. Users behavior in location-aware services: Digital natives vs digital immigrants. Advances in Human-Computer Interaction, Vol. 2014, Article ID 678165, 23 pages, 2014. 9. Eva Oliveira, Pedro Martins, and Teresa Chambel. Accessing movies based on emotional impact. Multimedia Systems, 19(6):559 576, 2013. 10. Marco Furini. Vimood: using social emotions to improve video indexing. In Proceedings of the 12th International IEEE Consumer Communications and Networking Conference (CCNC2015), Jan 2015. 11. R. Plutchik. Emotion: A Psychoevolutionary Synthesis. Harper and Row, New York, 1980 1980.