Social Data Analytics Tool (SODATO)

Similar documents
Big Social Data in Public Health: A Mixed-Methods Case Study of Sundhed.dk s Facebook Strategy, Engagement, and Performance

Social Big Data. LauritzenConsulting. Content and applications. Key environments and star researchers. Potential for attracting investment

Towards a Software Engineering Research Framework: Extending Design Science Research

PREFACE. Introduction

A FORMAL METHOD FOR MAPPING SOFTWARE ENGINEERING PRACTICES TO ESSENCE

Social Network Analysis and Its Developments

Socio-cognitive Engineering

Transferring knowledge from operations to the design and optimization of work systems: bridging the offshore/onshore gap

The Smart Production Laboratory: A Learning Factory for Industry 4.0 Concepts

Foresight and Scenario Development

Violent Intent Modeling System

Roadmapping. Break-out Groups: Policy Planning Methods and How They Can Be Used in Policy-making. Ondřej Valenta Technology Centre CAS

Design Science Research Methodology: An Artefact-Centric Creation and Evaluation Approach

Bernarditas Muller Coordinator of G77 and China for the AWG-LCA

EBC Annex 79. Occupant behaviour-centric building design and operation

Exploring the Nature of the Smart Cities Research Landscape

EXECUTIVE BOARD MEETING METHODOLOGY FOR DEVELOPING STRATEGIC NARRATIVES

Issues and Challenges in Coupling Tropos with User-Centred Design

Exploring the New Trends of Chinese Tourists in Switzerland

Proposal for Workshop

The Industry 4.0 Journey: Start the Learning Journey with the Reference Architecture Model Industry 4.0

2. What is Text Mining? There is no single definition of text mining. In general, text mining is a subdomain of data mining that primarily deals with

Interoperable systems that are trusted and secure

Inter-enterprise Collaborative Management for Patent Resources Based on Multi-agent

Organisation designing though the practice of multi-method research in Information Systems

Current Challenges for Measuring Innovation, their Implications for Evidence-based Innovation Policy and the Opportunities of Big Data

Social Innovation and new pathways to social changefirst insights from the global mapping

Using Variability Modeling Principles to Capture Architectural Knowledge

Communication and dissemination strategy

AGILE USER EXPERIENCE

A Three Cycle View of Design Science Research

Analogy Engine. November Jay Ulfelder. Mark Pipes. Quantitative Geo-Analyst

Multi-channel Provisioning of Public Services

Assessment of Smart Machines and Manufacturing Competence Centre (SMACC) Scientific Advisory Board Site Visit April 2018.

SMART PLACES WHAT. WHY. HOW.

INTELLIGENT SOFTWARE QUALITY MODEL: THE THEORETICAL FRAMEWORK

Modelling and Mapping the Dynamics and Transfer of Knowledge. A Co-Creation Indicators Factory Design

DRAFT. February 21, Prepared for the Implementing Best Practices (IBP) in Reproductive Health Initiative by:

Colombia s Social Innovation Policy 1 July 15 th -2014

Analysis of the electrical disturbances in CERN power distribution network with pattern mining methods

Initial communication and dissemination plan. Elias Alevizos, Alexander Artikis, George Giannakopoulos. Scalable Data Analytics Scalable Algorithms,

MEDIA AND INFORMATION

Research strategy

Integrated Transformational and Open City Governance Rome May

Predicting Content Virality in Social Cascade

Initial draft of the technology framework. Contents. Informal document by the Chair

Pervasive Services Engineering for SOAs

Wireless B2B Mobile Commerce: A Study on the Usability, Acceptance, and Process Fit

IT ADOPTION MODEL FOR HIGHER EDUCATION

Office for Nuclear Regulation

Digital Preservation Planning: Principles, Examples and the future with Planets

Digital Medical Device Innovation: A Prescription for Business and IT Success

Context Sensitive Interactive Systems Design: A Framework for Representation of contexts

An Action Design Research Approach to Developing Emergency Management Systems

Defining analytics: a conceptual framework

Social Network Analysis in HCI

Baltic Sea Region Urban Forum for Smart Cities - a market place for smart solutions

Computer Control System Application for Electrical Engineering and Electrical Automation

A STUDY ON THE DOCUMENT INFORMATION SERVICE OF THE NATIONAL AGRICULTURAL LIBRARY FOR AGRICULTURAL SCI-TECH INNOVATION IN CHINA

Analysis of Temporal Logarithmic Perspective Phenomenon Based on Changing Density of Information

Invoice Financing for Small and Mediumsized Enterprises on an Online Platform

CREATIVE ECONOMY PROGRAMME. Development through Creativity

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Grand Challenges for Systems and Services Sciences

VIEW POINT CHANGING THE BUSINESS LANDSCAPE WITH COGNITIVE SERVICES

A SYSTEMIC APPROACH TO KNOWLEDGE SOCIETY FORESIGHT. THE ROMANIAN CASE

Contextual Integrity through the lens of computer science

TERMS OF REFERENCE FOR CONSULTANTS

Erwin Mlecnik 1,2. Keywords: Renovation, Supply Chain Collaboration, Innovation, One Stop Shop, Business models. 1. Introduction

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN:

How the analysis of structural holes in academic discussions helps in understanding genesis of advanced technology

What is Big Data? Jaakko Hollmén. Aalto University School of Science Helsinki Institute for Information Technology (HIIT) Espoo, Finland

Computer Supported Cooperative Work. Series Editor Richard Harper Cambridge, United Kingdom

Smartkarma FAQ. Smartkarma Innovations Pte Ltd Singapore Co. Reg. No G

Standardization and Innovation Management

Introduction to Planets. Hans Hofman Nationaal Archief Netherlands Barcelona, 27 March 2009

INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN ICED 03 STOCKHOLM, AUGUST 19-21, 2003

DiMe4Heritage: Design Research for Museum Digital Media

Identifying Personality Trait using Social Media: A Data Mining Approach

IBM SPSS Neural Networks

ALCOTRA INNOVATION. Transnational Workshop July 8th 2011 Genova

Analysis of Data Mining Methods for Social Media

Building Collaborative Networks for Innovation

Special issue on behavior computing

SERBIA. National Development Plan. November

Faculty of Humanities and Social Sciences

Programme Curriculum for Master Programme in Economic History

Introduction to adoption of lean canvas in software test architecture design

Shipping information pipeline: initial design principles. Jensen, Thomas; Vatrapu, Ravi

INSPIRING A COLLECTIVE VISION: THE MANAGER AS MURAL ARTIST

Creating a Social Investment Fund in Israel: Policy and Financial Considerations

Knowledge Brokerage for Sustainable Development

Institute of Information Systems Hof University

Entrepreneurial Structural Dynamics in Dedicated Biotechnology Alliance and Institutional System Evolution

From Future Scenarios to Roadmapping A practical guide to explore innovation and strategy

Statistical Static Timing Analysis Technology

Cover Page. The handle holds various files of this Leiden University dissertation.

A Test Bed for Verifying and Comparing BIM-based Energy Analysis Tools

Design Science Research Methods. Prof. Dr. Roel Wieringa University of Twente, The Netherlands

UNIVERSIDAD CARLOS III DE MADRID

Transcription:

Social Data Analytics Tool (SODATO) Abid Hussain 1 and Ravi Vatrapu 1,2 1 CSSL, Department of IT Management, Copenhagen Business School, Denmark 2 MOTEL, Norwegian School of Information Technology (NITH), Norway {ah.itm,vatrapu}@cbs.dk Abstract. This paper presents the Social Data Analytics Tool (SODATO) that is designed, developed and evaluated to collect, store, analyze, and report big social data emanating from the social media engagement of and social media conversations about organizations. Keywords: social media, data science, computational social science, big data analytics. 1 Introduction Big social data that is generated by the social media engagement and social software utilization of a company results in both operational issues and managerial challenges [5, 8]. With the wide spread adoption of social media for organizational purposes, there is a clear need for developing concepts, methods, and tools that systematically analyze big social data. In recent years, researchers have emphasized that technical advancements are required to deal with the situation [10]. Such requirements include the modeling of social data on individual and collective levels as well as identification of unified methods to process social data [1, 4, 9]. In this paper, we report on the design, development and features provided by a theoretically informed and methodologically grounded IT artefact (SODATO) that addresses the diverse but interrelated issues associated with social data. The remainder of the paper is organized as follows. Section 2 describes the design of the IT artefact in terms of the problem statement, use cases, intended user groups, and technical architecture. Section 3 presents the Action Design Research (ADR) method [7] and describes how ADR informed the design, development and evaluation of SODATO. Sections 4 and 5 report on the significance of the IT artefact to researchers and practitioners respectively. Section 6 summarizes the ADR evaluation of SODATO. 2 Design of the IT Artefact 2.1 Problem Statement From an academic standpoint, Zeng [10] identified the unique technical challenges in social media analytics due to unstructured data across networks and observed that M.C. Tremblay et al. (Eds.): DESRIST 2014, LNCS 8463, pp. 368 372, 2014. Springer International Publishing Switzerland 2014

Social Data Analytics Tool (SODATO) 369 data has not been treated systematically in data and text mining literature. Wang [9] argued that social computing is changing the way we interact, identified a need for modeling of social data on both individual and collective level, and called for new analytical techniques and IT artefacts for social data analytics. Quite a few organizations have been falling behind in adopting the social media due to lack of understanding of its diverse scope [3]. Kietzmann et al. [4] urged organizations to take social media engagement seriously. They argued that since social media sites are diverse in functionality and scope, there is a need of identifying uniformed and generic methods to analyze social data from different sites. Finally, Chen et al. [1] describe the state of data analytics and state that social media analytics is essentially different from traditional Business Intelligence and more research is needed in order to design methods and techniques for drawing insights from social data. From an industry standpoint, many commercial software vendors (such as Radian6 1, IBM Cognos Customer Insight 2, SAS 3, Social Bakers 4 ) are providing software applications to monitor, measure, and manage social data. However, an important problem with existing commercial applications is that there is little-to-no empirical research on the data provenance, efficacy, effectiveness, and impact of the different social media metrics and key performance indicators employed. Further, there is no provision of raw data as such or transparency about the algorithms, formulas, and metrics from a data science technical perspective as well as an organizational perspective of business analytics. We term this lacuna in the current theoretical knowledge, empirical findings, and industry practice as the Gulf of Social Media Analytics. The primary objective of this paper to briefly present the design, development and evaluation of an IT artefact (Social Data Analytics Tool, SODATO) to bridge the Gulf of Social Media Analytics. To achieve this, we have developed a unified framework [6] consisting of a theory of social data, a conceptual model, a formal model, technological architecture and finally, the software tool itself which is the focus of this paper. 2.2 Use Cases One use case is for the campaign strategist of a political party can utilize SODATO in order to fetch Facebook walls. A second use case is that the social media manager at an organization can use SODATO to fetch their social data and extract meaningful and actionable insights on content performance (such as which post types are most popular amongst the users and explore correlations to sales and other in-house data from ERP and CRM systems). 2.3 Intended User Groups Analyzing the use cases mentioned, the target end users of SODATO can be listed as researchers, analysts, social media managers, chief listening officers, and trainee analysts (for example, students in social media management and social data analytics courses). 1 www.radian6.com 2 http://www-01.ibm.com/software/analytics/cognos/ 3 http://www.sas.com/software/customer-intelligence/ social-media-analytics/ 4 www.socialbakers.com

370 A. Hussain and R. Vatrapu 2.4 Technical Architecture Please refer to [6] for the unified framework of theory, conceptual model, and formal model of social data together with an illustrative example and a demonstrative case study. Figure 1 presents schematic of SODATO. SODATO can be accessed from http://cssl.cbs.dk/sodato Fig. 1. Schematic of the Social Data Analytics Tool (SODATO) Technically, SODATO utilizes the APIs provided by the social network vendors for example Facebook open source API named as Graph API. SODATO is a combination of web as well as windows based console applications that run in batches to fetch data and prepare data for analysis. The Web part of the tool is developed using HTML, JavaScript, Microsoft ASP.NET and C#. Console applications are developed using C#. Microsoft SQL Server is used for data storage and data pre-processing. SODATO provides a generic method for retrieving, storing and analyzing social data. SODATO can be utilized by practitioners as well as researchers in order to obtain a detailed understanding of trends, dynamics, and mechanisms in the domain of Facebook currently (and scalable to different online social media platforms). Specifically, SODATO supports descriptive, prescriptive and/or predictive analytics in terms of the social graph (actors involved, artefacts created, actions taken, activities engaged) and social text (sentiments expressed, topics discussed, pronouns addressed, and keywords mentioned) [8]. 3 Design Science Methodology SODATO has been developed based on the design principles defined by Action design Research methodology [7]. The development process started in 2011 when the alpha version of SODATO was released with the name SOGATO [2]. The development process started with the basic problem formulation jointly informed by the literature in the domain of social media analytics. Developers, researchers, trainee social media analysts and organizations constituted the ADR team of SODATO. There have been multiple iterations informed by the second phase of ADR, Building intervention and evaluation (BIE). Each iteration contributed towards the IT artefact as well as the knowledge body within the domain of social media

Social Data Analytics Tool (SODATO) 371 analytics. Finally, reflections and learnings from the ADR process are currently being actively reported through both academic publications and workshops.. Table 1 presents the First BIE cycle of ADR [7] for SODATO. SODATO: RELEASE ONE Table 1. First BIE Cycle for SODATO Actor: Type Researchers Evaluation: Examples Danish election 2011 Social media analysis Feedback: Examples Large walls cannot be fetched using web version Contribution: Ensemble Artefact Batch processing introduced Contribution: Science Trainee Analysts Practitioners Social media projects with realworld case companies Interaction with trainee analysts Statistics are performing slow Multiple requests for walls for multiple users User interface issues Statistics needed to be developed that suited the case company s industry sector and social media use Pre- Processing of fetched data built Authorization system developed User interface improved Custom metric module was developed Empirically informed modifications to the descriptive model of social data 4 Significance to Research SODATO incorporates innovative features that provide value to the researchers. The development of the tool in theoretically grounded [6] and follows industry standards in software engineering. SODATO addresses the commercial tools lack of attention to data fetch procedures by implementing systematic data collection procedures, logging, and error recovery options. Due to the complex data structure of Facebook when compared to Twitter, there has been far less research using Facebook data comparatively and SODATO enables researchers to fill this knowledge gap by provisioning facebook social data. As far as we know, there does not exist a social media analytics tool in the literature that is designed and developed using design science principles. Neither could we find any generic method that could uniformly be applied for building other application in the same domain of knowledge. Hence we believe that this artefact is a contribution for information systems in general and design science research in particular. At the operational level, SODATO provides unique (as of yet) features to researchers as stated below: SODATO can fetch and store historic data right from start of Facebook time (we are yet to find this in a commercial tool) SODATO provides very high level of transparency in data fetching and calculates data provenance

372 A. Hussain and R. Vatrapu Researchers are able to fetch the data using the tool and can export large sets of data and do analysis using modelling, statistical and/or coding tools of their choice such as Microsoft Excel, IBM SPSS, R, MatLab etc. Domain specific features for projects in fields such as Political Science, Marketing, Finance, and Sustainability. 5 Significance to Practice SODATO provides state-of-the-art functionality for descriptive, predictive, and prescriptive analytics of big social data for organizations. It differs from existing commercial tools in the sense that it can be used as a strategic tool for fetching the complete online social data record of an organization on the platform of Facebook. Different insights can be generated from different analysis methods provided by the tool such as sentiments analysis, keyword analysis, actor attribute analysis, content performance analysis, social influencer analysis and integrative analytics with in-house data from web analytics, ERP and CRM systems. 6 Evaluation As mentioned earlier, we adopted the ADR model [7] for design and development and the current state of the tool is informed by iterations over three years where trainee analysts, researchers and practitioners have been evaluating the tool (see Table 1 for the first BIE cycle). References 1. Chen, H., Chiang, R.H., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS Quarterly 36, 1165 1188 (2012) 2. Hussain, A., Vatrapu, R.: SOGATO: A Social Graph Analytics Tool. In: The 12th European Conference on Computer Supported Cooperative Work 2011 (2011) 3. Kaplan, A.M., Haenlein, M.: Users of the world, unite! The challenges and opportunities of Social Media. Business Horizons 53, 59 68 (2010) 4. Kietzmann, J.H., Hermkens, K., Mccarthy, I.P., Silvestre, B.S.: Social media? Get serious! Understanding the functional building blocks of social media. Business Horizons 54, 241 251 (2011) 5. Lovett, J.: Social Media Metrics Secrets. Wiley (2011) 6. Mukkamala, R., Hussain, A., Vatrapu, R.: Towards a Formal Model of Social Data. IT University Technical Report Series TR-2013-169 (2013), https://pure.itu.dk/ ws/files/54477234/itu_tr_54472013_54477169.pdf 7. Sein, M., Henfridsson, O., Purao, S., Rossi, M., Lindgren, R.: Action design research. MIS Quarterly 35, 37 56 (2011) 8. Vatrapu, R.: Understanding Social Business. In: Akhilesh, K.B. (ed.) Emerging Dimensions of Technology Management, pp. 147 158. Springer, Heidelberg (2013) 9. Wang, F.-Y., Carley, K.M., Zeng, D., Mao, W.: Social computing: From social informatics to social intelligence. IEEE Intelligent Systems 22, 79 83 (2007) 10. Zeng, D., Chen, H., Lusch, R., Li, S.-H.: Social media analytics and intelligence. IEEE Intelligent Systems 25, 13 16 (2010)