Using Data Analytics and Machine Learning to Assess NATO s Information Environment

Similar documents
SYSTEM ANALYSIS & STUDIES (SAS) PANEL CALL FOR PAPERS

Challenging the Situational Awareness on the Sea from Sensors to Analytics. Programme Overview

HARNESSING TECHNOLOGY

CMRE La Spezia, Italy

Prototyping: Accelerating the Adoption of Transformative Capabilities

Expression Of Interest

Challenges and Opportunities in the Changing Science & Technology Landscape

SACT remarks at. Atlantic Council SFA Washington DC, George Washington University, Elliott School of International Affairs

MAY. Programme. NATO s DIGITAL ENDEAVOUR: Berlin Germany. Expanding the Ecosystem. AFCEA Europe Tel: +32 (0)

Executive Summary FUTURE SYSTEMS. Thriving in a world of constant change

RIS3-MCAT Platform: Monitoring smart specialization through open data

Vision. The Hague Declaration on Knowledge Discovery in the Digital Age

Panel I A Dynamic Context Panel Lead: Assistant Secretary General for Defence Investment, Mr Patrick Auroy

Telecoms and Tech Week

FOREST PRODUCTS: THE SHIFT TO DIGITAL ACCELERATES

)XWXUH FKDOOHQJHV IRU WKH WRXULVP VHFWRU

FP9 s ambitious aims for societal impact call for a step change in interdisciplinarity and citizen engagement.

Preparatory paper: food for thought

BI TRENDS FOR Data De-silofication: The Secret to Success in the Analytics Economy

Point of View. Establishing a Culture of Digital Change within Universities

An Intellectual Property Whitepaper by Katy Wood of Minesoft in association with Kogan Page

Tutorial: The Web of Things

Framework Programme 7

Content-Based Multimedia Analytics: Rethinking the Speed and Accuracy of Information Retrieval for Threat Detection

OSRA Overarching Strategic Research Agenda and CapTech SRAs Harmonisation. Connecting R&T and Capability Development

SACT s. State of the NIF. Brussels, 09 Nov 2016, Hr

PROJECT FACT SHEET GREEK-GERMANY CO-FUNDED PROJECT. project proposal to the funding measure

NICIS: Stepping stone to a SA Cyberinfrastructure Commons?

Artificial intelligence, made simple. Written by: Dale Benton Produced by: Danielle Harris

2018 Research Campaign Descriptions Additional Information Can Be Found at

THEFUTURERAILWAY THE INDUSTRY S RAIL TECHNICAL STRATEGY 2012 INNOVATION

Copyright: Conference website: Date deposited:

EU businesses go digital: Opportunities, outcomes and uptake

EUROPEAN COMMISSION Directorate-General for Communications Networks, Content and Technology CONCEPT NOTE

An Approach to Integrating Modeling & Simulation Interoperability

ENSURING READINESS WITH ANALYTIC INSIGHT

Front Digital page Strategy and Leadership

Policing in the 21 st Century. Response from the Royal Academy of Engineering to the Home Affairs Select Committee

Challenges and Innovations in Digital Systems Engineering

TOURISM INSIGHT FRAMEWORK GENERATING KNOWLEDGE TO SUPPORT SUSTAINABLE TOURISM. IMAGE CREDIT: Miles Holden

Aviation Data Symposium June 2018 Berlin, Germany

Human-Centric Trusted AI for Data-Driven Economy

Countering Capability A Model Driven Approach

Finland s drive to become a world leader in open science

COMMISSION OF THE EUROPEAN COMMUNITIES COMMISSION RECOMMENDATION

Knowledge Management for Command and Control

Read- Ahead Document

FOSS in Military Computing

RC-IED ECM Lessons Learned and Implementation

Joint Programming Initiative Healthy and Productive Seas and Oceans

Unmanned Ground Military and Construction Systems Technology Gaps Exploration

COMPREHENSIVE COMPETITIVE INTELLIGENCE MONITORING IN REAL TIME

The Role of the Communities of Interest (COIs) March 25, Dr. John Stubstad Director, Space & Sensor Systems, OASD (Research & Engineering)

National view on research infrastructure development work. RDA Finland Autumn 2017 Meeting Senior Advisor Sami Niinimäki

Bringing the revolution to SMEs. Report for stakeholders August 2018

School of Informatics Director of Commercialisation and Industry Engagement

MSC Project Workplan

Bridging law and technology

DIGITAL WITH PLYMOUTH UNIVERSITY DIGITAL STRATEGY

Methodology for Agent-Oriented Software

Digital Disruption Thrive or Survive. Devendra Dhawale, August 10, 2018

Industry 4.0: the new challenge for the Italian textile machinery industry

Plum Goes Orange Elsevier Acquires Plum Analytics - The Scho...

RECOMMENDATIONS. COMMISSION RECOMMENDATION (EU) 2018/790 of 25 April 2018 on access to and preservation of scientific information

Innovation Management & Technology Transfer Innovation Management & Technology Transfer

Opening Science & Scholarship

Mission Capability Packages

Use of Communications EW in a Network Centric Warfare Environment

Technology Roadmapping. Lesson 3

TRANSFORMING DISRUPTIVE TECHNOLOGY INTO OPPORTUNITY MARKET PLACE CHANGE & THE COOPERATIVE

IRAHSS Pre-symposium Report

Remuneration Report

Situation Awareness in Network Based Command & Control Systems

BIM FOR INFRASTRUCTURE THE IMPACT OF TODAY S TECHNOLOGY ON BIM

November 18, 2011 MEASURES TO IMPROVE THE OPERATIONS OF THE CLIMATE INVESTMENT FUNDS

DiMe4Heritage: Design Research for Museum Digital Media

A Reconfigurable Citizen Observatory Platform for the Brussels Capital Region. by Jesse Zaman

Spurring Big Data-Driven Innovation and Promoting Responsible Data Governance in a Privacy-Centred Europe

e-infrastructures for open science

Social Big Data. LauritzenConsulting. Content and applications. Key environments and star researchers. Potential for attracting investment

How to Keep a Reference Ontology Relevant to the Industry: a Case Study from the Smart Home

Assessment of Smart Machines and Manufacturing Competence Centre (SMACC) Scientific Advisory Board Site Visit April 2018.

MERIL MAPPING OF THE EUROPEAN RESEARCH INFRASTRUCTURE LANDSCAPE

The Human in Defense Systems

IDEaS INNOVATION FOR DEFENCE EXCELLENCE AND SECURITY PROTECTION SECURITE ENGAGEMENT STRONG SECURE ENGAGED

Executive Summary. The process. Intended use

ENGINEERS, TECHNICIANS, ICT EXPERTS

Taking Europe forward

TELLING STORIES OF VALUE WITH IOT DATA

WITH Woodside. The Woodside Innovation & Technology Hub. Building a community of world-class innovators to unlock tomorrow s growth opportunities.

Project Title. Missed Detections: From Data to Actionable Estimates. Your logo here

A New Platform for escience and data research into the European Ecosystem.

Our Corporate Strategy Digital

University of Queensland. Research Computing Centre. Strategic Plan. David Abramson

Strategy EXECUTIVE SUMMARY NATIONAL DOCUMENTATION CENTRE NHRF

THE DRIVING FORCE BEHIND THE FOURTH INDUSTRIAL REVOLUTION

SMART PLACES WHAT. WHY. HOW.

USING THE INDUSTRIAL INTERNET OF THINGS TO TRANSFORM HUMAN SAFETY AND ENERGY CONSUMPTION IN THE MINING INDUSTRY

Foresight and Scenario Development

Our position. ICDPPC declaration on ethics and data protection in artificial intelligence

Transcription:

Using Data Analytics and Machine Learning to Assess NATO s Information Environment Col Richard Blunt, CapDev JISR, SACT HQ Allied Command Transformation Blandy Road, Norfolk, VA UNITED STATES Richard.blunt@act.nato.int T: +1 757 747-4214 Chris Riley, Stratcom, Public Diplomacy Division, NATO HQ NATO HQ Boulevard Leopold III, Brussels BELGIUM riley.chris@hq.nato.int T: +32 2 707 1009 ABSTRACT Marc Richter, Michael Street, Daniel Drabkin 1 NATO Communications and Information Agency Marc.Richter; Michael.Street; Daniel.Drabkin @ncia.nato.int T: +31 70 374 3444 / +32 65 44 9465 The digital world means that NATO s strategic communications operate in a complex information environment where assessment and effect are operating continuously in a competitive space. Many Publicly Available Information (PAI) sources - including traditional and social media, originating within or beyond the Alliance - impact the success of NATO s mission and effects. The volume, variety, velocity and veracity of PAI, which may provide insights, makes this a big data problem. While the complex relationships between information streams are difficult to make sense of using only natural assessment and intelligence processes. To provide commanders, Intel analysts and strategic communicators with clear insights requires an information environment assessment (IEA) which cannot be provided in its entirety with traditional analytical approaches. This paper gives an overview of IEA activities across NATO, exploiting data analytics, visualisation and artificial intelligence to bring meaningful insight from the information environment to the operational space. The paper describes the evolution of the existing Alliance Open Source System (AOSS), to operate on a greater range of input data sources, how this open source data is analysed using big data tools which exploit artificial intelligence, and visualised to produce information of operational value. The paper will address technological and non-technical challenges. 1.0 THE CHALLENGE Many news reports in recent years have drawn attention to attempts to interfere with public opinion in several NATO nations. Such influencing operations have the potential to undermine NATO s strategic communications and therefore the mission of the Alliance. Publicly available information (PAI) be it 1 Daniel Drabkin is located at NCIA Mons, BELGIUM; Michael Street at NCIA The Hague, NETHERLANDS; Marc Richter has recently left NCIA The Hague for EUROPOL, The Hague, NETHERLANDS. STO-MP-IST-160 S2-2 - 1

mainstream media, social media or other information which influences populations and affect their support (or otherwise) for NATO and the Alliance is termed the information environment. The information environment assessment (IEA) is a first attempt to apply data-centric technologies of data science, data analytics and machine learning, to understand hostile attempts to influence public opinion about NATO activities. 1.1 Current assessment of the information environment Currently strategic communications (StratCom) across NATO, Commands and National elements of the NATO force structure assess the information environment using predominantly manual methods. These methods ensure significant domain knowledge from subject matter experts is used in the assessment. However the scale of the information environment (the volume of data) to be assessed, the range of information sources and the languages used (variety) all present a challenge to human analysts. Accurate assessments are made more challenging by the potential for rapidly changing messages and trends (velocity) and the potential for much misleading information (veracity). Datasets which exhibit one of more properties of volume, variety, velocity and veracity are exactly those on which data-centric technologies are designed to operate. In this paper we refer to data-centric technologies of data science, data analytics, machine learning and artificial intelligence. Thus these technologies have the potential to make a significant impact when assessing the information environment. 2.0 INFORMATION ENVIRONMENT ASSESSMENT CONCEPT The IEA, as seen in Figure 1, involves connecting sources of information on the left of the figure with users on the right. Information is a joint function in terms of Sensor, Assessor, Effector and C2 ; operating continuously for multiple communities of interest. The IEA recognises that given the volume, variety and velocity of data which comprise it, it must harness data centric technologies e.g. big data analytics and artificial intelligence, and any capability development in this field will require a data centric methodology. There is a recognition that requirements amongst disparate communities of interest (COIs) at the source and content aggregation function on the left hand side have a lot of commonality. Likewise, much of the central data science and analytics needed to aggregate content from data sources. Once the source information has been aggregated and prepared for analysis (or extract, transform and load of the data) a range of common tools and technologies will be used to analyse and present the data to various COIs. While the data lake and tools will be similar, the analysis and visualisation is likely to be specific for each different COIs. 2.1 Resourcing the IEA Data Literacy and Data Readiness within NATO is identified as a key capability gap. Steps are being taken to adjust NATO information systems (and system managers) to a data-centric paradigm. While a blend of contracted services, industry expertise and a core of in-house skills will be needed to deliver an effective balance of leading-edge skills and contextual understanding needed to assess information for NATO. S2-2 - 2 STO-MP-IST-160

Figure 1: Information environment assessment. 2.2 IEA development areas IEA development follows four lines of engagement. Industry input is being sought through the NATO Industry Advisory Group (NIAG). Recognising that industry possesses leading edge analytics technologies and expertise, NIAG study group 225, Big Data in support of IEA [1]. This provides a forum for industry to propose leading edge technology solutions. This study group is developing a proof of technology demonstration system, using public cloud services to work with large data sets and access sophisticated analytics tools and models. This IEA study builds on an earlier initial NIAG study on the use of big data by NATO [2]. A second line of engagement takes existing NATO capabilities and evolves them. Combining the existing Alliance Open Source System (AOSS) which is already used by analysts to access public information sources is being extended to incorporate sophisticated analytics and visualisation tools already used in NATO [3]. Applying some of NATO s existing data analytics and visualisation tools to AOSS, its existing open source information tool, will provide an initial capability as well as providing a testbed for experimentation and refinement of requirements, analytics processes and views. Experimentation provides third line of engagement. Experimentation, led by ACT but involving many stakeholders across the Alliance, plays a key role in development, with major experimentation campaigns being planned around suitable events during 2018. A final engagement strand is the collection of user requirements for IEA FOC, where all three lines above from NATO and industry will all assist stakeholders to define appropriate future requirements for a system which must employ data analytics and machine learning technologies which are comparatively new to STO-MP-IST-160 S2-2 - 3

NATO. These requirements will not only cover aspects of technology and materiel, but the full DOTMPLFI capability landscape e.g. Doctrine, Organisation, Training, Materiel, Personnel, Leadership, Facilities and Interoperability. 2.3 IEA integration into operations Figure 2 shows how the IEA capability will develop within the wider NATO landscape. Incorporating a functional IEA capability into NATO operations with effects synchronisation and assessment of effects (friendly and adversary). All of this will includes a feedback loop into the IEA, including into the data analytics. Figure 2: IEA interfaces to NATO operations and effects. 3.0 ALLIED OPEN SOURCE SYSTEM AOSS is a functional service developed for NATO intelligence analysts. It provides access to a range of open source information sources including mainstream news sources and specialist open source providers such as Janes IHS etc. 90% of the content is subscription based [4]. AOSS currently (in version 5.5.) supports some analysis tools for intel analysts, as shown in Figure 3. S2-2 - 4 STO-MP-IST-160

Figure 3: Data analytic functions currently supported in AOSS. 4.0 EXTENDING ANALYTICS CAPABILITY An IEA initial operating capability (IOC) is in development, drawing on existing capabilities and expertise within NATO. This will take the existing sources of PAI and the intelligence analysis system around them, and add data analytics and data visualisation capabilities already in use in PMAR, supplemented by additional code development to provide initial data analytic services. Since its formation in 2012, NCIA has used data analytics and a data-centric approach to harvesting data from a range of existing sources and systems into a data mart (or small data lake). This data was then analysed using a number of tools in order to glean meaningful data from these varied sources. Since this initial implementation the data extraction methods have been enhanced, the data mart has grown to a data lake (fed by over 50 tributary data sources) and the analytic toolset has been expanded and refined. This is described further in [3]. A key component of this existing toolset is the KNIME analytics platform. This is a leading open source data mining tool and provides a graphical framework to manage data preparation. The graphical framework significantly reduces the need for low-level programming, making the data mining process accessible to a larger group of data analysts. KNIME also supports PYTHON natively and allows leading open source deep learning libraries to be integrated e.g. TensorFlow. This provides access to a range of sophisticated deep learning resources which can be integrated with KNIME and fed by data sources prepared using conventional KNIME functionality for big data analytics. The combination of KNIME, machine learning and deep learning models, applied to NATO data sets is addressed in depth in [5]. These functions are now being integrated into AOSS in order to extend the functionality towards an IEA initial operating capability. The ability for machines to analyse and learn does not lead directly to insight and value from the data. It will be necessary to train and refine the machine learning / deep learning models. This training process is known to require large quantities of data, coupled with time and expertise from data scientists to refine the models, and domain experts to explain data structures, features and relationships. STO-MP-IST-160 S2-2 - 5

5.0 RECOMMENDATIONS It is clear that NATO is on a steep learning curve as it deploys data-centric technology to support its mission. However, it is not starting from scratch and has valuable experience in the analysis of open source intelligence (through AOSS), sophisticated tools for data analysis and visualisation (PMAR) and the application of data centric technology to support operations [3][5]. Industry will be a source of leading edge technology, and close cooperation exists between the work described in this paper, and the activities of NIAG study group 225 which is looking at architectures and technologies for IEA FOC. REFERENCES [1] NATO Industrial Advisory Group Study Order 225, DI(2017)0261, September 2017. [2] NATO Industrial Advisory Group Study Group 208, final report On adopting big data in NATO, AC/322-N(2017)0085, June 2017. [3] M. Richter, M. Street, P. Lenk, Lessons learned from initial exploitation of big data and AI to support NATO decision making, STO IST-160, May 2018. [4] NATO open source intelligence (OSINT) policy 3.1. [5] M. Richter, M. Street & P. Lenk, Deep Learning NATO document labels: a preliminary investigation, Int. Conf. on Military CIS, Warsaw, May 2018. S2-2 - 6 STO-MP-IST-160

STO-MP-IST-160 S2-2 - 7