Toward Improved Visualization of Unstructured Information

Similar documents
Texas Hold em Inference Bot Proposal. By: Brian Mihok & Michael Terry Date Due: Monday, April 11, 2005

CONTENTS FOREWORD... VII ACKNOWLEDGMENTS... IX CONTENTS... XI LIST OF FIGURES... XVII LIST OF TABLES... XIX LIST OF ABBREVIATIONS...

STRATEGO EXPERT SYSTEM SHELL

Situation Awareness in Network Based Command & Control Systems

What is Artificial Intelligence? Alternate Definitions (Russell + Norvig) Human intelligence

Assessing Geocoding Solutions

EDUCATIONAL PROGRAM YEAR bachiller. The black forest FIRST YEAR OF HIGH SCHOOL PROGRAM

Expression Of Interest

CMSC 421, Artificial Intelligence

Interoperable systems that are trusted and secure

APPLICATION FOR PLANNED UNIT DEVELOPMENT (PUD) DEVELOPMENT PLAN

DEVELOPMENT REVIEW COMMENTS LAND DISTURBANCE PERMIT (LDP) CLEARING CLEARING & GRUBBING GRADING. Date Reviewed by. Project Name

Brief to the. Senate Standing Committee on Social Affairs, Science and Technology. Dr. Eliot A. Phillipson President and CEO

WGISS-42 USGS Agency Report

Reconsidering the Role of Systems Engineering in DoD Software Problems

Neuro-Fuzzy and Soft Computing: Fuzzy Sets. Chapter 1 of Neuro-Fuzzy and Soft Computing by Jang, Sun and Mizutani

deal done. Here folks get to do that first deal with help. Attend an introduction meeting to see if we can help.

Semiotics in Digital Visualisation

HELPING THE DESIGN OF MIXED SYSTEMS

How To Create A Superstar Success Book

Aesthetics Change Communication Communities. Connections Creativity Culture Development. Form Global interactions Identity Logic

The availability of cloud free Landsat TM and ETM+ land observations and implications for global Landsat data production

Planning in autonomous mobile robotics

Chapter 1: Introduction to Neuro-Fuzzy (NF) and Soft Computing (SC)

Disney acquires 21st Century Fox. veed snapshot February 2018

Table of Contents. Two Cultures of Ecology...0 RESPONSES TO THIS ARTICLE...3

Moving to Model-Based Design

An overview of Superintelligence, by Nick Bostrom

DARPA: Funding Advanced Research for the Department of Defense

MSc(CompSc) List of courses offered in

National Aeronautics and Space Administration. Landsat Update. Jeff Masek, NASA GSFC Jim Irons, NASA GSFC. April 3, 2012 LCLUC Meeting.

SPIRE A DECISION SUPPORT SYSTEM FOR ADDRESSING COMPLEX/CHAOTIC ENVIRONMENTS DMDU SOCIETY Conference

Executive Summary: Understanding Risk Communication Best Practices and Theory

DESIGN REVIEW PROCESS AND APPLICATION

Chess and Intelligence: Lessons for Scholastic Chess

Disney acquires 21st Century Fox veed snapshot February 2018

Introduction to Vision. Alan L. Yuille. UCLA.

Using the Streamlined Systems Engineering (SE) Method for Science & Technology (S&T) to Identify Programs with High Potential to Meet Air Force Needs

First hit on Google Image:

CHAPTER LEARNING OUTCOMES. By the end of this section, students will be able to:

Silicon Valley Venture Capital Survey Fourth Quarter 2018

Ionospheric Estimation using Extended Kriging for a low latitude SBAS

Modeling Enterprise Systems

Analysis of Temporal Logarithmic Perspective Phenomenon Based on Changing Density of Information

TANGIBLE IDEATION: HOW DIGITAL FABRICATION ACTS AS A CATALYST IN THE EARLY STEPS OF PRODUCT DEVELOPMENT

To be published by IGI Global: For release in the Advances in Computational Intelligence and Robotics (ACIR) Book Series

A Case Study of timeline investigation: the timeline in time

Ground Robotics Capability Conference and Exhibit. Mr. George Solhan Office of Naval Research Code March 2010

Phoenix project drilling update 29 June 2017

Promises and Limitations of Performance Measures

Site Plan/Building Permit Review

Chapter 8. Using the GLM

EMP Approval to Drill Next Dome Complex

Motivation and objectives of the proposed study

RACE TO THE TOP: Integrating Foresight, Evaluation, and Survey Methods

Cognition-based CAAD How CAAD systems can support conceptual design

Image Extraction using Image Mining Technique

Silicon Valley Venture Capital Survey Third Quarter 2017

CS494/594: Software for Intelligent Robotics

IRAHSS Pre-symposium Report

Featherstone Capital Inc. Mission

AP Studio Art 2009 Scoring Guidelines

Essay on A Survey of Socially Interactive Robots Authors: Terrence Fong, Illah Nourbakhsh, Kerstin Dautenhahn Summarized by: Mehwish Alam

Silicon Valley Venture Capital Survey Second Quarter 2018

I&S REASONING AND OBJECT-ORIENTED DATA PROCESSING FOR MULTISENSOR DATA FUSION

DESIGN REVIEW COMMITTEE AGENDA ITEM

Methodology for Agent-Oriented Software

Sixteen Ways to Use Your Booklets to Generate More Business

Communication Theories Origins, Methods and Uses in Mass Media. Werner J. Severin James W. Tankard, Jr Fifth Edition

(copy of one submitted by letter of. Division Research Grants, the National Institutes of Health).

Greater Binghamton, New York

Infographic Project Data Visualization

THE 5 POSTS EVERY REAL ESTATE AGENT MUST HAVE ON THEIR FACEBOOK PAGE

Exclusive: A first look inside Disney s id8 innovation studios

Referrals Testimonials for Tina M Craig

Valuation of Coastal Resources Understanding Substitution in Time and Space

THE FUTURE OF STORYTELLINGº

The. Nuts and Bolts of an. MYP Unit Shaker MYP Professional Development November 24, 2015

The Role of Goals in Design Reasoning

Buskerud University College: Program Systems Engineering

The Behavior Evolving Model and Application of Virtual Robots

Kansas Curricular Standards for Dance and Creative Movement

An Analysis of Aerial Imagery and Yield Data Collection as Management Tools in Rice Production

State Archives of Florida Collection Development Policy

2015 MDRT Annual Meeting e Handout Material. What Do You Do for a Living? Does The Answer Matter?

The patterns considered here are black and white and represented by a rectangular grid of cells. Here is a typical pattern: [Redundant]

Applying Equivalence Class Methods in Contract Bridge

Customising Foresight

February 11, 2015 :1 +0 (1 ) = :2 + 1 (1 ) =3 1. is preferred to R iff

YEAR IN REVIEW G O L F & R E S O R T P R O P E R T I E S

EA 3.0 Chapter 3 Architecture and Design

Winter 2004/05. Shaping Oklahoma s Future Economy. Success Stories: SemGroup, SolArc Technology Yearbook

Jacek Stanisław Jóźwiak. Improving the System of Quality Management in the development of the competitive potential of Polish armament companies

This list supersedes the one published in the November 2002 issue of CR.

Highways, ring road, expressways of tomorrow in the Greater Paris

Summit Morning Agenda

Field size estimation, past and future opportunities

Please send the following information: Your name. Best way to reach you. Fax form to:xxx-xxx-xxxx Or mail to 111 Hometown Lane, Hometown, Ohio xxxxx

Report to Congress regarding the Terrorism Information Awareness Program

Product architecture and the organisation of industry. The role of firm competitive behaviour

Transcription:

Toward Improved Visualization of Unstructured Information March 4, 2005 National Academy of Sciences Context 2 J. David Harris National Security Agency

Context 2 The definition: Visualization in the context of large, unstructured, changing data sets where the relevance, significance, and conceptual links among the data have yet to be discovered Preliminary thoughts Effective visualization of structured data is challenging Unstructured data requires some type of structural mapping. The data discovery and analysis will be imperfect. The mapping will be imperfect, and task dependent

An Example of Real Data A real-world story of intrigue About a cell of conspiring individuals Who set forth on a project Constrained by time and money Characterized by deception To advance their own cause. The plot is ultimately discovered But not before the mission is accomplished.

The Players and Their Motivation "Now this is very important to our future development. If we can get all the property here, we can do something imaginative with it. If we can't get it all, then we re stuck with something conventional." -- Walt Disney talks about the land acquisition for the Florida property, Project X

The Strategy Discovering Project X Exploratory analysis is there some data that conforms to some pattern? START PROJECT ORGANIZE EXECUTE END PROJECT Walt Disney World Company Acquire Land for Florida Property

The Investigation Discovering Project X Refining the pattern but only some of the pattern is observable in the unstructured, varied, uncertain and continuously updated data NOW START PROJECT ORGANIZE EXECUTE END PROJECT Walt Disney World Company Define Operation Establish Schedule Secure Finances Identify Purchase Agents Disguise Travel Patterns Establish Aliases for Agents Obfuscate Communications Establish Dummy Corporations Purchase Land Disguise Travel Patterns Acquire Mineral Rights Acquire Land for Florida Property Retrospective analysis? Or prospective analysis?

The Observables Establish aliases for purchase agents, and incorporate Relevant observations, (semi-)notional: COMPANY NAME POC STATE DATE ( Compass East Group, Roy Davis, Delaware, 7 December 1964 ) ( Tomahawk Properties, Inc., Bob Price, Florida, January 1965 ) (Latin-American Dev and Mgmt Corp, Roy Davis, Florida, February 1965 ) ( Ayefour Corporation, BTW These entities Bob Foster, are Florida, uncertain!! February 1965 ) ( Bay Lake Properties, Inc., Bob Price, Florida, March 1965 ) ( Reedy Creek Ranch, Inc., M. T. Lott, Florida, June 1965 ) Incorporations time Important information is dominated by irrelevant data There s very little evidence on which to base decisions

More Observables Purchase land Relevant observations: PURCHASING AGENT TRANSACTION DATE LONGITUDE LATITUDE ACRES Land Acquisition time Incorporations time The plot begins to unfold Correlated events emerge (both within and across streams)

The Discovery Orlando Sentinel Dateline May 4, 1965 Reported that two real estate transactions totaling over $1.5 million had been made for nearly 9,000 acres of land near the small Florida farming town of Orlando Dateline October 20, 1965 Reported that Walt Disney was secretly behind the purchase of land Vagueness (Dynamism) of Hypotheses Unknown Sources of Data and Information Relevant Data Concealed by Noise Uncertain and Erroneous Observations (Causally) Incomplete Context Missing Data Logical and Physical Structures PHYSICAL and TEMPORAL PROXIMITY OF TRANSACTIONS LOGICAL COMMUNITY OF INTEREST

But It Was Too Late Disney s Project X Began in the early 1960 s The Florida site was selected on November 22, 1963 Ayefour Corporation buys the first parcel of land on October 23, 1964 An official announcement was made by Disney on November 15, 1965 They had acquired 27,443 acres of land SW of Orlando And they had big plans What did it cost? About $185/acre, on average The first acre: $80 The final acre: $80,000

Why is This Context Interesting? The definition: Visualization in the context of large, unstructured, changing data sets where the relevance, significance, and conceptual links among the data have yet to be discovered To enable understanding! Retrospective Forensics Prospective Investigative Reporting Business Intelligence Security

The Context 2 Agenda Ronald Coifman Yale University Diffusion/Inference Geometries of Data Features, Situational Awareness and Visualization Andre Skupin University of New Orleans A Different Kind of Map Stephen Eick University of Illinois at Chicago; SSS Research DECIDE TM Hypothesis Visualization Tool Dave Harris National Security Agency Reactions and discussion

Reactions and Discussion Context 2 the definition: Visualization in the context of large, unstructured, changing data sets where the relevance, significance, and conceptual links among the data have yet to be discovered Perception and cognition of visualization Reasoning under uncertainty

Perception and Cognition of Visualization Map-making Simplification What s important? Who s the intended audience? How might we measure interpretability? Classification Symbolization Induction Visualize Existence Notation on a map that a point or area exists Associative existence Added absolute or relative quantity to the identified points and areas Spatially associated existence Spatial relationships between points and areas I m willing to trade accuracy, resolution, completeness, etc. for improved perception This representation of the Orlando metropolitan area is targeted at tourists Maps are a specific type of diagram with which most people have experience

Perception and Cognition of Visualization How can we capture the dynamic nature of data? Maps are snapshots but they require little additional training Can we place thematic overlays on top of term-document landscapes? as a means of creating different views of the same data... How do we encourage interactivity? What can t be represented using topography only? For unstructured data What kind of mappings can we impose? Some structure may be due to contact or context (not content) What might roads represent? What about rest areas? National Parks? Hospitals? How is uncertainty represented?

Reasoning Under Uncertainty A critical aspect of Context 2 Visualization of the hypotheses Capture the intent of the task and subject matter expertise Guide the exploration and analysis Customize the visualization NOW

Reasoning Under Uncertainty Multiple (competing) hypotheses Alternative models, at the onset or after improved/diminished understanding Machine-learning can (should?) offer data as... Supporting evidence Contradictory evidence Change in the actual plan Changing world events

What to Visualize? How do we decide what s important? X X X X XX X X XX XX XXX X X XX XX XX X X XXX X Land Acquisition time X X X X X X X X X X X X XX X XXXX XX X XX X X XX X Incorporations time We (probably) don t need all of these observations?

Final Thoughts So Visualization influences hypothesis generation Hypothesis generation influences analysis Analysis influences visualization

Reactions and Discussion Select relevant information from assembled data Impose some kind of structure Apply graphic techniques To enable understanding Vagueness (Dynamism) of Hypotheses Unknown Sources of Data and Information Relevant Data Concealed by Noise Uncertain and Erroneous Observations (Causally) Incomplete Context Missing Data Logical and Physical Structures