Raw Data. Cleaned, Structured Data. Exploratory Data Analysis. Verify Hunches (stats) Data Product

Similar documents
Build the Easiest Backlinks First. Build Internal Links

Recommendations Worth a Million

I. INTRODUCTION II. LITERATURE SURVEY. International Journal of Advanced Networking & Applications (IJANA) ISSN:

Patent Mining: Use of Data/Text Mining for Supporting Patent Retrieval and Analysis

Researchers and new tools But what about the librarian? mendeley.com

MSc(CompSc) List of courses offered in

Info 2950, Lecture 26

AI Fairness 360. Kush R. Varshney

WHAT IS FEATURED AUTHORS?

13 Dec 2pm-5pm Olin Hall 218 Final Exam Topics

DIGITAL MARKETING CHECKLIST. for. Home Remodelers & Builders

The Four P s to Create Websites that Attract Clients

Discussion Guide For New Sales Associates

Consider this sample set of questions as a baseline so that you get a complete picture of the candidate.

Keynote presentation Open: unlocking value

Dissemination Patterns of Technical Knowledge in the IR Industry. Scientometric Analysis of Citations in IR-related Patents

7 Ways to Build your Online Presence Now

SlideShare Traffic Rush

LinkedIn Social Selling Linkedin Session 2 -Managing Your Settings Tagging And Groups

LISTEN A MINUTE.com. Technology. Focus on new words, grammar and pronunciation in this short text.

MODULE 5 FACEBOOK PROMOTION AND MARKETING STRATEGIES

Analogy Engine. November Jay Ulfelder. Mark Pipes. Quantitative Geo-Analyst

HOW TO BE A Successful Blogger

Obviously, this is after you start to get some traffic, but that is one of the steps, so I want to get that in there.

Academia to Data Science. Faye Zheng Program Director Insight Data Science

Steps toward reproducible research

[Extract a Segment From Laura s Interview]

An Absolute Beginners Guide to BookBuzzr and Internet Marketing (For Authors)

Blogging with and for EAL Learners. Bonnie Nicholas REALize Online Conference January 24, 2013

Analysis of Data Mining Methods for Social Media

IBM SPSS Neural Networks

Power Networking For Results

1 // TOPICS + CATEGORIES 6 2 // BLOG POSTS 15 3 // STATIC PAGES 28 4 // NAVIGATION MENUS 36 5 // BLOG PHOTOS 39 6 // BLOG GRAPHICS 42

Vision - This is your time to write a statement about what you want your idea to become.

Huge Culver 2. Hugh: Thanks, Jaime. It s always fun.

Ways to find journalists...

LISTEN A MINUTE.com. Cars. One minute a day is all you need to improve your listening skills.

Enclosed you will find helpful hints and instructions on how to make your project a success.

Rethink the Way you Market Your SaaS Product. Wesley Bush

Marketing experts tell ya to publish content that grows your blog and business. But how can you actually know what works?

MEASURING PRIVACY RISK IN ONLINE SOCIAL NETWORKS. Justin Becker, Hao Chen UC Davis May 2009

Using Deep Learning for Sentiment Analysis and Opinion Mining

ANSIBLE TOWER OVERVIEW AND ROADMAP. Bill Nottingham Senior Principal Product Manager

2019 Marketing Planning Guide

TELLING STORIES OF VALUE WITH IOT DATA

DOWNLOADABLE MARKETING PLAN SPREAD YOUR MUSIC

Knowledge discovery & data mining Classification & fraud detection

ULTIMATE LOGO & BRANDING + STATIONERY SUITE

Techniques for Sentiment Analysis survey

lead generation strategies for your real estate business

Make Money Online Today With Affiliate Marketing How To Get Started Right Now

Supervisors: Rachel Cardell-Oliver Adrian Keating. Program: Bachelor of Computer Science (Honours) Program Dates: Semester 2, 2014 Semester 1, 2015

4-Steps To Becoming Wealthy AFFILIATE MARKETING BLUEPRINT. How To Create Your Own Money Making Machine In A Week Without Having Any Experience

신경망기반자동번역기술. Konkuk University Computational Intelligence Lab. 김강일

Copyright 2015 Silicon Valley Digital Marketing Institute, All Rights Reserved

COMPUTATIONAL SOCIAL SCIENCE AND ADVANCED COMPUTING INFRASTRUCTURE: CHALLENGES AND OPPORTUNITIES

The Ultimate. Content Creation. Guide. Learn to (Finally) Publish Consistently. Workbook

How to Start a Blog & Use It To Squash Writer s Block

ArcGIS Pro: What s New in Analysis. Rob Elkins

15 Ways to Make $1000/month from home...part time...and then scale it up to a full-time biz!

I ve spent the last few months experimenting with viral traffic.

Discover the Power of the Cloud

64% of people on Yelp! Make over $100,000 a year and they are looking for places to spend it.

Artificial Intelligence Machine learning and Deep Learning: Trends and Tools. Dr. Shaona

Transformation to Artificial Intelligence with MATLAB Roy Lurie, PhD Vice President of Engineering MATLAB Products

Split Testing 101 By George M. Brown

Social Network Analysis and Its Developments

MITOCW watch?v=guny29zpu7g

Introduction. Descriptive Statistics. Problem Solving. Inferential Statistics. Chapter1 Slides. Maurice Geraghty

NASA GES DISC data service and data management for the Air Quality community

Teaching icub to recognize. objects. Giulia Pasquale. PhD student

Pipeline Tracker. Brought to you by:

AUTOMATION ACROSS THE ENTERPRISE

League of Legends: Dynamic Team Builder

Mining Phasor Data To Find The Hidden Gems In Your Archive

The Latest from the Fung Institute Patent Lab Gabe Fierro, Lee Fleming, Kevin Johnson, Aditya Kaulagi, Guan Cheng Li, Sophia Pham, Bill Yeh

Module 4. Session 3: Social Media Tools

Getting Started with TrangoLink

Website & Newsletter Media Pack 2016

WEBSOLUTIONS B2B SURVEY

Practical Big Data Science

First off, let me tell you I m honored you are reading these words.

Spring 2018 CS543 / ECE549 Computer Vision. Course webpage URL:

How to Take Your Offline Business Online An Introduction to Online Visibility. By Vladas Krivickas

[Workshop 3 Part 2] You can see here on this post, I just posted this article yesterday and I ve already had 10 Google+1 s on there.

VIRTUAL ASSISTANT SERVICES

SELLING YOUR BOOKS ON AMAZON...3 GETTING STARTED...4 PUBLISHING YOUR BOOK...5 BOOK STATUS REVIEW, PUBLISHING & LIVE... 13

The Home Business SURVIVAL GUIDE. Recruit Effortlessly & Build A Big Team (Globally) FAST

Latest trends in sentiment analysis - A survey

Data Analysis Fundamentals

Jesse Stay on Google Plus for Dummies stay- google- plus

CURRICULUM MATERIALS FOR MUZZY ONLINE

Chapter 6. Meeting 6, Controlling Gain and Processing Signals

Module 34 COACH SECRETS REVEALED WORKSHOPS

making technology disappear

PMU Big Data Analysis Based on the SPARK Machine Learning Framework

LISTEN A MINUTE.com. Advice. One minute a day is all you need to improve your listening skills.

Meta Scientific Discovery Beyond Search CHAN ZUCKERBERG INITIATIVE

SOCIAL MEDIA SUPPORT SELF-ASSESSMENT

Expert Interview On The Savvy Biz Blog with Steve Martile From BloggingForCoaches.com and FreedomEducation.ca

Transcription:

Recap Overview

Raw Exploratory Image of Schedule A-P, showing two contributions to Obama for America. includes full name, date of contribution, and contribution amount. Product

Raw Exploratory Product C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN- 07,"","","","SA17A",288757C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR- 07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757

Raw Exploratory Product C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN- 07,"","","","SA17A",288757C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR- 07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 "SANTA C00420224,"P80002983","Cox, John H","STEWART, CLAUS","SNOWMAN", MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","BROWN, CHARLENE","EAGLE RIVER","AK","99577","","STUDENT",25,01-MAR-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","KELLY, RAY","HUNTSVILLE","AL","35801","ARKTECH","RETIRED",25,25-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CINGEL, KEITH","SEVERN","AL","20999","SANTA CLAUS","SNOWMAN",50,17-MAY-07,"","","","SA17A",305408 C00420224,"P80002983","Cox, John H","DUNAWAY, JONATHON","DEATSVILLE","AL","36022","CSC","TECHNICAL MANAGER",10,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","TERRY, R.S. MR. SR.","SHEFFIELD","AL","35660","RETIRED","",25,18-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","CANADY, DALE","PHOENIX","AZ","85051","RETIRED","",25,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","LORENZ, DWIGHT","SUN CITY","AZ","85351","NONE","RETIRED",20,12-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","STEWART, MICHAEL","CHANDLER","AZ","85224","DYNAMIC ENERGY","TECHNICIAN",5,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","ROSENTHAL, ARNOLD","CAREFREE","AZ","85277","RETIRED","",10,11-JAN-07,"","","","SA17A",288757 C00420224,"P80002983","Cox, John H","VADNAIS, DOROTHY","SAN DIEGO","CA","92116","","RETIRED",10,10-JAN-07,"","","","SA17A",288757

Raw Exploratory Product

Raw Exploratory T-test Create a model (linear regression) Product

Raw Exploratory T-test Create a model (linear regression) Significance Product

Raw Exploratory T-test Create a model (linear regression) Significance Product

Raw Exploratory Product

Raw Exploratory Product New York Times, Flowing. All rights reserved. This content is excluded from our Creative Commons license. For more information, see http://ocw.mit.edu/fairuse.

Raw Exploratory Images removed due to copyright restrictions: suggested movies on Netflix, Facebook search, LinkedIn logo. Product

Raw Context Exploratory Yesterday and today, 3 companies kindly came to talk about their technologies. I personally found it awesome as well because it gives context to the stuff we ve been teaching and learning. Product

Raw Similar Exploratory What struck me was how similar their processes are to what we ve done in this class, but on a different dataset, or different scale, etc. Product

Pipeline Crazy raw data Cleaned, structured data Exploratory data analysis Verify Hunches Product (tm hammer@cloudera) Different companies fit into different subsets of the pipeline locu is the first segment (100% accuracy) visible measures is full pipeline, at huge scale Hadapt makes exploratory and verifying faster

Raw http://locu.com/ Exploratory [logo removed due to copyright restrictions] Product

Raw Exploratory http://www.visiblemeasures.com/ Google analytics. Takes structured apache logs (access logs) and analyzes them to see how many people are viewing a particular internet video ad. Product

Raw http://www.vertica.com/ Exploratory [logo removed due to copyright restrictions] Product

Raw http://www.hadapt.com/ Exploratory Hadapt doesn t actively perform data analysis etc. Instead, they create platforms that help other companies (like visiblemeasures) perform their data analysis faster. You ll find companies focused on every part of this pipeline. It s what makes companies smarter. Product

Visible Measures Locu Gave us context about what companies that are centered around data analytics are doing A lot of them are very similar to what we did, at a huge scale.

Raw Clean Explore Verify Getting data Visualization Statistics Machine Learning Graph Text bases Big Product

Raw Getting Clean Explore Surveys Web Crawling/Scraping https://scraperwiki.com http://nutch.apache.org Sensors Verify Product

Raw Clean Explore Verify Product Visualizations Interactive Visualizations HTML5/CSS/JavaScript Tools processingjs, d3, prefuse Blogs http://flowingdata.com http://infosthetics.com Harvard http://cs171.org MIT 6.831

Raw Clean Explore Verify Product Statistics Are they different? T-Tests, ANOVA Bayesian Statistics Correlation Regressions Linear Non-Linear 16.470j http://statistics.mit.edu

Raw Clean Machine Learning Classification Clustering Explore Verify http://www.ml-class.org MIT 6.867 Python scikit-learn (sklearn) Product

Raw Graph Clean Explore Verify Examples: web pages, friend graph, twitter Metrics Centrality Cohesion Importance (page rank) Social Network Web data mining MIT Course Sep Kamvar Fall 2012 http://www.stats.ox.ac.uk/~snijders/sna_course.htm Product

Raw Text Clean Explore Verify Natural Language Processing Parsing sentences Extracting the grammar/structure Similarity measures Cosine Similarity Jaccard Identifying Entities Opencalais MIT 6.864/6.863J Product

Raw bases Clean Explore SQL Implements a lot of what we did Filtering Joining Grouping Summarizing Specialized system to do this SQL databases, Hive, Pig Verify MIT 6.830 http://db-class.org Product

Raw Big Clean Explore How to process on 1000+ machines? Problems Managing Machines fail all the time Network problems out-of-sync (consistency) Verify Distributed Systems MIT 6.824 (6.830 a bit) Product

Berkeley Also Has a Class! http://datascienc.es

Thank You! git pull

MIT OpenCourseWare http://ocw.mit.edu Resource: How to Process, Analyze and Visualize Adam Marcus and Eugene Wu The following may not correspond to a particular course on MIT OpenCourseWare, but has been provided by the author as an individual learning resource. For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.