JUMPSTARTING NEURAL NETWORK TRAINING FOR SEISMIC PROBLEMS

Similar documents
CS229 - Project Final Report: Automatic earthquake detection from distributed acoustic sensing (DAS) array data

CS229: Machine Learning

Supplementary Materials for

Contents of this file 1. Text S1 2. Figures S1 to S4. 1. Introduction

Earthquake detection through computationally efficient similarity search

Introduction to Machine Learning

A k-mean characteristic function to improve STA/LTA detection

Earthquakes: Nucleation, Triggering, Rupture, and Relationships to Aseismic Processes October 3, 2017, Cargese. ( Haar transform ), window #1267

FACE RECOGNITION USING NEURAL NETWORKS

Localization (Position Estimation) Problem in WSN

Classifying the Brain's Motor Activity via Deep Learning

CLASSLESS ASSOCIATION USING NEURAL NETWORKS

A multi-window algorithm for real-time automatic detection and picking of P-phases of microseismic events

Applications of Music Processing

Data-Driven Earthquake Location Method Project Report

Advanced Data Analysis Pattern Recognition & Neural Networks Software for Acoustic Emission Applications. Topic: Waveforms in Noesis

PASSIVE ACOUSTIC AND SEISMIC TOMOGRAPHY WITH OCEAN AMBIENT NOISE IN ORION

Spectral Detection of Attenuation and Lithology

Stanford Exploration Project, Report 103, April 27, 2000, pages

SSB Debate: Model-based Inference vs. Machine Learning

Improving microseismic data quality with noise attenuation techniques

Advanced Techniques for Mobile Robotics Location-Based Activity Recognition

Learning to Predict Indoor Illumination from a Single Image. Chih-Hui Ho

Libyan Licenses Plate Recognition Using Template Matching Method

Automatic Vehicles Detection from High Resolution Satellite Imagery Using Morphological Neural Networks

arxiv: v1 [stat.ml] 10 Nov 2017

Passive (Micro-)Seismic Event Detection

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

Image Extraction using Image Mining Technique

Voice Activity Detection

Are there alternatives to Sigmoid Hidden Units? MLP Lecture 6 Hidden Units / Initialisation 1

Classification in Image processing: A Survey

Th P6 01 Retrieval of the P- and S-velocity Structure of the Groningen Gas Reservoir Using Noise Interferometry

Singing Voice Detection. Applications of Music Processing. Singing Voice Detection. Singing Voice Detection. Singing Voice Detection

Locality-Sensitive Hashing for Earthquake Detection: A Case Study of Scaling Data-Driven Science

Progress in DAS Seismic Methods

Smart Geophone Sensor Network for Effective Detection of Landslide Induced Geophone Signals

This tutorial describes the principles of 24-bit recording systems and clarifies some common mis-conceptions regarding these systems.

Biologically Inspired Computation

A Review of Related Work on Machine Learning in Semiconductor Manufacturing and Assembly Lines

Application of Classifier Integration Model to Disturbance Classification in Electric Signals

arxiv: v1 [cs.lg] 2 Jan 2018

AUTOMATED MUSIC TRACK GENERATION

Adaptive f-xy Hankel matrix rank reduction filter to attenuate coherent noise Nirupama (Pam) Nagarajappa*, CGGVeritas

Optical Phase Lock Loop (OPLL) with Tunable Frequency Offset for Distributed Optical Sensing Applications

Derek Allman a, Austin Reiter b, and Muyinatu Bell a,c

Master event relocation of microseismic event using the subspace detector

Evaluation of 3C sensor coupling using ambient noise measurements Summary

Radio Deep Learning Efforts Showcase Presentation

SmartSenseCom Introduces Next Generation Seismic Sensor Systems

An Hybrid MLP-SVM Handwritten Digit Recognizer

TODAY, wireless communications are an integral part of

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS

Seismic Reflection Method

Coda Waveform Correlations

Deep Learning for Infrastructure Assessment in Africa using Remote Sensing Data

Low frequency extrapolation with deep learning Hongyu Sun and Laurent Demanet, Massachusetts Institute of Technology

Generating an appropriate sound for a video using WaveNet.

Seismic fault detection based on multi-attribute support vector machine analysis

(Gibbons and Ringdal 2006, Anstey 1964), but the method has yet to be explored in the context of acoustic damage detection of civil structures.

An Introduction to Convolutional Neural Networks. Alessandro Giusti Dalle Molle Institute for Artificial Intelligence Lugano, Switzerland

Urban Feature Classification Technique from RGB Data using Sequential Methods

Land Cover Analysis to Determine Areas of Clear-cut and Forest Cover in Olney, Montana. Geob 373 Remote Sensing. Dr Andreas Varhola, Kathry De Rego

Real-Time Selective Harmonic Minimization in Cascaded Multilevel Inverters with Varying DC Sources

CLASSIFICATION OF CLOSED AND OPEN-SHELL (TURKISH) PISTACHIO NUTS USING DOUBLE TREE UN-DECIMATED WAVELET TRANSFORM

Drum Transcription Based on Independent Subspace Analysis

DESIGN AND IMPLEMENTATION OF AN ALGORITHM FOR MODULATION IDENTIFICATION OF ANALOG AND DIGITAL SIGNALS

Contents 1 Introduction Optical Character Recognition Systems Soft Computing Techniques for Optical Character Recognition Systems

Noise Attenuation in Seismic Data Iterative Wavelet Packets vs Traditional Methods Lionel J. Woog, Igor Popovic, Anthony Vassiliou, GeoEnergy, Inc.

Research on Hand Gesture Recognition Using Convolutional Neural Network

Machine Learning and Decision Making for Sustainability

OPTICAL BACKSCATTER REFLECTOMETER TM (Model OBR 5T-50)

Suppression of Pulse Interference in Partial Discharge Measurement Based on Phase Correlation and Waveform Characteristics

Th ELI1 07 How to Teach a Neural Network to Identify Seismic Interference

Long Range Acoustic Classification

This is a repository copy of Seismic waveform classification and first-break picking using convolution neural networks.

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Segmentation of Fingerprint Images

Keywords: - Gaussian Mixture model, Maximum likelihood estimator, Multiresolution analysis

Basis Pursuit for Seismic Spectral decomposition

Physics Based Sensor simulation

Vehicle parameter detection in Cyber Physical System

Deep Learning for Autonomous Driving

23270: AUGMENTED REALITY FOR NAVIGATION AND INFORMATIONAL ADAS. Sergii Bykov Technical Lead Machine Learning 12 Oct 2017

Keywords: Power System Computer Aided Design, Discrete Wavelet Transform, Artificial Neural Network, Multi- Resolution Analysis.

A Vehicular Visual Tracking System Incorporating Global Positioning System

SIMULATION-BASED MODEL CONTROL USING STATIC HAND GESTURES IN MATLAB

HIGH IMPEDANCE FAULT DETECTION AND CLASSIFICATION OF A DISTRIBUTION SYSTEM G.Narasimharao

Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval

Remote Sensing. The following figure is grey scale display of SPOT Panchromatic without stretching.

Downloaded 09/04/18 to Redistribution subject to SEG license or copyright; see Terms of Use at

Geophysical Journal International

Domain Adaptation & Transfer: All You Need to Use Simulation for Real

EE216B: VLSI Signal Processing. Wavelets. Prof. Dejan Marković Shortcomings of the Fourier Transform (FT)

On the Use of Convolutional Neural Networks for Specific Emitter Identification

Ambient Passive Seismic Imaging with Noise Analysis Aleksandar Jeremic, Michael Thornton, Peter Duncan, MicroSeismic Inc.

SPACE. (Some space topics are also listed under Mechatronic topics)

Improvement of signal to noise ratio by Group Array Stack of single sensor data

Application of Multi-channel Wiener Filters to the Suppression of Ambient Seismic Noise in Passive Seismic Arrays

3-DEMON MONITORING PLATFORM: EXAMPLES OF APPLICATIONS IN STRUCTURAL AND GEOTECHNICAL MONITORING PROJECTS

Transcription:

JUMPSTARTING NEURAL NETWORK TRAINING FOR SEISMIC PROBLEMS Fantine Huot (Stanford Geophysics) Advised by Greg Beroza & Biondo Biondi (Stanford Geophysics & ICME)

LEARNING FROM DATA Deep learning networks car learn very accurate mappings from inputs to outputs from large amounts of labeled data NOISE REMOVAL OBJECT DETECTION EVENT DETECTION IMAGE SEGMENTATION However, these models are immensely data-hungry and rely on huge amounts of labeled data to achieve their performance 1

CHALLENGES & LIMITATIONS LIMITED LABELED DATA POOR GENERALIZATION The earth is intrinsically unlabeled Absence of ground truth Uncertain labels Fuzzy boundaries Unbalanced data sets with rare events The real world is messy Infinite number of novel scenarios Many sources of noise Small signal to noise ratio Incomplete data, drop outs The ability to transfer knowledge to new conditions is generally known as transfer learning 2

TRADITIONAL SUPERVISED LEARNING Task / Domain A Task / Domain B MODEL A Training and evaluation on the same task or domain MODEL B 3

TRANSFER LEARNING Source task / Domain Target task / Domain Storing knowledge gained solving one problem and applying it to a different but related problem MODEL MODEL KNOWLEDGE 4

TRANSFER LEARNING SCENARIOS Size of data set Train target model from scratch Fine tune the lower layers of the pretrained model Fine tune pretrained source model Fine tune the output dense layer of the pretrained model Problem similarity 5

SMALL MAGNITUDE EARTHQUAKE DETECTION

DATASET NCSN Event catalog 7

DATA SAMPLES 1-10Hz bandpass Downsampling Data cleaning using STA/LTA thresholding Normalization 8

DATA SAMPLES 1-10Hz bandpass Downsample at 20Hz Select a random noise window per day Normalize 9

WAVELET ATTRIBUTES CWT with Morlet wavelet 30 scales computed over time Time pseudo frequencies: 1 10Hz 15s waveform windows Downsample at 0.5s Normalize 10

NETWORK ARCHITECTURE 11

DATA VOLUMES For each station: 3,000 noise samples 2,000 earthquake samples Training set : test set = 80 : 20 ACCURACY: 65% INSUFFICIENT LABELED DATA UNBALANCED DATASET 12

TRANSFER LEARNING 1 0 MNIST 50,000 examples in the training set 13

RESULTS When using only one station ACCURACY: 99.5% When combining all 4 stations Out of 4000 events, about a hundred were misclassified ACCURACY: 96.8% Using a 7 layer CNN ACCURACY: 98.2% 14

TRAFFIC NOISE DETECTION USING A FIBER OPTIC SEISMIC NETWORK

AMBIENT NOISE MONITORING The ambient seismic noise field can be used for near-surface imaging or environmental monitoring Fiber optic cables can be used for recording seismic waves Change in backscattered light gives information about strain rate acoustic 16

FIBER OPTIC ARRAY UNDER STANFORD CAMPUS Continuous recording since September 2016 2.5 km loop 600 sensors 50 samples per second 138 100 building construction site road with cars fiber path 155 203 245 1-2 m z mix of materials at surface soil concrete 75 10-15 cm PVC or similar fibre optic cable 50 35 IU 270 soil 17

BASICALLY NOISE Strain rates Bandpass 0.5 24 Hz 18

SELECTIVE FILTERING BEFORE AFTER A Seismic Shift in Scalable Acquisition Demands New Processing: Fiber-Optic Seismic Signal Retrieval in Urban Areas with Unsupervised Learning for Coherent Noise Removal, IEEE Signal processing magazine Eileen R. Martin, Fantine Huot, Yinbin Ma, Robert Cieplicki, Steve Cole, Martin Karrenbach, Biondo L. Biondi 19

CAR DETECTION WITH A NEURAL NETWORK CARS BACKGROUND SYNTHETIC CARS NOISE Raw data Detection window 10 channels x 10 seconds Downsampling along the time axis : 10 x 50 samples 20

DATA SET Real cars Synthetic cars Background noise Total Training 5,000 20,000 25,000 50,000 Validation 1,000 4,000 5,000 10,000 Test 5,000 0 5,000 10,000 ACCURACY: 99.4% 21

EXAMPLES OF EVENTS THAT WERE HARD TO CLASSIFY Out of 5,000 cars: 1 was misclassified 38 obtained a normalized probability score less than 90% 59 obtained a normalized probability score less than 95% 22

CONCLUSIONS SYNTHETIC DATA GENERATION TRANSFER LEARNING Compensate for limited labeled data and unbalanced data sets Leverage domain knowledge Generalize to new conditions Transfer knowledge to problems with limited data THANK YOU Feedback? Questions? Please share! fantine@stanford.edu 23

BACKUP SLIDES

HOEFFDING INEQUALITY 25

THE IMPACT OF MISLABELED DATA 26

UNIVERSAL APPROXIMATION THEOREM The universal approximation theorem states that a feedforward neural network with a linear output layer and at least one hidden layer with any either a logistic sigmoid or rectified linear unit activation function can approximate any continuous function from one finite-dimensional space to another with any desired non-zero amount of error, provided that the network is given enough hidden units. 27

OBJECTIVE FUNCTION 28

MAX POOL 29

DATA ATTRIBUTES USING CONTINUOUS WAVELET TRANSFORMS 30

DATA ATTRIBUTES USING CONTINUOUS WAVELET TRANSFORMS Morlet wavelet 30 scales computed over both time and space Time pseudo frequencies: 0.5 24Hz Spatial pseudo wavenumbers: 1/500 1/8m Downsampling over 0.5s Normalization to zero mean and unit variance 31

TRAFFIC NOISE CLUSTER 138 100 building construction site road with cars fiber path K-means clustering over a week of data 155 203 245 75 50 35 IU 270 5 PM 8 AM 32

SO WHAT TYPE OF NOISE DID WE IDENTIFY? Hierarchical clustering over a month of data Cars Coherent noise Laser noise Background noise 33

CLUSTERED EVENTS 34

CLUSTERED EVENTS 35

INTERFEROMETRY EXTRACTS SIGNAL FROM AMBIENT NOISE d(r, t) d(v, t) Cross-correlation between receiver r & virtual source v is maximized at the time it takes a wave to travel from one receiver to the other 36

INTERFEROMETRY EXTRACTS SIGNAL FROM AMBIENT NOISE d(r, t) d(v, t) C(v, r, ) Cross-correlation between receiver r & virtual source v is maximized at the time it takes a wave to travel from one receiver to the other 37

WE CAN EXTRACT COHERENT SIGNAL FROM THE NOISE 138 100 building construction site road with cars fiber path array corner 155 203 245 75 50 35 IU 270 hyperbola along nearest orthogonal line 38

WE CAN EXTRACT COHERENT SIGNAL FROM THE NOISE 138 100 building construction site road with cars fiber path array corner 155 203 245 75 50 35 IU 270 hyperbola along nearest orthogonal line REQUIRES UNCORRELATED UNIFORMLY DISTRIBUTED NOISE 39

TRAIN WITH SYNTHETICALLY GENERATED DATA 40

TRAIN WITH SYNTHETICALLY GENERATED DATA 41

PROPOSED SOLUTION SYNTHETIC DATA GENERATION Commonly used in machine learning and has been successfully implemented for character recognition in natural images, traffic sign recognition, handwriting recognition, face recognition or protein interactions etc Fanelli et al., 2011 42

STRATEGY TRAINING ON SYNTHETIC DATA Commonly used in machine learning and has been successfully implemented for character recognition in natural images, traffic sign recognition, handwriting recognition, face recognition or protein interactions etc Deep networks for physics modeling are entirely trained on synthetic data Ling, A. Kurzawski, and J. Templeton. Journal of Fluid Mechanics, 807:155 166, 2016. LIMITATIONS Synthetic data do not capture the complexity of the real world Reynolds stress anisotropy tensor 43

MOTIVATION STA/LTA Efficient Similarity Search of Seismic Waveforms using template matching C. E. Yoon, O. OReilly, K. J. Bergen, and G. C. Beroza. Earthquake detection through computationally efficient similarity search. Science advances, 2015. How can we find new earthquake templates? 44

DATASET NCSN Event catalog BK-SAO.HHZ, San Andreas Geophysical Observatory, Hollister BK-JRSC.HHZ, Jasper Ridge Biological Preserve, near Stanford BK-PKD.HHZ, Bear Valley Ranch, Parkfield BK-CVS.HHZ, Carmenet Vineyards, Sonoma 45

DATA WRANGLING Decimate to 20Hz sampling rate and bandpass1-10hz Normalize traces (original scale stored) Earthquakes: remove events that don t meet STA/LTA criteria or if P-wave arrival time is unknown STA window = 1 LTA window = 30 Threshold = 5 Background: randomly selected 2 minute segments from each 24 hour period Background: Remove segments that are likely to contain events STA window = 3 LTA window = 45 Threshold = 6 46