Reducing confounding factors in automatic acoustic recognition of individual birds

Size: px
Start display at page:

Download "Reducing confounding factors in automatic acoustic recognition of individual birds"

Transcription

1 Reducing confounding factors in automatic acoustic recognition of individual birds Dan Stowell Machine Listening Lab Centre for Digital Music Acoustic recognition of birds 1 / 31

2 versus

3 Machine listening and bird sounds - why? dan.stowell@qmul.ac.uk Acoustic recognition of birds 3 / 31

4 Machine listening and bird sounds - why? Changes in populations, in migration patterns monitoring is important Intrusive vs. passive monitoring behavioural impact of catching/ringing birds Many birds are most easily observed by sound Manual (volunteer) monitoring common, but not scalable dan.stowell@qmul.ac.uk Acoustic recognition of birds 4 / 31

5 In this talk... Classification-based approaches to: 0. Bird species recognition 1. Bird sound detection (presence/absence) 2. Bird individual ID (By the way: we do more than just classification!) Acoustic recognition of birds 5 / 31

6 Species classification of bird sounds In 2014: feature-learning approach to bird sound recognition Dataset Location Total duration Num items Num classes Labelling lifeclef Brazil 77.8 hours (12M frames) singlelabel 100 lifeclef2014 Classifier: multilabel 90 AUC (%) mfcc-ms mfcc-maxp mfcc-modul melspec-ms melspec-maxp melspec-modul melspec-kfl1-ms melspec-kfl2-ms Feature learning melspec-kfl3-ms melspec-kfl4-ms Acoustic recognition of birds 6 / 31 melspec-kfl8-ms c-kfl4pl8kfl4-ms

7 Bird species classification: Warblr Warblr app for Android and ios Acoustic recognition of birds 7 / 31

8 Bird species classification: Warblr Over 45,000 recordings submitted to our database ( 80/day) Submission geolocations dan.stowell@qmul.ac.uk Acoustic recognition of birds 8 / 31

9 Some of our users... Acoustic recognition of birds 9 / 31

10 Some of our users... Acoustic recognition of birds 9 / 31

11

12 Part 1: Bird Audio Detection challenge Many projects need reliable detection of bird sounds e.g. in long unattended recordings But existing methods are not robust, not general-purpose enough, and need lots of manual tweaking/post-processing Acoustic recognition of birds 11 / 31

13 Bird Audio Detection challenge We designed the Bird Audio Detection challenge Dev set 1: 10k items, crowdsourced audio from around the UK (Warblr phone app) Dev set 2: 7k items, crowdsourced audio from misc field recordings Testing set: 10k items, remote monitoring, Chernobyl Exclusion Zone Acoustic recognition of birds 12 / 31

14 Bird Audio Detection challenge Training/testing sets differ in: location recording eqpt species class balance background sounds time of day time of year weather... How is a classifier meant to work in such mismatched conditions??? dan.stowell@qmul.ac.uk Acoustic recognition of birds 13 / 31

15 Bird Audio Detection challenge: outcomes 30 teams submitted Strong results (up to 89% AUC) Domain adaptation strategies Pseudo-labelling, test mixing Though not always needed Acoustic recognition of birds 14 / 31

16 So why do we evaluate using matched conditions? To study the classifier s behaviour Sometimes a practical application is in matched conditions Pragmatic reasons: only one dataset available; free choice of bootstrap/n-fold crossvalidation...because our algorithms aren t good enough at avoiding confounds? dan.stowell@qmul.ac.uk Acoustic recognition of birds 15 / 31

17 Machine learning workflow train validate test Acoustic recognition of birds 16 / 31

18 Machine learning workflow train validate test reallytest Acoustic recognition of birds 16 / 31

19

20 Part 2: Identifying individual bird ID Motivation: reduce intrusive monitoring (capturing/tagging/ringing) Many birds do have individual signature Acoustic recognition of birds 18 / 31

21 Identifying individual bird ID Data collection: Acoustic recognition of birds 19 / 31

22 Identifying individual bird ID Data collection: Acoustic recognition of birds 19 / 31

23 Identifying individual bird ID Data collection: Bird ID: categorical label. Is this the same task as species classification? Acoustic recognition of birds 19 / 31

24 Identifying individual bird ID Acoustic recognition of birds 20 / 31

25 Identifying individual bird ID Acoustic recognition of birds 20 / 31

26 Making use of silence (1) Training set: Acoustic recognition of birds 21 / 31

27 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31

28 Making use of silence (1) Acoustic recognition of birds 21 / 31

29 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31

30 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31

31 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Acoustic recognition of birds 22 / 31

32 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31

33 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31

34 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31

35 Territorial birds: the territory is the album mel spec features 100 skfl features AUC (%) owl cross-year t within-year t across-year f chaff within-year f chaff across-year aug little owl cross-year 40 pipit within-year pipit across-year chiff chaff within-year chiff chaff across-year 30 standard aug dan.stowell@qmul.ac.uk Acoustic recognition of birds 23 / 31

36 Making use of silence (2) Data augmentation of the TESTING set (adversarial) Measure the distractability of the classifier when mismatched silence is added Measure RMSE in classifier decisions Acoustic recognition of birds 24 / 31

37 Making use of silence (2) Confusion matrix: linhart2015marcutday1day2_melspec-kfl4pe8kfl4-ms_nr05_pk0_heq0_pool0_rfall_max PC PC1102 PC PC1104 PC True PC1106 PC1107 PC1108 PC PC1110 PC PC1112 PC Estimated dan.stowell@qmul.ac.uk Acoustic recognition of birds 25 / 31

38 Making use of silence (3) Data augmentation of the TRAINING set Each item gets new versions with added silence from each class... Acoustic recognition of birds 26 / 31

39 Making use of silence (4) Finally we can add a new wastebasket class NB not using the a/b labels here... dan.stowell@qmul.ac.uk Acoustic recognition of birds 27 / 31

40 Results Plus silence-test result: 50% AUC Acoustic recognition of birds 28 / 31

41

42 Conclusions Outdoor bird sound recognition is tricky: The sounds (classes) are highly variable Many potential confounding factors for black-box ML 1. Bird Audio Detection Challenge: Good detection, even in strongly mismatched conditions Adaptation methods useful though, not always needed? 2. Recognising individual bird ID: Strong recognition possible (depending on species) Silence is surprisingly useful for sound recognition! Generally: make more use of mismatched-condition testing Acoustic recognition of birds 30 / 31

43 Thank you Collaborators: 1. Bird Audio Detection Challenge: Mike Wood (U of Salford), Yannis Stylianou (U of Crete), Herve Glotin (U of Toulon), IEEE Signal Processing Society 2. Recognising individual bird ID: Pavel Linhart (Adam Mickiewicz U / Praha U) Machine Listening Lab: dan.stowell@qmul.ac.uk Acoustic recognition of birds 31 / 31

LifeCLEF Bird Identification Task 2016

LifeCLEF Bird Identification Task 2016 LifeCLEF Bird Identification Task 2016 The arrival of deep learning Alexis Joly, Inria Zenith Team, Montpellier, France Hervé Glotin, Univ. Toulon, UMR LSIS, Institut Universitaire de France Hervé Goëau,

More information

Audio Similarity. Mark Zadel MUMT 611 March 8, Audio Similarity p.1/23

Audio Similarity. Mark Zadel MUMT 611 March 8, Audio Similarity p.1/23 Audio Similarity Mark Zadel MUMT 611 March 8, 2004 Audio Similarity p.1/23 Overview MFCCs Foote Content-Based Retrieval of Music and Audio (1997) Logan, Salomon A Music Similarity Function Based On Signal

More information

Campus Location Recognition using Audio Signals

Campus Location Recognition using Audio Signals 1 Campus Location Recognition using Audio Signals James Sun,Reid Westwood SUNetID:jsun2015,rwestwoo Email: jsun2015@stanford.edu, rwestwoo@stanford.edu I. INTRODUCTION People use sound both consciously

More information

arxiv: v2 [eess.as] 11 Oct 2018

arxiv: v2 [eess.as] 11 Oct 2018 A MULTI-DEVICE DATASET FOR URBAN ACOUSTIC SCENE CLASSIFICATION Annamaria Mesaros, Toni Heittola, Tuomas Virtanen Tampere University of Technology, Laboratory of Signal Processing, Tampere, Finland {annamaria.mesaros,

More information

Two Convolutional Neural Networks for Bird Detection in Audio Signals

Two Convolutional Neural Networks for Bird Detection in Audio Signals th European Signal Processing Conference (EUSIPCO) Two Convolutional Neural Networks for Bird Detection in Audio Signals Thomas Grill and Jan Schlüter Austrian Research Institute for Artificial Intelligence

More information

The design and calibration of low cost urban acoustic sensing devices. SONYC Sounds Of New York City

The design and calibration of low cost urban acoustic sensing devices. SONYC Sounds Of New York City The design and calibration of low cost urban acoustic sensing devices SONYC Sounds Of New York City C. Mydlarz NYU CUSP C. Shamoon NYC DEP M. Baglione, M Pimpinella The Cooper Union cmydlarz@nyu.edu Sounds

More information

Cómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Launchmetrics

Cómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Launchmetrics Cómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Intelligence @ Launchmetrics annaboschrue@gmail.com Motivating example 90% Accuracy and you want to do better IDEAS: - Collect

More information

Sound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska

Sound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska Sound Recognition ~ CSE 352 Team 3 ~ Jason Park Evan Glover Kevin Lui Aman Rawat Prof. Anita Wasilewska What is Sound? Sound is a vibration that propagates as a typically audible mechanical wave of pressure

More information

The Jigsaw Continuous Sensing Engine for Mobile Phone Applications!

The Jigsaw Continuous Sensing Engine for Mobile Phone Applications! The Jigsaw Continuous Sensing Engine for Mobile Phone Applications! Hong Lu, Jun Yang, Zhigang Liu, Nicholas D. Lane, Tanzeem Choudhury, Andrew T. Campbell" CS Department Dartmouth College Nokia Research

More information

Recommendations Worth a Million

Recommendations Worth a Million Recommendations Worth a Million An Introduction to Clustering 15.071x The Analytics Edge Clapper image is in the public domain. Source: Pixabay. Netflix Online DVD rental and streaming video service More

More information

Electric Guitar Pickups Recognition

Electric Guitar Pickups Recognition Electric Guitar Pickups Recognition Warren Jonhow Lee warrenjo@stanford.edu Yi-Chun Chen yichunc@stanford.edu Abstract Electric guitar pickups convert vibration of strings to eletric signals and thus direcly

More information

ApProgXimate Audio: A Distributed Interactive Experiment in Sound Art and Live Coding

ApProgXimate Audio: A Distributed Interactive Experiment in Sound Art and Live Coding ApProgXimate Audio: A Distributed Interactive Experiment in Sound Art and Live Coding Chris Kiefer Department of Music & Sussex Humanities Lab, University of Sussex, Brighton, UK. School of Media, Film

More information

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs

Automatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs Automatic Text-Independent Speaker Recognition Approaches Using Binaural Inputs Karim Youssef, Sylvain Argentieri and Jean-Luc Zarader 1 Outline Automatic speaker recognition: introduction Designed systems

More information

I. Cocktail Party Experiment Daniel D.E. Wong, Enea Ceolini, Denis Drennan, Shih Chii Liu, Alain de Cheveigné

I. Cocktail Party Experiment Daniel D.E. Wong, Enea Ceolini, Denis Drennan, Shih Chii Liu, Alain de Cheveigné I. Cocktail Party Experiment Daniel D.E. Wong, Enea Ceolini, Denis Drennan, Shih Chii Liu, Alain de Cheveigné MOTIVATION In past years at the Telluride Neuromorphic Workshop, work has been done to develop

More information

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise

Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Noha KORANY 1 Alexandria University, Egypt ABSTRACT The paper applies spectral analysis to

More information

Environmental Sound Recognition using MP-based Features

Environmental Sound Recognition using MP-based Features Environmental Sound Recognition using MP-based Features Selina Chu, Shri Narayanan *, and C.-C. Jay Kuo * Speech Analysis and Interpretation Lab Signal & Image Processing Institute Department of Computer

More information

Cover Song Recognition Based on MPEG-7 Audio Features

Cover Song Recognition Based on MPEG-7 Audio Features Cover Song Recognition Based on MPEG-7 Audio Features Mochammad Faris Ponighzwa R, Riyanarto Sarno, Dwi Sunaryono Department of Informatics Institut Teknologi Sepuluh Nopember Surabaya, Indonesia ponighzwa13@mhs.if.its.ac.id,

More information

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives

Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Mathew Magimai Doss Collaborators: Vinayak Abrol, Selen Hande Kabil, Hannah Muckenhirn, Dimitri

More information

Knowledge discovery & data mining Classification & fraud detection

Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection 5/24/00 Click here to start Table of Contents Author: Dino Pedreschi

More information

Mobile Sensing: Opportunities, Challenges, and Applications

Mobile Sensing: Opportunities, Challenges, and Applications Mobile Sensing: Opportunities, Challenges, and Applications Mini course on Advanced Mobile Sensing, November 2017 Dr Veljko Pejović Faculty of Computer and Information Science University of Ljubljana Veljko.Pejovic@fri.uni-lj.si

More information

Introducing COVAREP: A collaborative voice analysis repository for speech technologies

Introducing COVAREP: A collaborative voice analysis repository for speech technologies Introducing COVAREP: A collaborative voice analysis repository for speech technologies John Kane Wednesday November 27th, 2013 SIGMEDIA-group TCD COVAREP - Open-source speech processing repository 1 Introduction

More information

Dimension Reduction of the Modulation Spectrogram for Speaker Verification

Dimension Reduction of the Modulation Spectrogram for Speaker Verification Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and

More information

Identification of Woodpecker Species through Drumming

Identification of Woodpecker Species through Drumming Gerard Gorman Identification of Woodpecker Species through Drumming J. Florentin O. Verlinden, T. Dutoit, F. Moiny, G. Kouroussis and P. Rasmont Symposium on Ecology and Acoustics June 16-18 2014 - Musée

More information

Discriminative Training for Automatic Speech Recognition

Discriminative Training for Automatic Speech Recognition Discriminative Training for Automatic Speech Recognition 22 nd April 2013 Advanced Signal Processing Seminar Article Heigold, G.; Ney, H.; Schluter, R.; Wiesler, S. Signal Processing Magazine, IEEE, vol.29,

More information

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification Wei Chu and Abeer Alwan Speech Processing and Auditory Perception Laboratory Department

More information

Automatic classification of traffic noise

Automatic classification of traffic noise Automatic classification of traffic noise M.A. Sobreira-Seoane, A. Rodríguez Molares and J.L. Alba Castro University of Vigo, E.T.S.I de Telecomunicación, Rúa Maxwell s/n, 36310 Vigo, Spain msobre@gts.tsc.uvigo.es

More information

Human-Centered DESIGN PROMPTS for Emerging Technologies. 20 deliberations, considerations, and provocations

Human-Centered DESIGN PROMPTS for Emerging Technologies. 20 deliberations, considerations, and provocations Human-Centered DESIGN PROMPTS for Emerging Technologies 20 deliberations, considerations, and provocations + Today s emerging technologies promise exciting new ways of engaging with our world and with

More information

Long Range Acoustic Classification

Long Range Acoustic Classification Approved for public release; distribution is unlimited. Long Range Acoustic Classification Authors: Ned B. Thammakhoune, Stephen W. Lang Sanders a Lockheed Martin Company P. O. Box 868 Nashua, New Hampshire

More information

Audio Fingerprinting using Fractional Fourier Transform

Audio Fingerprinting using Fractional Fourier Transform Audio Fingerprinting using Fractional Fourier Transform Swati V. Sutar 1, D. G. Bhalke 2 1 (Department of Electronics & Telecommunication, JSPM s RSCOE college of Engineering Pune, India) 2 (Department,

More information

GE 113 REMOTE SENSING

GE 113 REMOTE SENSING GE 113 REMOTE SENSING Topic 8. Image Classification and Accuracy Assessment Lecturer: Engr. Jojene R. Santillan jrsantillan@carsu.edu.ph Division of Geodetic Engineering College of Engineering and Information

More information

CONTEXT-AWARE COMPUTING

CONTEXT-AWARE COMPUTING CONTEXT-AWARE COMPUTING How Am I Feeling? Who Am I With? Why Am I Here? What Am I Doing? Where Am I Going? When Do I Need To Leave? A Personal VACATION ASSISTANT Tim Jarrell Vice President & Publisher

More information

PACIFIC MAMMAL RESEARCH. Marine Mammal Research & Education

PACIFIC MAMMAL RESEARCH. Marine Mammal Research & Education PACIFIC MAMMAL RESEARCH Marine Mammal Research & Education www.pacmam.org 1 OUR STORY Harbor porpoises are one of the smallest marine mammal residents of the Salish Sea region, yet by the 1990s they were

More information

Understanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing

Understanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing Understanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing EMBEDDED WORLD 2018 SAULI LEHTIMAKI, SILICON LABS Understanding Advanced Bluetooth Angle Estimation Techniques for

More information

Neural Networks The New Moore s Law

Neural Networks The New Moore s Law Neural Networks The New Moore s Law Chris Rowen, PhD, FIEEE CEO Cognite Ventures December 216 Outline Moore s Law Revisited: Efficiency Drives Productivity Embedded Neural Network Product Segments Efficiency

More information

Machine Learning and Decision Making for Sustainability

Machine Learning and Decision Making for Sustainability Machine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford University April 12 Overview Stanford Artificial Intelligence Lab Fellow, Woods Institute for

More information

Bag-of-Features Acoustic Event Detection for Sensor Networks

Bag-of-Features Acoustic Event Detection for Sensor Networks Bag-of-Features Acoustic Event Detection for Sensor Networks Julian Kürby, René Grzeszick, Axel Plinge, and Gernot A. Fink Pattern Recognition, Computer Science XII, TU Dortmund University September 3,

More information

On Intelligence Jeff Hawkins

On Intelligence Jeff Hawkins On Intelligence Jeff Hawkins Chapter 8: The Future of Intelligence April 27, 2006 Presented by: Melanie Swan, Futurist MS Futures Group 650-681-9482 m@melanieswan.com http://www.melanieswan.com Building

More information

www.ixpug.org @IXPUG1 What is IXPUG? http://www.ixpug.org/ Now Intel extreme Performance Users Group Global community-driven organization (independently ran) Fosters technical collaboration around tuning

More information

Passive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals

Passive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals Passive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals L. Neil Frazer Department of Geology and Geophysics University of Hawaii at Manoa 1680 East West Road,

More information

Music Genre Classification using Improved Artificial Neural Network with Fixed Size Momentum

Music Genre Classification using Improved Artificial Neural Network with Fixed Size Momentum Music Genre Classification using Improved Artificial Neural Network with Fixed Size Momentum Nimesh Prabhu Ashvek Asnodkar Rohan Kenkre ABSTRACT Musical genres are defined as categorical labels that auditors

More information

A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis

A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis Colin Vaz, Vikram Ramanarayanan, and Shrikanth Narayanan USC SAIL Lab INTERSPEECH Articulatory Data

More information

Voice Activity Detection for Speech Enhancement Applications

Voice Activity Detection for Speech Enhancement Applications Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity

More information

Passive Steady State RF Fingerprinting: A Cognitive Technique for Scalable Deployment of Co-channel Femto Cell Underlays

Passive Steady State RF Fingerprinting: A Cognitive Technique for Scalable Deployment of Co-channel Femto Cell Underlays Passive Steady State RF Fingerprinting: A Cognitive Technique for Scalable Deployment of Co-channel Femto Cell Underlays Presenter: Irwin O. Kennedy, Bell Labs Ireland Patricia Scanlon: Bell Labs Ireland

More information

Speech Processing. Simon King University of Edinburgh. additional lecture slides for

Speech Processing. Simon King University of Edinburgh. additional lecture slides for Speech Processing Simon King University of Edinburgh additional lecture slides for 2018-19 assignment Q&A writing exercise Roadmap Modules 1-2: The basics Modules 3-5: Speech synthesis Modules 6-9: Speech

More information

Robustness (cont.); End-to-end systems

Robustness (cont.); End-to-end systems Robustness (cont.); End-to-end systems Steve Renals Automatic Speech Recognition ASR Lecture 18 27 March 2017 ASR Lecture 18 Robustness (cont.); End-to-end systems 1 Robust Speech Recognition ASR Lecture

More information

Mikko Myllymäki and Tuomas Virtanen

Mikko Myllymäki and Tuomas Virtanen NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,

More information

Tu SRS3 06 Wavelet Estimation for Broadband Seismic Data

Tu SRS3 06 Wavelet Estimation for Broadband Seismic Data Tu SRS3 06 Wavelet Estimation for Broadband Seismic Data E. Zabihi Naeini* (Ikon Science), J. Gunning (CSIRO), R. White (Birkbeck University of London) & P. Spaans (Woodside) SUMMARY The volumes of broadband

More information

ENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS

ENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS ENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS Hui Su, Ravi Garg, Adi Hajj-Ahmad, and Min Wu {hsu, ravig, adiha, minwu}@umd.edu University of Maryland, College Park ABSTRACT Electric Network (ENF) based forensic

More information

Example Report Station Community Engagement Survey

Example Report Station Community Engagement Survey Station Community Engagement Survey Report Prepared for: EXAMPLE REPORT INTRODUCTION About this Research The results shown in this report are based on the responses to the questionnaire that your station

More information

Perception of low frequencies in small rooms

Perception of low frequencies in small rooms Perception of low frequencies in small rooms Fazenda, BM and Avis, MR Title Authors Type URL Published Date 24 Perception of low frequencies in small rooms Fazenda, BM and Avis, MR Conference or Workshop

More information

Detecting proximity from personal audio recordings

Detecting proximity from personal audio recordings Detecting proximity from personal audio recordings dpwe@ee.columbia.edu Dan Ellis, Hiroyuki Satoh, Zhuo Chen LabROSA, Columbia Univ., NY USA ICSI, Berkeley, CA, USA Morikawa lab, University of Tokyo, Tokyo,

More information

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection

The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection Tomi Kinnunen, University of Eastern Finland, FINLAND Md Sahidullah, University of Eastern Finland, FINLAND Héctor

More information

Comparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression

Comparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression Comparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression Muhammad SAFDAR, 1 Ming Ronnier LUO, 1,2 Xiaoyu LIU 1, 3 1 State Key Laboratory of Modern Optical Instrumentation, Zhejiang

More information

Machine Learning for Computational Sustainability

Machine Learning for Computational Sustainability Machine Learning for Computational Sustainability Tom Dietterich Oregon State University In collaboration with Dan Sheldon, Sean McGregor, Majid Taleghan, Rachel Houtman, Claire Montgomery, Kim Hall, H.

More information

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS

SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS AKSHAY CHANDRASHEKARAN ANOOP RAMAKRISHNA akshayc@cmu.edu anoopr@andrew.cmu.edu ABHISHEK JAIN GE YANG ajain2@andrew.cmu.edu younger@cmu.edu NIDHI KOHLI R

More information

Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN

Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN 1 Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN National Parks and wildlife conservation 2 Jim Corbett National Park, India Amboseli National Park, Kenya And many more The Challenge 3

More information

The Art of Neural Nets

The Art of Neural Nets The Art of Neural Nets Marco Tavora marcotav65@gmail.com Preamble The challenge of recognizing artists given their paintings has been, for a long time, far beyond the capability of algorithms. Recent advances

More information

Applications of Music Processing

Applications of Music Processing Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite

More information

A multi-class method for detecting audio events in news broadcasts

A multi-class method for detecting audio events in news broadcasts A multi-class method for detecting audio events in news broadcasts Sergios Petridis, Theodoros Giannakopoulos, and Stavros Perantonis Computational Intelligence Laboratory, Institute of Informatics and

More information

On Feature Selection, Bias-Variance, and Bagging

On Feature Selection, Bias-Variance, and Bagging On Feature Selection, Bias-Variance, and Bagging Art Munson 1 Rich Caruana 2 1 Department of Computer Science Cornell University 2 Microsoft Corporation ECML-PKDD 2009 Munson; Caruana (Cornell; Microsoft)

More information

enhanced room geometry optimization analyze this

enhanced room geometry optimization analyze this enhanced room geometry optimization analyze this Stop putting up and pick up an ERGO A great mix starts with a great recording room. But when it comes to achieving the most accurate mix, your room may

More information

DEEP LEARNING ON RF DATA. Adam Thompson Senior Solutions Architect March 29, 2018

DEEP LEARNING ON RF DATA. Adam Thompson Senior Solutions Architect March 29, 2018 DEEP LEARNING ON RF DATA Adam Thompson Senior Solutions Architect March 29, 2018 Background Information Signal Processing and Deep Learning Radio Frequency Data Nuances AGENDA Complex Domain Representations

More information

Extended Touch Mobile User Interfaces Through Sensor Fusion

Extended Touch Mobile User Interfaces Through Sensor Fusion Extended Touch Mobile User Interfaces Through Sensor Fusion Tusi Chowdhury, Parham Aarabi, Weijian Zhou, Yuan Zhonglin and Kai Zou Electrical and Computer Engineering University of Toronto, Toronto, Canada

More information

Environmental Data Science, and its Transformative Potential. 5 th September 2017 Gordon Blair and Graham Dean

Environmental Data Science, and its Transformative Potential. 5 th September 2017 Gordon Blair and Graham Dean Environmental Data Science, and its Transformative Potential 5 th September 2017 Gordon Blair and Graham Dean Structure An Introduction to Environmental Data Science [GSB] Overview of some statistical

More information

30 Minute Quick Setup Guide

30 Minute Quick Setup Guide 30 Minute Quick Setup Guide Introduction. Many thanks for choosing to trial Zahara, our innovative Purchase Order and Invoice Management system for accounting departments. Below you will find a quick start

More information

A Spatiotemporal Approach for Social Situation Recognition

A Spatiotemporal Approach for Social Situation Recognition A Spatiotemporal Approach for Social Situation Recognition Christian Meurisch, Tahir Hussain, Artur Gogel, Benedikt Schmidt, Immanuel Schweizer, Max Mühlhäuser Telecooperation Lab, TU Darmstadt MOTIVATION

More information

GPU ACCELERATED DEEP LEARNING WITH CUDNN

GPU ACCELERATED DEEP LEARNING WITH CUDNN GPU ACCELERATED DEEP LEARNING WITH CUDNN Larry Brown Ph.D. March 2015 AGENDA 1 Introducing cudnn and GPUs 2 Deep Learning Context 3 cudnn V2 4 Using cudnn 2 Introducing cudnn and GPUs 3 HOW GPU ACCELERATION

More information

E-Safety Newsletter. Bowmandale Primary School. Apps for Primary Age Children. Scratch Jr. Tynker. Lightbot: Code Hour. Apps and Age Ratings

E-Safety Newsletter. Bowmandale Primary School. Apps for Primary Age Children. Scratch Jr. Tynker. Lightbot: Code Hour. Apps and Age Ratings Bowmandale Primary School E-Safety Newsletter Apps and Age Ratings Apps for Primary Age Children We would like to suggest some age appropriate apps, including some we use in school. They are all available

More information

An Optimization of Audio Classification and Segmentation using GASOM Algorithm

An Optimization of Audio Classification and Segmentation using GASOM Algorithm An Optimization of Audio Classification and Segmentation using GASOM Algorithm Dabbabi Karim, Cherif Adnen Research Unity of Processing and Analysis of Electrical and Energetic Systems Faculty of Sciences

More information

Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition

Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Igor Vatolkin 1, Anil Nagathil 2, Wolfgang Theimer 3, Rainer Martin 2 1 ChairofAlgorithmEngineering, TU Dortmund

More information

Technical Writers Working with a Contract Staffing Agency

Technical Writers Working with a Contract Staffing Agency Technical Writers Working with a Contract Staffing Agency What s the Big Deal? Contract Technical Writing can be a wonderful and rewarding career. The opportunities for writers seem limitless since they

More information

DATA CHALLENGES AND RAMPS

DATA CHALLENGES AND RAMPS DATA CHALLENGES AND RAMPS BALÁZS KÉGL LAL / CNRS ALEXANDRE GRAMFORT LTCI / Telecom ParisTech ISABELLE GUYON LRI / UPSud AKIN KAZAKCI Ecole des Mines CAMILLE MARINI LTCI / CNRS MEHDI CHERTI LAL / CNRS 1

More information

Outline. Tracking with Unreliable Node Sequences. Abstract. Outline. Outline. Abstract 10/20/2009

Outline. Tracking with Unreliable Node Sequences. Abstract. Outline. Outline. Abstract 10/20/2009 Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009 Presenter: Jing He Abstract This paper proposes

More information

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850

More information

ATIS Briefing March 21, 2017 Economic Critical Infrastructure and its Dependence on GPS.

ATIS Briefing March 21, 2017 Economic Critical Infrastructure and its Dependence on GPS. ATIS Briefing March 21, 2017 Economic Critical Infrastructure and its Dependence on GPS. Briefing question: If it s critical, then why isn t it uniformly monitored to detect bad actor jamming and spoofing

More information

Emotion Recognition from Decision Level Fusion of Visual and Acoustic Features using Hausdorff Classifier

Emotion Recognition from Decision Level Fusion of Visual and Acoustic Features using Hausdorff Classifier Emotion Recognition from Decision Level Fusion of Visual and Acoustic Features using Hausdorff Classifier H.D.Vankayallapati 1, K.R.Anne 2, and K. Kyamakya 1 1 Institute of Smart System Technologies, Transportation

More information

Fig.1 AR as mixed reality[3]

Fig.1 AR as mixed reality[3] Marker Based Augmented Reality Application in Education: Teaching and Learning Gayathri D 1, Om Kumar S 2, Sunitha Ram C 3 1,3 Research Scholar, CSE Department, SCSVMV University 2 Associate Professor,

More information

Phase 1 US Compliance Report

Phase 1 US Compliance Report Implementation of Regulatory Information Submission Standards (IRISS) ectd Tool Interoperability Group (ETIG) ectd Tool Interoperability and Compliance Study 3 (ETICS 3) ETICS 15 April 2011 Implementation

More information

Interaction Design -ID. Unit 6

Interaction Design -ID. Unit 6 Interaction Design -ID Unit 6 Learning outcomes Understand what ID is Understand and apply PACT analysis Understand the basic step of the user-centred design 2012-2013 Human-Computer Interaction 2 What

More information

Convergence and coevolution Business Ecosystems. Digital Ecosystems

Convergence and coevolution Business Ecosystems. Digital Ecosystems CITY HALL OF PARIS - 9 & 10 November 2006 The Digital Convergence Towards a More Competitive, Mobile and Inclusive Knowledge-Based Society Convergence and coevolution Business Ecosystems and Digital Ecosystems

More information

An Improved Voice Activity Detection Based on Deep Belief Networks

An Improved Voice Activity Detection Based on Deep Belief Networks e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 676-683 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com An Improved Voice Activity Detection Based on Deep Belief Networks Shabeeba T. K.

More information

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc. Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Eleventh Edition Technology in Action Chapter 1 Using Technology to Change the World Topics Technology on the World Stage Technology and

More information

Separating Voiced Segments from Music File using MFCC, ZCR and GMM

Separating Voiced Segments from Music File using MFCC, ZCR and GMM Separating Voiced Segments from Music File using MFCC, ZCR and GMM Mr. Prashant P. Zirmite 1, Mr. Mahesh K. Patil 2, Mr. Santosh P. Salgar 3,Mr. Veeresh M. Metigoudar 4 1,2,3,4Assistant Professor, Dept.

More information

Pléiades potentialities :

Pléiades potentialities : GT2 Risque et Aide humanitaire Pléiades potentialities : Assessment of clearing levels for operational management of forest fires in the Maures massif Marechal D., Thierion V., Kabar B., Ayral P.-A., Salze

More information

Environmental Noise Management Sofia December Brüel & Kjær S&V Torben Munk

Environmental Noise Management Sofia December Brüel & Kjær S&V Torben Munk Environmental Noise Management Sofia December 2006 Brüel & Kjær S&V Torben Munk Environmental Noise Management Why? Filename Imagine a city today Increasing traffic flows Bad conditions of road surfaces

More information

INTRODUCTION TO DEEP LEARNING. Steve Tjoa June 2013

INTRODUCTION TO DEEP LEARNING. Steve Tjoa June 2013 INTRODUCTION TO DEEP LEARNING Steve Tjoa kiemyang@gmail.com June 2013 Acknowledgements http://ufldl.stanford.edu/wiki/index.php/ UFLDL_Tutorial http://youtu.be/ayzoubkuf3m http://youtu.be/zmnoatzigik 2

More information

Non-Data Aided Doppler Shift Estimation for Underwater Acoustic Communication

Non-Data Aided Doppler Shift Estimation for Underwater Acoustic Communication Non-Data Aided Doppler Shift Estimation for Underwater Acoustic Communication (Invited paper) Paul Cotae (Corresponding author) 1,*, Suresh Regmi 1, Ira S. Moskowitz 2 1 University of the District of Columbia,

More information

Open-source AR platform for the future

Open-source AR platform for the future DAQRI ARToolKit 6/Open Source Open-source AR platform for the future Phil Oxford Brookes University 2017-01 ARToolKit 6: Future AR platform Tools Frameworks Tracking and localisation Tangible user interaction

More information

Digital Birding Resources

Digital Birding Resources Digital Birding Resources Introduction It may feel a bit intimidating to see the word digital associated with birding. The term digital is so widely used and encompasses so many different kinds of evolving

More information

AFRL. Technology Directorates AFRL

AFRL. Technology Directorates AFRL Sensors Directorate and ATR Overview for Integrated Fusion, Performance Prediction, and Sensor Management for ATE MURI 21 July 2006 Lori Westerkamp Sensor ATR Technology Division Sensors Directorate Air

More information

Active and Passive Acoustic Detection, Classification and Recognition with the Hopkins Acoustic Surveillance Unit (HASU)

Active and Passive Acoustic Detection, Classification and Recognition with the Hopkins Acoustic Surveillance Unit (HASU) Active and Passive Acoustic Detection, Classification and Recognition with the Hopkins Acoustic Surveillance Unit (HASU) Andreas G. Andreou Electrical and Computer Engineering and Center for Language and

More information

Volume 2, Number 3 Technology, Economy, and Standards October 2009

Volume 2, Number 3 Technology, Economy, and Standards October 2009 Volume 2, Number 3 Technology, Economy, and Standards October 2009 Editor Jeremiah Spence Guest Editors Yesha Sivan J.H.A. (Jean) Gelissen Robert Bloomfield Reviewers Aki Harma Esko Dijk Ger van den Broek

More information

Designing the Smart Foot Mat and Its Applications: as a User Identification Sensor for Smart Home Scenarios

Designing the Smart Foot Mat and Its Applications: as a User Identification Sensor for Smart Home Scenarios Vol.87 (Art, Culture, Game, Graphics, Broadcasting and Digital Contents 2015), pp.1-5 http://dx.doi.org/10.14257/astl.2015.87.01 Designing the Smart Foot Mat and Its Applications: as a User Identification

More information

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Mel Spectrum Analysis of Speech Recognition using Single Microphone International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree

More information

FILE / GUITAR THEORY FOR DUMMIES

FILE / GUITAR THEORY FOR DUMMIES 02 December, 2018 FILE / GUITAR THEORY FOR DUMMIES Document Filetype: PDF 340.31 KB 0 FILE / GUITAR THEORY FOR DUMMIES Learn the fundamental concepts of Music Theory with this For Dummies guide. Get this

More information

Community-as-a-Service: Data Validation in Citizen Science

Community-as-a-Service: Data Validation in Citizen Science Community-as-a-Service: Data Validation in Citizen Science Yurong He and Andrea Wiggins College of Information Studies, University of Maryland College Park, MD, USA {yrhe,wiggins}@umd.edu Abstract. Currently,

More information

Black. LWECS Site Permit. Stearns County. Permit Section:

Black. LWECS Site Permit. Stearns County. Permit Section: PERMIT COMPLIANCE FILING Permittee: Permit Type: Project Location: Docket No: Permit Section: Date of Submission : Black Oak Wind,, LLC LWECS Site Permit Stearns County IP6853/WS-10-1240 and IP6866/WS-11-831

More information

AI Frontiers. Dr. Dario Gil Vice President IBM Research

AI Frontiers. Dr. Dario Gil Vice President IBM Research AI Frontiers Dr. Dario Gil Vice President IBM Research 1 AI is the new IT MIT Intro to Machine Learning course: 2013 138 students 2016 302 students 2017 700 students 2 What is AI? Artificial Intelligence

More information

Advanced audio analysis. Martin Gasser

Advanced audio analysis. Martin Gasser Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high

More information

Tumbleweed Express: A Tale of 54 Game Jams

Tumbleweed Express: A Tale of 54 Game Jams Tumbleweed Express: A Tale of 54 Game Jams Matthew Louis Mauriello Project Manager (@mattm401) IGDA DC Chapter Meeting June 28 th, 2016 (@IGDA_DC) INTRODUCTION Matthew Louis Mauriello PhD Student, Comp

More information