Reducing confounding factors in automatic acoustic recognition of individual birds
|
|
- Jasmine Elfrieda Green
- 5 years ago
- Views:
Transcription
1 Reducing confounding factors in automatic acoustic recognition of individual birds Dan Stowell Machine Listening Lab Centre for Digital Music Acoustic recognition of birds 1 / 31
2 versus
3 Machine listening and bird sounds - why? dan.stowell@qmul.ac.uk Acoustic recognition of birds 3 / 31
4 Machine listening and bird sounds - why? Changes in populations, in migration patterns monitoring is important Intrusive vs. passive monitoring behavioural impact of catching/ringing birds Many birds are most easily observed by sound Manual (volunteer) monitoring common, but not scalable dan.stowell@qmul.ac.uk Acoustic recognition of birds 4 / 31
5 In this talk... Classification-based approaches to: 0. Bird species recognition 1. Bird sound detection (presence/absence) 2. Bird individual ID (By the way: we do more than just classification!) Acoustic recognition of birds 5 / 31
6 Species classification of bird sounds In 2014: feature-learning approach to bird sound recognition Dataset Location Total duration Num items Num classes Labelling lifeclef Brazil 77.8 hours (12M frames) singlelabel 100 lifeclef2014 Classifier: multilabel 90 AUC (%) mfcc-ms mfcc-maxp mfcc-modul melspec-ms melspec-maxp melspec-modul melspec-kfl1-ms melspec-kfl2-ms Feature learning melspec-kfl3-ms melspec-kfl4-ms Acoustic recognition of birds 6 / 31 melspec-kfl8-ms c-kfl4pl8kfl4-ms
7 Bird species classification: Warblr Warblr app for Android and ios Acoustic recognition of birds 7 / 31
8 Bird species classification: Warblr Over 45,000 recordings submitted to our database ( 80/day) Submission geolocations dan.stowell@qmul.ac.uk Acoustic recognition of birds 8 / 31
9 Some of our users... Acoustic recognition of birds 9 / 31
10 Some of our users... Acoustic recognition of birds 9 / 31
11
12 Part 1: Bird Audio Detection challenge Many projects need reliable detection of bird sounds e.g. in long unattended recordings But existing methods are not robust, not general-purpose enough, and need lots of manual tweaking/post-processing Acoustic recognition of birds 11 / 31
13 Bird Audio Detection challenge We designed the Bird Audio Detection challenge Dev set 1: 10k items, crowdsourced audio from around the UK (Warblr phone app) Dev set 2: 7k items, crowdsourced audio from misc field recordings Testing set: 10k items, remote monitoring, Chernobyl Exclusion Zone Acoustic recognition of birds 12 / 31
14 Bird Audio Detection challenge Training/testing sets differ in: location recording eqpt species class balance background sounds time of day time of year weather... How is a classifier meant to work in such mismatched conditions??? dan.stowell@qmul.ac.uk Acoustic recognition of birds 13 / 31
15 Bird Audio Detection challenge: outcomes 30 teams submitted Strong results (up to 89% AUC) Domain adaptation strategies Pseudo-labelling, test mixing Though not always needed Acoustic recognition of birds 14 / 31
16 So why do we evaluate using matched conditions? To study the classifier s behaviour Sometimes a practical application is in matched conditions Pragmatic reasons: only one dataset available; free choice of bootstrap/n-fold crossvalidation...because our algorithms aren t good enough at avoiding confounds? dan.stowell@qmul.ac.uk Acoustic recognition of birds 15 / 31
17 Machine learning workflow train validate test Acoustic recognition of birds 16 / 31
18 Machine learning workflow train validate test reallytest Acoustic recognition of birds 16 / 31
19
20 Part 2: Identifying individual bird ID Motivation: reduce intrusive monitoring (capturing/tagging/ringing) Many birds do have individual signature Acoustic recognition of birds 18 / 31
21 Identifying individual bird ID Data collection: Acoustic recognition of birds 19 / 31
22 Identifying individual bird ID Data collection: Acoustic recognition of birds 19 / 31
23 Identifying individual bird ID Data collection: Bird ID: categorical label. Is this the same task as species classification? Acoustic recognition of birds 19 / 31
24 Identifying individual bird ID Acoustic recognition of birds 20 / 31
25 Identifying individual bird ID Acoustic recognition of birds 20 / 31
26 Making use of silence (1) Training set: Acoustic recognition of birds 21 / 31
27 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31
28 Making use of silence (1) Acoustic recognition of birds 21 / 31
29 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31
30 Making use of silence (1) Training set: Testing set: Acoustic recognition of birds 21 / 31
31 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Acoustic recognition of birds 22 / 31
32 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31
33 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31
34 Analogy: the album effect in music artist ID Training set: Express Yourself Bad Testing set: Like a Prayer Smooth Criminal dan.stowell@qmul.ac.uk Acoustic recognition of birds 22 / 31
35 Territorial birds: the territory is the album mel spec features 100 skfl features AUC (%) owl cross-year t within-year t across-year f chaff within-year f chaff across-year aug little owl cross-year 40 pipit within-year pipit across-year chiff chaff within-year chiff chaff across-year 30 standard aug dan.stowell@qmul.ac.uk Acoustic recognition of birds 23 / 31
36 Making use of silence (2) Data augmentation of the TESTING set (adversarial) Measure the distractability of the classifier when mismatched silence is added Measure RMSE in classifier decisions Acoustic recognition of birds 24 / 31
37 Making use of silence (2) Confusion matrix: linhart2015marcutday1day2_melspec-kfl4pe8kfl4-ms_nr05_pk0_heq0_pool0_rfall_max PC PC1102 PC PC1104 PC True PC1106 PC1107 PC1108 PC PC1110 PC PC1112 PC Estimated dan.stowell@qmul.ac.uk Acoustic recognition of birds 25 / 31
38 Making use of silence (3) Data augmentation of the TRAINING set Each item gets new versions with added silence from each class... Acoustic recognition of birds 26 / 31
39 Making use of silence (4) Finally we can add a new wastebasket class NB not using the a/b labels here... dan.stowell@qmul.ac.uk Acoustic recognition of birds 27 / 31
40 Results Plus silence-test result: 50% AUC Acoustic recognition of birds 28 / 31
41
42 Conclusions Outdoor bird sound recognition is tricky: The sounds (classes) are highly variable Many potential confounding factors for black-box ML 1. Bird Audio Detection Challenge: Good detection, even in strongly mismatched conditions Adaptation methods useful though, not always needed? 2. Recognising individual bird ID: Strong recognition possible (depending on species) Silence is surprisingly useful for sound recognition! Generally: make more use of mismatched-condition testing Acoustic recognition of birds 30 / 31
43 Thank you Collaborators: 1. Bird Audio Detection Challenge: Mike Wood (U of Salford), Yannis Stylianou (U of Crete), Herve Glotin (U of Toulon), IEEE Signal Processing Society 2. Recognising individual bird ID: Pavel Linhart (Adam Mickiewicz U / Praha U) Machine Listening Lab: dan.stowell@qmul.ac.uk Acoustic recognition of birds 31 / 31
LifeCLEF Bird Identification Task 2016
LifeCLEF Bird Identification Task 2016 The arrival of deep learning Alexis Joly, Inria Zenith Team, Montpellier, France Hervé Glotin, Univ. Toulon, UMR LSIS, Institut Universitaire de France Hervé Goëau,
More informationAudio Similarity. Mark Zadel MUMT 611 March 8, Audio Similarity p.1/23
Audio Similarity Mark Zadel MUMT 611 March 8, 2004 Audio Similarity p.1/23 Overview MFCCs Foote Content-Based Retrieval of Music and Audio (1997) Logan, Salomon A Music Similarity Function Based On Signal
More informationCampus Location Recognition using Audio Signals
1 Campus Location Recognition using Audio Signals James Sun,Reid Westwood SUNetID:jsun2015,rwestwoo Email: jsun2015@stanford.edu, rwestwoo@stanford.edu I. INTRODUCTION People use sound both consciously
More informationarxiv: v2 [eess.as] 11 Oct 2018
A MULTI-DEVICE DATASET FOR URBAN ACOUSTIC SCENE CLASSIFICATION Annamaria Mesaros, Toni Heittola, Tuomas Virtanen Tampere University of Technology, Laboratory of Signal Processing, Tampere, Finland {annamaria.mesaros,
More informationTwo Convolutional Neural Networks for Bird Detection in Audio Signals
th European Signal Processing Conference (EUSIPCO) Two Convolutional Neural Networks for Bird Detection in Audio Signals Thomas Grill and Jan Schlüter Austrian Research Institute for Artificial Intelligence
More informationThe design and calibration of low cost urban acoustic sensing devices. SONYC Sounds Of New York City
The design and calibration of low cost urban acoustic sensing devices SONYC Sounds Of New York City C. Mydlarz NYU CUSP C. Shamoon NYC DEP M. Baglione, M Pimpinella The Cooper Union cmydlarz@nyu.edu Sounds
More informationCómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Launchmetrics
Cómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Intelligence @ Launchmetrics annaboschrue@gmail.com Motivating example 90% Accuracy and you want to do better IDEAS: - Collect
More informationSound Recognition. ~ CSE 352 Team 3 ~ Jason Park Evan Glover. Kevin Lui Aman Rawat. Prof. Anita Wasilewska
Sound Recognition ~ CSE 352 Team 3 ~ Jason Park Evan Glover Kevin Lui Aman Rawat Prof. Anita Wasilewska What is Sound? Sound is a vibration that propagates as a typically audible mechanical wave of pressure
More informationThe Jigsaw Continuous Sensing Engine for Mobile Phone Applications!
The Jigsaw Continuous Sensing Engine for Mobile Phone Applications! Hong Lu, Jun Yang, Zhigang Liu, Nicholas D. Lane, Tanzeem Choudhury, Andrew T. Campbell" CS Department Dartmouth College Nokia Research
More informationRecommendations Worth a Million
Recommendations Worth a Million An Introduction to Clustering 15.071x The Analytics Edge Clapper image is in the public domain. Source: Pixabay. Netflix Online DVD rental and streaming video service More
More informationElectric Guitar Pickups Recognition
Electric Guitar Pickups Recognition Warren Jonhow Lee warrenjo@stanford.edu Yi-Chun Chen yichunc@stanford.edu Abstract Electric guitar pickups convert vibration of strings to eletric signals and thus direcly
More informationApProgXimate Audio: A Distributed Interactive Experiment in Sound Art and Live Coding
ApProgXimate Audio: A Distributed Interactive Experiment in Sound Art and Live Coding Chris Kiefer Department of Music & Sussex Humanities Lab, University of Sussex, Brighton, UK. School of Media, Film
More informationAutomatic Text-Independent. Speaker. Recognition Approaches Using Binaural Inputs
Automatic Text-Independent Speaker Recognition Approaches Using Binaural Inputs Karim Youssef, Sylvain Argentieri and Jean-Luc Zarader 1 Outline Automatic speaker recognition: introduction Designed systems
More informationI. Cocktail Party Experiment Daniel D.E. Wong, Enea Ceolini, Denis Drennan, Shih Chii Liu, Alain de Cheveigné
I. Cocktail Party Experiment Daniel D.E. Wong, Enea Ceolini, Denis Drennan, Shih Chii Liu, Alain de Cheveigné MOTIVATION In past years at the Telluride Neuromorphic Workshop, work has been done to develop
More informationClassification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise
Classification of ships using autocorrelation technique for feature extraction of the underwater acoustic noise Noha KORANY 1 Alexandria University, Egypt ABSTRACT The paper applies spectral analysis to
More informationEnvironmental Sound Recognition using MP-based Features
Environmental Sound Recognition using MP-based Features Selina Chu, Shri Narayanan *, and C.-C. Jay Kuo * Speech Analysis and Interpretation Lab Signal & Image Processing Institute Department of Computer
More informationCover Song Recognition Based on MPEG-7 Audio Features
Cover Song Recognition Based on MPEG-7 Audio Features Mochammad Faris Ponighzwa R, Riyanarto Sarno, Dwi Sunaryono Department of Informatics Institut Teknologi Sepuluh Nopember Surabaya, Indonesia ponighzwa13@mhs.if.its.ac.id,
More informationLearning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives
Learning to Unlearn and Relearn Speech Signal Processing using Neural Networks: current and future perspectives Mathew Magimai Doss Collaborators: Vinayak Abrol, Selen Hande Kabil, Hannah Muckenhirn, Dimitri
More informationKnowledge discovery & data mining Classification & fraud detection
Knowledge discovery & data mining Classification & fraud detection Knowledge discovery & data mining Classification & fraud detection 5/24/00 Click here to start Table of Contents Author: Dino Pedreschi
More informationMobile Sensing: Opportunities, Challenges, and Applications
Mobile Sensing: Opportunities, Challenges, and Applications Mini course on Advanced Mobile Sensing, November 2017 Dr Veljko Pejović Faculty of Computer and Information Science University of Ljubljana Veljko.Pejovic@fri.uni-lj.si
More informationIntroducing COVAREP: A collaborative voice analysis repository for speech technologies
Introducing COVAREP: A collaborative voice analysis repository for speech technologies John Kane Wednesday November 27th, 2013 SIGMEDIA-group TCD COVAREP - Open-source speech processing repository 1 Introduction
More informationDimension Reduction of the Modulation Spectrogram for Speaker Verification
Dimension Reduction of the Modulation Spectrogram for Speaker Verification Tomi Kinnunen Speech and Image Processing Unit Department of Computer Science University of Joensuu, Finland Kong Aik Lee and
More informationIdentification of Woodpecker Species through Drumming
Gerard Gorman Identification of Woodpecker Species through Drumming J. Florentin O. Verlinden, T. Dutoit, F. Moiny, G. Kouroussis and P. Rasmont Symposium on Ecology and Acoustics June 16-18 2014 - Musée
More informationDiscriminative Training for Automatic Speech Recognition
Discriminative Training for Automatic Speech Recognition 22 nd April 2013 Advanced Signal Processing Seminar Article Heigold, G.; Ney, H.; Schluter, R.; Wiesler, S. Signal Processing Magazine, IEEE, vol.29,
More informationA Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification
A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification Wei Chu and Abeer Alwan Speech Processing and Auditory Perception Laboratory Department
More informationAutomatic classification of traffic noise
Automatic classification of traffic noise M.A. Sobreira-Seoane, A. Rodríguez Molares and J.L. Alba Castro University of Vigo, E.T.S.I de Telecomunicación, Rúa Maxwell s/n, 36310 Vigo, Spain msobre@gts.tsc.uvigo.es
More informationHuman-Centered DESIGN PROMPTS for Emerging Technologies. 20 deliberations, considerations, and provocations
Human-Centered DESIGN PROMPTS for Emerging Technologies 20 deliberations, considerations, and provocations + Today s emerging technologies promise exciting new ways of engaging with our world and with
More informationLong Range Acoustic Classification
Approved for public release; distribution is unlimited. Long Range Acoustic Classification Authors: Ned B. Thammakhoune, Stephen W. Lang Sanders a Lockheed Martin Company P. O. Box 868 Nashua, New Hampshire
More informationAudio Fingerprinting using Fractional Fourier Transform
Audio Fingerprinting using Fractional Fourier Transform Swati V. Sutar 1, D. G. Bhalke 2 1 (Department of Electronics & Telecommunication, JSPM s RSCOE college of Engineering Pune, India) 2 (Department,
More informationGE 113 REMOTE SENSING
GE 113 REMOTE SENSING Topic 8. Image Classification and Accuracy Assessment Lecturer: Engr. Jojene R. Santillan jrsantillan@carsu.edu.ph Division of Geodetic Engineering College of Engineering and Information
More informationCONTEXT-AWARE COMPUTING
CONTEXT-AWARE COMPUTING How Am I Feeling? Who Am I With? Why Am I Here? What Am I Doing? Where Am I Going? When Do I Need To Leave? A Personal VACATION ASSISTANT Tim Jarrell Vice President & Publisher
More informationPACIFIC MAMMAL RESEARCH. Marine Mammal Research & Education
PACIFIC MAMMAL RESEARCH Marine Mammal Research & Education www.pacmam.org 1 OUR STORY Harbor porpoises are one of the smallest marine mammal residents of the Salish Sea region, yet by the 1990s they were
More informationUnderstanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing
Understanding Advanced Bluetooth Angle Estimation Techniques for Real-Time Locationing EMBEDDED WORLD 2018 SAULI LEHTIMAKI, SILICON LABS Understanding Advanced Bluetooth Angle Estimation Techniques for
More informationNeural Networks The New Moore s Law
Neural Networks The New Moore s Law Chris Rowen, PhD, FIEEE CEO Cognite Ventures December 216 Outline Moore s Law Revisited: Efficiency Drives Productivity Embedded Neural Network Product Segments Efficiency
More informationMachine Learning and Decision Making for Sustainability
Machine Learning and Decision Making for Sustainability Stefano Ermon Department of Computer Science Stanford University April 12 Overview Stanford Artificial Intelligence Lab Fellow, Woods Institute for
More informationBag-of-Features Acoustic Event Detection for Sensor Networks
Bag-of-Features Acoustic Event Detection for Sensor Networks Julian Kürby, René Grzeszick, Axel Plinge, and Gernot A. Fink Pattern Recognition, Computer Science XII, TU Dortmund University September 3,
More informationOn Intelligence Jeff Hawkins
On Intelligence Jeff Hawkins Chapter 8: The Future of Intelligence April 27, 2006 Presented by: Melanie Swan, Futurist MS Futures Group 650-681-9482 m@melanieswan.com http://www.melanieswan.com Building
More informationwww.ixpug.org @IXPUG1 What is IXPUG? http://www.ixpug.org/ Now Intel extreme Performance Users Group Global community-driven organization (independently ran) Fosters technical collaboration around tuning
More informationPassive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals
Passive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals L. Neil Frazer Department of Geology and Geophysics University of Hawaii at Manoa 1680 East West Road,
More informationMusic Genre Classification using Improved Artificial Neural Network with Fixed Size Momentum
Music Genre Classification using Improved Artificial Neural Network with Fixed Size Momentum Nimesh Prabhu Ashvek Asnodkar Rohan Kenkre ABSTRACT Musical genres are defined as categorical labels that auditors
More informationA Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis
A Two-step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis Colin Vaz, Vikram Ramanarayanan, and Shrikanth Narayanan USC SAIL Lab INTERSPEECH Articulatory Data
More informationVoice Activity Detection for Speech Enhancement Applications
Voice Activity Detection for Speech Enhancement Applications E. Verteletskaya, K. Sakhnov Abstract This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity
More informationPassive Steady State RF Fingerprinting: A Cognitive Technique for Scalable Deployment of Co-channel Femto Cell Underlays
Passive Steady State RF Fingerprinting: A Cognitive Technique for Scalable Deployment of Co-channel Femto Cell Underlays Presenter: Irwin O. Kennedy, Bell Labs Ireland Patricia Scanlon: Bell Labs Ireland
More informationSpeech Processing. Simon King University of Edinburgh. additional lecture slides for
Speech Processing Simon King University of Edinburgh additional lecture slides for 2018-19 assignment Q&A writing exercise Roadmap Modules 1-2: The basics Modules 3-5: Speech synthesis Modules 6-9: Speech
More informationRobustness (cont.); End-to-end systems
Robustness (cont.); End-to-end systems Steve Renals Automatic Speech Recognition ASR Lecture 18 27 March 2017 ASR Lecture 18 Robustness (cont.); End-to-end systems 1 Robust Speech Recognition ASR Lecture
More informationMikko Myllymäki and Tuomas Virtanen
NON-STATIONARY NOISE MODEL COMPENSATION IN VOICE ACTIVITY DETECTION Mikko Myllymäki and Tuomas Virtanen Department of Signal Processing, Tampere University of Technology Korkeakoulunkatu 1, 3370, Tampere,
More informationTu SRS3 06 Wavelet Estimation for Broadband Seismic Data
Tu SRS3 06 Wavelet Estimation for Broadband Seismic Data E. Zabihi Naeini* (Ikon Science), J. Gunning (CSIRO), R. White (Birkbeck University of London) & P. Spaans (Woodside) SUMMARY The volumes of broadband
More informationENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS
ENF ANALYSIS ON RECAPTURED AUDIO RECORDINGS Hui Su, Ravi Garg, Adi Hajj-Ahmad, and Min Wu {hsu, ravig, adiha, minwu}@umd.edu University of Maryland, College Park ABSTRACT Electric Network (ENF) based forensic
More informationExample Report Station Community Engagement Survey
Station Community Engagement Survey Report Prepared for: EXAMPLE REPORT INTRODUCTION About this Research The results shown in this report are based on the responses to the questionnaire that your station
More informationPerception of low frequencies in small rooms
Perception of low frequencies in small rooms Fazenda, BM and Avis, MR Title Authors Type URL Published Date 24 Perception of low frequencies in small rooms Fazenda, BM and Avis, MR Conference or Workshop
More informationDetecting proximity from personal audio recordings
Detecting proximity from personal audio recordings dpwe@ee.columbia.edu Dan Ellis, Hiroyuki Satoh, Zhuo Chen LabROSA, Columbia Univ., NY USA ICSI, Berkeley, CA, USA Morikawa lab, University of Tokyo, Tokyo,
More informationThe ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection
The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection Tomi Kinnunen, University of Eastern Finland, FINLAND Md Sahidullah, University of Eastern Finland, FINLAND Héctor
More informationComparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression
Comparing CSI and PCA in Amalgamation with JPEG for Spectral Image Compression Muhammad SAFDAR, 1 Ming Ronnier LUO, 1,2 Xiaoyu LIU 1, 3 1 State Key Laboratory of Modern Optical Instrumentation, Zhejiang
More informationMachine Learning for Computational Sustainability
Machine Learning for Computational Sustainability Tom Dietterich Oregon State University In collaboration with Dan Sheldon, Sean McGregor, Majid Taleghan, Rachel Houtman, Claire Montgomery, Kim Hall, H.
More informationSONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS
SONG RETRIEVAL SYSTEM USING HIDDEN MARKOV MODELS AKSHAY CHANDRASHEKARAN ANOOP RAMAKRISHNA akshayc@cmu.edu anoopr@andrew.cmu.edu ABHISHEK JAIN GE YANG ajain2@andrew.cmu.edu younger@cmu.edu NIDHI KOHLI R
More informationWildlife Census via LSH-based animal tracking APOORV PATWARDHAN
1 Wildlife Census via LSH-based animal tracking APOORV PATWARDHAN National Parks and wildlife conservation 2 Jim Corbett National Park, India Amboseli National Park, Kenya And many more The Challenge 3
More informationThe Art of Neural Nets
The Art of Neural Nets Marco Tavora marcotav65@gmail.com Preamble The challenge of recognizing artists given their paintings has been, for a long time, far beyond the capability of algorithms. Recent advances
More informationApplications of Music Processing
Lecture Music Processing Applications of Music Processing Christian Dittmar International Audio Laboratories Erlangen christian.dittmar@audiolabs-erlangen.de Singing Voice Detection Important pre-requisite
More informationA multi-class method for detecting audio events in news broadcasts
A multi-class method for detecting audio events in news broadcasts Sergios Petridis, Theodoros Giannakopoulos, and Stavros Perantonis Computational Intelligence Laboratory, Institute of Informatics and
More informationOn Feature Selection, Bias-Variance, and Bagging
On Feature Selection, Bias-Variance, and Bagging Art Munson 1 Rich Caruana 2 1 Department of Computer Science Cornell University 2 Microsoft Corporation ECML-PKDD 2009 Munson; Caruana (Cornell; Microsoft)
More informationenhanced room geometry optimization analyze this
enhanced room geometry optimization analyze this Stop putting up and pick up an ERGO A great mix starts with a great recording room. But when it comes to achieving the most accurate mix, your room may
More informationDEEP LEARNING ON RF DATA. Adam Thompson Senior Solutions Architect March 29, 2018
DEEP LEARNING ON RF DATA Adam Thompson Senior Solutions Architect March 29, 2018 Background Information Signal Processing and Deep Learning Radio Frequency Data Nuances AGENDA Complex Domain Representations
More informationExtended Touch Mobile User Interfaces Through Sensor Fusion
Extended Touch Mobile User Interfaces Through Sensor Fusion Tusi Chowdhury, Parham Aarabi, Weijian Zhou, Yuan Zhonglin and Kai Zou Electrical and Computer Engineering University of Toronto, Toronto, Canada
More informationEnvironmental Data Science, and its Transformative Potential. 5 th September 2017 Gordon Blair and Graham Dean
Environmental Data Science, and its Transformative Potential 5 th September 2017 Gordon Blair and Graham Dean Structure An Introduction to Environmental Data Science [GSB] Overview of some statistical
More information30 Minute Quick Setup Guide
30 Minute Quick Setup Guide Introduction. Many thanks for choosing to trial Zahara, our innovative Purchase Order and Invoice Management system for accounting departments. Below you will find a quick start
More informationA Spatiotemporal Approach for Social Situation Recognition
A Spatiotemporal Approach for Social Situation Recognition Christian Meurisch, Tahir Hussain, Artur Gogel, Benedikt Schmidt, Immanuel Schweizer, Max Mühlhäuser Telecooperation Lab, TU Darmstadt MOTIVATION
More informationGPU ACCELERATED DEEP LEARNING WITH CUDNN
GPU ACCELERATED DEEP LEARNING WITH CUDNN Larry Brown Ph.D. March 2015 AGENDA 1 Introducing cudnn and GPUs 2 Deep Learning Context 3 cudnn V2 4 Using cudnn 2 Introducing cudnn and GPUs 3 HOW GPU ACCELERATION
More informationE-Safety Newsletter. Bowmandale Primary School. Apps for Primary Age Children. Scratch Jr. Tynker. Lightbot: Code Hour. Apps and Age Ratings
Bowmandale Primary School E-Safety Newsletter Apps and Age Ratings Apps for Primary Age Children We would like to suggest some age appropriate apps, including some we use in school. They are all available
More informationAn Optimization of Audio Classification and Segmentation using GASOM Algorithm
An Optimization of Audio Classification and Segmentation using GASOM Algorithm Dabbabi Karim, Cherif Adnen Research Unity of Processing and Analysis of Electrical and Energetic Systems Faculty of Sciences
More informationPerformance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition
Performance of Specific vs. Generic Feature Sets in Polyphonic Music Instrument Recognition Igor Vatolkin 1, Anil Nagathil 2, Wolfgang Theimer 3, Rainer Martin 2 1 ChairofAlgorithmEngineering, TU Dortmund
More informationTechnical Writers Working with a Contract Staffing Agency
Technical Writers Working with a Contract Staffing Agency What s the Big Deal? Contract Technical Writing can be a wonderful and rewarding career. The opportunities for writers seem limitless since they
More informationDATA CHALLENGES AND RAMPS
DATA CHALLENGES AND RAMPS BALÁZS KÉGL LAL / CNRS ALEXANDRE GRAMFORT LTCI / Telecom ParisTech ISABELLE GUYON LRI / UPSud AKIN KAZAKCI Ecole des Mines CAMILLE MARINI LTCI / CNRS MEHDI CHERTI LAL / CNRS 1
More informationOutline. Tracking with Unreliable Node Sequences. Abstract. Outline. Outline. Abstract 10/20/2009
Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009 Presenter: Jing He Abstract This paper proposes
More informationCROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen
CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS Kuan-Chuan Peng and Tsuhan Chen Cornell University School of Electrical and Computer Engineering Ithaca, NY 14850
More informationATIS Briefing March 21, 2017 Economic Critical Infrastructure and its Dependence on GPS.
ATIS Briefing March 21, 2017 Economic Critical Infrastructure and its Dependence on GPS. Briefing question: If it s critical, then why isn t it uniformly monitored to detect bad actor jamming and spoofing
More informationEmotion Recognition from Decision Level Fusion of Visual and Acoustic Features using Hausdorff Classifier
Emotion Recognition from Decision Level Fusion of Visual and Acoustic Features using Hausdorff Classifier H.D.Vankayallapati 1, K.R.Anne 2, and K. Kyamakya 1 1 Institute of Smart System Technologies, Transportation
More informationFig.1 AR as mixed reality[3]
Marker Based Augmented Reality Application in Education: Teaching and Learning Gayathri D 1, Om Kumar S 2, Sunitha Ram C 3 1,3 Research Scholar, CSE Department, SCSVMV University 2 Associate Professor,
More informationPhase 1 US Compliance Report
Implementation of Regulatory Information Submission Standards (IRISS) ectd Tool Interoperability Group (ETIG) ectd Tool Interoperability and Compliance Study 3 (ETICS 3) ETICS 15 April 2011 Implementation
More informationInteraction Design -ID. Unit 6
Interaction Design -ID Unit 6 Learning outcomes Understand what ID is Understand and apply PACT analysis Understand the basic step of the user-centred design 2012-2013 Human-Computer Interaction 2 What
More informationConvergence and coevolution Business Ecosystems. Digital Ecosystems
CITY HALL OF PARIS - 9 & 10 November 2006 The Digital Convergence Towards a More Competitive, Mobile and Inclusive Knowledge-Based Society Convergence and coevolution Business Ecosystems and Digital Ecosystems
More informationAn Improved Voice Activity Detection Based on Deep Belief Networks
e-issn 2455 1392 Volume 2 Issue 4, April 2016 pp. 676-683 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com An Improved Voice Activity Detection Based on Deep Belief Networks Shabeeba T. K.
More informationTechnology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Eleventh Edition Technology in Action Chapter 1 Using Technology to Change the World Topics Technology on the World Stage Technology and
More informationSeparating Voiced Segments from Music File using MFCC, ZCR and GMM
Separating Voiced Segments from Music File using MFCC, ZCR and GMM Mr. Prashant P. Zirmite 1, Mr. Mahesh K. Patil 2, Mr. Santosh P. Salgar 3,Mr. Veeresh M. Metigoudar 4 1,2,3,4Assistant Professor, Dept.
More informationPléiades potentialities :
GT2 Risque et Aide humanitaire Pléiades potentialities : Assessment of clearing levels for operational management of forest fires in the Maures massif Marechal D., Thierion V., Kabar B., Ayral P.-A., Salze
More informationEnvironmental Noise Management Sofia December Brüel & Kjær S&V Torben Munk
Environmental Noise Management Sofia December 2006 Brüel & Kjær S&V Torben Munk Environmental Noise Management Why? Filename Imagine a city today Increasing traffic flows Bad conditions of road surfaces
More informationINTRODUCTION TO DEEP LEARNING. Steve Tjoa June 2013
INTRODUCTION TO DEEP LEARNING Steve Tjoa kiemyang@gmail.com June 2013 Acknowledgements http://ufldl.stanford.edu/wiki/index.php/ UFLDL_Tutorial http://youtu.be/ayzoubkuf3m http://youtu.be/zmnoatzigik 2
More informationNon-Data Aided Doppler Shift Estimation for Underwater Acoustic Communication
Non-Data Aided Doppler Shift Estimation for Underwater Acoustic Communication (Invited paper) Paul Cotae (Corresponding author) 1,*, Suresh Regmi 1, Ira S. Moskowitz 2 1 University of the District of Columbia,
More informationOpen-source AR platform for the future
DAQRI ARToolKit 6/Open Source Open-source AR platform for the future Phil Oxford Brookes University 2017-01 ARToolKit 6: Future AR platform Tools Frameworks Tracking and localisation Tangible user interaction
More informationDigital Birding Resources
Digital Birding Resources Introduction It may feel a bit intimidating to see the word digital associated with birding. The term digital is so widely used and encompasses so many different kinds of evolving
More informationAFRL. Technology Directorates AFRL
Sensors Directorate and ATR Overview for Integrated Fusion, Performance Prediction, and Sensor Management for ATE MURI 21 July 2006 Lori Westerkamp Sensor ATR Technology Division Sensors Directorate Air
More informationActive and Passive Acoustic Detection, Classification and Recognition with the Hopkins Acoustic Surveillance Unit (HASU)
Active and Passive Acoustic Detection, Classification and Recognition with the Hopkins Acoustic Surveillance Unit (HASU) Andreas G. Andreou Electrical and Computer Engineering and Center for Language and
More informationVolume 2, Number 3 Technology, Economy, and Standards October 2009
Volume 2, Number 3 Technology, Economy, and Standards October 2009 Editor Jeremiah Spence Guest Editors Yesha Sivan J.H.A. (Jean) Gelissen Robert Bloomfield Reviewers Aki Harma Esko Dijk Ger van den Broek
More informationDesigning the Smart Foot Mat and Its Applications: as a User Identification Sensor for Smart Home Scenarios
Vol.87 (Art, Culture, Game, Graphics, Broadcasting and Digital Contents 2015), pp.1-5 http://dx.doi.org/10.14257/astl.2015.87.01 Designing the Smart Foot Mat and Its Applications: as a User Identification
More informationMel Spectrum Analysis of Speech Recognition using Single Microphone
International Journal of Engineering Research in Electronics and Communication Mel Spectrum Analysis of Speech Recognition using Single Microphone [1] Lakshmi S.A, [2] Cholavendan M [1] PG Scholar, Sree
More informationFILE / GUITAR THEORY FOR DUMMIES
02 December, 2018 FILE / GUITAR THEORY FOR DUMMIES Document Filetype: PDF 340.31 KB 0 FILE / GUITAR THEORY FOR DUMMIES Learn the fundamental concepts of Music Theory with this For Dummies guide. Get this
More informationCommunity-as-a-Service: Data Validation in Citizen Science
Community-as-a-Service: Data Validation in Citizen Science Yurong He and Andrea Wiggins College of Information Studies, University of Maryland College Park, MD, USA {yrhe,wiggins}@umd.edu Abstract. Currently,
More informationBlack. LWECS Site Permit. Stearns County. Permit Section:
PERMIT COMPLIANCE FILING Permittee: Permit Type: Project Location: Docket No: Permit Section: Date of Submission : Black Oak Wind,, LLC LWECS Site Permit Stearns County IP6853/WS-10-1240 and IP6866/WS-11-831
More informationAI Frontiers. Dr. Dario Gil Vice President IBM Research
AI Frontiers Dr. Dario Gil Vice President IBM Research 1 AI is the new IT MIT Intro to Machine Learning course: 2013 138 students 2016 302 students 2017 700 students 2 What is AI? Artificial Intelligence
More informationAdvanced audio analysis. Martin Gasser
Advanced audio analysis Martin Gasser Motivation Which methods are common in MIR research? How can we parameterize audio signals? Interesting dimensions of audio: Spectral/ time/melody structure, high
More informationTumbleweed Express: A Tale of 54 Game Jams
Tumbleweed Express: A Tale of 54 Game Jams Matthew Louis Mauriello Project Manager (@mattm401) IGDA DC Chapter Meeting June 28 th, 2016 (@IGDA_DC) INTRODUCTION Matthew Louis Mauriello PhD Student, Comp
More information