MURDOCH RESEARCH REPOSITORY

Similar documents
MURDOCH RESEARCH REPOSITORY

Long Range Acoustic Classification

Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition

Sonar Signal Classification using Neural Networks

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

Electric Guitar Pickups Recognition

MATLAB/GUI Simulation Tool for Power System Fault Analysis with Neural Network Fault Classifier

IBM SPSS Neural Networks

CHAPTER 8: EXTENDED TETRACHORD CLASSIFICATION

Noise estimation and power spectrum analysis using different window techniques

Chapter 3 Data and Signals 3.1

Detection and Classification of Power Quality Event using Discrete Wavelet Transform and Support Vector Machine

Chapter 4 SPEECH ENHANCEMENT

Improved Detection by Peak Shape Recognition Using Artificial Neural Networks

A Spread Spectrum Network Analyser

CLASSIFICATION OF POWER QUALITY DISTURBANCES USING WAVELET TRANSFORM AND S-TRANSFORM BASED ARTIFICIAL NEURAL NETWORK

Keywords: Power System Computer Aided Design, Discrete Wavelet Transform, Artificial Neural Network, Multi- Resolution Analysis.

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Phase Jitter in MPSK Carrier Tracking Loops: Analytical, Simulation and Laboratory Results

Fibre Laser Doppler Vibrometry System for Target Recognition

COMPUTATIONAL RHYTHM AND BEAT ANALYSIS Nicholas Berkner. University of Rochester

SIGNAL PROCESSING OF POWER QUALITY DISTURBANCES

AN ANN BASED FAULT DETECTION ON ALTERNATOR

Tadeusz Stepinski and Bengt Vagnhammar, Uppsala University, Signals and Systems, Box 528, SE Uppsala, Sweden

280 IEEE TRANSACTIONS ON POWER DELIVERY, VOL. 23, NO. 1, JANUARY 2008

Waveform Multiplexing using Chirp Rate Diversity for Chirp-Sequence based MIMO Radar Systems

Partial Discharge Classification Using Acoustic Signals and Artificial Neural Networks

A Novel Technique for Automatic Modulation Classification and Time-Frequency Analysis of Digitally Modulated Signals

Roberto Togneri (Signal Processing and Recognition Lab)

Signals A Preliminary Discussion EE442 Analog & Digital Communication Systems Lecture 2

IDENTIFICATION OF POWER QUALITY PROBLEMS IN IEEE BUS SYSTEM BY USING NEURAL NETWORKS

Speech/Music Change Point Detection using Sonogram and AANN

Frequency Domain Representation of Signals

TRANSIENT STABILITY ENHANCEMENT OF POWER SYSTEM USING INTELLIGENT TECHNIQUE

Acoustic signal processing via neural network towards motion capture systems

Learning New Articulator Trajectories for a Speech Production Model using Artificial Neural Networks

Report 3. Kalman or Wiener Filters

High-speed Noise Cancellation with Microphone Array

Voice Activity Detection

Dwt-Ann Approach to Classify Power Quality Disturbances

Classification of Analog Modulated Communication Signals using Clustering Techniques: A Comparative Study

Wavelet Transform for Classification of Voltage Sag Causes using Probabilistic Neural Network

A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification

Seam position detection in pulsed gas metal arc welding

TCM-coded OFDM assisted by ANN in Wireless Channels

EXAMINATION FOR THE DEGREE OF B.E. Semester 1 June COMMUNICATIONS IV (ELEC ENG 4035)

IEEE Transactions on Power Delivery. 15(2) P.467-P

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

Industrial computer vision using undefined feature extraction

Chapter 2: Signal Representation

Acoustic resolution. photoacoustic Doppler velocimetry. in blood-mimicking fluids. Supplementary Information

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

D DAVID PUBLISHING. 1. Introduction

DIAGNOSIS OF STATOR FAULT IN ASYNCHRONOUS MACHINE USING SOFT COMPUTING METHODS

ARTIFICIAL NEURAL NETWORK BASED CLASSIFICATION FOR MONOBLOCK CENTRIFUGAL PUMP USING WAVELET ANALYSIS

CHAPTER 1 INTRODUCTION

Real-time fundamental frequency estimation by least-square fitting. IEEE Transactions on Speech and Audio Processing, 1997, v. 5 n. 2, p.

Characterization of Voltage Dips due to Faults and Induction Motor Starting

Frequency Hopping Spread Spectrum Recognition Based on Discrete Fourier Transform and Skewness and Kurtosis

A COMPARISON OF ARTIFICIAL NEURAL NETWORKS AND OTHER STATISTICAL METHODS FOR ROTATING MACHINE

Wheel Health Monitoring Using Onboard Sensors

Advanced Signal Processing and Digital Noise Reduction

6/20/2012 ACORN ACORN ACORN ACORN ACORN ACORN. Arnstein Prytz. Australian Coastal Ocean Radar Network (ACORN)

NAVAL POSTGRADUATE SCHOOL THESIS

Signal Processing Toolbox

Modulation Classification of Satellite Communication Signals Using Cumulants and Neural Networks

Carrier Frequency Offset Estimation in WCDMA Systems Using a Modified FFT-Based Algorithm

Characterization of Voltage Sag due to Faults and Induction Motor Starting

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

HD Radio FM Transmission. System Specifications

AUTOMATIC MODULATION RECOGNITION OF COMMUNICATION SIGNALS

Gammatone Cepstral Coefficient for Speaker Identification

DYNAMIC BEHAVIOR MODELS OF ANALOG TO DIGITAL CONVERTERS AIMED FOR POST-CORRECTION IN WIDEBAND APPLICATIONS

Publication P IEEE. Reprinted with permission.

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Voice Recognition Technology Using Neural Networks

Electronic disguised voice identification based on Mel- Frequency Cepstral Coefficient analysis

Rhythmic Similarity -- a quick paper review. Presented by: Shi Yong March 15, 2007 Music Technology, McGill University

Speech Enhancement Using Spectral Flatness Measure Based Spectral Subtraction

Statistical Tests: More Complicated Discriminants

A multi-class method for detecting audio events in news broadcasts

Self Localization Using A Modulated Acoustic Chirp

A Simplified Extension of X-parameters to Describe Memory Effects for Wideband Modulated Signals

Discrete Fourier Transform (DFT)

Speech Enhancement in Presence of Noise using Spectral Subtraction and Wiener Filter

Automatic Fault Classification of Rolling Element Bearing using Wavelet Packet Decomposition and Artificial Neural Network

Decriminition between Magnetising Inrush from Interturn Fault Current in Transformer: Hilbert Transform Approach

INTRODUCTION TO RADAR SIGNAL PROCESSING

ECE 3500: Fundamentals of Signals and Systems (Fall 2014) Lab 4: Binary Phase-Shift Keying Modulation and Demodulation

Hand & Upper Body Based Hybrid Gesture Recognition

Sound pressure level calculation methodology investigation of corona noise in AC substations

A Spectral Conversion Approach to Single- Channel Speech Enhancement

Applications of Music Processing

Simplified, high performance transceiver for phase modulated RFID applications

International Journal of Digital Application & Contemporary research Website: (Volume 1, Issue 7, February 2013)

Human detection by neural networks using a low-cost short-range Doppler radar sensor

DETECTION AND CLASSIFICATION OF POWER QUALITY DISTURBANCES

A Novel Fuzzy Neural Network Based Distance Relaying Scheme

Prediction of airblast loads in complex environments using artificial neural networks

Band- Pass ΣΔ Architectures with Single and Two Parallel Paths

Transcription:

MURDOCH RESEARCH REPOSITORY http://dx.doi.org/10.1109/kes.1999.820143 Zaknich, A. and Attikiouzel, Y. (1999) The classification of sheep and goat feeding phases from acoustic signals of jaw sounds. In: Third International Conference on Knowledge-Based Intelligent Information Engineering Systems, 31 August - 1 September, Adelaide, Australia, pp 158 161. http://researchrepository.murdoch.edu.au/18048/ Copyright 1999 IEEE Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

1999 Third International Conference on Knowledge-Based Intelligent Information Engineeing Systems, 3 I" Aug-1' Sept 1999, Adelaide, Australia The Classification of Sheep and Goat Feeding Phases from Acoustic Signals of Jaw Sounds Anthony Zaknich and Yianni Attikiouzel Centre for Intelligent Information Processing Systems (CIIPS), Department of Electrical and Electronic Engineering, The University of Western Australia, Nedlands 6907, Western Australia Keywords - Time and frequency domain features, Rumination, Mastication, Probabilistic Neural Network, Multi-layer Perceptron. Abstract - This paper describes and documents investigatory work for the detection and measurement of sheep rumination and mastication time periods from jaw sounds transmitted through the skull. The rumination and mastication time periods were determined by a neural network classifier using a combination of time and frequency domain features extracted from successive 10 second acoustic signal lengths. It is shown that spectral features contain most of the information required for good classification. 1. Introduction Animal nutritionists need accurate data on the time spent by ruminants which include sheep, goats and cattle in mastication and rumination phases over long periods. This data can be gathered by humans manually by listening to recordings of jaw movements. However, this is a tedious task which is very time consuming and subject to error. Consequently, an accurate automated system is required to process the large amounts of acoustic data with minimum error. The main processing task is the ability to classify time periods according to the three main feeding phases of, mastication and rumination. A suitable classifier for this problem is an Artificial Neural Network (ANN) fed by relevant time domain and frequency domain features extracted from the jaw sounds. Investigations based on preliminary data gathered from a Tasmanian study [l] and other studies showed that it is feasible to achieve the required rumination and mastication classifications using frequency spectral information over ten-second blocks of time. An undergraduate student project ("Classification of Sheep Feeding Phases", Michael Dragojevic 1994, Department of Electrical and Electronics Engineering, The University of Westem Australia) showed that adequate classification was also achievable using the spectral information in a single chew. This work did eventually result in the development of a real-time system for the characterisation of sheep feeding phases [2]. 2. Frequency Domain Features The initial study was conducted with data supplied from [l]. The data consisted of nine minutes and 42 seconds of audio cassette recording of a sheep B ruminating (146 seconds), a goat R ruminating (155 seconds), a goat P (134 seconds) and a sheep A (147 seconds). After initial investigation it was found there was little significant signal energy in the recording above 1,680 Hz so the recordings were digitised at a sample rate of 3,360 Hz with 14 bit precision. Independent signal records of 33,792 points each were taken from the four categories to build up data for category samples. Each record represented 10.057 seconds of continuous recording and it was subdivided into 33 contiguous frames of 1,024 points each. Examples of sheep mastication and ruminating records are shown in Figures 1 and 2. F11.18.byr R,lIU., 20392 z.0-.i-, y.litnl F-YS 3.- m lrulul a m Figure 1. Sheep Mastication Time Plot Figure 2. Sheep Ruminating Time Plot Feature vectors for category classification were derived from the power spectral estimate (PSE) of each 33,792 point record. Firstly, 66 1,024 point frames were Blackman windowed to taper the values at the beginning and end of the frame to zero. The 66 frames were taken progressively through the record, each having a 5 12 point overlap with the previous frame. A 1,024 point 0-7803-5578-4/99/$10.0001999 IEEE 158

1999 Third International Conference on Knowledge-Based Intelligent Information Engineeing Systems, 31 Aug-IU Sept 1999, Adelaide, Australia Fast Fourier Transform (FFT) was performed on each frame. The magnitude squared values of each frame were averaged over the 66 frames to produce a PSE of 512 points spanning 0 to 1,680 Hz. Each point in the PSE represented a 3.28125 Hz frequency band. Next, the square root of the PSE was performed to produce the FFT magnitude which was divided into nine equally spaced frequency bands of 167.34375 Hz each. This choice was based on visual inspection of the PSE plots which indicated that most of these bands had differing values for the four categories. Finally, the frequency values in each band were averaged and used to create a normalised nine dimensional feature vector for category classification. The Probabilistic Neural Network (PNN) [3] was used to perform the classification task because it performs well with relatively few independent feature vectors and it is quick and easy to train. For initial testing the following numbers of feature vectors for each of the four categories were extracted from relatively noiseless parts of the recording: Class 1, Sheep B ruminating Class 2, Goat R ruminating Class 3, Sheep P Class 4, Goat A - 9-12 - 8-4 Due to the relatively small numbers of feature vectors (33) the PNN was used in the holdout test mode to maximise the results. In the holdout test mode each vector is taken out of the vector set in turn and tested against the remainder. The PNN achieved 100% classification accuracy for sigma between 0.014 and 0.07 1. In the next sample set some mostly noisy samples were added to the relatively noiseless samples above. The main type of noises were the banging of metal feed trays and occasional animal bleating during the mastication phases only. The rumination recordings had no loud noises in them except for occasional low level metal feed tray banging. The numbers of feature vectors were as follows: Class 1, Sheep B ruminating Class 2, Goat R ruminating Class 3, Sheep P Class 4, Goat A - 1 1-12 - 11-12 Once again the PNN was used in the holdout test mode and it achieved 100% classification for sigma between 0.028 and 0.049. These results showed that it was possible to positively identify sheep B ruminating, goat R ruminating, sheep P and. goat A from the audio recordings. The preprocessing and classification method described above was able to effectively perform the discrimination even when significant feed bin banging and bleating noises are mixed with feeding sounds. Dragojevic used an Autoregressive Power Spectral Density (APSD) estimation technique to extract the spectral features from each individual chew signal of 512 points sampled at a rate of 3,360 Hz. He used 12 spectral bins to create the feature vectors and tested various neural network classifiers. All data were taken from a single sheep fed clover. The training data set consisted of the following numbers of feature vectors: Class 1, Sheep ruminating chew Class 2, Sheep chew Class 3, Random Gaussian noise - 175-175 - 46 The testing data set consisted of samples of: Class 1, Sheep ruminating chew Class 2, Sheep chew Class 3, Random Gaussian noise After training he achieved a 93.0% accuracy on the testing set using a PNN classifier. Both these preliminary studies provided enough confidence in the belief that there is sufficient spectral information in the chewing sounds to achieve satisfactory discrimination between sheep mastication and rumination phases. 3. Bandwidth Filtering Following these preliminary classification tests more extensive field trials were conducted where a large amount of data were gathered. This data included jaw sounds from various sheep exposed to a number of different pastures including clover-dominant pasture and mixed pastures of clover and grasses. During the field trials there were a number of different noise signals that were mixed in with the jaw sound signals, including: 1) Sheep noises, eg. bleating, jumping, rubbing. 2) Environmental, eg. wind and rain, aeroplane, vehicle and machinery, human voices and movement, and wild life. 3) Electrical, eg. static, radio signal break in and break out, radio signal whistle and howl. Due to this large range of background noise it was decided to bandpass filter the signal in a band between seven Hz and 212 Hz. This bandwidth represents the frequency band where most of the information related to chewing is found and which excludes many of the background noises and interferences. After the bandpass filter the signals were digitised via a 12 bit Analog to Digital Converter (ADCs) sampling at 400 Hz. 159

1999 Third Intemational Conference on Knowledge-Based Intelligent Information Engineeing Systems, 31" Aug-I" Sept 1999, Adelaide, Australia 4. Time and Frequency Domain Features Although it seemed possible to develop an adequate classification system based totally on frequency domain information it was considered that adding time domain information may improve the performance. There are very clear and relevant time domain features related to the chewing intervals and chewing frequency as well as chewing strength. Therefore, it was decided to construct a classification feature vector which was composed of chewing period information, chewing energy and frequency spectrum information. There were a total of seven chewing period features, five chewing energy features and eight frequency spectrum features selected. These were used to form a twenty one coefficient vector x as follows: Chewing Period Features within 10 Second Blocks Chew pulse period statistics were computed only if there were more than 2 chew pulses in the 10 signal block. x[ 11 = (no. of chews / 30.0), if max chew period f 0 x[2] = (mean chew period / max chew period), if mean chew period f 0 x[3] = (median chew period / mean chew period ) x[4] = (0.1 SD of chew periods / mean chew period) x[5] = (min chew period / mean chew period ) x[6] = (mean chew period / 20.0), if max chew period f 0 x[7] = (mean chew periodmaximum chew period) Chewing Energy Features within 10 Second Blocks If standard deviation (SD) of chew energy z 0 then: x[8] = (10.0 x mean chew energy / SD of chew energy), if mean chew energy f 0 x[9] = (10.0 x minimum chew energy / mean chew energy) x[ 101 = (mean chew energy/2000.0), if maximum chew energy pulse height f 0 x[l1] = (mean chew energy pulse height / maximum chew energy pulse height), if SD of chew energy pulse heights f 0 4121 = mean chew energy pulse height / SD of chew energy pulse heights ), if mean chew energy pulse height f 0 x[13] = (minimum chew energy pulse height / mean chew energy pulse height ) Spectral Features within 10 Second Blocks x[ 141 = (band 1 (0-25 Hz) / max. band energy) x[15] = (band 2 (25-50 Hz) / max. band energy) x[ 161 = (band 3 (50-75 Hz) / max. band energy) x[ 171 = (band 4 (75-100 Hz) / max. band energy) x[ 181 = (band 5 (100-125 Hz) / max. band energy) x[ 191 = (band 6 (125-150 Hz) / max. band energy) 4201 = (band 7 (150-175 Hz) / max. band energy) x[2 11 = (band 8 (1 75-200 Hz) / max. band energy) 5. Classifier Test Results A number of classifier tests were performed using the 2 1 time and frequency domain features defined above. The first neural network classifier which was used to categorise the feeding phases was a Multi-layer Perceptron (MLP) with 21 input, 7 hidden and 3 output nodes. It required 50,000 training iterations at a gain setting of 0.001 and momentum setting of 0.0001 to achieve a satisfactory classifier performance. The MLP (21-7-3) neural network classifier was trained with a wide range of data taken from various field trials and different sheep fed clover-dominant pasture as well as mixed clover and grass pastures. The resulting category confusion matrices for independent training and testing data sets were as shown in Tables I and 11. 0 753 68 1 105 1169 I ruminating I I I I 94.19% training accuracy true ruminating TABLE I I I I ruminating I 900 I 0 I 0 TABLE I1 ruminating 900 0 0 0 742 79 2 117 1156 ~~~ 93.39% testing accuracy The same data with all 21 features was tested using a PNN classifier. Table I11 shows the resulting category confbsion matrix and classification accuracy. The ts specified in The tables 111, IV and V is the single optimising parameter which is chosen during PNN training. It represents the common bandwidth of Gaussian radial basis functions centred at each and every training vector in the network structure. The PNN works by performing nonparametric feature vector density estimation through the use of a common bandwidth (0) radial kernel. If there are enough training vectors for each class the network is able to provide a very good density estimate for each class. Thus the PNN approximates a Bayesian classifier as the number of training vectors increases and the bandwidth reduces. The only real disadvantage of this is that the network 160

1999 Third International Conference on Knowledge-Based Intelligent Information Engineeing Systems, 3 1' Aug- I" Sept 1999, Adelaide, Australia size becomes a direct function of the number of training vectors. TABLE VI 900 0 restin ruminating 0 masticatin ruminatin 93.39% testing accuracy, o= 0.124 The classification results achieved with the MLP and PNN classifiers for all 2 1 features were very similar. Next, the data with only the first seven chewing period features was tested using a PNN classifier. Table IV shows the resulting category confusion matrix and classification accuracy. ruminating 900 0 0 0 761 60 2 129 1144 ruminating I I I I 93.62% testing accuracy, o = 0.05 The results using only the chewing period features were comparable with that of all twenty one features. It was subsequently discovered that the chewing energy features, features eight to thirteen, were not overly effective. They alone produced a classification accuracy of only 80.94%. However, the spectral features alone, features fourteen to twenty one, produced very good results as shown in Table V. restin TABLE V restin masticatin ruminatin masticatin 95 1178 I ruminating I I I I 96.06% testing accuracy, o = 0.004 The accuracy, 96.06%, using spectral features in the ten second blocks was higher than that achieved by Dragojevic, 93.0%, using spectral features of individual chews only. Clearly, the frequency information related to the chewing sequences added vital classification information as one would expect. Combining the chewing period and spectral features produced the results shown in Table VI. 95.43% testing accuracy, o = 0.05 Adding the spectral features to the chewing period features strengthened the discrimination achieved with the chewing period features alone. For all combinations of features tested the accuracy of mastication detection was slightly better than rumination detection. 6. Conclusions The tests performed in this experiment indicate that spectral features, within a frequency band of seven Hz to 2 12 Hz in 10 second signal blocks, alone give very good discrimination between mastication and rumination. This is very convenient as spectral features are relatively easy to implement in a real-time system. However, in a real system it would also be necessary to identify the individual chews for adequate feeding statistics analysis. It is also possible to perform adequate discrimination between mastication and rumination using features extracted from chewing period features alone. Therefore it is possible to design a system based on either chewing period features or both chewing period and spectral features. Acknowledgments The data for this work was supplied by the Division of Animal Production, CSIRO Australia in Floreat Park, Western Australia. Special thanks to Barrie Purser, Sue Baker, Robyn Dynes and Louis Klein from CSIRO for their involvement and assistance. References Reynolds, Linda, "A Comparison of Rumination in Goats and Sheep", Honours Thesis for Bachelor of Agricultural Science, The University of Tasmania, February 1990. Zaknich, A. and Baker, S. K., "A real-time system for the characterisation of sheep feeding phases from acoustic signals of jaw sounds", Australian Journal of Intelligent Information Processing Systems (AJIIPS), Vol. 5, No. 2, Winter 1998, pp. 103-1 10. Specht, D. F., "Probabilistic neural networks and the polynomial ADALME as complementary techniques for classification", IEEE Transactions on Neural.Networks, Vol. I, No 1, March 1990, pp. 11 1-121. 161