IAENG International Journal of Computer Science, :, IJCS Examination of Single Wavelet-Based s of EHG Signals for Preterm Birth Classification Suparerk Janjarasjitt, Member, IAENG, Abstract In this study, wavelet-based features of electrohysterogram (EHG) quantifying the electrical activity of uterine muscles are applied for preterm birth classification. EHG has been shown to provide useful information for uterine contraction that leads to the anticipation of delivery. A waveletbased feature referred to as l in this study is determined from adifferencebetweenthelogarithmsofvariancesofdetailcoefficients of EHG data corresponding to two consecutive levels, i.e., l and l +.Performanceonpretermbirthclassifications using single wavelet-based features of EHG data is examined. Asimplethresholdingtechniqueisappliedforpretermbirth classifications. The leave-one-out cross validation is used to validate the performance on preterm birth classifications. From the computational results, it is shown that the wavelet-based features of EHG can provide a reasonable performance on preterm birth classification with the accuracy, the sensitivity and the specificity of.799,.68 and.7, respectively. Index Terms electrohysterogram, preterm birth, pregnancy, wavelet analysis, classification I. INTRODUCTION The discrete wavelet transform is one of the most significant computational tool that has been applied to various applications including biomedical signal processing. The discrete wavelet transform is a natural tool for characterizing self-similar signals [], []. The derivation of the discrete wavelet transform for the representation of /f processes [], [] makes the discrete wavelet transform can be further applied into the fields of fractals and complex systems analysis. Recently, computational tools and techniques applied for complex systems analysis have been widely applied to applications in biology and medicine []. One of those various applications in biology and medicine is cardiology where a number of measures derived from concepts of complex systems analysis applied to characterize heart rate variability (HRV). In Ref. [], the wavelet-based approach is applied to extract features of RR interval data for congestive heart failure classification. Such wavelet-based feature of RR interval data is demonstrated to be potentially a good feature for the congestive heart failure classification []. The similar wavelet-based approach is also applied for examining the characteristics of epileptic ECoG signals in Ref. []. In this study, the wavelet-based approach examined in Ref. [] is applied to the so-called electrohysterogram (EHG) data for preterm birth classification. The EHG data correspond to the activity of uterine muscles [6] whereas the activitiy of uterine muscles of pregnant women may be used to predict the This work is supported by the Higher Education Research Promotion and National Research University Project of Thailand, Office of the Higher Education Commission. S. Janjarasjitt is with the Department of Electrical and Electronic Engineering, Ubon Ratchathani University, Ubon Ratchathani, 9 Thailand e-mail: suparerk.j@ubu.ac.th premature onset of labor [7], [8]. Preterm birth has been one of the most important issues worldwide. Prematurity is the leading cause of newborn deaths [9]. An estimated million babies are born preterm every year [9]. In addition, this number is rising [9]. The preterm prediction and detection are therefore essential as they can help prevent preterm birth. Performance on preterm birth classifications using single wavelet-based features of EHG data extracted using the wavelet-based approach [] is examined. The rest of this paper is organized as follows. Section II presents data, wavelet-based feature extraction, and experimental setup. Section III details and discusses the computational results on preterm birth classifications. Finally, Section IV summarizes and concludes the paper. II. METHODS A. Data and Subjects The electrohysterogram (EHG) data examined in this study is obtained from the Term-Preterm EHG Database (TPEHGDB) on PhysioNet (available online at http://www.physionet.org/physiobank/database/tpehgdb/). The TPEHGDB contain recordings of EHG data collected from 997 until 6 at the Department of Obstetrics and Gynecology, Medical Centre Ljubljana, Ljubljana [6], []. The EHG data were recorded from a general population of pregnant women during regular check-ups either around the nd week of gestation or around the nd week of gestation [6], []. The EHG data were recorded using the sampling frequency of Hz. Each EHG recording is composed of three channels, referred to as,,and,whichwererecordedfromelectrodes placed around the navel [6], []. All three hundred EHG recordings are classified into four s according to their corresponding time of delivery (either term or preterm birth) and time of recordings (either early or later period of pregnancy). EHG recordings are classified as term birth if the delivery was after the 7th week of gestation; and preterm birth otherwise. EHG recordings are classified as early period if they were recorded before the 6th week of gestation; and later period otherwise. The EHG recordings that were recorded before the 6th week of gestation and on the the 6th week of gestation or after for the pregnancy with term birth are referred to as TE and TL s, respectively. Likewise, the EHG recordings that TABLE I THE NUMBER OF EHG RECORDINGS Early period Later period Term birth 9 Preterm birth 9 9 (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS were recorded before the 6th week of gestation and on the the 6th week of gestation or after for the pregnancy with preterm birth are referred to as, respectively, PE and PL s. The numbers of EHG recordings corresponding to each are summarized in Table I. B. The Wavelet-Based s The discrete wavelet transform (DWT) is applied to decompose an EHG signal into subband components through halfband lowpass and highpass filters. Approximations and details are obtained from the discrete wavelet decomposition using the scaling function and the wavelet function that correspond to, respectively, halfband lowpass filter and halfband highpass filter. For the L-level discrete wavelet decomposition, there are L sets of detail coefficients, i.e., d l for l =,,...,L,andonesetofapproximationcoefficients, i.e., a L,obtained.Onlythedetailcoefficientsd l are however used. The wavelet-based features of EHG signals examined in this study can obtained by the following steps []: ) Apply the discrete wavelet transform to decompose an EHG signal into L levels to obtain the detail coefficients d l ; ) Compute the variance of detail coefficients d l corresponding to each level l, var(d l ); ) Take the logarithm to base of the corresponding variance of detail coefficients, log var(d l ); ) Subtract the logarithm of variance of detail coefficients corresponding to the level l from that corresponding to the lower level, l = log var(d l+ ) log var(d l ). C. Experimental Setup Asegmentwithlengthof89samplesisobtainedfrom the middle section of each channel of EHG recordings, i.e.,,, and. The th order Daubechies wavelets are applied to decompose EHG segments into 7 levels yielding 7detailcoefficientsd, d,..., d 7 and one approximation coefficient a 7.Thecoefficientsd, d, d, d, d, d 6, d 7, and a 7 approximately correspond to..-hz,..-hz,..-hz,.6.-hz,..6-hz,.6.-hz,.8.6-hz, -.8-Hz subbands, respectively. Accordingly, there are 6 wavelet-based features, i.e.,,,..., 6,extractedfromeachEHGsegment.Twoclassifications examined focus on discriminating between pregnancies with term birth and pregnancy with preterm birth. In the first preterm birth classification, single wavelet-based features of EHG segments associated with the TE and PE s are classified. Single wavelet-based features of EHG segments associated with the TL and PL s are classified in another preterm birth classification. A thresholding technique is applied for classifying single wavelet-based features into either a term birth pregnancy or a preterm birth pregnancy. Aleave-one-out(LOO)cross-validationisusedtovalidate the performance on the preterm birth classifications as the numbers of EHG recordings associated with the PE and PL s are small. A threshold τ is obtained from a training set composed of single wavelet-based features of both positive (EHG segments associated with preterm birth pregnancy) and negative (EHG segments associated with term birth pregnancy) classes except a wavelet-based feature of one EHG segment. The left-out wavelet-based feature is used as a testing sample. This process is repeated to include all samples of wavelet-based features as the testing sample. Athresholdτ is determined from the training set of waveletbased features l as follows []: { max MP +min M N τ = if MP < M N min M P +max M N () if M P > M N where M P and M N denote the wavelet-based features l corresponding to positive and negative classes, and M P and M N denote the means of wavelet-based features l corresponding to positive and negative classes, respectively. The classification is simply performed using the following rules []. In the first case, i.e., MP < M N,anEHGsegment is classified to belong to a positive class if the corresponding wavelet-based feature l is less than or equal to the threshold τ; andanegativeclass,otherwise.onthecontrary,inanother case, i.e., MP > M N, an EHG segment is classified to belong to a positive class if the corresponding wavelet-based feature l is greather than or equal to the threshold τ; and anegativeclass,otherwise. The performance of the preterm birth classifications using single wavelet-based features is evaluted using three conventional measures: accuracy (Ac), sensitivity (Se), and specificity (Sp) givenby,respectively, TP + TN Ac = TP + TN + FP + FN, TP Se = TP + FN, and TN Sp = TN + FP, where TP, TN, FP,andFN denote a number of true positives, a number of true negatives, a number of false positives, and a number of false negatives. In addition, the product of sensitivity and specificity, i.e., Se Sp, isalso determined as a performance measure that justifies both the true positive rate and the true negative rate. The performances on preterm birth classification using other quantitative features including root-mean-square (RMS), median frequency (f med ), peak frequency (f peak ) and sample entropy (SampEn) provided on the TPEHGDB are also examined and validated using the same procedure. The RMS, median frequency, peak frequency, and sample entropy [6] were determined from filtered EHG signals rather than the original EHG signals. The original EHG signals, i.e.,,,and,werefilteredusingthreedifferent- pole digital Butterworth bandpass filters with.8.-hz, TABLE II THE p-values OF t-tests DETERMINING THE SIGNIFICANT DIFFERENCES BETWEEN THE WAVELET-BASED FEATURES OF EHG SEGMENTS. PE vs. TE PL vs. TL.69.77.98...76.9.8...7.79.78.89.7...7...8.8996.66.999..6.9.99.9.7 6.9.89.9.78.6. (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS (a) (b) (c) (d) 6 (e) Fig.. Box plots of wavelet-based features l of EHG segments. (f) 6..-Hz and..-hz passbands [6] referred to as, respectively, the spectral bands b, b and b. III. RESULTS A. Characteristics of the Wavelet-Based s The wavelet-based features,,..., 6 of EHG segments associated with all s, i.e., PE, PL, TE, and TL, are compared in box plots shown in Fig. (a) (f), respectively. The wavelet-based features of EHG segments vary corresponding to level l, channel, and also. Table II summarizes the p-values obtained from t-tests used to determine whether the wavelet-based features l of EHG segments associated with the preterm and term births are significantly different from each other. It is shown that there are statistically significant differences between the waveletbased features of EHG segments of channel associated with the PE and TE s, the wavelet-based features of EHG segments of channel associated with the PE and TE s, the wavelet-based features of EHG segments of channel associated with the PL and TL s, the wavelet-based features 6 of EHG segments of channel (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS TABLE III PERFORMANCE ON PRETERM BIRTH CLASSIFICATIONS OF EHG SEGMENTS ASSOCIATED WITH THE PE AND TE GROUPS USING THE WAVELET-BASED FEATURES. channel channel channel.76.68.88..889.66.66..7..76.9.799..776.6.69..67.8.68.6.66.97.7..88..6.6.9.9.7.66...7.79.797.9.76..8..799.68.7.88.89..96..98.68.86.6.7.768.97.6 6.6.789.96.87.76..89.767.77.68.78.77 TABLE IV PERFORMANCE ON PRETERM BIRTH CLASSIFICATIONS OF EHG SEGMENTS ASSOCIATED WITH THE PL AND TL GROUPS USING THE WAVELET-BASED FEATURES. channel channel channel.77..798.68.79.77.78.7.6..76...768.9.9.768.79.86.67.9.66..6.69..687.866.986.768..9.79..87..79.6.8..6..68..69.6.9..77.6.7899.79...798..6667.6.7.9 6.6.789.687.697.786..8.8.797.68.86.89 associated with the PL and TL s, and the wavelet-based features 6 of EHG segments of channel associated with the PL and TL s with p<. as written in bold in Table II. B. Performance on Preterm Birth Classifications The accuracy (Ac), the sensitivity (Se), the specificity (Sp), and the product of sensitivity and specificity (Se Sp) of the preterm birth classifications of EHG segments associated with the PE and TE s using the leave-oneout cross-validation are summarized in Table III. The best accuracy, sensitivity, and specificity obtained for the preterm birth classifications are.89,.768, and.96, respectively, using the wavelet-based feature of channel,the wavelet-based feature of channel,andthewaveletbased feature of channel.thebestperformanceon the preterm birth classification of wavelet-based features of EHG segments associated with the PE and TE s with respect to the product of sensitivity and specificity is however obtained using the wavelet-based feature of channel with the product of sensitivity and specificity of.88. The corresponding accuracy, sensitivity, and specificity are.799,.68, and.7, respectively. Table IV summarizes the accuracy (Ac), the sensitivity (Se), the specificity (Sp), and the product of sensitivity and specificity (Se Sp) of the preterm birth classifications of EHG segments associated with the PL and TL s using the leave-one-out cross-validation. The best accuracy, sensitivity, and specificity obtained for the preterm birth classifications are.797,.768, and.86, respectively, using the wavelet-based feature 6 of channel,thewaveletbased features of channel and of channel, and the wavelet-based features of channel and 6 of channel.thebestperformanceonthepretermbirth classification of wavelet-based features of EHG segments associated with the PL and TL s with respect to the product of sensitivity and specificity is obtained using the wavelet-based feature of channel with the product of sensitivity and specificity of.7. The corresponding accuracy, sensitivity, and specificity are.79,.77, and.78, respectively. The quantitative measures, i.e., RMS, f med, f peak,and SampEn, of spectral bands b, b and b of EHG segments are compared in box plots shown in Fig. (a) (c), Fig. (a) (c), Fig. (a) (c), and Fig. (a) (c), respectively. The performances on the preterm birth classifications of various spectral bands, i.e., b, b and b,ofehgsegmentsassociatedwith the PE and TE s and those associated with the PL and TL s using quantitative measures, i.e., root-meansquare (RMS), median frequency (f med ), peak frequency (f peak )andsampleentropy(sampen),aresummarizedin Table V and Table VI, respectively. The best performances on preterm birth classification using the RMS, median frequency, peak frequency, and sample entropy of EHG segments associated with the PE and TE s with respect to the product of sensitivity and specificity are, respectively, obtained at the spectral band b of channel (Se Sp =.97), the spectral band b of channel (Se Sp =.78), the spectral band b of channel (Se Sp =.8), and the spectral band b of channel (Se Sp =.8). The accuracy, sensitivity, and specificity obtained for the preterm birth classification using the sample entropy of the spectral band b of channel are, respectively,.8,.77, and.86. The best performances on preterm birth classification using the RMS, median frequency, peak frequency, and sample entropy of EHG segments associated with the PL and TL s with respect to the product of sensitivity and specificity are obtained at the spectral band b of channel (Se Sp =.7), the spectral band b of channel (Se Sp =.967), the spectral bands b and b of channel (Se Sp =.8), and the spectral band b of channel (Se Sp =.7), respectively. The accuracy, sensitivity, and specificity obtained for the preterm birth classification (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS 8 6 RMS RMS RMS (a) b (b) b (c) b Fig.. Box plots of RMS of spectral bands b, b and b of EHG segments.... f med f med. f med... (a) b (b) b (c) b Fig.. Box plots of median frequency (f med )ofspectralbandsb, b and b of EHG segments. f peak... f peak f peak.. (a) b (b) b (c) b Fig.. Box plots of peak frequency (f peak )ofspectralbandsb, b and b of EHG segments. SampEn.8.6.. SampEn..8.6.. SampEn.8.6.. (a) b (b) b (c) b Fig.. Box plots of sample entropy (SampEn) of spectral bands b, b and b of EHG segments. (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS TABLE V PERFORMANCE ON PRETERM BIRTH CLASSIFICATIONS OF EHG SEGMENTS ASSOCIATED WITH THE PE AND TE GROUPS USING THE QUANTITATIVE MEASURES. channel channel channel.8. Hz RMS.88.8.9.97.8..99..86..979. f med.76.8.98.69.9.789.776.98.778.897.98.7 f peak.99..99.8.877.768..9.796..9. SampEn.776.8.8.68.7.66...8.77.86.8.. Hz RMS.8.789.7.7.78..888..9.79.6.79 f med.889.768.7..99.97.9.6.8.789.96.9 f peak..789.77..7.97.7.66..97.. SampEn.96.897.8..679.79.78.8.87.79.97.8.. Hz RMS.9.8.867..76..867..89.6.98. f med.6.8.7...97.9.9.679.68..78 f peak..789.77...68.6.8.9.97.8.9 SampEn.8.897.9.8.67..6.79.88.68.9. TABLE VI PERFORMANCE ON PRETERM BIRTH CLASSIFICATIONS OF EHG SEGMENTS ASSOCIATED WITH THE PL AND TL GROUPS USING THE QUANTITATIVE MEASURES channel channel channel.8. Hz RMS.87..986..89..96..8.79.9.7 f med.79..86..769.79.8..79..86. f peak.87.79.96.6.76..89.767.89..97.986 SampEn.769.77.78.7.9.68.96.679.69..77.6.. Hz RMS.77..78.8.8.6.9.86...8.7 f med.6.68.6.967.6..6..87.97.77.9 f peak.7.6.8.8..97...7.66.776.8 SampEn.678.8.7.7.6667.6.68.67.88.6.888.7.. Hz RMS.76..797.89.88.6.96.8....6 f med.69..69.6.76..8..9.97.77.8 f peak.7.6.8.8.7.97.9.99.68.66.6.78 SampEn.96.6.6.6.67.66..89.79.8.88.8 using the sample entropy of spectral band b of channel are.769,.77, and.78, respectively. IV. DISCUSSION AND CONCLUSIONS The wavelet-based feature l defined as the difference between the logarithms of variances of detail coefficients corresponding to consecutive levels, i.e., l + and l, ofehg signals is applied for preterm birth classification. For the EHG segments obtained during early period, i.e., the EHG segments recorded before the 6th week of gestation, the best performance on the preterm birth classification is obtained using the wavelet-based feature which corresponds to the..6-hz and.6.-hz subbands of EHG segments. The best performance on the preterm birth classification for the EHG segments obtained during later period, i.e., the EHG segments recorded on or after the 6th week of gestation, is obtained using the wavelet-based feature which corresponds to the..-hz and..-hz subbands of EHG segments. The wavelet-based feature of EHG segments provides a substantially better performance on the preterm birth classification using the EHG segments obtained during early period compared to the quantitative measures, i.e., root-mean-square (RMS), median frequency (f med ), peak frequency (f peak ), and sample entropy (SampEn). On the other hand, the performance on the preterm birth classification using the sample entropy of the.8.-hz subband of EHG segments obtained during later period is slightly superior to that using the wavelet-based features of EHG segments obtained during later period. The computation results suggest that the wavelet-based features of EHG signals can be reasonably used for preterm birth classification. REFERENCES [] G.W. Wornell, Wavelet-based representations for the /f family of fractal processes, Proceedings of the IEEE, vol. 8, pp. 8, 99. [] G.W. Wornell, Signal Processing with Fractals: A Wavelet-Based Approach. PrenticeHall:NewJersey,99. [] G. W. Wornell, Synthesis, analysis, and processing of fractal signals, Ph.D. Thesis, Massachusetts Institute of Technology, Massachusetts, U.S.A., 99. (Advance online publication: May 7)
IAENG International Journal of Computer Science, :, IJCS [] S. Janjarasjitt, Examination of wavelet-based features for congestive heart failure classification using SVM, Lecture Notes in Engineering and Computer Science: Proceedings of The World Congress on Engineering 6, WCE 6, 9 June July, 6, London, U.K., pp. 6. [] S. Janjarasjitt and K.A. Loparo, Examination of Multiple Spectral Exponents of Epileptic ECoG Signal, IAENG International Journal of Computer Science, vol.,no.,pp.7 8,. [6] G. Fele-Žorž, G. Kavšek, Ž. Novak-Antolič and F. Jager, A comparison of various linear and non-linear signal processing techniques to separate uterine EMG records of term and pre-term delivery s, Med. Biol. Eng. Comput., vol.6,pp.9 9,8. [7] H. Leman, C. Marque and J. Gondry, Use of the electrohysterogram signal for characterization of contractions during pregnancy, IEEE Trans. Biomed. Eng., vol.6,pp. 9,999. [8] W.L. Maner and R.E. Garfield, Identification of human term and preterm labor using artificial neural networks on uterine electromyography data, Ann. Biomed. Eng., vol., pp. 6 7, 7. [9] World Health Organization, Born too soon: the global action report on preterm birth, Geneva,. [] A. L. Goldberger, et al., PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals, Circulation, vol.,pp.e e,. [] S. Janjarasjitt, Performance of epileptic single-channel scalp EEG classifications using single wavelet-based features, Australas. Phys. Eng. Sci. Med.. DOI:.7/s6-6-- (Advance online publication: May 7)