The NU-NAIST voice conversion system for the Voice Conversion Challenge 2016

Size: px
Start display at page:

Download "The NU-NAIST voice conversion system for the Voice Conversion Challenge 2016"

Transcription

1 INTERSPEECH 16 Septeber 8 12, 16, San Francisco, USA The NU-NAIST voice syste for the Voice Conversion Challenge 16 Kazuhiro Kobayashi 1, Shinnosuke Takaichi 1, Satoshi Nakaura 1, Tooki Toda 2 1 Nara Institute of Science and Technology (NAIST), Japan 2 Inforation Technology Center, Nagoya University, Japan 1 {kazuhiro-k, shinnosuke-t, s-nakaura}@is.naist.jp, 2 tooki@icts.nagoya-u.ac.jp Abstract This paper presents the NU-NAIST voice (VC) syste for the Voice Conversion Challenge 16 (VCC 16) developed by a joint tea of Nagoya University and Nara Institute of Science and Technology. Statistical VC based on a Gaussian ixture odel akes it possible to convert speaker identity of a source speaker voice into that of a target speaker by converting several speech paraeters. However, various factors such as paraeterization errors and over-soothing effects usually cause speech quality degradation of the converted voice. To address this issue, we have proposed a direct wavefor odification technique based on spectral differential filtering and have successfully applied it to singing voice where excitation features are not necessary converted. In this paper, we propose a ethod to apply this technique to a standard voice task where excitation feature is needed. The result of VCC 16 deonstrates that the NU-NAIST VC syste developed by the proposed ethod yields the best accuracy for speaker identity (ore than 7% of the correct rate) and quite high naturalness score (ore than 3 of the ean opinion score). This paper presents detail descriptions of the NU-NAIST VC syste and additional results of its perforance evaluation. Index Ters: voice challenge 16, speaker identity, segental feature, Gaussian ixture odel, STRAIGHT analysis. 1. Introduction Varieties of voice characteristics, such as voice tibre and fundaental frequency (F ) patterns, produced by individual speakers are always restricted by their own physical constraint due to the speech production echanis. This constraint is helpful for aking it possible to produce a speech signal capable of siultaneously conveying not only linguistic inforation but also non-linguistic inforation such as speaker identity. However, it also causes various barriers in speech counication; e.g., severe vocal disorders are easily caused even if speech organs are partially daaged; and we hesitate to talk about soething private using a cell phone if we are surrounded by others. If the individual speakers freely produced various voice characteristics over their own physical constraints, it would break down these barriers and open up an entirely new speech counication style. Voice (VC) is a potential technique to ake it possible for us to produce speech sounds beyond our own physical constraints 1]. VC research was originally started to achieve speaker to ake it possible to transfor the voice identity of a source speaker into that of a target speaker while preserving the linguistic content 2]. A ainstrea of VC is a statistical approach to developing a function using a parallel data set consisting of utterances of the source and target speakers. As one of the ost popular statistical VC ethods, a regression ethod using a Gaussian ixture odel (GMM) was proposed 3]. To iprove perforance of the GMM-based VC ethod, various VC ethods have been proposed by ipleenting ore sophisticated techniques, such as Gaussian process regression 4, 5] deep neural networks 6, 7], non-negative atrix factorization 8, 9], and so on. We have also significantly iproved perforance of the standard GMMbased VC ethod by incorporating a trajectory-based algorith to ake it possible to consider teporal correlation in 1], odeling additional features to alleviate an over-soothing effect of the converted speech paraeters, such as global variance (GV) 1] and odulation spectru (MS) 11], and ipleenting STRAIGHT 12] with ixed excitation 13]. Furtherore, a real-tie process has also been successfully ipleented for state-of-the-art GMMbased VC 14]. However, the speech quality of the converted voices is still obviously degraded copared to that of the natural voices. One of the biggest factors causing this quality degradation is the wavefor generation process using a vocoder 15], which is still observed even when using high-quality vocoder systes 12, 16, 17]. In singing VC (SVC), to avoid the quality degradation caused by the vocoding process 15], we have proposed an intragender SVC ethod with direct wavefor odification based on spectru differential (DIFFSVC) 18] considering global variance (GV) 19], focusing on F transforation is not necessary in the intra-gender SVC. The DIFFSVC fraework can avoid using the vocoder by directly filtering an input singing voice wavefor with a tie sequence of spectral paraeter differentials estiated by a differential GMM (DIFFGMM) analytically derived fro the conventional GMM used in the standard ethod. Moreover, to apply this DIFFSVC fraework to cross-gender DIFFSVC as well, we have proposed an F transforation technique with direct residual signal odification ] based on tie-scaling with wavefor siilarity-based overlap-add 21] and resapling. In this paper, we develop a new VC syste for speaker based on the direct wavefor odification technique, which was subitted to the Voice Conversion Challenge 16 (VCC 16) 22] fro our joint tea of Nagoya University and Nara Institute of Science and Technology (NAIST) as the NU- NAIST VC syste (called new NAIST VC syste ). The following techniques are newly ipleented for our GMM-based VC syste: 1) voice with direct wavefor odification with spectral differential (DIFFVC), 2) speech paraeter trajectory soothing in the GMM training, 3) post-filtering process based on MS for DIFFVC, and 4) excitation conver- Copyright 16 ISCA

2 sion (EC) using STRAIGHT as preprocessing of spectral. The results of the VCC 16 have deonstrated that the NU-NAIST VC syste (syste J ) achieved the best accuracy on speaker identity and high naturalness (ore than 3 on the ean opinion score scale). In this paper, we also conduct subjective evaluations, deonstrating that the NU- NAIST VC syste achieves high speech quality and accuracy coparable to our conventional GMM-based VC syste. 2. VC based on GMM In the conventional VC, acoustic features such as spectral features and aperiodic coponents of a source speaker are converted into those of a target speaker based on previously trained GMMs. F is transfored to copensate for the difference in pitch between the source and target speakers based on fraeby-frae linear. Finally, the converted voice is generated by synthesizing these converted acoustic features using a vocoder Acoustic feature apping based on GMM Acoustic feature apping based on GMM consists of a training process and a process. In the training process, a joint probability density function of acoustic features of the source and target speaker voices are odeled with a GMM using a parallel data set. As the acoustic features of the source and target speakers, we eploy 2D-diensional joint static and dynaic feature vectors X t =x t, Δx t ] of the source and Y t =y t, Δy t ] of the target consisting of D-diensional static feature vectors x t and y t and their dynaic feature vectors Δx t and Δy t at frae t, respectively, where denotes the transposition of the vector. Their joint probability density odeled by the GMM is given by P (X t, Y t λ) M = α N =1 ( Xt Y t ] ; μ (X) μ (Y ) ], Σ (XX) Σ (XY ) Σ (YX) Σ (YY) ]), (1) where N ( ; μ, Σ) denotes the noral distribution with a ean vector μ and a covariance atrix Σ. The ixture coponent index is. The total nuber of ixture coponents is M. λ is a GMM paraeter set consisting of the ixture-coponent weight α, the ean vector μ, and the covariance atrix Σ of the -th ixture coponent. The GMM is trained using joint vectors of X t and Y t in the parallel data set, which are autoatically aligned to each other by dynaic tie warping. In the process, the acoustic features of the source speaker are converted into that of the target speaker using axiu likelihood estiation (MLE) of speech paraeter trajectories using the GMM and GV 1] F transforation In both intra- and cross-gender s, F is transfored frae-by-frae in order to line up pitch differences between source and target speakers. ŷ t = σ(y) σ (x) (xt μ(x) )+μ (y), (2) where x t and ŷ t are a log-scaled F of the source speaker and the converted one at frae t. μ (x) and σ (x) are the ean and standard deviation of log-scaled F of the source speaker and μ (y) and σ (y) are those of the target speaker. 3. The NU-NAIST VC syste for VCC 16 In this paper, we proposed the following techniques: 1) DIF- FVC, 2) GMM training with soothed speech paraeter trajectory, 3) post-filtering process based on odulation spectru (MS) for DIFFVC, and 4) excitation with F and aperiodic coponents transforations using a vocoder. Figure 1 indicates the flow of the NU-NAIST VC syste for the VCC 16. The NU-NAIST VC syste perfors excitation and spectral. During excitation, F values and aperiodic coponents extracted fro a source voice are transfored within an analysis/synthesis fraework using a vocoder. During spectral, spectral features of the source voice are converted into spectral feature differentials based on the DIFFGMM. Next, MS-based post-filtering is applied to the spectral feature differential. Finally, the converted speech wavefor is generated by directly filtering the analysis-synthesized speech wavefor generated during the excitation step using the post-filtered spectral feature differentials DIFFVC based on DIFFGMM As part of the odelling step, the DIFFGMM is analytically derived fro the traditional GMM (in Eq. (3)). Let D t = ] d t, Δd t denote the static and dynaic differential feature vector, where d t = y t x t, the DIFFGMM is derived by transforing odel paraeters in the sae anner as DIFFSVC 18] as follows: P (X t, D t λ) M = α N =1 ( Xt D t ] ; μ (X) μ (D) ], Σ (XX) Σ (DX) Σ (XD) Σ (DD) ]). (3) During the step, a tie sequence of the D- diensional converted spectral feature differentials, ˆd, is deterined using MLE of the speech paraeter trajectory using the DIFFGMM 18]. Then, the converted speech wavefor is generated by directly filtering an input speech wavefor with a tie-variant synthesis filter designed fro the spectral feature differential sequence. This filtering process odifies a spectral envelope sequence while basically preserving the natural excitation signals of the input speech wavefor Speech paraeter trajectory soothing Modulation Spectru (MS) 11] is defined as the log-scaled power spectru of the paraeter sequence; i.e., teporal fluctuation of the paraeter sequence is decoposed into individual odulation frequency coponents and their power values are represented as the MS. The MS, s (y), of the paraeter sequence y is defined as: s (y) = s 1 (y),, s d (y),, s D (y) ], (4) s d (y) = s d, (y),,s (y),,s d,ds 1 (y)], (5) where 2D s is the length of the discrete Fourier transfor, and s (y) is the f-th MS of the d-th diension of the paraeter sequence y 1 (d),, y T (d)]. f is the odulation frequency index. As reported in 23, 24], the higher odulation frequency coponents (ore fluctuating coponents of a teporal sequence) of spectral paraeter sequences are negligible for speech quality. By applying a low-pass filter (LPF) that reoves the higher odulation frequency coponents (e.g., ore than 5 Hz (f > D s/2)), we can iprove training accuracy 1668

3 Source voice STRAIGHT Analysis F Aperiodicity Band aperiodicity Linear transforation GMM for aperiodic coponents Aperiodicity odification Converted band aperiodicity Transfored F Excitation generation Synthesis filter F transfored source voice Converted voice (DIFFVC (EC)) Synthesis filter Spectru envelope Mel-cepstru DIFFGMM for el-cepstru Converted el-cepstru differential MSPF Figure 1: Conversion process of the NU-NAIST VC syste for VCC 16. of acoustic odels as done for hidden Markov odel-based speech synthesis 25]. Here, source and target speakers speech paraeter sequences, x and y, are LPFed, then the LPFed sequences, x (LPF) and y (LPF), are used to train the GMM. In, x (LPF) is used to generate the spectral differentials MS-based post-filter for VC with spectral differentials Statistical odeling tends to deteriorate MSs of the converted speech paraeters, and keeping natural MSs is strongly effective for iproving the quality of the converted speech. An MSbased post-filter (MSPF) 11], which is applied after speech paraeter in conventional GMM-based VC, odifies a converted speech paraeter sequence so that the sequence has the target speaker s natural MS. Here, we propose an MS-based post-filtering process that odifies spectral differentials, ˆd, such that the finally synthesized speech has the target speaker s natural MS. In training, we calculate MS statistics for target speaker s natural and converted speech paraeters fro the training data, y and ỹ =ˆd+x (LPF) ]. Here, let μ (y) s (y) and s (ỹ), and let σ (y) and μ(ỹ) and σ(ỹ) be the ean of be their variance. The ˆd is generated by converting x (LPF). In, x (LPF) is first added to the generated ˆd. Then, the MS, s (ỹ) is converted as follows: s (ỹ) = σ(y) ( σ (ỹ) s (ỹ) μ (ỹ) ) + μ (y). (6) The converted ỹ is deterined using the converted MS and the original phase coponents. The MSPFed spectral differentials, ˆd (MSPF) can be deterined by subtracting x (LPF) fro the converted ỹ 1. Note that, in this paper, we use ean-noralized MSs and adopt a segent-level post-filtering process 11] Excitation based on F and aperiodicity transforations using a vocoder Although we initially tried ipleenting the F transforation technique with direct residual signal odification ] for singer, we found that this technique was not effective for speaker. In speaker, we need to handle larger acoustic differences in excitation signals between the source and target speakers copared to singing voice. To address this issue, we ipleented excitation using STRAIGHT 26] as high-quality vocoder. For the F transforation, we perfor the global linear transforation as described in Sect 2.2. For the aperiodic coponents, band-averaged aperiodic coponents are extracted and converted with the GMM as in the conventional ethod 13]. Then, 1 Note that, because the MSPF process is non-linear to the speech paraeter sequence, the sequence that x (LPF) is subtracted fro the converted ỹ is not equal to ˆd. original aperiodic coponents at all frequency bins are shifted using aperiodic differentials between the extracted and converted band-averaged aperiodic coponents. Finally, analysissynthesized speech is generated fro these transfored excitation paraeters using STRAIGHT. Note that full STRAIGHT spectral representation is directly used in synthesis. This excitation ethod actually causes significant quality degradation because original phase inforation is discarded. Nevertheless, we have found that this ethod yields better speech quality as well as better accuracy than the direct residual signal odification ]. 4. Experiental evaluation In this section, we show results of the VCC 16 to deonstrate perforance of the NU-NAIST VC syste. Moreover, we copare the following three systes: DIFFVC (EC): The NU-NAIST VC syste subitted to the VCC 16, VC: Our conventional VC syste 13], DIFFVC: The NU-NAIST VC syste w/o excitation Experiental conditions We evaluated speech quality and speaker identity of the converted voices to copare perforance of the different VC systes in both intra-gender and cross-gender tasks. We used the English speech database used in the VCC 16. The nuber of source speakers was 5 including 3 feales and 2 ales, and that of the target speakers was 5 including 2 feales and 3 ales who were different fro the source feale and ale speakers. The nuber of sentences uttered by each speaker was 216. The sapling frequency was set to 16 khz. STRAIGHT 12] was used to extract spectral envelopes, which was paraeterized into the 1-24th el-cepstral coefficients as the spectral feature. The frae shift was 5 s. The el log spectru approxiation (MLSA) filter 27] was used as the synthesis filter. As the source excitation features, we used F and aperiodic coponents extracted with STRAIGHT 26]. The aperiodic coponents were averaged over five frequency bands, i.e., -1, 1-2, 2-4, 4-6, and 6-8 khz, to be odeled with the GMM. We used 162 sentences for training and the reaining 54 sentences were used for evaluation. The speaker-dependent GMMs were separately trained for all cobinations of source and target speaker pairs. The nuber of ixture coponents for the el-cepstral coefficients was 128 and for the aperiodic coponents was 64. Two preference tests were conducted. In the first test, speech quality of the converted voices was evaluated. The converted voice saples generated by two different VC systes for the sae sentences were presented to subjects in rando order. The subjects selected which saple had better speech quality. 1669

4 Preference score for accuracy on speaker identity %] 6 4 The NU-NAIST VC syste Target Source Subitted systes MOS score for speech quality Figure 2: Sound quality and accuracy on speaker identity in VCC 16. Preference score %] 6 4 DIFFVC(EC) w/ MSPF VC w/ GV DIFFVC w/ MSPF 95% confidence interval (a) Intra-gender (b) Cross-gender Figure 3: AB preference test for speech quality. Preference score %] 6 4 DIFFVC(EC) w/ MSPF VC w/ GV DIFFVC w/ MSPF 95% confidence interval (a) Intra-gender (b) Cross-gender Figure 4: XAB test for accuracy on speaker identity. In the second test, accuracy in speaker identity was evaluated. A natural voice saple of the target speaker was presented to the subjects first as a reference. Then, the converted voice saples generated by two different VC systes for the sae sentences were presented in rando order. The subjects selected which saple was ore siilar to the reference natural voice in ters of speaker identity. The nuber of subjects was 1 and each listener evaluated 54 saple pairs in each evaluation. They were allowed to replay each saple pair as any ties as necessary Results of the VCC 16 Figure 2 indicates an overall result of the VCC 16. The NU- NAIST VC syste achieved quite high speech quality over 3. of MOS and the best accuracy (about 74%) aong all subitted VC systes. In ters of the accuracy, our syste achieved successful perforance even though very siple prosodic was perfored. However, it is observed that there is still a large gap between the converted voices and the natural target voices. It is expected that further iproveents will be yielded by ipleenting a ethod of prosodic patterns or asking the source speakers to iic target prosodic patterns, which would be possible in several practical applications. In ters of speech quality, the NU-NAIST VC syste causes serious quality degradation copared to natural voices, i.e., fro 4.6 to 3. in MOS. This quality degradation is ainly caused by using a vocoder to perfor the excitation as shown in the next section. Therefore, it is expected that the converted speech quality will be significantly iproved by developing a better analysis/synthesis technique than STRAIGHT Results of subjective evaluation Figures 3 (a) and (b) indicate the results of the preference test for speech quality. DIFFVC (EC) achieves equivalent speech quality copared to VC in both intra/cross-gender s. On the other hand, DIFFVC achieves significantly higher speech quality copared to the other two ethods in the intra-gender. This is because DIFFVC can avoid using vocoding to generate converted speech wavefors, aking the process free fro various errors, such as F extraction errors and unvoiced/voiced decision errors. Note that DIFFVC in cross-gender condition does not result in any significant quality iproveents as it suffers fro isatches between spectral envelope and F in the cross-gender. Figures 4 (a) and (b) indicate the results of the preference test for speaker identity. Although DIFFVC (EC) has equivalent accuracy copared to VC in the intra-gender, it tends to be degraded in the cross-gender. It is expected that the residual spectral envelope preserved in the direct wavefor odification process still includes speakerdependent or gender-dependent features, and that this causes adverse effects on accuracy. These results suggest that 1) the NU-NAIST VC syste deonstrating the best accuracy and high speech quality in the VCC 16 has an alost equivalent perforance copared to the conventional VC syste in both intragender and cross-gender s, and 2) the direct wavefor odification technique achieves significantly higher converted speech quality copared to the conventional VC syste if the excitation is not necessary as in the intragender, and therefore, there is still large roo to iprove the converted speech quality of the NU-NAIST VC syste. 5. Conclusions This paper describes the details of the NU-NAIST voice (VC) syste for the Voice Conversion Challenge 16 (VCC 16) developed by a joint tea of Nagoya University and Nara Institute of Science and Technology. In order to iprove the quality of statistical VC based on Gaussian Mixture Model (GMM), we applied the following techniques: 1) voice with direct wavefor odification with spectral differential (DIFFVC), 2) speech paraeter trajectory soothing, 3) post-filtering based on odulation spectru for DIFFVC, and 4) preprocessing for excitation with F and aperiodic coponent transforations using high-quality vocoding. The experiental results deonstrated that the NU-NAIST VC syste was highly ranked in the VCC 16, its perforance was coparable to our conventional VC syste, and the DIF- FVC technique showed large potential to significantly iprove the converted speech quality of the NU-NAIST VC syste. In future work, we plan to ipleent high quality F and aperiodicity transforation for the DIFFVC technique. Acknowledgeents This work was supported in part by JSPS KAKENHI Grant Nuber 266 and Grant-in-Aid for JSPS Research Fellow Nuber 16J

5 6. References 1] T. Toda, Augented speech production based on real-tie statistical voice, Proc. GlobalSIP, pp , Dec ] M. Abe, S. Nakaura, K. Shikano, and H. Kuwabara, Voice through vector quantization, J. Acoust. Soc. Jpn (E), vol. 11, no. 2, pp , ] Y. Stylianou, O. Cappé, and E. Moulines, Continuous probabilistic transfor for voice, IEEE Trans. SAP, vol. 6, no. 2, pp , Mar ] N. Pilkington, H. Zen, and M. Gales, Gaussian process experts for voice, Proc. INTERSPEECH, pp , Aug ] N. Xu, Y. Tang, J. Bao, A. Jiang, X. Liu, and Z. Yang, Voice based on Gaussian processes by coherent and asyetric training with liited training data, Speech Counication, vol. 58, pp , Mar ] L.-H. Chen, Z.-H. Ling, L.-J. Liu, and L.-R. Dai, Voice using deep neural networks with layer-wise generative training, IEEE/ACM Trans. ASLP, vol. 22, no. 12, pp , Dec ] L. Sun, S. Kang, K. Li, and H. Meng, Voice using deep bidirectional long short-ter eory based recurrent neural networks, Proc. ICASSP, pp , Apr ] R. Takashia, T. Takiguchi, and Y. Ariki, Exeplar-based voice using sparse representation in noisy environents, IEICE Trans. on Inf. and Syst., vol. E96-A, no. 1, pp , Oct ] Z. Wu, T. Virtanen, E. Chng, and H. Li, Exeplar-based sparse representation with residual copensation for voice, IEEE/ACM Trans. ASLP, vol. 22, no. 1, pp , June 14. 1] T. Toda, A. W. Black, and K. Tokuda, Voice based on axiu likelihood estiation of spectral paraeter trajectory, IEEE Trans. ASLP, vol. 15, no. 8, pp , Nov ] S. Takaichi, T. Toda, A. W. Black, G. Neubig, S. Sakti, and S. Nakaura, Postfilters to odify the odulation spectru for statistical paraetric speech synthesis, IEEE Trans. ASLP, vol. 24, no. 4, pp , Jan ] H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, Restructuring speech representations using a pitch-adaptive tie-frequency soothing and an instantaneous-frequency-based f extraction: Possible role of a repetitive structure in sounds, Speech Counication, vol. 27, no. 3-4, pp , Apr ] Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, Maxiu likelihood voice based on GMM with STRAIGHT ixed excitation, Proc. INTERSPEECH, pp , Sept ] T. Toda, T. Muraatsu, and H. Banno, Ipleentation of coputationally efficient real-tie voice, Proc. INTER- SPEECH, Sept ] H. Dudley, Reaking speech, JASA, vol. 11, no. 2, pp , ] Y. Stylianou, Applying the haronic plus noise odel in concatenative speech synthesis, IEEE Trans. SAP, vol. 9, no. 1, pp , 1. 17] D. Erro, I. Sainz, E. Navas, and I. Hernaez, Haronics plus noise odel based vocoder for statistical paraetric speech synthesis, IEEE J-STSP, vol. 8, no. 2, pp , ] K. Kobayashi, T. Toda, G. Neubig, S. Sakti, and S. Nakaura, Statistical singing voice with direct wavefor odification based on the spectru differential, Proc. INTERSPEECH, pp , Sept ], Statistical singing voice based on direct wavefor odification with global variance, Proc. INTERSPEECH, pp , Sept. 15. ] K. Kobayashi, T. Toda, and S. Nakaura, Ipleentation of f transforation for statistical singing voice based on direct wavefor odification, Proc. ICASSP, pp , Mar ] W. Verhelst and M. Roelands, An overlap-add technique based on wavefor siilarity (WSOLA) for high quality tie-scale odification of speech, Proc. ICASSP, pp vol.2, Apr ] T. Toda, L.-H. Chen, D. Saito, F. Villavicencio, M. Wester, Z. Wu, and J. Yaagishi, The Voice Conversion Challenge 16, Proc. INTERSPEECH, Sept ] S. Takaichi, T. Toda, A. W. Black, and S. Nakaura, Paraeter generation algorith considering odulation spectru for HMMbased speech synthesis, Proc. ICASSP, Apr ], Modulation spectru-constrained trajectory training algorith for GMM-based voice, Proc. ICASSP, Apr ] S. Takaichi, K. Kobayashi, K. Tanaka, T. Toda, and S. Nakaura, The naist text-to-speech syste for the blizzard challenge 15, Proc. Blizzard Challenge workshop, Sept ] H. Kawahara, J. Estill, and O. Fujiura, Aperiodicity extraction and control using ixed ode excitation and group delay anipulation for a high quality speech analysis, odification and syste straight, Proc. MAVEBA, Sept ] K. Tokuda, T. Kobayashi, T. Masuko, and S. Iai, Melgeneralized cepstral analysis a unified approach to speech spectral estiation, Proc. ICSLP, pp ,

Statistical Singing Voice Conversion with Direct Waveform Modification based on the Spectrum Differential

Statistical Singing Voice Conversion with Direct Waveform Modification based on the Spectrum Differential INTERSPEECH 2014 Statistical Singing Voice Conversion with Direct Wavefor Modification based on the Spectru Differential Kazuhiro Kobayashi, Tooki Toda, Graha Neubig, Sakriani Sakti, Satoshi Nakaura Graduate

More information

Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance

Statistical Singing Voice Conversion based on Direct Waveform Modification with Global Variance INTERSPEECH 15 Statistical Singing Voice Conversion based on Direct Wavefor Modification with Global Variance Kazuhiro Kobayashi, Tooki Toda, Graha Neubig, Sakriani Sakti, Satoshi Nakaura Graduate School

More information

Direct F 0 Control of an Electrolarynx based on Statistical Excitation Feature Prediction and its Evaluation through Simulation

Direct F 0 Control of an Electrolarynx based on Statistical Excitation Feature Prediction and its Evaluation through Simulation INTERSPEECH 2014 Direct F 0 Control of an Electrolarynx based on Statistical Excitation Prediction and its Evaluation through Siulation Kou Tanaka, Tooki Toda, Graha Neubig, Sakriani Sakti, Satoshi Nakaura

More information

A Digital Signal Processor Implementation of Silent/Electrolaryngeal Speech Enhancement based on Real-Time Statistical Voice Conversion

A Digital Signal Processor Implementation of Silent/Electrolaryngeal Speech Enhancement based on Real-Time Statistical Voice Conversion INTERSPEECH 03 A Digital Signal Processor Ipleentation of Silent/Electrolaryngeal Speech Enhanceent based on Real-Tie Statistical Voice Conversion Takuto Moriguchi, Tooki Toda, Motoaki Sano, Hiroshi Sato,

More information

Quality-enhanced Voice Morphing using Maximum Likelihood Transformations

Quality-enhanced Voice Morphing using Maximum Likelihood Transformations 1 Quality-enhanced Voice Morphing using Maxiu Likelihood Transforations Hui Ye, Student Meber, IEEE, and Steve Young, Meber, IEEE Abstract Voice orphing is a technique for odifying a source speaker s speech

More information

Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring

Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring Yusuke Tajiri 1, Tomoki Toda 1 1 Graduate School of Information Science, Nagoya

More information

Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016

Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016 INTERSPEECH 1 September 8 1, 1, San Francisco, USA Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 1 Fernando Villavicencio

More information

WaveNet Vocoder and its Applications in Voice Conversion

WaveNet Vocoder and its Applications in Voice Conversion The 2018 Conference on Computational Linguistics and Speech Processing ROCLING 2018, pp. 96-110 The Association for Computational Linguistics and Chinese Language Processing WaveNet WaveNet Vocoder and

More information

APPLICATION OF THE FAN-CHIRP TRANSFORM TO HYBRID SINUSOIDAL+NOISE MODELING OF POLYPHONIC AUDIO

APPLICATION OF THE FAN-CHIRP TRANSFORM TO HYBRID SINUSOIDAL+NOISE MODELING OF POLYPHONIC AUDIO 6th European Signal Processing Conference (EUSIPCO 8), Lausanne, Switzerland, August 5-9, 8, copyright by EURASIP APPLICATION OF THE FAN-CHIRP TRANSFORM TO HYBRID SINUSOIDAL+NOISE MODELING OF POLYPHONIC

More information

DSI3 Sensor to Master Current Threshold Adaptation for Pattern Recognition

DSI3 Sensor to Master Current Threshold Adaptation for Pattern Recognition International Journal of Signal Processing Systes Vol., No. Deceber 03 DSI3 Sensor to Master Current Threshold Adaptation for Pattern Recognition David Levy Infineon Austria AG, Autootive Power Train Systes,

More information

SYNTHETIC SPEECH DETECTION USING TEMPORAL MODULATION FEATURE

SYNTHETIC SPEECH DETECTION USING TEMPORAL MODULATION FEATURE SYNTHETIC SPEECH DETECTION USING TEMPORAL MODULATION FEATURE Zhizheng Wu 1,2, Xiong Xiao 2, Eng Siong Chng 1,2, Haizhou Li 1,2,3 1 School of Computer Engineering, Nanyang Technological University (NTU),

More information

TESTING OF ADCS BY FREQUENCY-DOMAIN ANALYSIS IN MULTI-TONE MODE

TESTING OF ADCS BY FREQUENCY-DOMAIN ANALYSIS IN MULTI-TONE MODE THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Series A, OF THE ROMANIAN ACADEMY Volue 5, Nuber /004, pp.000-000 TESTING OF ADCS BY FREQUENCY-DOMAIN ANALYSIS IN MULTI-TONE MODE Daniel BELEGA

More information

System Fusion for High-Performance Voice Conversion

System Fusion for High-Performance Voice Conversion System Fusion for High-Performance Voice Conversion Xiaohai Tian 1,2, Zhizheng Wu 3, Siu Wa Lee 4, Nguyen Quy Hy 1,2, Minghui Dong 4, and Eng Siong Chng 1,2 1 School of Computer Engineering, Nanyang Technological

More information

Adaptive Harmonic IIR Notch Filter with Varying Notch Bandwidth and Convergence Factor

Adaptive Harmonic IIR Notch Filter with Varying Notch Bandwidth and Convergence Factor Journal of Counication and Coputer (4 484-49 doi:.765/548-779/4.6. D DAVID PUBLISHING Adaptive Haronic IIR Notch Filter with Varying Notch Bandwidth and Convergence Factor Li Tan, Jean Jiang, and Liango

More information

ROBUST UNDERWATER LOCALISATION OF ULTRA LOW FREQUENCY SOURCES IN OPERATIONAL CONTEXT

ROBUST UNDERWATER LOCALISATION OF ULTRA LOW FREQUENCY SOURCES IN OPERATIONAL CONTEXT ROBUST UNDERWATER LOCALISATION OF ULTRA LOW FREQUENCY SOURCES IN OPERATIONAL CONTEXT M. Lopatka a, B. Nicolas a, G. Le Touzé a,b, X. Cristol c, B. Chalindar c, J. Mars a, D. Fattaccioli d a GIPSA-Lab /DIS/

More information

A Preprocessing Method to Increase High Frequency Response of A Parametric Loudspeaker

A Preprocessing Method to Increase High Frequency Response of A Parametric Loudspeaker A Preprocessing Method to Increase High Frequency Response of A Paraetric Loudspeaker Chuang Shi * and Woon-Seng Gan Digital Processing Laboratory School of Electrical and Electronic Engineering Nanyang

More information

FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT

FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT FORWARD MASKING THRESHOLD ESTIMATION USING NEURAL NETWORKS AND ITS APPLICATION TO PARALLEL SPEECH ENHANCEMENT T. S. GUNAWAN 1, O. O. KHALIFA 1, E. AMBIKAIRAJAH 2 1 Electrical and Coputer Engineering Departent,

More information

Alternative Encoding Techniques for Digital Loudspeaker Arrays

Alternative Encoding Techniques for Digital Loudspeaker Arrays Alternative Encoding Techniques for Digital Loudspeaer Arrays Fotios Kontoichos, Nicolas Alexander Tatlas, and John Mourjopoulos Audio and Acoustic Technology Group, Wire Counications Laboratory, Electrical

More information

Performance Analysis of an AMC System with an Iterative V-BLAST Decoding Algorithm

Performance Analysis of an AMC System with an Iterative V-BLAST Decoding Algorithm I. J. Counications, Network and Syste Sciences, 008,, 105-06 Published Online May 008 in SciRes (http://www.srpublishing.org/journal/ijcns/). Perforance Analysis of an AMC Syste with an Iterative V-BLAST

More information

New Adaptive Linear Combination Structure for Tracking/Estimating Phasor and Frequency of Power System

New Adaptive Linear Combination Structure for Tracking/Estimating Phasor and Frequency of Power System 28 Journal of Electrical Engineering & echnology Vol. 5, No., pp. 28~35, 2 New Adaptive Linear Cobination Structure for racking/estiating Phasor and Frequency of Power Syste Choowong-Wattanasakpubal and

More information

Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters

Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters PAGE 420 Speech Enhanceent using Teporal Masking and Fractional Bark Gaatone Filters Teddy Surya Gunawan, Eliathaby Abikairajah School of Electrical Engineering and Telecounications The University of New

More information

Characterization and Modeling of Underwater Acoustic Communications Channels for Frequency-Shift-Keying Signals

Characterization and Modeling of Underwater Acoustic Communications Channels for Frequency-Shift-Keying Signals Characterization and Modeling of Underwater Acoustic Counications Channels for Frequency-Shift-Keying Signals Wen-Bin Yang and T.C. Yang Naval Research Laboratory Washington, DC 375 USA Abstract In a fading

More information

SECURITY AND BER PERFORMANCE TRADE-OFF IN WIRELESS COMMUNICATION SYSTEMS APPLICATIONS

SECURITY AND BER PERFORMANCE TRADE-OFF IN WIRELESS COMMUNICATION SYSTEMS APPLICATIONS Latin Aerican Applied Research 39:187-192 (2009) SECURITY AND BER PERFORMANCE TRADE-OFF IN WIRELESS COMMUNICATION SYSTEMS APPLICATIONS L. ARNONE, C. GONZÁLEZ, C. GAYOSO, J. CASTIÑEIRA MOREIRA and M. LIBERATORI

More information

Additive Synthesis, Amplitude Modulation and Frequency Modulation

Additive Synthesis, Amplitude Modulation and Frequency Modulation Additive Synthesis, Aplitude Modulation and Frequency Modulation Pro Eduardo R Miranda Varèse-Gastproessor eduardo.iranda@btinternet.co Electronic Music Studio TU Berlin Institute o Counications Research

More information

Non-Linear Weighting Function for Non-stationary Signal Denoising

Non-Linear Weighting Function for Non-stationary Signal Denoising Non-Linear Weighting Function for Non-stationary Signal Denoising Farès Abda, David Brie, Radu Ranta To cite this version: Farès Abda, David Brie, Radu Ranta. Non-Linear Weighting Function for Non-stationary

More information

Department of Mechanical and Aerospace Engineering, Case Western Reserve University, Cleveland, OH, 2

Department of Mechanical and Aerospace Engineering, Case Western Reserve University, Cleveland, OH, 2 Subission International Conference on Acoustics, Speech, and Signal Processing (ICASSP ) PARAMETRIC AND NON-PARAMETRIC SIGNAL ANALYSIS FOR MAPPING AIR FLOW IN THE EAR-CANALTO TONGUE MOVEMENT: A NEW STRATEGY

More information

NONLINEAR WAVELET PACKET DENOISING OF IMPULSIVE VIBRATION SIGNALS NIKOLAOS G. NIKOLAOU, IOANNIS A. ANTONIADIS

NONLINEAR WAVELET PACKET DENOISING OF IMPULSIVE VIBRATION SIGNALS NIKOLAOS G. NIKOLAOU, IOANNIS A. ANTONIADIS NONLINEAR WAVELET PACKET DENOISING OF IMPULSIVE VIBRATION SIGNALS NIKOLAOS G. NIKOLAOU, IOANNIS A. ANTONIADIS Departent of Mechanical Engineering, Machine Design and Control Systes Section National Technical

More information

EQUALIZED ALGORITHM FOR A TRUCK CABIN ACTIVE NOISE CONTROL SYSTEM

EQUALIZED ALGORITHM FOR A TRUCK CABIN ACTIVE NOISE CONTROL SYSTEM EQUALIZED ALGORITHM FOR A TRUCK CABIN ACTIVE NOISE CONTROL SYSTEM Guangrong Zou, Maro Antila, Antti Lanila and Jari Kataja Sart Machines, VTT Technical Research Centre of Finland P.O. Box 00, FI-0 Tapere,

More information

Overlapping Signal Separation in DPX Spectrum Based on EM Algorithm. Chuandang Liu 1, a, Luxi Lu 1, b

Overlapping Signal Separation in DPX Spectrum Based on EM Algorithm. Chuandang Liu 1, a, Luxi Lu 1, b nd International Worshop on Materials Engineering and Coputer Sciences (IWMECS 015) Overlapping Signal Separation in DPX Spectru Based on EM Algorith Chuandang Liu 1, a, Luxi Lu 1, b 1 National Key Laboratory

More information

Comparison Between PLAXIS Output and Neural Network in the Guard Walls

Comparison Between PLAXIS Output and Neural Network in the Guard Walls Coparison Between PLAXIS Output and Neural Network in the Guard Walls Ali Mahbod 1, Abdolghafar Ghorbani Pour 2, Abdollah Tabaroei 3, Sina Mokhtar 2 1- Departent of Civil Engineering, Shahid Bahonar University,

More information

Efficient Non-linear Changed Mel-filter Bank VAD Algorithm

Efficient Non-linear Changed Mel-filter Bank VAD Algorithm Matheatical Models and Methods in Modern Science Efficient on-linear Changed Mel-filter Bank VAD Algorith DAMJA VLAJ, ZDRAVKO KAČIČ, MARKO KOS Faculty of Electrical Engineering and Coputer Science University

More information

High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder

High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder Interspeech 2018 2-6 September 2018, Hyderabad High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder Kuan Chen, Bo Chen, Jiahao Lai, Kai Yu Key Lab. of Shanghai Education Commission for

More information

Iterative Receiver Signal Processing for Joint Mitigation of Transmitter and Receiver Phase Noise in OFDM-Based Cognitive Radio Link

Iterative Receiver Signal Processing for Joint Mitigation of Transmitter and Receiver Phase Noise in OFDM-Based Cognitive Radio Link Iterative Receiver Signal Processing for Joint Mitigation of Transitter and Receiver Phase Noise in OFDM-Based Cognitive Radio Link Ville Syrjälä and Mikko Valkaa Departent of Counications Engineering

More information

Recent Development of the HMM-based Singing Voice Synthesis System Sinsy

Recent Development of the HMM-based Singing Voice Synthesis System Sinsy ISCA Archive http://www.isca-speech.org/archive 7 th ISCAWorkshopon Speech Synthesis(SSW-7) Kyoto, Japan September 22-24, 200 Recent Development of the HMM-based Singing Voice Synthesis System Sinsy Keiichiro

More information

COMBINED FREQUENCY AND SPATIAL DOMAINS POWER DISTRIBUTION FOR MIMO-OFDM TRANSMISSION

COMBINED FREQUENCY AND SPATIAL DOMAINS POWER DISTRIBUTION FOR MIMO-OFDM TRANSMISSION The 8th nnual IEEE International Syposiu on Personal, Indoor and Mobile Radio Counications (PIMRC 07) COMINED FREQUENCY ND SPTIL DOMINS POWER DISTRIUTION FOR MIMO-OFDM TRNSMISSION Wladiir ocquet, Kazunori

More information

Kalman Filtering for NLOS Mitigation and Target Tracking in Indoor Wireless Environment

Kalman Filtering for NLOS Mitigation and Target Tracking in Indoor Wireless Environment 16 Kalan Filtering for NLOS Mitigation and Target Tracking in Indoor Wireless Environent Chin-Der Wann National Sun Yat-Sen University Taiwan 1. Introduction Kalan filter and its nonlinear extension, extended

More information

A HIGH POWER FACTOR THREE-PHASE RECTIFIER BASED ON ADAPTIVE CURRENT INJECTION APPLYING BUCK CONVERTER

A HIGH POWER FACTOR THREE-PHASE RECTIFIER BASED ON ADAPTIVE CURRENT INJECTION APPLYING BUCK CONVERTER 9th International onference on Power Electronics Motion ontrol - EPE-PEM Košice A HIGH POWER FATOR THREE-PHASE RETIFIER BASE ON AAPTIVE URRENT INJETION APPYING BUK ONVERTER Žarko Ja, Predrag Pejović EE

More information

A NEW APPROACH TO UNGROUNDED FAULT LOCATION IN A THREE-PHASE UNDERGROUND DISTRIBUTION SYSTEM USING COMBINED NEURAL NETWORKS & WAVELET ANALYSIS

A NEW APPROACH TO UNGROUNDED FAULT LOCATION IN A THREE-PHASE UNDERGROUND DISTRIBUTION SYSTEM USING COMBINED NEURAL NETWORKS & WAVELET ANALYSIS A NEW APPROACH TO UNGROUNDED FAULT LOCATION IN A THREE-PHASE UNDERGROUND DISTRIBUTION SYSTEM USING COMBINED NEURAL NETWORKS & WAVELET ANALYSIS Jaal Moshtagh University of Bath, UK oshtagh79@yahoo.co Abstract

More information

An orthogonal multi-beam based MIMO scheme. for multi-user wireless systems

An orthogonal multi-beam based MIMO scheme. for multi-user wireless systems An orthogonal ulti-bea based IO schee for ulti-user wireless systes Dong-chan Oh o and Yong-Hwan Lee School of Electrical Engineering and IC, Seoul ational University Kwana P.O. Box 34, Seoul, 151-600,

More information

Fundamental study for measuring microflow with Michelson interferometer enhanced by external random signal

Fundamental study for measuring microflow with Michelson interferometer enhanced by external random signal Bulletin of the JSME Journal of Advanced Mechanical Design, Systes, and Manufacturing Vol.8, No.4, 2014 Fundaental study for easuring icroflow with Michelson interferoeter enhanced by external rando signal

More information

RAKE Receiver. Tommi Heikkilä S Postgraduate Course in Radio Communications, Autumn II.

RAKE Receiver. Tommi Heikkilä S Postgraduate Course in Radio Communications, Autumn II. S-72333 Postgraduate Course in Radio Counications, Autun 2004 1 RAKE Receiver Toi Heikkilä toiheikkila@teliasoneraco Abstract RAKE receiver is used in CDMA-based (Code Division Multiple Access) systes

More information

A New Localization and Tracking Algorithm for Wireless Sensor Networks Based on Internet of Things

A New Localization and Tracking Algorithm for Wireless Sensor Networks Based on Internet of Things Sensors & Transducers 203 by IFSA http://www.sensorsportal.co A New Localization and Tracking Algorith for Wireless Sensor Networks Based on Internet of Things, 2 Zhang Feng, Xue Hui-Feng, 2 Zhang Yong-Heng,

More information

Power Improvement in 64-Bit Full Adder Using Embedded Technologies Er. Arun Gandhi 1, Dr. Rahul Malhotra 2, Er. Kulbhushan Singla 3

Power Improvement in 64-Bit Full Adder Using Embedded Technologies Er. Arun Gandhi 1, Dr. Rahul Malhotra 2, Er. Kulbhushan Singla 3 Power Iproveent in 64-Bit Full Adder Using Ebedded Technologies Er. Arun Gandhi 1, Dr. Rahul Malhotra 2, Er. Kulbhushan Singla 3 1 Departent of ECE, GTBKIET, Chhapianwali Malout, Punjab 2 Director, Principal,

More information

Overlapped frequency-time division multiplexing

Overlapped frequency-time division multiplexing April 29, 16(2): 8 13 www.sciencedirect.co/science/journal/158885 he Journal of China Universities of Posts and elecounications www.buptjournal.cn/xben Overlapped frequency-tie division ultiplexing JIANG

More information

Detection of Faults in Power System Using Wavelet Transform and Independent Component Analysis

Detection of Faults in Power System Using Wavelet Transform and Independent Component Analysis Detection of Faults in Power Syste Using Wavelet Transfor and Independent Coponent Analysis 1 Prakash K. Ray, 2 B. K. Panigrahi, 2 P. K. Rout 1 Dept. of Electrical and Electronics Engineering, IIIT, Bhubaneswar,

More information

Waveform Design and Receive Processing for Nonrecurrent Nonlinear FMCW Radar

Waveform Design and Receive Processing for Nonrecurrent Nonlinear FMCW Radar Wavefor Design and Receive Processing for Nonrecurrent Nonlinear FMCW Radar John Jakabosky and Shannon D. Blunt Radar Systes Lab University of Kansas Lawrence, KS Braha Hied Sensors Directorate Air Force

More information

A soft decision decoding of product BCH and Reed-Müller codes for error control and peak-factor reduction in OFDM

A soft decision decoding of product BCH and Reed-Müller codes for error control and peak-factor reduction in OFDM A soft decision decoding of product BCH and Reed-Müller codes for error control and pea-factor reduction in OFDM Yves LOUET *, Annic LE GLAUNEC ** and Pierre LERAY ** * PhD Student and ** Professors, Departent

More information

Comparing structural airframe maintenance strategies based on probabilistic estimates of the remaining useful service life

Comparing structural airframe maintenance strategies based on probabilistic estimates of the remaining useful service life 22 èe Congrès Français de Mécanique Lyon, 24 au 28 Août 2015 Coparing structural airfrae aintenance strategies based on probabilistic estiates of the reaining useful service life. WAG a, C.GOGU b,.biaud

More information

Multicarrier Interleave-Division Multiple Access Communication in Multipath Channels

Multicarrier Interleave-Division Multiple Access Communication in Multipath Channels Multicarrier Interleave-Division Multiple Access Counication in Multipath Channels Habib ur Rehan *, Muhaad Naee **, Iran Zaa *, Syed Isail Shah ** * Center for Advanced Studies in Engineering (CASE) Islaabad

More information

Track-Before-Detect for an Active Towed Array Sonar

Track-Before-Detect for an Active Towed Array Sonar 17-20 Noveber 2013, Victor Harbor, Australia Track-Before-Detect for an Active Towed Array Sonar Han X. Vu (1,2), Sauel J. Davey (1,2), Fiona K. Fletcher (1), Sanjeev Arulapala (1,2), Richard Elle (1)

More information

Keywords Frequency-domain equalization, antenna diversity, multicode DS-CDMA, frequency-selective fading

Keywords Frequency-domain equalization, antenna diversity, multicode DS-CDMA, frequency-selective fading Joint Frequency-doain Equalization and Antenna Diversity Cobining for Orthogonal Multicode DS-CDMA Signal Transissions in A Frequency-selective Fading Channel Taeshi ITAGAKI *1 and Fuiyui ADACHI *2 Dept.

More information

ELEC2202 Communications Engineering Laboratory Frequency Modulation (FM)

ELEC2202 Communications Engineering Laboratory Frequency Modulation (FM) ELEC Counications Engineering Laboratory ---- Frequency Modulation (FM) 1. Objectives On copletion of this laboratory you will be failiar with: Frequency odulators (FM), Modulation index, Bandwidth, FM

More information

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis using Mel-Cepstral Coefficient Feature Speech Synthesis using Mel-Cepstral Coefficient Feature By Lu Wang Senior Thesis in Electrical Engineering University of Illinois at Urbana-Champaign Advisor: Professor Mark Hasegawa-Johnson May 2018 Abstract

More information

Relation between C/N Ratio and S/N Ratio

Relation between C/N Ratio and S/N Ratio Relation between C/N Ratio and S/N Ratio In our discussion in the past few lectures, we have coputed the C/N ratio of the received signals at different points of the satellite transission syste. The C/N

More information

Relative phase information for detecting human speech and spoofed speech

Relative phase information for detecting human speech and spoofed speech Relative phase information for detecting human speech and spoofed speech Longbiao Wang 1, Yohei Yoshida 1, Yuta Kawakami 1 and Seiichi Nakagawa 2 1 Nagaoka University of Technology, Japan 2 Toyohashi University

More information

Incorporating Performance Degradation in Fault Tolerant Control System Design with Multiple Actuator Failures

Incorporating Performance Degradation in Fault Tolerant Control System Design with Multiple Actuator Failures International Incorporating Journal Perforance of Control, Degradation Autoation, in and ault Systes, Tolerant vol. Control, no. Syste, pp. 7-, Design with June Multiple Actuator ailures 7 Incorporating

More information

Ruohua Zhou, Josh D Reiss ABSTRACT KEYWORDS INTRODUCTION

Ruohua Zhou, Josh D Reiss ABSTRACT KEYWORDS INTRODUCTION Subitted for; Algoriths and Systes, Edited by W. Wang, Published by IGI Global, ISBN-13: 978-1615209194, July, Music Onset Detection Ruohua Zhou, Josh D Reiss Center for Digital Music, Electronic Engineering

More information

Selective Harmonic Elimination for Multilevel Inverters with Unbalanced DC Inputs

Selective Harmonic Elimination for Multilevel Inverters with Unbalanced DC Inputs Selective Haronic Eliination for Multilevel Inverters with Unbalanced DC Inputs Abstract- Selective haronics eliination for the staircase voltage wavefor generated by ultilevel inverters has been widely

More information

Robust Acceleration Control of Electrodynamic Shaker Using µ Synthesis

Robust Acceleration Control of Electrodynamic Shaker Using µ Synthesis Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference 5 Seville, Spain, Deceber -5, 5 WeIC8. Robust Acceleration Control of Electrodynaic Shaker Using µ Synthesis

More information

Emotional Voice Conversion Using Deep Neural Networks with MCC and F0 Features

Emotional Voice Conversion Using Deep Neural Networks with MCC and F0 Features Emotional Voice Conversion Using Deep Neural Networks with MCC and F Features Zhaojie Luo, Tetsuya Takiguchi, Yasuo Ariki Graduate School of System Informatics, Kobe University, Japan 657 851 Email: luozhaojie@me.cs.scitec.kobe-u.ac.jp,

More information

Transmit Power and Bit Allocations for OFDM Systems in a Fading Channel

Transmit Power and Bit Allocations for OFDM Systems in a Fading Channel Transit Power and Bit Allocations for OFD Systes in a Fading Channel Jiho Jang *, Kwang Bok Lee, and Yong-Hwan Lee * Sasung Electronics Co. Ltd., Suwon P.O.Box, Suwon-si, Gyeonggi-do 44-74, Korea School

More information

High Impedance Fault Detection in Electrical Power Feeder by Wavelet and GNN

High Impedance Fault Detection in Electrical Power Feeder by Wavelet and GNN International Journal of Engineering and Applied Sciences (IJEAS) ISSN: 2394-3661, Volue-2, Issue-3, March 2015 High Ipedance Fault Detection in Electrical Power Feeder by Wavelet and GNN Majid Jail, Rajveer

More information

A comparison of LSF and ISP representations for wideband LPC parameter coding using the switched split vector quantiser

A comparison of LSF and ISP representations for wideband LPC parameter coding using the switched split vector quantiser A coparison of LSF and ISP representations for wideband LPC paraeter coding using the switched split vector quantiser Author So, Stephen, Paliwal, Kuldip Published 2005 Conference Title The 8th International

More information

Enhanced Algorithm for MIESM

Enhanced Algorithm for MIESM Recent Patents on Signal Processing, 9,, -7 Enhanced Algorith for MIESM R. Sandanalakshi *, Shahid Mutaz * and Kazi Saidul * Open Access University of Aveiro, Aveiro, Portugal Abstract: The link adaptation

More information

Keywords: Equivalent Instantaneous Inductance, Finite Element, Inrush Current.

Keywords: Equivalent Instantaneous Inductance, Finite Element, Inrush Current. Discriination of Inrush fro Fault Currents in Power Transforers Based on Equivalent Instantaneous Inductance Technique Coupled with Finite Eleent Method Downloaded fro ijeee.iust.ac.ir at 5:47 IRST on

More information

Optical fiber beamformer for processing two independent simultaneous RF beams

Optical fiber beamformer for processing two independent simultaneous RF beams Optical fiber beaforer for processing two independent siultaneous RF beas M. Jaeger, S. Granieri *, and A. Siahakoun Departent of Physics and Optical Engineering, Rose-Hulan Institute of Technology Terre

More information

NINTH INTERNATIONAL CONGRESS ON SOUND AND VIBRATION, ICSV9 PASSIVE CONTROL OF LAUNCH NOISE IN ROCKET PAYLOAD BAYS

NINTH INTERNATIONAL CONGRESS ON SOUND AND VIBRATION, ICSV9 PASSIVE CONTROL OF LAUNCH NOISE IN ROCKET PAYLOAD BAYS first nae & faily nae: Rick Morgans Page nuber: 1 NINTH INTERNATIONAL CONGRESS ON SOUND AND VIBRATION, ICSV9 PASSIVE CONTROL OF LAUNCH NOISE IN ROCKET PAYLOAD BAYS Rick Morgans, Ben Cazzolato, Anthony

More information

Comparison of Fourier Bessel (FB) and EMD-FB Based Noise Removal Techniques for Underwater Acoustic Signals

Comparison of Fourier Bessel (FB) and EMD-FB Based Noise Removal Techniques for Underwater Acoustic Signals Journal of Scientific & Industrial Research Vol. 73, Deceber 214, pp. 756-762 Coparison of Fourier Bessel (FB) and EMD-FB Based Noise Reoval Techniques for Underwater Acoustic Signals V Vijaya Baskar*,

More information

Precise Indoor Localization System For a Mobile Robot Using Auto Calibration Algorithm

Precise Indoor Localization System For a Mobile Robot Using Auto Calibration Algorithm Precise Indoor Localization Syste For a Mobile Robot Using Auto Calibration Algorith Sung-Bu Ki, JangMyung Lee, and I.O. Lee : Pusan National University, http://robotics.ee.pusan.ac.r, : Ninety syste Abstract:

More information

Model Development for the Wideband Vehicle-to-vehicle 2.4 GHz Channel

Model Development for the Wideband Vehicle-to-vehicle 2.4 GHz Channel Model Developent for the Wideband Vehicle-to-vehicle.4 GHz Channel Guillero Acosta and Mary Ann Ingra School of ECE, Georgia Institute of Technology, Atlanta, GA 333-5, USA gte437k@ail.gatech.edu, ai@ece.gatech.edu

More information

Research Article Novel Design for Reduction of Transformer Size in Dynamic Voltage Restorer

Research Article Novel Design for Reduction of Transformer Size in Dynamic Voltage Restorer Research Journal of Applied Sciences, Engineering and Technology 8(19): 057-063, 014 DOI:10.1906/rjaset.8.1198 ISSN: 040-7459; e-issn: 040-7467 014 Maxwell Scientific Publication Corp. Subitted: April

More information

Secondary-side-only Simultaneous Power and Efficiency Control in Dynamic Wireless Power Transfer System

Secondary-side-only Simultaneous Power and Efficiency Control in Dynamic Wireless Power Transfer System 069060 Secondary-side-only Siultaneous Power and Efficiency Control in Dynaic Wireless Power Transfer Syste 6 Giorgio ovison ) Daita Kobayashi ) Takehiro Iura ) Yoichi Hori ) ) The University of Tokyo,

More information

Radar Imaging of Non-Uniformly Rotating Targets via a Novel Approach for Multi-Component AM-FM Signal Parameter Estimation

Radar Imaging of Non-Uniformly Rotating Targets via a Novel Approach for Multi-Component AM-FM Signal Parameter Estimation Sensors 5, 5, 695-693; doi:.339/s53695 Article OPEN ACCESS sensors ISSN 44-8 www.dpi.co/journal/sensors Radar Iaging of Non-Uniforly Rotating Targets via a Novel Approach for Multi-Coponent AM-FM Signal

More information

Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F0 based on Wavelet Transform

Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F0 based on Wavelet Transform 9th ISCA Speech Synthesis Workshop 13-15 Sep 216, Sunnyvale, USA Emotional Voice Conversion Using Neural Networks with Different Temporal Scales of F based on Wavelet Transform Zhaojie Luo 1, Jinhui Chen

More information

POWER QUALITY ASSESSMENT USING TWO STAGE NONLINEAR ESTIMATION NUMERICAL ALGORITHM

POWER QUALITY ASSESSMENT USING TWO STAGE NONLINEAR ESTIMATION NUMERICAL ALGORITHM POWER QUALITY ASSESSENT USING TWO STAGE NONLINEAR ESTIATION NUERICAL ALGORITH Vladiir Terzia ABB Gerany vadiir.terzia@de.abb.co Vladiir Stanoevic EPS Yugoslavia vla_sta@hotail.co artin axiini ABB Gerany

More information

ABNORMAL SOUND EVENT DETECTION USING TEMPORAL TRAJECTORIES MIXTURES

ABNORMAL SOUND EVENT DETECTION USING TEMPORAL TRAJECTORIES MIXTURES ABNORMAL SOUND EVENT DETECTION USING TEMPORAL TRAJECTORIES MIXTURES Debalya Chakrabarty Mounya Elhilali Departent of Electrical and Coputer Engineering Johns Hopkins University, Baltiore, MD, USA. ABSTRACT

More information

Improved Codebook-based Speech Enhancement based on MBE Model

Improved Codebook-based Speech Enhancement based on MBE Model INTERSPEECH 7 August 4, 7, Stochol, Sweden Iproved Codeboo-based Speech Enhanceent based on MBE Model Qizheng Huang, Changchun Bao, Xianun Wang Speech Audio Signal Processing Laborator, Facult of Inforation

More information

A Pulse Model in Log-domain for a Uniform Synthesizer

A Pulse Model in Log-domain for a Uniform Synthesizer G. Degottex, P. Lanchantin, M. Gales A Pulse Model in Log-domain for a Uniform Synthesizer Gilles Degottex 1, Pierre Lanchantin 1, Mark Gales 1 1 Cambridge University Engineering Department, Cambridge,

More information

Waveform generation based on signal reshaping. statistical parametric speech synthesis

Waveform generation based on signal reshaping. statistical parametric speech synthesis INTERSPEECH 2016 September 8 12, 2016, San Francisco, USA Waveform generation based on signal reshaping for statistical parametric speech synthesis Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu,

More information

Using Adaptive Modulation in a LEO Satellite Communication System

Using Adaptive Modulation in a LEO Satellite Communication System Proceedings of the 11th WSEAS International Conference on COMMUNICATIONS, Agios Nikolaos, Crete Island, Greece, July 26-28, 27 255 Using Adaptive Modulation in a LEO Satellite Counication Syste L. HADJ

More information

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists 3,900 116,000 10M Open access books available International authors and editors Downloads Our authors

More information

A Novel Control Scheme to Reduce Storage Capacitor of Flyback PFC Converter

A Novel Control Scheme to Reduce Storage Capacitor of Flyback PFC Converter International Journal of Electronics and Electrical Engineering Vol. 4, No., April 6 A Novel Control Schee to Reduce Storage Capacitor of Flyback PFC Converter Boyang Chen and Lei Li College of Autoation,

More information

Design and Implementation of Block Based Transpose Form FIR Filter

Design and Implementation of Block Based Transpose Form FIR Filter Design and Ipleentation of Bloc Based Transpose For FIR Filter O. Venata rishna 1, Dr. C. Venata Narasihulu 2, Dr.. Satya Prasad 3 1 (ECE, CVR College of Engineering, Hyderabad, India) 2 (ECE, Geethanjali

More information

Performance Analysis of Atmospheric Field Conjugation Adaptive Arrays

Performance Analysis of Atmospheric Field Conjugation Adaptive Arrays Perforance Analysis of Atospheric Field Conjugation Adaptive Arrays Aniceto Belonte* a, Joseph M. Kahn b a Technical Univ. of Catalonia, Dept. of Signal Theory and Coun., 08034 Barcelona, Spain; b Stanford

More information

Fiber Bragg grating based four-bit optical beamformer

Fiber Bragg grating based four-bit optical beamformer Fiber Bragg grating based four-bit optical beaforer Sean Durrant a, Sergio Granieri a, Azad Siahakoun a, Bruce Black b a Departent of Physics and Optical Engineering b Departent of Electrical and Coputer

More information

Interference Management in LTE Femtocell Systems Using Fractional Frequency Reuse

Interference Management in LTE Femtocell Systems Using Fractional Frequency Reuse Interference Manageent in LTE Fetocell Systes Using Fractional Frequency Reuse Poongup Lee and Jitae Shin School of Inforation and Counication Engineering Sungyunwan University, Suwon, 440-746, Korea {poongup,

More information

REPORT ITU-R SA Telecommunication characteristics and requirements for space VLBI systems

REPORT ITU-R SA Telecommunication characteristics and requirements for space VLBI systems Rep. ITU-R SA.2132 1 REPORT ITU-R SA.2132 Telecounication characteristics and requireents for space VLBI systes (2008) This Report describes the characteristics of the space VLBI systes. These characteristics

More information

Intermediate-Node Initiated Reservation (IIR): A New Signaling Scheme for Wavelength-Routed Networks with Sparse Conversion

Intermediate-Node Initiated Reservation (IIR): A New Signaling Scheme for Wavelength-Routed Networks with Sparse Conversion Interediate-Node Initiated Reservation IIR): A New Signaling Schee for Wavelength-Routed Networks with Sparse Conversion Kejie Lu, Jason P. Jue, Tiucin Ozugur, Gaoxi Xiao, and Irich Chlatac The Center

More information

UNIT - II CONTROLLED RECTIFIERS (Line Commutated AC to DC converters) Line Commutated Converter

UNIT - II CONTROLLED RECTIFIERS (Line Commutated AC to DC converters) Line Commutated Converter UNIT - II CONTROLLED RECTIFIERS (Line Coutated AC to DC converters) INTRODUCTION TO CONTROLLED RECTIFIERS Controlled rectifiers are line coutated ac to power converters which are used to convert a fixed

More information

UWB System for Time-Domain Near-Field Antenna Measurement

UWB System for Time-Domain Near-Field Antenna Measurement UWB Syste for Tie-Doain Near-Field Antenna Measureent B. Levitas #, M. Drozdov #, I. Naidionova #, S. Jefreov #, S. Malyshev *2, A. Chizh *3 www.geozondas.co # Geozondas Ltd., 6, Shevchenko Str., LT-3,

More information

Optimal Modulation Index of the Mach-Zehnder Modulator in a Coherent Optical OFDM System Employing Digital Predistortion

Optimal Modulation Index of the Mach-Zehnder Modulator in a Coherent Optical OFDM System Employing Digital Predistortion Optial Modulation Index of the Mach-Zehnder Modulator in a Coherent Optical OFDM yste Eploying Digital redistortion David Rörich, Xiaojie Wang, Michael Bernhard, Joachi peidel Universität tuttgart, Institut

More information

Keywords: International Mobile Telecommunication (IMT) Systems, evaluating the usage of frequency bands, evaluation indicators

Keywords: International Mobile Telecommunication (IMT) Systems, evaluating the usage of frequency bands, evaluation indicators 2nd International Conference on Advances in Mechanical Engineering and Industrial Inforatics (AMEII 206) Entropy Method based Evaluation for Spectru Usage Efficiency of International Mobile Telecounication

More information

EFFECTS OF MASKING ANGLE AND MULTIPATH ON GALILEO PERFORMANCES IN DIFFERENT ENVIRONMENTS

EFFECTS OF MASKING ANGLE AND MULTIPATH ON GALILEO PERFORMANCES IN DIFFERENT ENVIRONMENTS 1 EFFECTS OF MASKING ANGLE AND MULTIPATH ON GALILEO PERFORMANCES IN DIFFERENT ENVIRONMENTS M. Malicorne*, M. Bousquet**, V. Calettes*** SUPAERO, 1 avenue Edouard Belin BP 43, 3155 Toulouse Cedex, France.

More information

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation

Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Evaluation of clipping-noise suppression of stationary-noisy speech based on spectral compensation Takahiro FUKUMORI ; Makoto HAYAKAWA ; Masato NAKAYAMA 2 ; Takanobu NISHIURA 2 ; Yoichi YAMASHITA 2 Graduate

More information

A Novel NLOS Mitigation Approach for Wireless Positioning System

A Novel NLOS Mitigation Approach for Wireless Positioning System 2 3rd International Conference on Coputer and Electrical Engineering (ICCEE 2) IPCSIT vol. 53 (22) (22) IACSIT Press, Singapore DOI:.7763/IPCSIT.22.V53.No..54 A Novel NLOS Mitigation Approach for Wireless

More information

Cross-correlation tracking for Maximum Length Sequence based acoustic localisation

Cross-correlation tracking for Maximum Length Sequence based acoustic localisation Cross-correlation tracking for Maxiu Length Sequence based acoustic localisation Navinda Kottege Research School of Inforation Sciences and Engineering The Australian National University, ACT, Australia

More information

Evaluation of Steady-State and Dynamic Performance of a Synchronized Phasor Measurement Unit

Evaluation of Steady-State and Dynamic Performance of a Synchronized Phasor Measurement Unit 01 IEEE Electrical Power and Energy Conference Evaluation of Steady-State and Dynaic Perforance of a Synchronized Phasor Measureent Unit Dinesh Rangana Gurusinghe, Graduate Student Meber, IEEE, Athula

More information

Application of velvet noise and its variants for synthetic speech and singing (Revised and extended version with appendices)

Application of velvet noise and its variants for synthetic speech and singing (Revised and extended version with appendices) Application of velvet noise and its variants for synthetic speech and singing (Revised and extended version with appendices) (Compiled: 1:3 A.M., February, 18) Hideki Kawahara 1,a) Abstract: The Velvet

More information

LETTER Adaptive Multi-Stage Parallel Interference Cancellation Receiver for Multi-Rate DS-CDMA System

LETTER Adaptive Multi-Stage Parallel Interference Cancellation Receiver for Multi-Rate DS-CDMA System IEICE TRANS. COMMUN., VOL.E87 B, NO.8 AUGUST 2004 2401 LETTER Adaptive Multi-Stage Parallel Interference Cancellation Receiver for Multi-Rate DS-CDMA Syste Seung Hee HAN a), Student Meber and Jae Hong

More information

OTC Statistics of High- and Low-Frequency Motions of a Moored Tanker. sensitive to lateral loading such as the SAL5 and

OTC Statistics of High- and Low-Frequency Motions of a Moored Tanker. sensitive to lateral loading such as the SAL5 and OTC 61 78 Statistics of High- and Low-Frequency Motions of a Moored Tanker by J.A..Pinkster, Maritie Research Inst. Netherlands Copyright 1989, Offshore Technology Conference This paper was presented at

More information