Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G Codec

Similar documents
Proceedings of Meetings on Acoustics

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec

Transcoding of Narrowband to Wideband Speech

3GPP TS V5.0.0 ( )

Flexible and Scalable Transform-Domain Codebook for High Bit Rate CELP Coders

Transcoding free voice transmission in GSM and UMTS networks

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Wideband Speech Coding & Its Application

ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY

A Fast Image Encryption Scheme based on Chaotic Standard Map

CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT

Chapter IV THEORY OF CELP CODING

Quality comparison of wideband coders including tandeming and transcoding

ETSI TS V ( )

Perceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited

Final draft ETSI EN V1.2.0 ( )

An Introduction to Compressive Sensing and its Applications

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Synchronous Overlap and Add of Spectra for Enhancement of Excitation in Artificial Bandwidth Extension of Speech

A Novel Color Image Cryptosystem Using Chaotic Cat and Chebyshev Map

MODIFIED DCT BASED SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS

SOURCE CONTROLLED CHANNEL DECODING FOR GSM-AMR SPEECH TRANSMISSION WITH VOICE ACTIVITY DETECTION (VAD) C. Murali Mohan R. Aravind

Overview of Code Excited Linear Predictive Coder

22. Konferenz Elektronische Sprachsignalverarbeitung (ESSV), September 2011, Aachen, Germany (TuDPress, ISBN )

EFFICIENT SUPER-WIDE BANDWIDTH EXTENSION USING LINEAR PREDICTION BASED ANALYSIS-SYNTHESIS. Pramod Bachhav, Massimiliano Todisco and Nicholas Evans

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM

Ninad Bhatt Yogeshwar Kosta

Super-Wideband Fine Spectrum Quantization for Low-rate High-Quality MDCT Coding Mode of The 3GPP EVS Codec

Chaotically Modulated RSA/SHIFT Secured IFFT/FFT Based OFDM Wireless System

ENHANCED TIME DOMAIN PACKET LOSS CONCEALMENT IN SWITCHED SPEECH/AUDIO CODEC.

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

Information. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract

EUROPEAN pr ETS TELECOMMUNICATION November 1996 STANDARD

Improving Sound Quality by Bandwidth Extension

Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions

Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel

Perceptual Speech Enhancement Using Multi_band Spectral Attenuation Filter

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

EUROPEAN pr ETS TELECOMMUNICATION March 1996 STANDARD

Open Access Improved Frame Error Concealment Algorithm Based on Transform- Domain Mobile Audio Codec

ETSI TS V8.0.0 ( ) Technical Specification

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

Chaos based Communication System Using Reed Solomon (RS) Coding for AWGN & Rayleigh Fading Channels

Communications Theory and Engineering

Call Quality Measurement for Telecommunication Network and Proposition of Tariff Rates

An Improved Version of Algebraic Codebook Search Algorithm for an AMR-WB Speech Coder

Non-Uniform Speech/Audio Coding Exploiting Predictability of Temporal Evolution of Spectral Envelopes

Speech Coding using Linear Prediction

High-Capacity Reversible Data Hiding in Encrypted Images using MSB Prediction

SNR Scalability, Multiple Descriptions, and Perceptual Distortion Measures

Digital Audio Watermarking With Discrete Wavelet Transform Using Fibonacci Numbers

6/29 Vol.7, No.2, February 2012

Artificial Bandwidth Extension Using Deep Neural Networks for Spectral Envelope Estimation

Scalable Speech Coding for IP Networks

ETSI EN V7.0.2 ( )

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

Automatic Speech Recognition (ASR) Over VoIP and Wireless Networks

ITU-T P.863. Amendment 1 (11/2011)

Audio Signal Compression using DCT and LPC Techniques

A NOVEL FREQUENCY-MODULATED DIFFERENTIAL CHAOS SHIFT KEYING MODULATION SCHEME BASED ON PHASE SEPARATION

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

Blind Dereverberation of Single-Channel Speech Signals Using an ICA-Based Generative Model

Copyright S. K. Mitra

Voice Activity Detection for Speech Enhancement Applications

Technical Specification Group Services and System Aspects Meeting #7, Madrid, Spain, March 15-17, 2000 Agenda Item: 5.4.3

Keywords-component: Secure Data Transmission, GSM voice channel, lower bound on Capacity, Adaptive Multi Rate

Error Protection: Detection and Correction

Dynamic Collage Steganography on Images

sensors ISSN

A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder

MATHEMATICS IN COMMUNICATIONS: INTRODUCTION TO CODING. A Public Lecture to the Uganda Mathematics Society

Efficient Statistics-Based Algebraic Codebook Search Algorithms Derived from RCM for an ACELP Speech Coder

Acoustics of wideband terminals: a 3GPP perspective

Chapter 3 LEAST SIGNIFICANT BIT STEGANOGRAPHY TECHNIQUE FOR HIDING COMPRESSED ENCRYPTED DATA USING VARIOUS FILE FORMATS

Keywords Arnold transforms; chaotic logistic mapping; discrete wavelet transform; encryption; mean error.

Journal of American Science 2015;11(7)

Study of Turbo Coded OFDM over Fading Channel

Accurate Delay Measurement of Coded Speech Signals with Subsample Resolution

Enhancement of Speech Signal Based on Improved Minima Controlled Recursive Averaging and Independent Component Analysis

IMPROVED CODING OF TONAL COMPONENTS IN MPEG-4 AAC WITH SBR

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Methods for objective and subjective assessment of quality

APPLICATIONS OF DSP OBJECTIVES

Cryptography CS 555. Topic 20: Other Public Key Encryption Schemes. CS555 Topic 20 1

Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding?

Speech Coding in the Frequency Domain

INTERNATIONAL TELECOMMUNICATION UNION

Improved signal analysis and time-synchronous reconstruction in waveform interpolation coding

The Channel Vocoder (analyzer):

Image Encryption Based on New One-Dimensional Chaotic Map

New binary image encryption algorithm based on combination of confusion and diffusion

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

Speech Signal Encryption Using Chaotic Symmetric Cryptography

RECOMMENDATION ITU-R BS User requirements for audio coding systems for digital broadcasting

A NEW FEATURE VECTOR FOR HMM-BASED PACKET LOSS CONCEALMENT

Joint effect of channel coding and AMR compression on speech quality

Different Approaches of Spectral Subtraction Method for Speech Enhancement

Golomb-Rice Coding Optimized via LPC for Frequency Domain Audio Coder

3GPP TS V8.0.0 ( )

Transcription:

Wideband Speech Encryption Based Arnold Cat Map for AMR-WB G.722.2 Codec Fatiha Merazka Telecommunications Department USTHB, University of science & technology Houari Boumediene P.O.Box 32 El Alia 6 Bab Ezzouar, Algiers Algeria fmerazka@usthb.dz Abstract. Speech encryption is becoming more and more essential as the increasing importance of multimedia applications and mobile telecommunications. However, multimedia encryption and decryption are often computationally demanding and unpractical for power-constrained devices and narrow bandwidth environments. In this paper an encryption scheme for AM-WB ITU-T G. 722.2 speech based Arnold cat Map is presented analyzed and evaluated using objective and subjective tests for the 8 modes of the AMR-WB ITU-T G.722.2. Simulation results show that AMR-WB ITU-T G.722.2 based Arnold cat Map encryption is very efficient since the encrypted speech is similar to a white noise. The perceptual evaluation of speech quality (PESQ) and enhanced modified bark spectral distortion (EMBSD) tests for speech speech extracted from TIMIT database confirm the efficiency of the presented scheme. Keywords: Speech encryption, ITU-T G.722.2, Arnold cat map, EMBSD, PESQ. Introduction Nowadays, interactive multimedia services such as Voice over IP (VoIP) and video conferencing have changed from promising new applications to reality. The increasing demand for multimedia services in the Internet has produced a number of commercial. However, content protection and customer privacy are becoming more and more significant. Since the encryption can effectively prevent eavesdropping, its use is widely advocated in many areas [-6]. Traditional encryption techniques such as RSA and DES are not efficient for speech and multimedia data in general, because of the large data size, high correlation among data and high redundancy. In recent years, there is a large amount of work utilizing chaos in various algorithms and systems for communication, cryptography and watermarking [7]. Encryption by chaotic maps is generally used in image processing due to its random-like behavior and its sensitivity to initial conditions in addition to its high confusion property [8]. In this paper, Arnold Cat Map encryption scheme for the AMR-WB G.722.2 standard [9] is presented. A. Elmoataz et al. (Eds.): ICISP 24, LNCS 859, pp. 658 664, 24. Springer International Publishing Switzerland 24

Wideband Speech Encryption based Arnold Cat Map for AMR-WB G.722.2 Codec 659 The rest of the paper is organized as follows. In Section 2, AMR-WB ITU-T G.722.2 is introduced. Section 3 presents the Arnold Cat Map encryption scheme for AMR-WB ITU-T G.722.2. Section 4 analyzes the scheme s security, and gives contrast experiments to evaluate our scheme s performance. Finally, some conclusions are drawn, in Section 6. 2 Overview of the Standard ITU-T G.722.2 The standard ITU-T G722.2 is a coder / decoder for adaptation to high-quality multirate wideband (AMR-WB), which are primarily intended to process the speech signals of a bandwidth of 7 khz. Adaptation AMR-WB operates at a variety of bit rates between 6.6 kbit/s and 23.85 kbit/s. The bit rate may be changed at any frame boundary of 2 ms. The AMR -WB G.722.2 codec is the same as the 3GPP AMR-WB codec. The corresponding 3GPP specifications are TS 26.9 standards to the speech codec [] and TS 26.94 for the voice activity detector []. The AMR- WB G.722.2 codec consists of nine source codecs with bit rates of 23.85, 23.5, 9.85, 8.25, 5.85, 4.25, 2.65, 8.85 and 6.6 kbit/s. In practice, these rates are represented by modes 8, 7, 5, 4, 3, 2, and respectively. This codec is based on Code Excited Linear Prediction (CELP)[2]. AMR-WB encoder uses the ACELP (Algebraic CELP) technology that relies on a system modeling speech production. It also has mechanisms discontinuous transmission (DTX) to optimize the radio resource consumption by not transmitting signal during periods of non-voice activity. To do this, the encoder, a voice activity detector (VAD for "Voice Activity Detection ) discriminates the word of those moments of silence or noise. At the decoder, a comfort noise generator (CNG) regenerates the closest possible to the original sound signal. At the decoder, the correction devices corrupted frames can reduce the effect of errors occurring on the radio channel. The decoder is informed of the status of each frame (fully preserved, partially corrupted, completely corrupted) using information provided by the network layer. 3 The Adopted Encryption Method Cat map, introduced by Arnold and Avez [3], is a well-known chaotic map which is generally used in chaos image encryption, watermarking and public-key cryptosystem. The Arnold cat map is often employed as it possesses nice ergodic and mixing properties. This map is an area-preserving chaotic map having the form. x y = x 2 y mod where x, y [ ] and det =. In addition, it can be generalized and 2 discretized by using control parameters, p and q, as follows: () ()

66 F. Merazka x y where, y {,, N } = q p x pq + y mod ( N ) x and p, q are positive integers. Thus, the confusion key of cat map is composed of the parameters p and q. To perform the encryption, the parameters p and q are elected randomly between and 256. In fact, the Cat Map performs a permutation. The coordinates ( x, y ) of a given bit in the original signal become ( x ( n + ), y( n + ) ) in the encrypted signal according to eq. 2. The matrix obtained from encryption equation is transformed into encrypted vector to generate the encrypted message. Decryption is done by the same p equation except that the matrix is replaced by its inverse. q pq + (2) 4 Experiment Results In this section we present the results. Several experiments are carried out to test the encryption efficiency of the presented wideband speech cryptosystem. The quality of both the encrypted and reconstructed signals is assessed for the standard AMR-WB G.722.2. Simulations and results of our implemented method are given. The speech used is extracted from TIMIT database [4]. The speech file was encoded using AMR-WB G.722.2 CS-ACELP. The resulting bitstreams were encrypted by Arnold Cat Map encryption scheme. Its performance was evaluated: ) by signal inspection, in both time and frequency domains; 2) by means of objective distortion measures. We have conducted our simulations in 9 modes which correspond to 6.6, 8.85, 2.65, 4.25, 5.85, 8.25, 9.85, 23.5 and 23.85 kbit/s for the AMR-WB ITU-T G.722.2. The original speech and its spectrogram are given in Fig. and respectively for comparison later with the encrypted and reconstructed speech..3.2 Original speech Spectrogram of original speech. -. -.2.5 -.3 -.4.5.5 2 x 5 2 3 4 5 6 7 x 4 Fig.. Original speech, Spectrogram of original speech Fig. 2, 3 and 4 present the encrypted speech and their spectrograms in mode, 4, and 8 respectively. We have selected three modes for representing simulation results (mode, 4 and 8) because of space.

Wideband Speech Encryption based Arnold Cat Map for AMR-WB G.722.2 Codec 66 Arnol Cat Map Encryption mode 5 Spectrogram Arnold Cat Map encryption mode.5 4 -.5-2 4 6 8 x 4 3 2 2 3 4 5 6 7 8 Spectrogram of reconstructed speech mode.5 2 3 4 5 6 7 x 4 Fig. 2. Original speech encrypted with Arnold Cat Map, Spectrogram of encrypted with Arnold Cat Map, (c) Spectrogram of reconstructed speech. (mode AMR-WB ITU-T G.722.2) (c) Arnold Cat Map encryption mode 4 Spectrogram Arnold Cat Map encryption mode 4 5 4 -.5.5 2 x 5 3 2 2 4 6 8 2 4 Spectrogram of reconstructed speech mode 4.8.6.4.2 2 3 4 5 6 7 x 4 (c) Fig. 3. Original speech encrypted with Arnold Cat Map, Spectrogram of encrypted with Arnold Cat Map, (c) Spectrogram of reconstructed speech. (mode 4 AMR-WB ITU-T G.722.2) We can see from these figures that encrypted speech signals obviously are similar to the white noise which indicates that no residual intelligibility can be useful for eavesdroppers at the communication channel.

662 F. Merazka.5 -.5 Arnold Cat Map encryption mode 8 Spectrogram Arnold Cat Map encryption mode 8 4 2 -.5.5 2 2.5 x 5 5 5 2 Spectrogram of reconstructed speech mode 8.8.6.4.2 (c) Fig. 4. Original speech encrypted with Arnold Cat Map, Spectrogram of encrypted with Arnold Cat Map, (c) Spectrogram of reconstructed speech. (mode 8 AMR-WB ITU-T G.722.2) Comparing Fig with Figs. 2 (c), 3 (c) and 4 (c), we can see clearly that the reconstructed speech signals are the same as the original one with hardly noticeable differences. PESQ is an objective measurement tool, defined according to [5], that predicts the results of subjective listening tests on narrowband telephony systems and speech codecs. This quality measure method uses a perceptual model to compare the original, unprocessed signal, with the degraded or processed signal. The resulting quality score, though an objective measure, is more closely related to the subjective Mean Opinion Score (MOS) defined according to [6]. We also performed EMBSD (Enhanced Modified Bark Spectral Distortion) which was developed by Temple University in USA [7]. The obtained results from tests with EMBSD and PESQ are given in Tables and 2 respectively. mode 2 3 4 5 6 7 x 4 Table. EMBSD Tests Original speech Without encryption Reconstructed speech 3.27 3.27 2.546 2.546 2 2.632 2.632 3 2.737 2.737 4 2.88 2.88 5 2.768 2.768 6 2.679 2.679 7 2.8 2.8 8 2.95 2.95

Wideband Speech Encryption based Arnold Cat Map for AMR-WB G.722.2 Codec 663 mode Table 2. PESQ Tests Original speech without encryption Reconstructed speech 2.79 2.79 3.28 3.28 2 3.248 3.248 3 3.39 3.39 4 3.356 3.356 5 3.45 3.45 6 3.433 3.433 7 3.59 3.59 8 3.487 3.487 Results from Tables and 2 confirm the efficiency of the chaotic cat map based algorithm for the standard AMR-WB ITU-T G.722.2 since the same values are obtained with and without encryption with the Arnold cat map algorithm. 5 Conclusion In this paper, a wideband speech encryption based Arnold Cat Map algorithm for AMR-WD ITU-T G.722.2 is presented. From our results, it is obvious that even with insignificant differences in speech quality; the presented method performs well with the standard AMR-WD ITU-T G.722.2 for the encryption and reconstruction of speech. References. Beker, H., Piper, F.C.: Secure Speech Communications. Academic Press, London (985) 2. Lian, S.: Multimedia Content Encryption: Techniques and Applications. CRC Press, Boca Raton (28) 3. Gemmill, J., Srinivasan, A., Lynn, J., Chatterjee, S., Tulu, B., Abhichandani, T.: Middleware for Scalable Real-Time Multimedia Cyberinfrastructure. Journal of Internet Technology 5(4), 99 4 (24) 4. Jorstad, I., Dustdar, S., Do, T.V.: An Analysis of Cur-rent Mobile Services and Enabling Technologies. Int. J. Ad Hoc and Ubiquitous Computing (/2), 92 2 (25) 5. Kamel, I., Juma, H.: Simplified Watermarking Scheme for Sensor Networks. International Journal of Internet Protocol Technology 5(/2), (2) 6. Sobhi Afshar, A.A., Eghlidos, T., Aref, M.R.: Efficient Secure Channel Coding Based on Quasi-Cyclic Low-Density-Parity-Check Codes. IET Communications 3(2), 279 292 (29) 7. Chen, F., Wong, K.-W., Liao, X., Xiang, T.: Period Distribution of the Generalized Discrete Arnold Cat Map for N= 2 e IEEE Transactions on Information Theory. IEEE Transactions on Information Theory 59(5), 3249 (23)

664 F. Merazka 8. Fridrich, J.: Symmetric ciphers based on two-dimensional chaotic maps. International Journal of Bifurcation and Chaos 8(6), 259 284 (998) 9. 3GPP TS 26.7: AMR Wideband Speech Codec; General description. 3GPP TS 26.9 Adaptive Multi-Rate wideband speech transcoding, 3GPP Technical Specification. 3GPP TS 26.94: AMR Wideband speech codec; Voice Activity Detector (VAD), 3GPP Technical Specification 2. Schroeder, M.R.: B.S.: Code-Excited Linear Prediction (CELP): High quality speech very low bit rates. In: Proc. ICASSP, pp. 937 94 (985) 3. Arnold, E.A., Avez, A.: Ergodic Problems of Classical Mechanics Benjamin, W. A., New Jersey. ch., p. 6 (968) 4. NIST,Timit Speech Corpus, NIST (99) 5. ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. International Telecommunication Union, Geneva (2) 6. ITU-T Recommendation P.8, Methods for subjective determination of transmission quality. International Telecommunication Union, Geneva (996) 7. Yang, W.: Enhanced Modified Bark Spectral Distortion (EMBSD): An Objective Speech Quality Measurement Based on Audible Distortion and Cognition Model, PhD Dissertation. Temple University, USA (999)