Telephone Speech Quality Standards. for. Wideband IP Phone Terminals (handsets) CES-Q March 30, 2009

Similar documents
3GPP TS V ( )

INTERIM EUROPEAN I-ETS TELECOMMUNICATION December 1994 STANDARD

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Voice terminal characteristics

EUROPEAN pr I-ETS TELECOMMUNICATION June 1996 STANDARD

3GPP TS V4.2.0 ( )

3GPP TS V ( )

3GPP TS V ( )

ETSI TS V5.2.0 ( )

ETSI TS V ( )

TECHNICAL TBR 8 BASIS for October 1998 REGULATION

ETSI TS V ( )

ETSI TS V ( )

INTERIM EUROPEAN I-ETS TELECOMMUNICATION January 1996 STANDARD

Acoustics of wideband terminals: a 3GPP perspective

INTERNATIONAL TELECOMMUNICATION UNION

3GPP TS V ( )

Conversational Speech Quality - The Dominating Parameters in VoIP Systems

3GPP TS V ( )

Part V: Requirements and Test Methods for Magnetic Output From Handset Telephones for Hearing Aid Coupling and for Receive Volume Control

-/$5,!4%$./)3% 2%&%2%.#% 5.)4 -.25

3GPP TS V ( )

ITU-T P.863. Amendment 1 (11/2011)

Part V: Requirements and Test Methods for Magnetic Output from Handset Telephones for Hearing Aid Coupling and for Receive Volume Control

ETSI EN V1.8.1 ( )

ETSI ES V1.3.1 ( ) ETSI Standard

SPECIFICATION of the GENERIC TRANSMISSION INTERFACE. Interconnection Specification 1. Page 1 of 9

ETSI ES V1.2.1 ( )

CTIA Speech Performance Recommendations

ETSI ES V1.1.1 ( ) ETSI Standard

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Voice terminal characteristics

ETSI TS V1.2.1 ( )

ETSI TS V1.1.1 ( )

ETSI EN V2.2.1 ( ) European Standard (Telecommunications series)

Factors impacting the speech quality in VoIP scenarios and how to assess them

ETSI TS V ( )

ETSI TS V (201

SERIES K: PROTECTION AGAINST INTERFERENCE

European Standard Digital Enhanced Cordless Telecommunications (DECT); Common Interface (CI); Part 8: Speech and audio coding and transmission

Meetings Attended (C= Contribution, A= Attended Meeting) Organization Represented. Ken Macdonald (Chair) Roger Britt (Editor) Ron Magnuson

ing. Vasile Petrică, Drd. ing. Sorin Soviany*

Practical Limitations of Wideband Terminals

INTERNATIONAL TELECOMMUNICATION UNION

INTERNATIONAL TELECOMMUNICATION UNION

INTERNATIONAL TELECOMMUNICATION UNION

INFLUENCE OF NATIONAL SYSTEMS ON STABILITY, TALKER ECHO, AND LISTENER ECHO IN INTERNATIONAL CONNECTIONS

ETSI TS V ( )

ETSI TS V1.1.2 ( ) Technical Specification

GSM GSM TECHNICAL September 1996 SPECIFICATION Version 5.0.0

ETSI EN V6.2.1 ( )

ETSI TS V1.1.1 ( ) Technical Specification

Final draft ETSI ES V1.5.1 ( )

ETSI ES V1.4.1 ( )

EUROPEAN STANDARD Digital Enhanced Cordless Telecommunications (DECT); Common Interface (CI); Part 8: Speech and audio coding and transmission

End-to-End Speech Quality Testing in a Complex Transmission Scenario

Speech Quality Assessment for Wideband Communication Scenarios

Test Plan for Hearing Aid Compatibility

SERIES P: TELEPHONE TRANSMISSION QUALITY, TELEPHONE INSTALLATIONS, LOCAL LINE NETWORKS Voice terminal characteristics

Audio Quality Terminology

ETSI TS V1.3.1 ( )

Test Report. 4 th ITU Test Event on Compatibility of Mobile Phones and Vehicle Hands-free Terminals th September 2017

Draft ES V1.2.1 ( )

INTERNATIONAL STANDARD

RECOMMENDATION ITU-R M

TR V1.1.1 ( )

Draft Recommendation P.emergency. Speech communication requirements for emergency calls originating from vehicles V0.43. Summary.

Part V: Requirements and Test Methods for Magnetic Output from Handset Telephones for Hearing Aid Coupling

INTERNATIONAL TELECOMMUNICATION UNION

ETSI TS V ( )

ETSI ES V1.7.1 ( )

COM 12 C 288 E October 2011 English only Original: English

Perceptual wideband speech and audio quality measurement. Dr Antony Rix Psytechnics Limited

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Communications involving vehicles

Technical Report Speech and multimedia Transmission Quality (STQ); Speech samples and their usage for QoS testing

ETSI ES V1.7.1 ( )

SERIES P: TERMINALS AND SUBJECTIVE AND OBJECTIVE ASSESSMENT METHODS Communications involving vehicles

European ETS Telecommunication July 1999 Standard

Technical Requirements for Resistibility of Telecommunications Equipment to. Overvoltage and Overcurrent

Technical Requirements for Resistibility of Telecommunications Equipment to. Overvoltage and Overcurrent

Speech Quality in modern Network-Terminal Configurations

DX-Altus. Reaching New Heights in Radio Dispatch & Interoperability

Speech communication in cars goes wideband the new ITU-T T Focus Group CarCom

Telecommunications equipment Subscriber equipment Attachment requirements for analogue connection to a public switched telephone network Amendment 2

TECHNICAL REQUIREMENTS FOR ELECTROMAGNETIC DISTURBANCES EMITTED FROM LIGHTING EQUIPMENT INSTALLED IN TELECOMMUNICATION CENTERS

Part VI: Requirements for ISDN Terminal Equipment

Telecommunications equipment Subscriber equipment Attachment requirements for analogue connection to a public switched telephone network Amendment 2

3GPP TS V5.0.0 ( )

ARIB STD-T64-C.S0018-D v1.0

ETSI TS V1.2.1 ( )

ETSI TR V1.1.1 ( )

Part IV: Glossary of Terms

Part VI: Requirements for Integrated Services Digital Network Terminal Equipment

INTERNATIONAL TELECOMMUNICATION UNION. Timing requirements of slave clocks suitable for use as node clocks in synchronization networks

Public Address Systems

QUALITY OF SERVICE AND TELECOMMUNICATION IMPAIRMENTS

ETSI EG V1.1.2 ( )

INTERNATIONAL STANDARD

PTC220 Requirements for Private Voice Networks connected to the PSTN/ISDN. 9 Voice Mail system Requirements 72

ENGINEERING COMMITTEE Interface Practices Subcommittee AMERICAN NATIONAL STANDARD ANSI/SCTE

SECTION 3 TRANSMISSION STANDARDS SPECIFICATION FOR AN INTERMEDIATE REFERENCE SYSTEM. (Geneva, 1976; amended at Geneva, 1980,

Deriving Equipment Impairment Factors for Wideband Speech Codecs

Transcription:

Telephone Speech Quality Standards for Wideband IP Phone Terminals (handsets) CES-Q004-1 1. V.0 March 30, 2007 2. V.0 March 30, 2008 3. V.0 November 10, 2008 March 30, 2009 Communications and Information network Association of Japan (CIAJ)

Table of contents Introduction...3 1. Scope of application...5 1.1. Devices to which the Standards are to be applied... 5 1.2. Interfaces... 5 1.3. Principles for specified values... 5 2. Normative references...6 2.1. Overload point for digital signals for IP packet... 6 3. Send speech characteristics...6 3.1. Send loudness rating (SLR)... 6 3.2. Send frequency response... 6 3.3. Send noise... 7 3.4. Distortion (Reference values)... 7 4. Receive speech characteristics...8 4.1. Receive loudness rating (RLR)... 8 4.2. Receive frequency response... 8 4.3. Receive noise... 9 4.4. Distortion (Reference values)... 9 5. Sidetone characteristics...9 5.1. Sidetone masking rating (STMR)... 9 6. Echo...9 6.1. Weighted terminal coupling loss... 10 6.2. Stability loss (Reference values)... 10 7. Network performance loss due to traffic congestion...10 7.1 Network performance objectives... 10 7.2 Terminal voice latency... 11 7.3 Transmission rating factor (Reference values)...11 2

Introduction [Objectives for establishing these Standards] It is the responsibility of manufacturers or service providers in the information and communication network industry to give users clear information about the speech quality performance of the devices or services they provide. For this reason, when the provision of communication terminals was deregulated more than ten years ago, the Communications and Information network Association of Japan (CIAJ) established the speech quality standards for Standard Telephones and Multi-functional Telephones, and began to provide a compliance test and certification service for speech devices. Manufacturers and vendors have displayed the C mark on compliant products as proof of satisfactory speech quality performance for users. CIAJ drafted the speech quality standards for Digital Cordless and Mobile Phones in 2001 and released it to CIAJ members in the hope of early adoption. Recently, as the speeds of both Internet access connections and terminals have become faster, IP phones are beginning to be widely used. IP phone service has the advantages of being more economical and more compatible with multimedia than the conventional circuit-switching based telephone services. In particular, IP phones supporting wideband communications have been released in an effort to improve speech quality. In order to accommodate such new services, the CIAJ Telecommunications Quality Committee formed a Working Group to study widedband speech quality problems associated with providing VoIP telephone service and to develop speech quality standards for such services. In order that the standards so developed might be widely accepted by CIAJ member companies, it was necessary to make the standards reasonably achievable with currently available technology. Therefore, the WG surveyed the current situation by conducting speech quality tests, mainly using products from member companies as samples. In the International Telecommunications Union, the Telecommunication Standardization Sector (ITU-T) had been studying wideband speech quality of VoIP. As a result, ITU-T Recommendation P. 311 was approved in June 2005. Taking these activities in various standardization organizations into consideration, CIAJ has established the Telephone Speech Quality Standards for Wideband IP Phone Terminals as follows. 3

[Notes on the Standards] These standards were developed by referring to ITU-T Rec. 311 (June 2005). Although the scope of this Recommendation covers headsets and hands-free devices, the scope of application of the CIAJ Standards has placed priority on handsets, on account of the urgency of the need to issue standards and taking into consideration the effort to promote wideband IP telephones by not only terminal manufacturers, but network providers. Among the items specified in the ITU-T Recommendation, receive comfort noise, maximum acoustic pressure, input/output linearity, distortion ratio and stability loss have been classified as items for further study by the CIAJ Telecommunications Quality Committee because the Committee was unable to determine the applicability of these items in the ITU-T Recommendation due to the insufficient time spent on its study within the Committee and on the survey of CIAJ members products. Therefore, P. 311 figures have been listed for reference. Although these Standards address terminals, some items may be affected by the conditions of the IP network used. To cope with such network factors, end-to-end quality has been specified while setting several sets of typical conditions on the network. [Use of the Standards] At present, a compliance mark (the so-called C Mark ) is defined for CES-Q001 Standards for analog telephones. The application of a similar mark for the present Standards is being considered, and the application procedure, the evaluation procedure and any other procedures shall be conducted according to operational rules. 4

1. Scope of application 1.1. Devices to which the Standards are to be applied IP phone terminals and PC soft phones with handsets (150 7000Hz bandwidth). 1.2. Interfaces On the phone terminal side, the acoustic interfaces are the mouthpiece and earpiece of the handset. On the network side, the interface is Ethernet (IEEE802.3) and is converted into analog by using reference codec ( 1 ). Acoustic connection point Electrical connection point Send Receive Handset CODEC Signal Ethernet processing interface Send Receive IP phone type USB handset, etc. PC, etc. Send Send Receive AUDIO Signal Ethernet interface processing interface Receive PC soft phone type 1.3. Principles for specified values Standard values are design target values not taking the variance of products into consideration. 1 : Includes ITU-T Rec. G. 711.1, which is being considered by carriers, with the exception of G. 722 series. 5

2. Normative references The standards from which this CES-Q004-1 has been derived are: ITU-T Rec. P. 311, P. 1010 Regarding loudness rating ITU-T Recs. P. 48, P. 64 and P. 79 2.1. Overload point for digital signals for IP packet In the commonly used ITU-T Rec. G. 722 series for wideband voice encoding, the overload point is +9 dbm0, with standard signal level at -15 dbm0. ITU-T Rec. G. 711.1, the extension for G. 711 for encoding voice-frequency signals in narrowband, uses +3.17 dbm0 as the overload point and -15 dbm0 as the standard signal level when using the same overload point as G.711. 3. Send speech characteristics 3.1. Send loudness rating (SLR) The send loudness rating (SLR) shall be 4±4 db. The sensitivity shall be based on signal level calculations defined in 2.1 above. LR calculations shall be made according to ITU- T Rec. P. 79 Annex G. If the send sensitivity is variable, the measurement shall be made using the sensitivity recommended by the manufacturer. It is desirable that the physical shape and the dimensions of the handset are close to those defined in ITU-T Rec. P.350. 3.2. Send frequency response Table 1 and Figure 1 define the mask pattern. The measuring frequency points shall be ISO 1/3 octave band central frequencies. These points shall be within the mask pattern. ( 2 ) Table 1 ( 2 ) Frequency(Hz) Upper Limit(dB) Lower Limit(dB) 100 6 125 6 9 200 6 6 1000 6 6 5000 ( 3 ) 9 6300 11 13 8000 11 2 Sensitivity is a relative figure, not an absolute figure. Therefore, the points shall be acceptable as long as the shape of the speech frequency characteristics is within the mask pattern when the points are raised or lowered in the direction of sensitivity. 3 This upper limit will not be designated, since it is the point of change between 1000 Hz and 6300 Hz. 6

感度 (db) 30 送話感度周波数特性 Send Frequency Response 20 Sensitivity (db) 10 0-10 -20-30 100 1000 10000 周波数 (Hz) Frequency (Hz) Figure 1 ( 2 ) 3.3. Send noise The send noise shall be -68 dbm0a or lower. The measurement shall be made in accordance with CES-Q004M-1 Measuring method. Any noise reduction capabilities, such as a noise canceller, an echo canceller or AGC, shall be enabled. If the send sensitivity is variable, the measurement shall be made using the sensitivity recommended by the manufacturer. 3.4. Distortion (Reference values) For sine wave acoustical input signals of 200 Hz, 1 khz and 6 khz, the ratio of distortion of the digital signal output for IP packet payload must be equivalent or above the numbers in the table below. Sending level(relative) Lower limit of distribution(db) (Reference is -4.7dBPa) 200 Hz 1 khz 6 khz 18(+12( 4 )) to 20 29.0 35.0 29.0 30 25.0 26.5 25.0 46 11.0 12.5 11.0 2 Sensitivity is a relative figure, not an absolute figure. Therefore, the points shall be acceptable as long as the shape of the speech frequency characteristics is within the mask pattern when the points are raised or lowered in the direction of sensitivity. 4 The overload point will be encoding format +3.17 dbmo, the same as G. 711 7

感度 (db) 4. Receive speech characteristics 4.1. Receive loudness rating (RLR) The receive loudness rating (RLR) shall be 2±4 db. If a receive sound level setting function is provided, the nominal setting designated by the manufacturer shall be selected. The sensitivity shall be based on signal level calculations defined in section 3 above. LR calculations shall be made according to ITU-T Rec. P. 79 Annex G. It is desirable that the physical shape and the dimensions of the handset are close to those defined in ITU- T Rec. P. 350. 4.2. Receive frequency response Table 2 and Figure 2 define the mask pattern. The measuring frequency points shall be ISO 1/3 octave band central frequencies. These points shall be within the mask pattern. ( 2 ) Table 2 ( 2 ) Frequency(Hz) Upper Limit(dB) Lower Limit(dB) 100 9 160 9 12 200 9 9 1000 9 9 5000 9 9 6300 9 12 8000 9 30 Receive 受話感度周波数特性 Frequency Response Sensitivity (db) 20 10 0-10 -20-30 100 1000 10000 Frequency 周波数 (Hz) (Hz) 2 Sensitivity is a relative figure, not an absolute figure. There fore, the points shall be acceptable as long as the shape of the speech frequency characteristics is within the mask pattern when the points are raised or lowered in the direction of sensitivity. 8

Figure 2 ITU-T Rec. P. 311 sets the upper and lower limits at within 5 db, but the rule shall be relaxed for the time being, taking into consideration the technology currently available. ( 2A ) 4.3. Receive noise The receive noise shall be -59 dbpa(a) or lower. If a receive sound level adjustment function is provided, the nominal position shall be selected. The measurement shall be made in accordance with CES-Q004M-1 Measuring method. Any noise reduction capabilities, such as a noise canceller, an echo canceller or AGC, shall be enabled. 4.4. Distortion (Reference values) For sine wave acoustical input signals of 200 Hz, 1 khz and 6 khz, the ratio of distortion of the output sound signal must be equivalent or above the numbers in the table below. Digital data on IP packet payload (dbm0) Reference is -15dBm0 Lower limit of distribution(db) 200 Hz 1 khz 6 khz 8(-3( 4 )) to 30 29.0 35.0 29.0 40 25.0 26.5 25.0 56 11.0 12.5 11.0 5. Sidetone characteristics 5.1. Sidetone masking rating (STMR) The sidetone masking rating (STMR) shall be 13-22 db. If a receiving sound level adjustment function is provided, the nominal position shall be selected. The measurement shall be made in accordance with CES-Q004M-1 Measuring method. If a sending sensitivity setting function is provided, the standard setting shall be selected for the measurement. 6. Echo The return that causes talker echo is specified as the talker echo loudness rating (TELR). 2A ITU-T Rec. 311 also states: It is recognized that it may be difficult for telephone sets equipped with receivers designed according to the current technology to meet the recommended requirement, and still comply with the recommended TCLw limit. 4 The overload point will be encoding format +3.17 dbmo, the same as G. 711 9

6.1. Weighted terminal coupling loss Since the IP phone type and the PC soft phone type terminals use four wires, the only coupling that arises is acoustic coupling from the earpiece to the microphone. The acoustic coupling level is expressed as the weighted terminal coupling loss (TCLw). TELR is the sum of the TCLw, and SLR and RLR. The TELR of the IP phones shall be 58 db or higher at the normative recommended value of receiving sound level adjustment. 6.2. Stability loss (Reference values) The stability loss from the digital input to the digital output shall be determined for all frequencies in the range of 100 Hz to 8000 Hz. The stability loss shall be measured with the handset lying on a hard surface. Rules concerning these measuring methodology shall be discussed in the future. Stability loss for IP phones shall be no less than 6 db at all receive loudness settings. 7. Network performance loss due to traffic congestion Types of distortion that arise in a VoIP-based telephone service include those generated within the terminal, such as speech coding distortion, and those that vary depending on the network performance objectives, such as distortion due to packet loss. The delay time within a terminal is dependent on the relationship between the delay variation within the network and the terminal s jitter buffer performance. Therefore, a requirement for these quality factors cannot be defined in a terminal in isolation of a connected network. 7.1. Network performance objectives Network load conditions are defined in Table 3. 10

Element Table 3 IP network performance objectives Average delay time, T (ms) 70 Maximum delay time, Ta (ms) 67. 10 Maximum delay variation, ΔT: Δtmax (ms) 20 Average delay variation: Δtave (ms) 2. 90 Packet loss ratio (%): Ppl 0. 1 The delay variation is a change in the instantaneous delay time of the network from Ta to Ta+ΔT. The probability of ΔT is assumed to follow an exponential distribution. The event probability of to the maximum Δt max for delay variation ΔT shall be 99.9%. The probability of Ppl is assumed to follow a uniform distribution (random loss). Busty loss is left as an item for further study. 7.2. Terminal voice latency The terminal voice latency is the average end-to-end voice latency, under the performance on the IP network shown in Table 3, minus the average delay time of the IP network. If the terminal voice latency fluctuates, the average value is used. The average delay time of the IP network is the sum of the absolute delay time and the average delay variation of the IP network. Send delay time should be 35 ms or less when there is no network load. (target: 30 ms or less) Receive delay time, when there is network loads described in Table 3, shall be 65 ms or less (target: 50 ms or less) Although it is possible to realize these targets, the requirement is provisionally relaxed in consideration of the actual performance of the products available on the market. 7.3. Transmission rating factor (Reference values) In existing telephone bandwidths, the R Value is adopted as the index to quantify the transmission rating factor, but for wideband voice communications, there is no ITU-T recommendation. Thus, the ITU-T Rec. P. 862.2 Wide Band PESQ (WB-PESQ) shall be used. The WB-PESQ for end-to-end acoustic input/output measurement conditions for both sending and receiving shall be 3.5 or above. 11

Telecommunications Quality Committee Chair Hiroshi Asada Business Communication Systems Department1, PC Development center Toshiba Corporation PC&Network Company Working Group on Operation Requirements Chair Shinji Naito Enterprise System Department, Development Promotion headquarters, Nakayo Telecommunications, Inc. Members Yoshimasa Kamogashira Engineering Division, Iwatsu Electric Co., Ltd. Ad-hoc member Hiroshi Asada Hiroshi Kuboki Kunihiko Kimura Ogiyama Tetsuya Katsuichi Osakabe Manabu Okamoto Business Communication Systems Department1, PC Development center Toshiba Corporation PC&Network Company esound Business Depertment Telecom Systems Division Oki Electric Industry Co., Ltd. Research & Development Div Hardware Development Dept SAXA,Inc. Network Development Div. NEC Infrontia Corp. Sound Network Division YAMAHA CORPORATION NTT Cyber Space Laboratories NIPPON TELEGRAPH AND TELEPHONE CORPORATION Secretary Tadahiro Higuchi Communications and Information Network Association of Japan 12