LMR Codecs Why codecs? Which ones? Why care? Joseph Rothweiler Sensicomm LLC Hudson NH

Similar documents
EE482: Digital Signal Processing Applications

Surveillance Transmitter of the Future. Abstract

Chapter IV THEORY OF CELP CODING

Digital Speech Processing and Coding

Wideband Speech Coding & Its Application

APPLICATIONS OF DSP OBJECTIVES

Communications Theory and Engineering

Overview of Code Excited Linear Predictive Coder

Ap A ril F RRL RRL P ro r gra r m By Dick AH6EZ/W9

Page 0 of 23. MELP Vocoder

Cellular systems & GSM Wireless Systems, a.a. 2014/2015

Speech Compression Using Voice Excited Linear Predictive Coding

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Transcoding free voice transmission in GSM and UMTS networks

Speech Coding using Linear Prediction

Voice Excited Lpc for Speech Compression by V/Uv Classification

techniques are means of reducing the bandwidth needed to represent the human voice. In mobile

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

Analysis/synthesis coding

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

Waveform Coding Algorithms: An Overview

A 600 BPS MELP VOCODER FOR USE ON HF CHANNELS

TELECOMMUNICATION SYSTEMS

Speech Synthesis using Mel-Cepstral Coefficient Feature

Speech Synthesis; Pitch Detection and Vocoders

Practical Limitations of Wideband Terminals

Adaptive time scale modification of speech for graceful degrading voice quality in congested networks

EE 225D LECTURE ON MEDIUM AND HIGH RATE CODING. University of California Berkeley

MX800 BASE STATION SPECIFICATIONS

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

Telecommunication Electronics

Speech Enhancement using Wiener filtering

Comparison of Low-Rate Speech Transcoders in Electronic Warfare Situations: Ambe-3000 to G.711, G.726, CVSD

General outline of HF digital radiotelephone systems

Digital Audio. Lecture-6

Analog and Telecommunication Electronics

Speech Coding Technique And Analysis Of Speech Codec Using CS-ACELP

Universal Vocoder Using Variable Data Rate Vocoding

Audio Signal Compression using DCT and LPC Techniques

Transcoding of Narrowband to Wideband Speech

The Emergence, Introduction and Challenges of Wideband Choice Codecs in the VoIP Market

An objective method for evaluating data hiding in pitch gain and pitch delay parameters of the AMR codec

Distributed Speech Recognition Standardization Activity

The Channel Vocoder (analyzer):

Improving Sound Quality by Bandwidth Extension

3.6. Cell-Site Equipment. Traffic and Cell Splitting Microcells, Picocelles and Repeaters

Bandwidth Extension of Speech Signals: A Catalyst for the Introduction of Wideband Speech Coding?

Speech Enhancement Based On Spectral Subtraction For Speech Recognition System With Dpcm

DEPARTMENT OF DEFENSE TELECOMMUNICATIONS SYSTEMS STANDARD

Audio Quality Terminology

Datenkommunikation SS L03 - TDM Techniques. Time Division Multiplexing (synchronous, statistical) Digital Voice Transmission, PDH, SDH

SILK Speech Codec. TDP 10/11 Xavier Anguera I Ciro Gracia

2: Audio Basics. Audio Basics. Mark Handley

SGN Audio and Speech Processing

Bandwidth Extension for Speech Enhancement

Information. LSP (Line Spectrum Pair): Essential Technology for High-compression Speech Coding. Takehiro Moriya. Abstract

International Journal of Advanced Engineering Technology E-ISSN

Mel Spectrum Analysis of Speech Recognition using Single Microphone

Pulse Code Modulation

Pattern Recognition. Part 6: Bandwidth Extension. Gerhard Schmidt

NOVEL PITCH DETECTION ALGORITHM WITH APPLICATION TO SPEECH CODING

Linguistic Phonetics. Spectral Analysis

3GPP TS V5.0.0 ( )

Comparison of CELP speech coder with a wavelet method

Speech Quality Assessment for Wideband Communication Scenarios

ECE 556 BASICS OF DIGITAL SPEECH PROCESSING. Assıst.Prof.Dr. Selma ÖZAYDIN Spring Term-2017 Lecture 2

6/29 Vol.7, No.2, February 2012

White Paper. Whitepaper. 4 Level FSK/FDMA 6.25 khz Technology. New dpmr

10 Speech and Audio Signals

Ninad Bhatt Yogeshwar Kosta

Different Approaches of Spectral Subtraction Method for Speech Enhancement

ENRICHING YOUR DIGITAL EXPERIENCE NEW MEMBER TO WORLD'S MOST COMPLETE DMR PORTFOLIO PD98X

CDMA Principle and Measurement

Enhanced Waveform Interpolative Coding at 4 kbps

CHAPTER 7 ROLE OF ADAPTIVE MULTIRATE ON WCDMA CAPACITY ENHANCEMENT

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

MASTER'S THESIS. Speech Compression and Tone Detection in a Real-Time System. Kristina Berglund. MSc Programmes in Engineering

Voice Coding, PCM Voice, Voice Quality, E-model

Pulse Code Modulation (PCM)

SPEECH AND SPECTRAL ANALYSIS

IMPROVED SPEECH QUALITY FOR VMR - WB SPEECH CODING USING EFFICIENT NOISE ESTIMATION ALGORITHM

ON THE PERFORMANCE OF WTIMIT FOR WIDE BAND TELEPHONY

Lesson 8 Speech coding

EE 225D LECTURE ON SPEECH SYNTHESIS. University of California Berkeley

Voice Transmission --Basic Concepts--

Aspiration Noise during Phonation: Synthesis, Analysis, and Pitch-Scale Modification. Daryush Mehta

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Introduction to Communications Part Two: Physical Layer Ch3: Data & Signals

Speech Processing. Undergraduate course code: LASC10061 Postgraduate course code: LASC11065

Data Encoding g(p (part 2)

Introduction to DMR and the DMR Association

35"*%#4)6% 0%2&/2-!.#%!33%33-%.4 /& 4%,%0(/.%"!.$!.$ 7)$%"!.$ $)')4!, #/$%#3

CSCD 433 Network Programming Fall Lecture 5 Physical Layer Continued

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

CIRCULAR NO. O-14. Q & As on the Conversion of Railway Radio Communication to Narrowband Technology

Sound/Audio. Slides courtesy of Tay Vaughan Making Multimedia Work

Low Bit Rate Speech Coding

CHAPTER 4. PULSE MODULATION Part 2

INTERNATIONAL STANDARD

Transcription:

Enhanced Digital LMR Seminar 19th Aug 2016 Wentworth by the Sea New Castle NH LMR Codecs Why codecs? Which ones? Why care? Joseph Rothweiler Sensicomm LLC Hudson NH http://sensicomm.com Presentation available at: http://rothweiler.us Rev. Aug. 22, 2016

1 Why go Digital? Clearer voice, easier communication (potentially). Range or Power efficiency Longer radio range for the same TX power. Example: P25 >10dB better than Analog FM. Resistant to RF adjacent-channel or co-channel interference. Digital encryption much better than analog scrambling. Easy to mux multiple audio channels or other nonvoice data (eg, GPS location tracking). Other features: "voicemail", store and forward, archival recording, etc. Digital Voice requires about 10dB less transmit power here.

2 Why not use Waveform Coding (PCM)? Waveform coding Pulse Code Modulation(PCM). Microphone Analog-to-Digital converter; Digitalto-Analog converter speaker. Sample at a fixed rate, with a fixed number of bits per sample. How fast should we sample? Nyquist's Theorem: Sampling rate must be at least 2x the highest frequency. Speech goes to 6 to 7 khz, or more. Telephones cut off at 4 khz. Number of bits must be high enough to minimize quantization noise. 16 excellent, 12 good, 8 pretty good with nonlinear (mu-law) coding. Sampling Bits BPS Quality Rate 16000 16 256k Near Transparent 8000 8 μ 64k As good as Bell's 1880's carbon microphones. Simple compression (eg, predictive coding) can get us to 32 or even 16 kbps. LMR needs 8kbps or below to fix existing channel spacings and bandwidth allocations. Vocoders to the rescue!

3 What's a Vocoder? Definitions CODEC: Convert analog signal to digital (encoder) and vice versa (DECoder). Vocoder: a codec built on a model of human speech. Vocoders transmit a description of the signal rather than the signal itself. The decoder synthesizes a waveform that "sounds like" the input. Vocoders use an approximation of the human vocal tract: Excitation source: Periodic pulses from the vibrating vocal chords ("voiced"), or noise from friction turbulence ("unvoiced") y k = α x k + 10 a k p y k p p = 1 Mouth and throat form resonant cavities that shape the spectrum (like a pipe organ). Simplified mechanical model is a pulse or noise signal feeding cascaded tubes. Math equivalent is Linear Prediction model (or autoregressive or all-pole). Vocoders send small set of parameters (filter coefficients, pitch, voicing, gain,...) at a low rate (20-30ms) so the bit rate can be very low

4 A Somewhat Oversimplified Vocoder Grouping LPC: the original buzz-hiss vocoder model. SIGSALY(1943) or DoD LPC-10 (1970's). Simple 7-parameter model: amplitude, pitch, voiced/unvoiced bit, 10 coefficients. Highly intelligible, but very synthetic machine-like quality. Problematic in noise, especially the voiced/unvoiced decision. Good approach when you desperately need the lowest possible bit rates. *MBE* - Multiband-Excited synthesis. IMBE, AMBE, AMBE+2, etc. P25, satellite systems, etc. Replace voiced/unvoiced bit with a multiband mix of buzz and hiss excitations. Plus other enhancements. Better than classical LPC, without much increase in bit rate. *ELP* - (something)-excited Linear Prediction. CELP, ACELP, etc. TETRA, cellphones,... Use a stored or computed library of excitation functions. Search algorithm to select the optimal excitation. Can reproduce the input waveform if given enough bits.

5 Some LMR Standards and their codecs Standard Past, Present and Future Codecs P25 IMBE, AMBE+2 TETRA ACELP, AMR 4.75(opt); Future TBD, MELPe? NXDN AMBE+2 dpmr AMBE+2,Chinese TBD, RALCWI, Manufacturer-Specified TETRAPOL RPCELP DMR AMBE+2 FirstNet AMR? AMR-WB? Other LTE? MPT-1327 Analog FM LTE(cellular) AMR-NB, AMR-WB, etc D-STAR(HAM) AMBE, CODEC2

6 Some Codec Characteristics Codec Year Voice Rate IMBE FR 1995 4.4 TIA-102.BABA; DVSI AMBE FR ~1998 4.4 TIA-102.BABA; DVSI AMBE HR 2007 2.45 TIA-102.BABA-1; DVSI Notes ACELP 1996 4.567 ETSI EN 300 395-2; Reference and test vectors. MELPe 0.6,1.2,2.4 NATO STANAG 4591 RALCWI <2011 2.4,2.74 Proprietary cmlmicro.com RPCELP 1997? 6.0 TETRAPOL specs CODEC2? 0.7-3.2 freedv.org - Ham radio project New codecs and improved versions of old codecs appear regularly.

7 A Note on Evaluation Conventional audio measurements (SNR, bandwidth, distortion) don't really apply to vocoders. We don't transmit the waveform, so we can't use waveform measurements. Subjective testing: Rating by a panel of trained listeners. Quality: Mean Opinion Score (MOS) - Does it sound good? Rating as: 5:Excellent 4:good 3:fair 2:poor 1:bad Intelligibility: Modified Rhyme Test (MRT) Can you distinguish consonants? Listener hears: "Please select: red" and selects from: led shed red bed fed wed Sounding good is good, but a communication system is for communicating. Intelligibility is what you need. Tests aren't reality: If the users are complaining, pay attention. Testing is a really involved process, best left to the experts. Data collection, listening environment and equipment, listener training, habituation* and fatigue. *"Vocoders sound a lot better on Friday than they do on Monday." In English, consonants carry most of the information. _n _ngl_sh, c_ns_n_nts c_rry m_st _f th nf_rm_t n. I_ E i, _o o_a a o o e i o a_io_.

8 Background Noise Problems Effects Theoretical: no longer fits the model. Practical: harder to calculate the filter coefficients, and especially the pitch. Particular issue for firefighters. Results Lowered intelligibility. Mitigation Avoid noise Noise-canceling mic, throat mic, dual-microphone setups, etc. Noise reduction algorithms Hard to improve intelligibility this way. Robust codec Test for it. Bit rate tradeoff.

9 Voice Model Limitations Model approximation Nasal sounds (/m/,/n/) don't follow the lossless-tube model. Vocal chord vibration isn't really all that regular for many speakers. Breathy voices: simultaneous mix of voiced and unvoiced excitation. Unusual situations Whispered speech (covert situation?) Distorted speech (Firefighter breathing mask - SCBA). Other talkers. Result: unnatural sound, difficulty recognizing speakers, loss of inflection (stress, urgency, fear, etc).

10 Background Sounds Background noises may be important for situation awareness. Firefighter alarms: man-down, air tank empty, etc. Sirens, etc. Crowd noise. Vocoders are for voice, so nonspeech sounds may be heavily distorted. Which alarm just went off? Is it a happy crowd or a riot? Test and tell users about any issues.

11 Delay Underappreciated issue: Analog FM delay is essentially zero. Vocoders analyze 50-100ms segment of audio to extract parameters, so delay is unavoidable. Add in processing time, RF synchronization, and error-correction interleaving. In full-duplex telephony delays above 500ms become very annoying. Throws off the speak-response rhythm, so perceived as rude behavour by the other speaker. Can't jump in the gap, so end up interrupting while speaking. Issue for precise coordination?

12 Tandeming Connection of 2 different vocoders in series. P25 to TETRA via some sort of gateway. Range extension: radio satellite link radio. Old radio to new radio with improved codec, via gateway? Avoid tandeming if possible. End-to-end transport of codec bitstream is best. Consider specifying a mandatory codec for interoperation / backward compatibility? Tandem connection is worse then either alone.

13 Looking to the Future: Wideband Future systems (FirstNet?) may use wider audio bandwidth. AMR-WB codecs support 16kHz audio sampling rate. DVSI advertises a wideband codec. Others exist or are in development. Wideband does improve operation in challenging acoustic conditions. Nice to have when you can get it. Bitrate and Audio Bandwidth improve intelligibility.

14 Summary Recommendations Use the highest bit rate you can get away with. Test under realistic conditions. Test for usability (intelligibility), not just "quality". Think about non-speech signals. Consider compatibility: Need to communicate with other organizations' radios. Future buys to expand the system.

15 References Images on 3, 8, 9 are from Wikipedia. Charts on slides 1, 8, 10, 12, 13 were generated by the author from data published by NTIA. See these NTIA Publications for the full studies: http://www.its.bldrdoc.gov/publications/download/tr-01-386.pdf http://www.its.bldrdoc.gov/publications/download/tr-08-453.pdf http://www.its.bldrdoc.gov/publications/download/tr-09-459.pdf http://www.its.bldrdoc.gov/publications/download/tr-13-495.pdf http://www.its.bldrdoc.gov/publications/download/tr-15-520.pdf