AN547 - Why you need high performance, ultra-high SNR MEMS microphones

Similar documents
Digital encoding requirements for high dynamic range microphones

IM69D120. Description. Features. Typical applications. High performance digital XENSIVTM MEMS microphone

EMC output filter recommendations for MA120XX(P)

VM2000. Low-Noise Bottom Port Piezoelectric MEMS Microphone Data Sheet Vesper Technologies Inc. Differential Analog Output

TLS10xB0MB Demoboard. Preface Z8F Table of contents

IPM Motor Drive Simulator User Manual

VM1010. Low-Noise Bottom Port Piezoelectric MEMS Microphone Data Sheet Vesper Technologies Inc. With Wake on Sound Feature

CIPOS IPM Motor Drive Simulator User Manual

F4-(A)HDMOE-J098R3627-5P

F2-(A)HCDMO-B125T26-6CP

For broadband amplifiers up to 1 GHz at collector currents from 1 ma to 20 ma For mixers and oscillators in sub-ghz applications

DESIGN OF VOICE ALARM SYSTEMS FOR TRAFFIC TUNNELS: OPTIMISATION OF SPEECH INTELLIGIBILITY

Applications of 1EDNx550 single-channel lowside EiceDRIVER with truly differential inputs

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22.

Guidelines for CoolSiC MOSFET gate drive voltage window

Data Sheet, V1.0, Aug SMM310. Silicon MEMS Microphone. Small Signal Discretes

2018 Data Sheet Vesper Technologies Inc. VM1000. Low-Noise Bottom Port

BGA5L1BN6 BGA5L1BN6. 18dB High Gain Low Noise Amplifier for LTE Lowband VCC GND. Features

WITHIN GENERATOR APPLICATIONS

Earl R. Geddes, Ph.D. Audio Intelligence

ICS Ultra-Low Noise Microphone with Differential Output

[Q] DEFINE AUDIO AMPLIFIER. STATE ITS TYPE. DRAW ITS FREQUENCY RESPONSE CURVE.

Week 1. Signals & Systems for Speech & Hearing. Sound is a SIGNAL 3. You may find this course demanding! How to get through it:

PCB layout guidelines for MOSFET gate driver

CDM10V programming user manual describes the COOLDIM_PRG_BOARD burner board usage, the UART protocol handling and the fusing details.

Evaluation Board for CoolSiC Easy1B half-bridge modules

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22.

3 RD GENERATION BE HEARD AND HEAR, LOUD AND CLEAR

Case study for voice amplification in a highly absorptive conference room using negative absorption tuning by the YAMAHA Active Field Control system

Restoration Performance Report

ICS Ultra-Low Noise Microphone with Differential Output

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22

Production Noise Immunity

Improving PFC efficiency using the CoolSiC Schottky diode 650 V G6

Week I AUDL Signals & Systems for Speech & Hearing. Sound is a SIGNAL. You may find this course demanding! How to get through it: What is sound?

Contents. Welcome To K-Meter. System Requirements. Compatibility. Installation and Authorization. K-Meter User Interface.

Replacement of HITFET devices

ICS Analog Microphone with Low Power Mode GENERAL DESCRIPTION APPLICATIONS FEATURES FUNCTIONAL BLOCK DIAGRAM ORDERING INFORMATION

PROFET ITS716G Ω Ω. Green Product (RoHS compliant) Data Sheet 1 Rev. 1.1,

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22

Combining Subjective and Objective Assessment of Loudspeaker Distortion Marian Liebig Wolfgang Klippel

Preliminary. Wake on Sound Piezoelectric MEMS Microphone Evaluation Module

Qualified for Automotive Applications. Product Validation according to AEC-Q100/101

FREQUENCY RESPONSE AND LATENCY OF MEMS MICROPHONES: THEORY AND PRACTICE

Schottky diode mixer for 5.8 GHz radar sensor

Internally matched general purpose LNA MMIC for 50 MHz- 3.5 GHz applications

Amplifier Performance Report

Audio Testing. application note. Arrakis Systems inc.

TVS (transient voltage suppressor) Bi-directional, 5.5 V, 0.1 pf, 0201, 0402, RoHS and halogen free compliant

Thermal behavior of the new high-current PROFET

The new OptiMOS V

Sound Design and Technology. ROP Stagehand Technician

OPTIREG Linear TLE4262

ICS Ultra-Low Noise Microphone with Differential Output

How To... Commission an Installed Sound Environment

The BFP840FESD is a discrete RF heterojunction bipolar transistor (HBT) with an integrated ESD protection suitable for 5 GHz band applications.

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22.

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22

High voltage CoolMOS CE in SOT-223 package

ALTERNATING CURRENT (AC)

SPL3220. Stereo Multiband Sound Processor / Loudness Maximizer / Limiter. Features

Robust low noise broadband pre-matched RF bipolar transistor

What you Need: Exel Acoustic Set with XL2 Analyzer M4260 Measurement Microphone Minirator MR-PRO

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22.

Design of an Optimal Analogue Microphone System for Best Possible Capture of Incoming Acoustic Signals

TVS (transient voltage suppressor) Bi-directional, 18 V (AC), 13 V (DC), 0.3 pf, 0201, RoHS and halogen free compliant

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22

LME49710 High Performance, High Fidelity Audio Operational Amplifier

Amplifier Performance Report

Dynamic thermal behavior of MOSFETs

ILD2035. MR16 3 W Control Board with ILD2035. Application Note AN214. Industrial and Multimarket. Revision: 1.0 Date:

Analysis on Acoustic Attenuation by Periodic Array Structure EH KWEE DOE 1, WIN PA PA MYO 2

CAN TRANSISTORS SOUND LIKE VALVES? ABSTRACT

BGA855N6 BGA855N6. Low Noise Amplifier for Lower L-Band GNSS Applications GND. Features

USBPRO User Manual. Contents. Cardioid Condenser USB Microphone

Latest fast diode technology tailored to soft switching applications

Chapter 6. Meeting 6, Controlling Gain and Processing Signals

AP08023 C504. Important application hints for dead time generation with the Capture/Compare Unit. Microcontrollers. Application Note, V 1.0, Feb.

Fundamentals of Digital Audio *

Audio Video Production Audio Basics

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22

OBSOLETE. Microphone Preamplifier with Variable Compression and Noise Gating SSM2165

Bass Extension Comparison: Waves MaxxBass and SRS TruBass TM

Audio Quality Terminology

Meta-Hearing Defect Detection

Revision 1.1 May Front End DSP Audio Technologies for In-Car Applications ROADMAP 2016

Since the advent of the sine wave oscillator

Application Note V

THIEL OWNER INFORMATION. Coherent Source loudspeaker system. Model SCS3

APPLICATION NOTE. Atmel AVR127: Understanding ADC Parameters. Atmel 8-bit Microcontroller. Features. Introduction

ODEON APPLICATION NOTE Calculation of Speech Transmission Index in rooms

AXIHORN CP5TB: HF module for the high definition active loudspeaker system "NIDA Mk1"

Using the VM1010 Wake-on-Sound Microphone and ZeroPower Listening TM Technology

Qualified for industrial applications according to the relevant tests of JEDEC47/20/22.

LM4935 Automatic Gain Control (AGC) Guide

Audio Measurements Workshop

Small Footprint Ultra Low Current Low Noise Amplifier for Global Navigation Satellite Systems (GNSS)

ICS High SPL Analog Microphone with Extended Low Frequency Response

3GPP TS V4.2.0 ( )

TRITON - USB Audio Interface

Transcription:

AN547 AN547 - Why you need high performance, ultra-high SNR MEMS Table of contents 1 Abstract................................................................................1 2 Signal to Noise Ratio (SNR)..............................................................2 3 Acoustic Overload Point (AOP).......................................................... 2 4 4.1 4.2 Importance of microphone performance for recordings...................................4 Signal to Noise Ratio importance for recordings............................................ 4 Acoustic Overload Point importance for recordings......................................... 5 5 Importance of microphone performance for speech recognition.......................... 5 6 Importance of microphone performance for noise cancellation algorithms................ 6 7 7.1 Summary.............................................................................. 6 List of abbreviations..................................................................... 7 Disclaimer............................................................................. 8 1 Abstract The popularity of automatic speech recognition systems and the use of video content to share information and experiences is increasing dramatically. The performance and quality of the used to capture sound must be high to ensure great user experiences. Critical factors include noise, distortion, frequency response and component matching. This application note concentrates on signal to noise ratio (SNR) and acoustic overload point (AOP) and explains the benefits of having high microphone performance in speech recognition and audio / video capturing systems. Application Note www.infineon.com Please read the Important Notice and Warnings at the end of this document 1.1

2 Signal to Noise Ratio (SNR) 2 Signal to Noise Ratio (SNR) Noise in the output of a microphone can be defined as any signals which are not the intended input source and is generally regarded to be an undesired element of the output signal. The higher the noise level, the more it reduces the audio signal quality. Noise can be external to the microphone or it can originate in the microphone itself. People usually hear microphone self-noise as a hiss that affects perceived sound quality. For algorithms, noise deteriorates the fidelity of the signal, thereby reducing system performance. The noise of a microphone can be expressed in different ways:. Self-noise (Vrms, dbv, dbfs) is the rms noise voltage generated by the microphone itself when it is not excited by an external sound. Signal to Noise Ratio, SNR (db), describes the self-noise of the microphone relative to the intended input signal. SNR is usually measured using a standardized acoustic input signal to represent the wanted sound, a 94 dbspl (1 Pa) sine wave. Equivalent Input Noise, EIN (dbspl), is the (imaginary) acoustic noise level coming into the microphone that is equivalent to the electrical noise level at the output of the microphone. Electrical SNR (dbv) Electrical output Self-noise Acoustical SNR (dbspl) Acoustical input EIN 3 Acoustic Overload Point (AOP) All real-life audio transducers are non-linear systems in that they add content to the signal that passes through them. In the case of distortion, the added content lies at the harmonics of the frequencies that are present in the original signal. Distortion is typically measured as Total Harmonic Distortion, THD (THD+N if self-noise is included). It is the ratio of the energy in the signal harmonics (typically second through fifth) to the energy in the fundamental frequency when the microphone is excited by a sine wave. The test signal is typically a 1 khz sine signal at a relatively high sound pressure level (SPL), often 94 dbspl or higher. THD is given as a percentage (%). Acoustic Overload Point, AOP is commonly defined as the sound pressure level at which the THD exceeds 10%. The unit of AOP is dbspl. In most cases it is beneficial and important to preserve the original form and content of the sound incoming to the microphone(s). Adding content, such as distortion, to the original signal is likely to sound unpleasant to the person listening to the captured sound. The more added energy there is (i.e. the higher the THD), the worse the perceived audio quality will be. Distortion is also likely to confuse algorithms such as speech recognition system which carry out very detailed analysis of the contents of the incoming signal. Application Note 2 1.1

3 Acoustic Overload Point (AOP) 140 dbspl 130 dbspl 120 dbspl 110 dbspl 100 dbspl Jet engine during take-off Rock concert (Peak) Rock concert (RMS) Car horn Shouting (1M) 135dBSPL 130dBSPL 120dBSPL 94 dbspl Mic. Calibration Level 90 dbspl 80 dbspl 70 dbspl 60 dbspl 50 dbspl 40 dbspl 30 dbspl 20 dbspl 10 dbspl 0 dbspl Lawn mower Vacuum cleaner Conversational speech (1m) Quiet office Refrigerator hum Rustling leaves Whisper (1m) Silent room Threshold of hearing IM70A135 24dBSPL IM69D130 25dBSPL Standard MEMS mic SNR = 64dB, AOP = 120dBSPL 30dBSPL Figure 1 Dynamic range of IM69D130 and IM70A135 vs standard MEMS Application Note 3 1.1

4 Importance of microphone performance for recordings 4 Importance of microphone performance for recordings 4.1 Signal to Noise Ratio importance for recordings The goal of audio / video recording is to capture the incoming sound from the subject and to reproduce it in the output of the microphone system. When the recording is intended for human ears, it is desirable for the electrical output signal to match the acoustic signal as closely as possible, providing a "natural" sounding recording. The microphone and its SNR are critical parts of the sound capturing signal chain which affects the quality of audio recordings. Some typical use cases are presented in the table below. Use Case Home video Children Social media Professional videos Music Performances Nature Surveillance Details and Challenges Typically the home is a quiet environment where microphone noise can easily become dominant. Varying capturing and playback conditions and equipment. Filmed objects are mobile and have soft (quiet) voices. High video quality requirements to maximize viewer engagement. Job applications, job interviews, talent introductions, presentations, etc. High video quality is crucial to differentiate an applicant or business from others. High sound quality is important to ensure a natural sounding recording. Varying capturing and playback conditions are challenging. E.g. school plays can be challenging: quiet voices, long distances, ambient noise. Recorded sounds can be at low or very low sound pressure levels. The captured sounds can be quiet and coming in from long distances. In free field, sound pressure halves (reduces by 6 db) for every doubling of distance. The further the captured sound source is, the quieter the acoustic signal that reaches the microphone. As the self-noise of a microphone is practically constant, a reduction in incoming signal level causes a reduction in the SNR of the output signal of the microphone. Typically, a weak signal has to be amplified to bring it up to an appropriate level for the device signal path. Amplifying the signal also amplifies the noise present in the output. The more amplification there is, the higher the risk is that the noise will rise to a level at which it degrades the quality of the captured signal significantly. A high microphone SNR helps keep the noise floor inaudible even when the signal is amplified. The longer the capturing distance, the lower the microphone self-noise should be to avoid problems. This is especially critical when the distance is long and the sound source itself is quiet. As sound pressure attenuates by 6 db per doubling of distance, using a microphone with a 6 db higher SNR can enable doubling the capturing distance without degradation in signal quality. POLQA (Perceptual Objective Listening Quality Assessment) is an ITU-T standard model that uses digital speech analysis to objectively determine the quality and intelligibility of a recorded speech signal. Microphones with high SNR perform clearly better in POLQA tests and result in superior speech intelligibility. Signals of the same level are more intelligible when recorded with a higher SNR microphone. Playback conditions and video picture quality affect the perceived noise level. Ambient noise level in the playback environment Playback volume Quality of listening equipment (e.g. noise and frequency response) High video quality demands high sound quality to avoid degrading the overall audio / video quality Application Note 4 1.1

5 Importance of microphone performance for speech recognition 4.2 Acoustic Overload Point importance for recordings Just like SNR, AOP is an important audio / video quality factor. Distortion can very easily render a video recording useless. There are many smartphone videos online which have been shot in pop/rock concerts and are unwatchable due to badly distorted audio. High AOP improves sound quality if the incoming sound pressure level of the intended sound (or of disturbances) is high or very high. High AOP helps a microphone system handle very high signal peaks that may appear in the incoming acoustic signal even if the average sound pressure level is not very high. See some typical use cases in the table below. Use Case Pop/rock music concerts Sports events Traffic Wind Details and Challenges Concerts are typically loud. High sound quality is a key enabler for good and natural sounding performance recordings. Either the sport (e.g. motorsports) or the crowd (e.g. ice hockey arena) is very loud. Lots of low frequency noise. Wind is a common cause for poor sound quality in audio / video recordings shot outdoors. High AOP can help with certain kinds of wind conditions. Up until a few years ago the standard level for consumer electronics device microphone AOP was between 110 and 120 dbspl. In the recent past, the requirements for AOP have moved up. In order to ensure sound quality and speech recognition performance which satisfy customers, a device designer should choose significantly better that have AOPs closer to the 130 dbspl mark, or higher. At lower sound pressure levels, it makes more sense to look at lower THD levels than the 10% specified for AOP. In addition to having high AOP, it is also important that the THD stays low, below 2%, up to high enough sound pressure levels for the intended applications (for example, up to 120 dbspl). 5 Importance of microphone performance for speech recognition In the case of systems where the captured sound is intended for algorithms, the sound quality goals may be different to when the signal is for human ears. The signal does not necessarily have to sound natural as long as it is optimized for the algorithms. Regardless of the use case, it is always important that the signal stays clean of disturbances, artifacts, distortion and noise. Automatic speech recognition (ASR) is the task of automatically transcribing a speech signal into written words. Transcription accuracies are getting closer to the human level, which is at approximately 95%. However, so far achieving this level has been possible only in laboratories where the ambient conditions are favorable. Speech recognition in real-life environments and at a distance involves some significant acoustic challenges such as background noise, reverberations, echo cancellation and microphone positioning. It is not enough to just have a good speech recognition engine. Every element in the system should be performing at a high standard to prevent a quality bottleneck. The microphone s job is to provide the speech recognition system with the best possible input signal. High input signal quality helps the ASR system analyze the incoming sound and find the characteristics in it that enable recognizing the speech content. Critical parameters are noise, distortion, frequency response and phase. Application Note 5 1.1

6 Importance of microphone performance for noise cancellation algorithms High AOP can help speech recognition systems in loud environments. Sometimes the speech signal itself is not loud but there are other disturbances present. For example, there are speakers close to the in speech controlled home entertainment systems and digital assistants which may output loud music or spoken information. High AOP helps keep distortion low and improve the cancellation of noise and echoes. The longer the distance to the speech source, the lower the signal to noise ratio of the signal being fed to the ASR algorithm. Therefore, microphone SNR should be the higher when the intended capturing distance is longer. 6 Importance of microphone performance for noise cancellation algorithms A key function for speech recognition systems is being able to ignore the sounds and noises which are not the speech to be transcribed. Audio / video capturing and human-to-human communication quality can also be improved by excluding unwanted sounds from the signal. The goal is to increase SNR, which in this case is the ratio of the wanted sound (signal) to the unwanted ambient sounds (noise). Noise cancellation and directionality can be achieved by using multiple in combination with algorithms. Directional microphone systems, such as beam forming, can concentrate the sensitivity of the towards the desired direction and highlight the desired sound sources. Unwanted sounds can also be canceled based on parameters such as level differences between two. Blind source separation is a more sophisticated noise reduction system. It enables canceling noise independent of orientation, distance, and location. All these noise cancellation methods benefit from the accuracy and high quality of the signal they receive. The microphone should have high SNR, low distortion, flat frequency response (also improves phase response) and low group delay. In order to optimize the functionality of noise cancellation algorithms, the used in the system should have identical properties. The role of microphone to microphone matching is critical. The less variance there is in sensitivity, phase behavior and latency from microphone to microphone, the better. 7 Summary From 2005 to 2015 the SNRs of state-of-the-art in mass market consumer electronics devices improved from below 60 db up to about 65 db. With the requirements set by new high-performance speech recognition systems and other capturing use cases, even 65 db is no longer enough. Current high-end are approaching 70 db SNR. High microphone performance is a key enabler for high speech recognition and audio capturing quality. The performance of technologies such as automatic speech recognition algorithms and cameras are improving rapidly and the user experience expectations of device buyers are rising. It is important to avoid becoming improvement bottlenecks. Luckily there are high performance available. Noise performance has improved significantly in the last few years. SNR is rising beyond the 70 db level and quality degrading distortion is becoming a thing of the past with AOP reaching the 130 dbspl mark. This level of microphone performance helps devices give satisfying user experiences to even the most demanding customers. Application Note 6 1.1

7 Summary 7.1 List of abbreviations SNR: signal to noise ratio EIN: equivalent input noise THD: total harmonic distortion AOP: acoustic overload point ASR: automatic speech recognition SPL: sound pressure level db: decibel db(a): decibel, A-weighted dbv: decibels relative to 1 volt dbspl: decibels, sound pressure level Pa: Pascal, unit of pressure CE: consumer electronics Application Note 7 1.1

Trademarks All referenced product or service names and trademarks are the property of their respective owners. Edition Published by Infineon Technologies AG 81726 Munich, Germany 2017 Infineon Technologies AG All Rights Reserved. Do you have a question about any aspect of this document? Email: erratum@infineon.com Document reference IFX-ovy1506346816951 IMPORTANT NOTICE The information contained in this application note is given as a hint for the implementation of the product only and shall in no event be regarded as a description or warranty of a certain functionality, condition or quality of the product. Before implementation of the product, the recipient of this application note must verify any function and other technical information given herein in the real application. Infineon Technologies hereby disclaims any and all warranties and liabilities of any kind (including without limitation warranties of non-infringement of intellectual property rights of any third party) with respect to any and all information given in this application note. The data contained in this document is exclusively intended for technically trained staff. It is the responsibility of customer s technical departments to evaluate the suitability of the product for the intended application and the completeness of the product information given in this document with respect to such application. WARNINGS Due to technical requirements products may contain dangerous substances. For information on the types in question please contact your nearest Infineon Technologies office. Except as otherwise explicitly approved by Infineon Technologies in a written document signed by authorized representatives of Infineon Technologies, Infineon Technologies products may not be used in any applications where a failure of the product or any consequences of the use thereof can reasonably be expected to result in personal injury