Sound Synthesis Methods

Similar documents
Synthesis Techniques. Juan P Bello

MUS421/EE367B Applications Lecture 9C: Time Scale Modification (TSM) and Frequency Scaling/Shifting

A Parametric Model for Spectral Sound Synthesis of Musical Sounds

ADDITIVE SYNTHESIS BASED ON THE CONTINUOUS WAVELET TRANSFORM: A SINUSOIDAL PLUS TRANSIENT MODEL

Advanced audio analysis. Martin Gasser

Plaits. Macro-oscillator

Computer Audio. An Overview. (Material freely adapted from sources far too numerous to mention )

L19: Prosodic modification of speech

VIBRATO DETECTING ALGORITHM IN REAL TIME. Minhao Zhang, Xinzhao Liu. University of Rochester Department of Electrical and Computer Engineering

Combining granular synthesis with frequency modulation.

Non-stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignment

Digitalising sound. Sound Design for Moving Images. Overview of the audio digital recording and playback chain

THE BEATING EQUALIZER AND ITS APPLICATION TO THE SYNTHESIS AND MODIFICATION OF PIANO TONES

Linear Frequency Modulation (FM) Chirp Signal. Chirp Signal cont. CMPT 468: Lecture 7 Frequency Modulation (FM) Synthesis

Audio Engineering Society Convention Paper Presented at the 110th Convention 2001 May Amsterdam, The Netherlands

CS 591 S1 Midterm Exam

TIME DOMAIN ATTACK AND RELEASE MODELING Applied to Spectral Domain Sound Synthesis

Lecture 6: Nonspeech and Music

Lecture 5: Sinusoidal Modeling

Spectrum. Additive Synthesis. Additive Synthesis Caveat. Music 270a: Modulation

Music 270a: Modulation

CMPT 468: Frequency Modulation (FM) Synthesis

Converting Speaking Voice into Singing Voice

SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum

Between physics and perception signal models for high level audio processing. Axel Röbel. Analysis / synthesis team, IRCAM. DAFx 2010 iem Graz

FIR/Convolution. Visulalizing the convolution sum. Convolution

EE482: Digital Signal Processing Applications

INTRODUCTION TO COMPUTER MUSIC SAMPLING SYNTHESIS AND FILTERS. Professor of Computer Science, Art, and Music

Waveshaping Synthesis. Indexing. Waveshaper. CMPT 468: Waveshaping Synthesis

Audio Signal Compression using DCT and LPC Techniques

INTRODUCTION TO COMPUTER MUSIC PHYSICAL MODELS. Professor of Computer Science, Art, and Music. Copyright by Roger B.

Lecture 6: Nonspeech and Music. Music & nonspeech

Lecture 6: Nonspeech and Music

applications John Glover Philosophy Supervisor: Dr. Victor Lazzarini Head of Department: Prof. Fiona Palmer Department of Music

Formant Synthesis of Haegeum: A Sound Analysis/Synthesis System using Cpestral Envelope

Many powerful new options were added to the MetaSynth instrument architecture in version 5.0.

Sound Modeling from the Analysis of Real Sounds

Outline. Communications Engineering 1

Developing a Versatile Audio Synthesizer TJHSST Senior Research Project Computer Systems Lab

Music 171: Amplitude Modulation

Class Overview. tracking mixing mastering encoding. Figure 1: Audio Production Process

Timbral Distortion in Inverse FFT Synthesis

ALTERNATING CURRENT (AC)

Quantification of glottal and voiced speech harmonicsto-noise ratios using cepstral-based estimation

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 13 Timbre / Tone quality I

YAMAHA. Modifying Preset Voices. IlU FD/D SUPPLEMENTAL BOOKLET DIGITAL PROGRAMMABLE ALGORITHM SYNTHESIZER

Preeti Rao 2 nd CompMusicWorkshop, Istanbul 2012

Qäf) Newnes f-s^j^s. Digital Signal Processing. A Practical Guide for Engineers and Scientists. by Steven W. Smith

Lecture 9: Time & Pitch Scaling

Chapter 5: Music Synthesis Technologies

Interpolation Error in Waveform Table Lookup

Overview of Code Excited Linear Predictive Coder

Signal Processing for Speech Applications - Part 2-1. Signal Processing For Speech Applications - Part 2

CSE481i: Digital Sound Capstone

VOICE QUALITY SYNTHESIS WITH THE BANDWIDTH ENHANCED SINUSOIDAL MODEL

I-Hao Hsiao, Chun-Tang Chao*, and Chi-Jo Wang (2016). A HHT-Based Music Synthesizer. Intelligent Technologies and Engineering Systems, Lecture Notes

EC 6501 DIGITAL COMMUNICATION UNIT - II PART A

Modeling of Tension Modulation Nonlinearity in Plucked Strings

FFT analysis in practice

Complex Sounds. Reading: Yost Ch. 4

Final Exam Study Guide: Introduction to Computer Music Course Staff April 24, 2015

Three Modeling Approaches to Instrument Design

HIGH ACCURACY FRAME-BY-FRAME NON-STATIONARY SINUSOIDAL MODELLING

Identification of Nonstationary Audio Signals Using the FFT, with Application to Analysis-based Synthesis of Sound

SPEECH TO SINGING SYNTHESIS SYSTEM. Mingqing Yun, Yoon mo Yang, Yufei Zhang. Department of Electrical and Computer Engineering University of Rochester

What is Sound? Simple Harmonic Motion -- a Pendulum

AUDL GS08/GAV1 Auditory Perception. Envelope and temporal fine structure (TFS)

Musical Acoustics, C. Bertulani. Musical Acoustics. Lecture 14 Timbre / Tone quality II

IMPROVING QUALITY OF SPEECH SYNTHESIS IN INDIAN LANGUAGES. P. K. Lehana and P. C. Pandey

APPLICATIONS OF DSP OBJECTIVES

Subtractive Synthesis & Formant Synthesis

Sound Synthesis. A review of some techniques. Synthesis

Advanced Audiovisual Processing Expected Background

DAFX - Digital Audio Effects

PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based on a Sinusoidal Representation

PROJECT NOTES/ENGINEERING BRIEFS

E : Lecture 8 Source-Filter Processing. E : Lecture 8 Source-Filter Processing / 21

Hungarian Speech Synthesis Using a Phase Exact HNM Approach

GEN/MDM INTERFACE USER GUIDE 1.00

Analysis and Design of Autonomous Microwave Circuits

speech signal S(n). This involves a transformation of S(n) into another signal or a set of signals

FIR/Convolution. Visulalizing the convolution sum. Frequency-Domain (Fast) Convolution

An Interactive Multimedia Introduction to Signal Processing

Speech Coding in the Frequency Domain

Chapter 4. Digital Audio Representation CS 3570

CMPT 468: Delay Effects

SAMPLING THEORY. Representing continuous signals with discrete numbers

INFLUENCE OF FREQUENCY DISTRIBUTION ON INTENSITY FLUCTUATIONS OF NOISE

Laboratory Assignment 4. Fourier Sound Synthesis

INTRODUCTION TO COMPUTER MUSIC. Roger B. Dannenberg Professor of Computer Science, Art, and Music. Copyright by Roger B.

SGN Audio and Speech Processing

Speech Synthesis; Pitch Detection and Vocoders

Structure of Speech. Physical acoustics Time-domain representation Frequency domain representation Sound shaping

Sound synthesis and physical modeling

Analysis and Synthesis of Expressive Guitar Performance. AThesis. Submitted to the Faculty. Drexel University. Raymond Vincent Migneco

Lecture Schedule: Week Date Lecture Title

WARPED FILTER DESIGN FOR THE BODY MODELING AND SOUND SYNTHESIS OF STRING INSTRUMENTS

Photone Sound Design Tutorial

Vocoder (LPC) Analysis by Variation of Input Parameters and Signals

Chapter 2: Digitization of Sound

Transcription:

Sound Synthesis Methods Matti Vihola, mvihola@cs.tut.fi 23rd August 2001 1 Objectives The objective of sound synthesis is to create sounds that are Musically interesting Preferably realistic (sounds like some instrument) Produced in real time A taxonomy of digital sound synthesis 1. Abstract algorithms 2. Processed recordings (sampling) 3. Spectral models 4. Physical models 2 Abstract algorithms Typical features of abstract algorithms are the simplicity and ease of implementation. The sound is more or less artificial compared to more sophisticated methods 1. 2.1 Frequency Modulation (FM) ( 70s) Early digital synthesizers and sound card synthesizer chips are based on FM. 2 Very simple, easy to implement, only couple of voltage controlled oscillators (VCOs) Time-variant structure like in natural sounds, fast vibrato Bell-like and metallic timbres Harmonic sound when integer ratio between carrier and modulator Feedback systems add stability to frequency behaviour 1 Which may not be always a drawback... 2 Take a look at http://www.obsolete.com/120_years/ 1

Figure 1: Left: simple FM synthesis, right: one-oscillator feedback system (y FD1 ) and two-oscillator feedback system (y FD1 ) Figure 2: Frequency-domain presentation of FM synthesis. 2

Figure 3: Spectra of (top to down) one-oscillator feedback, two-oscillator feedback and simple FM with different modulation indices and feedback gains. 2.2 Waveshaping Synthesis (Nonlinear Distortion) (late 60s) Nonlinear shaping function applied to the input (excitation) signal Most fundamental case: excitation signal sinusoidal By using a linear combination of Chebyshev polynomials the ratios of harmonics can be controlled can be made to correspond harmonic structure of a real instrument Post-processing can be applied, e.g. amplitude modulation (AM) Exitation signal can be something else than sinusoid 2.3 Karplus-Strong algorithm ( 83) Very simple and computationally effective algorithm A short sound buffer, initialized with a random data Looping of the buffer, which is filtered with a simple low-pass filter 3 after every read Plucked string tones or percussion tones Can be implemented using only shift and add operations 3 Processed recordings (sampling) Manipulation of recorded sounds dates back to 1920s. The memory requirements has been a problem with digital sampling synthesis. The idea is in recording relatively short samples of sounds, which are then played back. 3 H z 1 2 1 z 1 3

Figure 4: Karplus-Strong algorithm. (c) Simple filtering operation, results in a plucked string tone. (d) Flip sign of every filtered bit with certain probability, produces sound resembling percussion tone. 3.1 Digital Wavetable Synthesis Looping Typical parts of instrument sound: attack, steady state, release Waveform of steady-state sound of most of the instruments approx. periodical short sample of the steady state looped Loop points determined: length of the looped sample should correspond to fundamental frequency Pitch shifting every 3th or 4th semitone stored, pitch shifting applied to nearest to obtain the rest Data reduction Lossless or lossy compression. Alteration of sampling frequency, quantization step or more sophisticated method. 3.2 Multiple Wavetable Synthesis More than one wavetable, or sample played at once. Wavetable Cross-Fading Several sections of tone stored, samples are cross-faded from one to another multiplying with amplitude envelope. Wavetable Stacking Several (arbitrary) sound signals multiplied with amplitude envelope and summed together synthetic sound signal. Problem: Find single wavetables (spectra) and amplitude envelopes to produce natural sound. 3.3 Granular Synthesis ( 40s) Idea is representing sound signals with sound atoms or grains. Waveform shape of grains and their temporal distribution determine the type of sound. 4

3.3.1 Asynchronous (AGS) ( 90s) Scatter sound grains in a statistical manner over a region in time-frequency plane Region called a sound cloud Several parameters: Start time, duration, bandwidth of clouds; duration, density, amplitude envelope, waveforms and spatial distribution of grains Effective in generating new sound events, simulations of existing instruments hard 3.3.2 Pitch Synchronous (PSGS) ( 90s) Better performance in simulation of realistic sounds Periodicity corresponding to fundamental frequency Analysis grains using STFT or LPC analysis impulse response estimation Resynthesis using parallel FIRs driven with train of impulses Transformations that add variation to the produced signal All methods that use overlap-add technique can be viewed as granular synthesis. 4 Spectral Models The idea is that the properties of sound that are perceived are stored (time-varying spectra). 4.1 Additive Synthesis Summing sinusoidal components of different (phase,) amplitude and frequency y n A k n sin k 2πn Fs F k n The control functions of single sinusoids (A k n, F k n ) are slowly-varying Drawback: Large amount of data (control function parameters) and large number of oscillators Data reduction that preserves intuitively appealing data and original sound (e.g. line segment approximation of control parameters) 5

4.2 Phase Vocoder Can be viewed as bank of filters or STFT analyzer Resynthesis: FT 1 and overlap-add Time-scaling and pitch transposition easily: Time-scaling accomplished by modifying the hop size (the time difference of consecutive synthesized frames) Pitch modification by modifying time scale, and then changing the sampling rate of the signal Works well for harmonic, slowly varying tones, blurs transient type of sounds (the time resolution of the STFT-analysis) 4.3 Source-Filter Synthesis Exitation signal filtered with time-varying filter Titled also as subtractive synthesis: input signal with harmonic rich spectrum filtered Not very robust representation of realistic sounds Used in analog synthesizers: some exitation waveform generators and filters 4.4 McAulay-Quatieri (MQ) ( 86) Original signal decomposed to a set of sinusoids A l k ω l k ψ l k Trajectories for all components Analysis: STFT, peak detection, sinusoidal trajectory detection and noise thresholding Synthesis: Trajectory interpolation, additive synthesis 4.5 Spectral Modeling Synthesis (SMS) (late 80s) Sinusoidal analysis with MQ deterministic part of the signal Residual x res n x n x sin n modeled as noise (stochastic component) Duration (tempo) and frequency (key) modification easily Transient sounds a problem 6

4.6 Transient Modeling Synthesis (TMS) ( 97) Extension of SMS The residual part of SMS presented as transients and noise, x res n x tra n x noi n The noise part is then only the steady noisy components Time frequency domain duality: impulsive signals in time domain sinusoidal in frequency domain DCT produces real-valued sinusoid when the time domain signal is an impulse Transients detected and parametrized, resynthesized steady noisy components left 4.7 FFT 1 Synthesis ( 92) Can be viewed as additive synthesis in frequency domain All the signal components added together as spectral envelopes Synthesized using overlap-add of consecutive frames Complex sounds possible 4.8 Formant Synthesis Formant is a concentration of energy in energy/power spectrum envelope Originally in speech processing Used in speech synthesis (and singing) 5 Exercise Work 1. Implement the simple FM synthesis. Search suitable values for f c and f m in order to make the signal sound funny. Test with different values of modulation index I. 2. Implement the Karplus-Strong algorithm. Find suitable buffer size, and try to create both plucked string and percussion type of sounds You can find the more detailed description of the methods e.g. in pages 4 10 of [1]. References [1] Tolonen T., Välimäki V. and Karjalainen M., Evaluation of Modern Sound Synthesis Methods, Report 48, HUT, Department of Electrical and Communications Engineering, Laboratory of Acoustics and Audio Signal Processing, Espoo, Mar 1998. Available at http://www.acoustics.hut.fi:7077/ ttolonen/sound_synth_report.html 7

[2] Välimäki V. Lecture notes on Digital Sound Synthesis, HUT, Laboratory of Acoustics and Audio Signal Processing. Available at /share/argh/klap/hut_asp/asp-8-synthesis.pdf 8