lpc10 2.4kbps federal standard in speech coding soo hyun bae school of electrical & computer...

LPC10 LPC10 2.4kbps federal standard in 2.4kbps federal standard in

speech codingspeech coding

LPC10 LPC10 2.4kbps federal standard in 2.4kbps federal standard in

speech codingspeech coding

Soo Hyun Bae

School of Electrical & Computer Engineering

Georgia Institute of Technology<soohyun@ece.gatech.edu>

ECE 8873 Data Compression & Modeling

03/17/2004

AgendaAgendaAgendaAgenda

1. Taxonomy of Speech Coders

2. LPC10 Properties

3. Voicing Classification

4. Levinson-Durbin Recursion

5. Pitch Detection

6. Synthesize Speech

7. Speech Coder Comparision

Linear PredictionLinear PredictionLinear PredictionLinear Prediction

Speech Coder Standard

FS1015-LPC10 Coefficient 10

FS1016-CELP Code Excitation

MELP Mixed Excitation

IS-54 VCELP Vector Sum Excited

IS-96 QCELP QualComm Code Excited

LD-CELP G.728 Low-Delay Code-Excited

G.729 CS-ACELP Conjugate-structure Algebraic-Code-Excited

Where is LPC10?Where is LPC10?Where is LPC10?Where is LPC10?

• Taxonomy of Speech Coders

Speech Coders

Waveform Coders Vocoders

Time Domain : PCM. ADPCM

Frequency Domain : Sub-band coders,

Adaptive transform coder

Linear Predictive Coder Formant Coders

Waveform Coders : Preserve the signal waveform not speech

Vocoders : Analyze speech, extract parameters, use parameters to synthesize speech

Properties (1)Properties (1)Properties (1)Properties (1)

• So called LPC10 because 10 LP coefficients are used

• Bandwidth: 2.4kbps• Samples/frame : 180 samples• Bits/frame: 54 bits• Frame Size: 22.5ms = 44.44 frames/sec• Target stream : 8khz sampling rate, 16bit

quantization

Properties (2)Properties (2)Properties (2)Properties (2)

• “Buzzy” since noise through parameter updates

• Regularly voiced excitation is unnatural, makes some jitter

• Voicing error produce significant distortions

• Only models speech, doesn’t work if backgound noise. Not suitable to mobile phone application

Encoded streamEncoded streamEncoded streamEncoded stream

LP Coefficients Pitch&Voicing Energy

0 41 48 53- The remaining 1 bit is for synchronization

• LP Coefficients: Levinson-Durbin Recursion

• Pitch & Voicing : Causal & Noncausal Prediction Gain

• Energy : Low-Band Speech Energy

VocoderVocoderVocoderVocoder

Original Speech

Analysis:• Voiced/Unvoiced decision• Pitch Period (voiced only)• Signal power (Gain)

Pulse Train

Random Noise

Vocal TractModel

Synthesized Speech

DecoderSignal Power

PitchPeriod

Encoder

Voicing Classification(1)Voicing Classification(1)Voicing Classification(1)Voicing Classification(1)

Voiced Source– Generated by vocal cords’ vibrations– Periodic, spacing is the pitch,

Unvoiced Source– Generated without vibrations– Excitation is modeled by a White Gaussian Noise source– No pitch

How to discriminate?

Fisher’s Method

Voice Classification (2)Voice Classification (2)Voice Classification (2)Voice Classification (2)

Compute R(0)

R(0) > R(0) for noise ?Compute LPC and

Pitch Detection

Silence PeriodNo

Pitch & Voicing (1)Pitch & Voicing (1)Pitch & Voicing (1)Pitch & Voicing (1)

• If x(n) is periodic in N, R(k) is also periodic in N• Hard to compute

)()()(kN

kmxmxkR

)()()(kN

cc kmxmxkR

otherwise

Pitch & Voicing (2)Pitch & Voicing (2)Pitch & Voicing (2)Pitch & Voicing (2)

Reflection Coefficient (1)Reflection Coefficient (1)Reflection Coefficient (1)Reflection Coefficient (1)

• Human auditory system is more sensitive to poles then to zeros

Where G is the gain, p is the order, a’s are poles

iii zaza

*1 )1)(1(

Reflection Coefficient (2)Reflection Coefficient (2)Reflection Coefficient (2)Reflection Coefficient (2)

• Levinson-Durbin Recursion for all-pole model

)0()3()2()1(

)3()0()1()2(

)2()1()0()1(

)1()2()1()0(

RpRpRpR

Toeplitz

Energy – Gain CoefficientEnergy – Gain CoefficientEnergy – Gain CoefficientEnergy – Gain Coefficient

• From autocorrelation matching property, G is calculated from MSE given by Levinson-Durbin Revursion

• Transmit the coefficient G• Recall

kPk kRaRG

2 )()0(

iii zaza

*1 )1)(1(

Synthesize speechSynthesize speechSynthesize speechSynthesize speech

Pulse Train

Random Noise

Synthesized Speech

DecoderSignal Power

PitchPeriod

• Recall the Encoder/Decoder structure

Speech Coder ComparisonSpeech Coder ComparisonSpeech Coder ComparisonSpeech Coder Comparison

Original

ReferencesReferencesReferencesReferences

• Welch V.C., Tremain T.E., Campbell J. P. Jr., “A comparison of US Government standard voice coders”, MILCOM’89, Vol. 1, pp269-273, 1989.

• Cox R. V., “Three New Speech Coders from the ITU Cover a Range of Applications”, Comm. Magazine of IEEE, Vol. 35, pp40-47, 1997

• Campbell J. P. Jr., Tremain T.E., “Voiced/Unvoiced Classification of Speech with Applications to the U.S. Government LPC-10E Algorithm”, ICASSP86, Vol. 11, pp473-476, 1986

• http://www.ee.ucla.edu/~ingrid/ee213a/speech/speech.html

• http://mia.ece.uic.edu/~papers/WWW/MultimediaStandards/

• http://www.ecse.rpi.edu/Homepages/shivkuma/

• http://www.eee.strath.ac.uk/r.w.stewart/index2.htm

• http://web.syr.edu/~gsriniva/tech/docs/

• http://www.speech.cs.cmu.edu/comp.speech/Section3/Software/celp-3.2a.html

• http://www.arl.wustl.edu/~jaf/lpc/• http://www.ecsl.cs.sunysb.edu/cse660/speech.html

lpc10 2.4kbps federal standard in speech coding soo hyun bae school of electrical & computer...

Documents

subtorsubtor.com.sg/wp-content/uploads/2019/01/subtor-sanitary...•...

2ª série publicação quadrimestral 2183-8873 (em linha)...

8873 8874 8875 - surplus sales of nebraska · title: 8873...

commercial property - john dixon &...

evaluation and modeling of the properties and antioxidant...

technical reference manual abb-free@home compact ... ·...

new 8873 8874 8875 - surplus sales of nebraska · 2015. 3....

overview - georgia institute of...

180309 alpha awards announcement youth category final ·...

4.1: list of accounts for payment– march...

performance analysis of different types of decoders for...

chippewa chatter · and programming including title ix and...

amazing clouds-8873-1920x1080

8873 y20 sy chemistry h1 for 2020 - seab · other...

ic-7112w user...

operation and maintenance manualbidonequipment.info/s/arpac...

conference meeting menus - club colonial · cairns colonial...

lomeditor 2006 -...

access to justice: a deconstructionist approach to...

an alternative means of spectroscopic...