qos measurement and management for voip

QoS Measurement and QoS Measurement and Management for VoIPManagement for VoIP

Wenyu Jiang

IRT LabMarch 5, 2003

Introduction to VoIP & Introduction to VoIP & IP TelephonyIP Telephony

Transport of voice packets over IP networks Cost savings

– Consolidates voice and data networks– Avoids leased lines, long-distance toll calls

Smart and new services– Call management (filtering, TOD forwarding): CPL– Better than PSTN quality: wide-band codecs

Protocols and Standards– Signaling: SIP (IETF), H.323 (ITU-T)– Transport: RTP/RTCP (IETF)

Practical Issues in VoIPPractical Issues in VoIPQuality of Service (QoS)

– Internet is a best-effort network Loss, delay and jitter Users expect at least PSTN quality for VoIP!

Ease of deployment– Requires seamless integration with legacy

networks (PSTN/PBX)– Security is a must

High yardstick of service availability– Can your network achieve 99.999% up time?

OutlineOutline QoS measurement

– Objective vs. subjective metrics – Automated measurement of subjective quality

QoS management: improving your quality– End-to-End: FEC, LBR, PLC– Network provisioning: voice traffic aggregation

Reality check– Performance of end-points (IP phones, …)– Deployment issues in VoIP– Evaluation of VoIP service availability through

Internet measurement

Workings of a VoIP ClientWorkings of a VoIP ClientAudio is packetized, encoded and transmittedForward error correction (FEC) may be used

to recover lost packetsPlayout control smoothes out jitter to

minimize late losses; coupled with FECPacket loss concealment (PLC)

– Last line of “defense” after FEC and playout

FEC affects playout control

addedloss, jitter

recoveryFEC

unrecoverableplayoutdelaycontrol

losses by FEC

& decoding

lossconcealmentInternet

addedlatelosses

packets with FECmultimedia

LBR: An Alternative to FECLBR: An Alternative to FEC An (n,k) block FEC code can recover n-k losses Low Bit-rate Redundancy (LBR)

– Transmit a lower bit-rate version of original audio– No notion of “blocks”– Not bit-exact recovery

CA B D

transmission time

FEC block 1 FEC block 2

FEC dataFEC data

transmission time

LBR datab'

Objective QoS Metrics: LossObjective QoS Metrics: Loss Internet packet loss is often bursty

– May worsen voice quality than random (Bernoulli) loss Characterization of packet loss

– 2-state Markov (Gilbert) model: conditional loss prob.

– More detailed models, but more states! Extended Gilbert model, nth order Markov model Hidden Markov model, Gilbert-Elliot model, inter-loss distance

– More states Larger test set, loss of big picture, and Adaptive applications can trade-off model accuracy for fast feedback Gilbert model provides an acceptable compromise

0 11-p p

(non-loss) (loss)

1-q = p c

Effect of Gilbert Loss ModelEffect of Gilbert Loss Model Loss burst distribution of a packet trace

– Roughly, though not exactly exponential Loss burstiness on FEC performance

– FEC less efficient under bursty loss

0 2 4 6 8 10 12

Loss burst length

Packet traceGilbert model

10 20 30 40 50 60

conditional loss p_c (%)

GilbertBernoulli

Objective QoS Metrics: DelayObjective QoS Metrics: Delay Complementary Conditional CDF (C3DF)

– More descriptive than auto-correlation function (ACF)– Delay correlation rises rapidly beyond a threshold– Approximates conditional late loss probability

lag=10lag=20

unconditional

0 0.05 0.1 0.15 0.2 0.25 0.3

x: delay (sec)

idltdtdPtf ilii packet ofdelay : ,...,3,2,1 lag ],|[)(

Subjective QoS MetricsSubjective QoS MetricsPerceived quality

– Mean Opinion Score (MOS) ITU-T P.800/830 Obtained via listening tests

– MOS variations DMOS (Degradation) CMOS (Comparison) MOSc (Conversational): considers delay A/B preference

Pros: more meaningful to end usersCons: time consuming, labor intensive

MOS Grade Score

Excellent 5

Good 4

Fair 3

Poor 2

Effect of Loss Model on Effect of Loss Model on Perceived QualityPerceived Quality

Codec: G.729 (8kb/s ITU std)Random (Bernoulli) vs. bursty (Gilbert) loss

– Bursty lower MOS– True even when FEC or LBR is used

0.02 0.04 0.06 0.08 0.1 0.12

loss probability

Effect of random vs. bursty loss on MOS quality

random (Bernoulli) lossbursty (Gilbert) loss

0.02 0.04 0.06 0.08 0.1 0.12

loss probability

random vs. bursty loss on FEC (G.723.1) quality

FEC (3,2) (Gilbert)FEC (3,2) (Bernoulli)

Going Further: Bridging Going Further: Bridging Objective and Subjective MetricsObjective and Subjective Metrics The E-model (ITU-T G.107/108)

– Originally for telephone network planning– Considers various impairments– Reduces to delay and loss impairment when adapted for

Objective quality estimation algorithms– Suitable when network stats is not available, e.g.,

phone-to-phone service with IP in between.– Speech recognition performance may be used as a

quality predictor, by comparing with original text

The E-modelThe E-model Map from loss and delay to

impairment scores (Ie, Id) Compute a gross score (R

value) and map to MOSc

Limited number of codec loss impairment mappings 10

0 0.03 0.06 0.09 0.12 0.15 0.18

average loss probability

G.729 T=20ms random loss

20 40 60 80 100

R value

R to MOS mapping

0 50 100 150 200 250 300 350 400

delay (ms)

E-model Id

Using Speech Recognition to Using Speech Recognition to Predict MOSPredict MOS

Evaluation of automatic speech recognition (ASR) based MOS prediction– IBM ViaVoice Linux version– Codec used: G.729– Performance metric

absolute word recognition ratio

relative word recognition ratio

dsspoken wor of # total

wordsrecognizedcorrectly of #absR

yprobabilit loss is ,%)0(

)()( p

absrel

Recognition Ratio vs. MOSRecognition Ratio vs. MOSBoth MOS and Rabs

decrease w.r.t. lossThen, eliminate

middle variable p 2

0 2 4 6 8 10 12 14 16

loss rate (%)

Impact of packet loss on audio quality

G.729 codec

0 2 4 6 8 10 12 14 16

loss rate (%)

Impact of packet loss on automatic speech recognition

G.729 codec

28 30 32 34 36 38 40 42 44

word recognition ratio (%)

mapping from speech recognition performance to MOS

speech recognition performance

Speaker DependencySpeaker Dependency Absolute performance

is speaker-dependent But relative word

recognition ratio is not Suitable for MOS

prediction

0 2 4 6 8 10 12 14 16

packet loss probability p (%)

Speaker ASpeaker BSpeaker C

0.65 0.7 0.75 0.8 0.85 0.9 0.95 1

relative word recognition ratio R_rel

0 2 4 6 8 10 12 14 16

packet loss probability p (%)

Summary of QoS Summary of QoS MeasurementMeasurement

Loss burstiness:– Affects (generally worsens) perceived quality as well

as FEC performance– May be described with, e.g., a Gilbert model

Delay correlation:– Increases rapidly beyond a threshold, revealed through

Complementary Conditional CDF (C3DF)– Late losses are also bursty

Perceived quality (MOS) estimation– Analytical: the E-model– If network statistics N/A: relative word recognition

ratio can provide speaker-independent MOS prediction

Reality check– Performance of VoIP end-points (IP phones, …)– Deployment issues in VoIP– Evaluation of VoIP service availability through Internet

measurement

Quality of FEC vs. LBRQuality of FEC vs. LBR FEC is substantially and consistently better

– At comparable bandwidth overhead– Across all codec configurations tested

0.02 0.04 0.06 0.08 0.1 0.12

loss probability

FEC vs. LBR based on G.723.1

J: FEC (2,1)I: G.723.1 LBR

0.02 0.04 0.06 0.08 0.1 0.12

loss probability

FEC vs. LBR based on AMR

N: AMR12.2+FEC (3,2)M: AMR12.2+6.7 LBR

G.729+G.723.1 LBR AMR LBR

Quality of FEC under Bursty Quality of FEC under Bursty LossLoss

Packet interval T has a stronger effect on MOS with FEC than without FEC

0.5-0.6 MOS

0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18

p_u (overall loss rate)

conditional loss probability p_c = 30%

T=20ms

T=40ms

T=20ms, FEC

T=40ms, FEC

FEC MOS Optimization FEC MOS Optimization Considering Delay EffectConsidering Delay Effect

Larger T FEC efficiency, but delay Optimizing T with the E-model

– Calculate final loss probability after FEC, apply delay impairment of FEC, map to MOSc

Prediction close to FEC MOS test results– Suitable for analytical perceived quality prediction

20 40 60 80 100 120 140 160 180

packet interval T (ms)

FEC MOS optimization, Id != 0, d=3*T

p_u=4%p_u=8%

p_u=12%p_u=16%

0 2 4 6 8 10 12 14 16

original loss rate (%)

FEC MOS prediction, p_c=30%

E-model prediction T=40msreal MOS test T=40ms

Trade-off Analysis between Trade-off Analysis between Codec Robustness and FECCodec Robustness and FEC

3 loss repair options– FEC, LBR, PLC

Loss-resilient codec– Better PLC

iLBC (IETF)

– But more bit-rates– Better than FEC?

0 0.03 0.06 0.09 0.12 0.15

iLBC 14kb/sG.729 8kb/s

G.723.1 6.3kb/s

Observations and ResultsObservations and Results When considering delay:

– iLBC is usually preferred in low loss conditions– G.729 or G.723.1 + FEC better for high loss

Example: max bandwidth 14 kb/s– Consider delay impairment (use MOSc)

0 0.03 0.06 0.09 0.12 0.15

iLBC,no FECG.729+(5,3)

G.723.1+(2,1),T=60ms

G.729+(5,3)

G.723.1+(2,1),T=60ms

33.23.43.63.8

0 0.03 0.06 0.09 0.12 0.15

Max BW: 14 kb/s

2.82.62.4

Effect of Max Bandwidth on Effect of Max Bandwidth on Achievable QualityAchievable Quality

14 to 21 kb/s: significant improvement in MOSc

From 21 to 28 kb/s: marginal change due to increasing delay impairment by FEC

0 0.03 0.06 0.09 0.12 0.15

Max BW: 14 kb/sMax BW: 21 kb/sMax BW: 28 kb/s

Provisioning a VoIP NetworkProvisioning a VoIP Network Silence detection/suppression

– Transmit only during On period, saves bandwidth– Allows traffic aggregation through statistical multiplexing

Characteristics of On/Off patterns in VoIP– Traditionally found to be exponentially distributed– Modern silence detectors (G.729B VAD, NeVoT SD) produce

different patterns

0.0001

0 50 100 150 200 250 300 350 400 450 500

spurt/gap duration (in 10 ms frames)

talk-spurt/gap distribution, G.729B VAD

real spurt CDFexponential spurt CDF

real gap CDFexponential gap CDF

0.0001

0 200 400 600 800 1000

spurt/gap duration (in 10 ms frames)

talk-spurt/gap distribution, Nevot SD (default setting)

real spurt CDFexponential spurt CDF

real gap CDFexponential gap CDF

Traffic Aggregation SimulationTraffic Aggregation Simulation Token bucket filter with N sources, R: reserved to peak BW ratio CDF model resembles trace model in most cases Exponential (traditional) model

– Under-predicts out-of-profile packet probability;– Under-prediction ratio as token buffer size B

qos measurement and management for voip

loss impairment

gilbertelliot model

extended gilbert model

conditional loss prob

lossinternet packet

loss of big picture

voice quality

fec performancefec

Documents

industry discussion paper qos-based voip service ... ·...

qos measurement for broadband

opnet analysis of voip over mpls vpn with ip qos

voip & qos tesis

end-to-end voip quality measurement · the end-to-end voip...

aula 05 - qos voip

networking solutions for voip · • quality of service...

evaluation of voip qos over wibro

qos not needed ben teitelbaum internet2 voip sig september,...

improving qos of voip over wlan...

danny goderis alcatel · • research institutes ... voip,...

architecture of end-to-end qos for voip call processing in...

qos-aware lte downlink scheduler for voip in relation with...

voip weathermap a voip qos collection...

voip white paper -...

ensuring qos in your voip development

qos requirements for voip

response to acif qos-based voip service interconnectivity

policy framework for qos measurement in mobile broadband

qos: don't try voip without it