al-jobouri, fleury, ghanbari - 2014 - engineering wireless broadband access to iptv

Upload: shaun-neville

Post on 05-Jul-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    1/14

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    2/14

    The underlying transport protocol for HTTP is TCP, which

    retransmits packets when acknowledgments fail. This implies that

    there is no need for the channel coding used in our paper. How-

    ever, over a wireless channel, TCP has difficulty distinguishing

    between packet loss due to congestion and packet loss due to

    transmission errors, implying that packets will be retransmitted

    even if they may be again lost through a persistently poor wireless

    channel. DASH compensates for TCP’s behavior by allowing the cli-ent to control the bit-rate of the stream being downloaded (by

    selection between a set of streams with differing bit-rates). Unfor-

    tunately, during periods of heavy network congestion, DASH

    defaults to predominantly TCP control rather than client control.

    Study of a Netflix application [8]  showed service interruptions of 

    around 300 s, which is larger than a typical Windows wired device

    buffer of 240 s. The risk to display interruption on an Android

    device is still greater, as the typical buffer size is only 30 s. From

    the evidence of the performance studies referred to in [8], the per-

    formance of DASH has until recently been relatively poorly under-

    stood. The main reason   [9]   that commercial companies have

    implemented such multi-stream systems is not delivered video

    quality, which is difficult to monitor in a DASH-based system,

    but the compatibility of DASH with off-the-shelf web servers and

    existing content-delivery networks. The simpler UDP transport

    system, which is the subject of this paper: reduces the risk of video

    service interruption; does not rely on network over-provisioning

    [9]; and as a result potentially represents a greener solution.

    The authors’ robust scheme for streaming video over wireless

    broadband access has been simulated for IEEE 802.16 (WiMAX)

    systems  [10]. WiMAX may have recently lost the technological

    competition with Long Term Evolution (LTE)   [11]   in developed

    Western countries but it is still being deployed in rural areas

    [12]  and in countries where 3G cellular phone coverage is poor,

    including Africa and the Middle East. WiMAX is also attractive

    [13]   for backhauling from local IEEE 802.11 networks. Though

    the authors’ scheme as a whole has been analysed in previous

    works such as in  [14,15], the individual codec settings have not

    been analysed in isolation. In fact, it has been suggested to the firstauthor that a very helpful service to the system developer commu-

    nity would be to analyze codec settings independently of each

    other, which is what this paper sets out to do.

    The authors’ scheme combines data-partitioning as a form of 

    error resilience [16] with the addition of Forward Error Correction

    (FEC) using rateless channel coding   [17]   at the application layer,

    together with retransmission of extra redundant data when

    required. Retransmission is limited to one round to avoid accumu-

    lating latencies. The current authors have introduced into their

    video streaming scheme various error resilience measures  [18]

    that exist in the literature: different rates and types of Intra-

    Refresh (IR) Macroblocks (MBs) [19], Constrained Intra Prediction

    (CIP)  [20]   and where necessary, redundant data-partitioned Net-

    work Abstraction Layer (NAL) units (codec level packets)  [21].The impact of each of these measures is analyzed in isolation from

    each other, for Constant Bit Rate (CBR) as well as Variable Bit Rate

    (VBR) streaming. CBR allows storage and bandwidth capacity to be

    planned in advance, at a cost in video quality fluctuations. VBR 

    enables greater compression efficiency relative to CBR, which is

    why it is generally used for disc storage. VBR can benefit from

    two or even three-pass encoders, which are unsuitable for live

    video compression. The relative merits of CBR and VBR are further

    discussed in [22]. In general, we have noticed that many research-

    ers on similar topics have not explored the effect of video settings

    on their schemes and the authors believe that it is important to

    investigate this aspect of any protection scheme in order to under-

    stand the overall outcome of the scheme. In particular, it is impor-

    tant to determine how critical the video codec settings are to the

    success of a protection scheme. Is the scheme robust to changes

    or should certain settings be set within a given range?

    The remainder of this paper is organized as follows. The follow-

    ing Section adds further information on how IPTV can be delivered

    to the consumer, as well as supplying background technical con-

    text. Section   3   selects research from the literature that reveals

    the motivation behind the codec-based approach to video stream-

    ing used in this paper. Particular attention is paid to video stream-ing over WiMAX in that section. Section   4   then outlines the

    approach taken to protect IPTV streams. As the main focus of this

    paper is the configuration of the scheme rather than the scheme

    itself, the description is necessarily brief. Section  5   continues by

    detailing how the video codec settings were modelled in order to

    provide the evaluation that appears in Section 6. Section 7 summa-

    rizes the findings and rounds off with some observations about

    video streaming in this environment.

    2. Context

    Readers requiring further information on: IPTV, technical con-

    text of H.264/AVC data-partitioning, and/or Raptor codes can con-

    sult this Section.

     2.1. IPTV delivery

    There are two basic forms of IPTV: (1) That delivered over

    closed or proprietary managed networks, including cable TV, often

    to set-top boxes (STBs) that perform decoding and possibly decryp-

    tion; and (2) That delivered over the unmanaged public Internet,

    often displayed directly on desktop or laptop computer screens.

    The latter may be called Web or Internet TV   [23]   or latterly

    Over-the-Top (OTT) TV, as a way of distinguishing it from the ser-

    vice that the telecommunications companies hoped to challenge

    terrestrial broadcasters with by means of a high-quality service

    delivered over closed networks. Increasingly the industry trend,

    apart from cable TV, is towards the latter model of IPTV, i.e. OTTTV. There is no place for STBs when mobile IPTV  [24] is delivered,

    hence, our interest in OTT TV. The two forms of IPTV may differ in:

    the picture quality of the stream offered (lower for unmanaged);

    the amount of content available (higher for unmanaged); the for-

    mats that the video is compressed to (the need to update the STB

    is an issue); the way content is secured (unmanaged delivery

    may not be encrypted or employ selective encryption   [25]

    instead); and the frame resolutions presented (lower resolutions

    for unmanaged delivery). A flexible way to deliver IPTV is to set-

    up (and tear-down) connections [26] using the Real-Time Stream-

    ing Protocol (RTSP) (over reliable TCP transport), with subsequent

    data delivery using the Real-time Transport Protocol (RTP) over

    UDP transport. The Real-time Control Protocol (RTCP) can also

    allow end-to-end feedback to the IPTV server to control the bitrate. RTSP also can support so-called trick mode functionality such

    as rewind and pause.

    To allow unmanaged delivery, OTT TV, to compete with Stan-

    dard TV (SDTV) Quality-of-Experience (QoE), TV and video material

    can be locally cached  [27], reducing latency to the access link,

    which in our paper is represented by a WiMAX base to subscriber

    station link. Whether streaming over managed or unmanaged net-

    works, packets will be lost at the access network. This is the case,

    whether over broadband wireless or xDSL (Digital Subscriber Line),

    as the latter also suffers from burst errors  [28]. Prior to that,

    whether managed or unmanaged networks are in use, video

    streams are aggregated over high-capacity optical networks, such

    as in Swisscom’s Bluewin IPTV service   [29]. Only Passive Optical

    Networks (PONs) can reduce the error rates at the access link but

    1494   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    3/14

    currently PONs are not widely deployed. The approach adopted in

    this paper is potentially applicable across a range of access link

    types, as it provides protection against error bursts and causes only

    moderate delays. If a multicast service is required, it can also be

    readily adapted from its unicast form by dropping acknowledg-

    ments to the base station and increasing the level of FEC. This pos-

    sibility was tested by us in  [30] and, therefore, this paper confines

    itself to unicast delivery.

     2.2. H.264 codec 

    Features of the H.264 codec are used in this work. H.264/

    Advanced Video Coding (AVC) standardization was initiated by

    the Video Coding Experts Group (VCEG), which is a working group

    of the International Telecommunication Union (ITU-T). The Joint

    Video Team carried out the final task of developing H.264/AVC

    [3], a co-operative effort of both Moving Picture Expert Group

    MPEG and VCEG in 2003.  Fig. 1 shows the video frame structure.

    A video-frame in H.264/AVC is divided into MBs   [31],   which are

    the basic unit for motion prediction. Each MB can be further sub-

    divided into blocks that are transform-coded in order to de-corre-

    late the data. For transmission, a video frame can be split into

    slices, each of which occupies a network packet. When data-parti-

    tioning is enabled, each slice can be further sub-divided in up to

    three partitions, data-partitions-A, -B and -C, each of which can

    occupy a separate network packet. It will be necessary to split a

    network packet if its size exceeds the Maximum Transport Unit

    (MTU) of the network.

     2.3. Rateless channel coding 

    We selected Raptor codes [32]  as our rateless code.  Fig. 2 is a

    simple diagram of how a Raptor encoder works.   k   represents

    source symbols and  m  represents received symbols. A systematic

    Raptor code is arrived at [32] by first applying the inverse of the

    inner code to the first  k  symbols before the outer pre-coding step.

    Systematic channel codes separate the redundant coding data fromthe information data, allowing the information data to be passed

    directly to the source code decoder without channel decoding if 

    no errors are detected (usually by means of checksums).

    For a rateless code, even if there were no channel errors, there is

    a very small probability that the decoding will fail (the probability

    converges to zero in polynomial time in the number of input sym-

    bols), which failure can be modeled statistically by the scheme

    introduced in   [28]. In compensation, these codes are completely

    flexible, have a theoretical linear decode computational complex-

    ity, and generally their overhead is considerably reduced compared

    to fixed erasure codes. Furthermore, if the packets are pre-encoded

    with an inner code, as in Raptor codes to achieve a systematic code,

    a weakened Luby Transform (LT) [33], with reduced computational

    complexity compared to the original LT  [28], can be applied to thesymbols and their redundant symbols.

    In [32], an implementation of Raptor code is reported decoding

    at several gigabits per second on a 2.4 GHz Intel Xeon processor. By

    way of comparison, in 2013 smartphones generally had a four-core

    processor running at around 1.5 GHz, though high-end smart-

    phones had 64-bit processor clock speeds of around 2.2 GHz. As a

    measure of its acceptability for mobile multimedia, Raptor coding

    has been adopted for two wireless standards, Digital Video Broad-

    casting (DVB) for handheld devices (DVB-H) [34] and Third Gener-

    ation Partnership Project (3GPP) Multimedia Broadcast/Multicast

    Services (MBMSs) [35], though neither uses the codes for unicastor accepts repair packet requests, as occurs in this paper. Impor-

    tantly though, using Raptor code in these standards at the block-

    level rather than the byte-level, as herein, can significantly impede

    its throughput.

    Notice that in these standards [34,35], application-layer Raptor

    coding is applied at the packet-erasure correction level, while

    physical-layer channel coding occurs at the bit-level. If physical-

    layer correction fails according to checksum detection [36] then a

    packet is marked as an erasure to be corrected at the application

    layer. However in our paper, upon physical-layer correction failure,

    packets are not marked as erased but their data are passed to the

    application layer for byte-level Raptor code correction. A related

    procedure is proposed for mobile phones in [37], when, after phys-

    ical-layer correction failure, partially decoded packets are passed

    to the application layer for block-level correction.

    In a Raptor code, a belief-propagation (BP) algorithm can be

    employed to decode the inner LT code. A BP implementation in

    [38] treated decoding as a soft decoding task rather than correction

    of erasures. The alternative is a hardware implementation of Rap-

    tor’s inner code using Gaussian elimination. Gaussian elimination

    has a total complexity of order O(nk2) to solve a linear equation

    system with n  output symbols and  k  unknowns (the original data

    symbols), resulting in a linear per data symbol complexity of 

    O(nk). The outer code in [28] is selected as a Low-Density Parity-

    check Code (LDPC), which is also an optional physical-layer code

    in IEEE 802.11n, either using Gaussian elimination or in  [39] a BP

    algorithm in hardware. As Gaussian elimination is at the heart of 

    Raptor decoding, optimizations of the algorithm intended for hard-

    ware implementation on mobile phones [40] are available.

    3. Related research

    A number of research papers have considered some aspects of 

    the video streaming method used by us. In   [41]   a packetization

    method was presented for robust H.264 video transmission over

    an IEEE 802.11 wireless local area network (WLAN) configured as

    home network. Video robustness was enhanced by using small

    NAL units and by retrieving possible error-free IP packets from

    the received MAC frame. An aggregation scheme with a recovery

    mechanism was deployed and evaluated via simulation. For fixed

    physical-layer resources, the system provided a 2.5-dB gain in

    video quality (PSNR) compared to making no NAL packetizationadjustments for similar throughput efficiency. Equally an 80%

    Macro

    block 

    Frame

    Frame

    slicesDP-A DP-B DP-C

    DP-A DP-B DP-C

    DP-A DP-B DP-C

    Fig. 1.   Video frame structure, DP-A = data-partition-A and similarly for DP-B andDP-C.

    + +   +   +   +   +   ++   +++

    codingOuter 

    LTcoding

    Redundantnodes

    m

    Fig. 2.   Raptor encoding scheme.

    L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506    1495

    http://-/?-http://-/?-http://-/?-

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    4/14

    improvement in throughput was achieved for a similar video qual-

    ity. However, data partitioning as a way of varying NAL unit sizes

    was not used. The work in [41] was tailored to IEEE 802.11 WLANs

    using the Distributed Coordination Function (DCF) for negotiation

    of access to the wireless channel. As a result, data throughput

    was strongly influenced by the data frame size. Broadband wireless

    systems tend to employ centralized packet scheduling, typically

    through Time Division Multiple Access (TDMA) with pre-config-ured frame sizes into which packets from each node (subscriber

    station) are packed. Consequently, data partitioning for such sys-

    tems is a more appropriate way of managing packetization issues.

    Research in [42] used Forward Error Correction (FEC) and Auto-

    matic Repeat re-Quest (ARQ) to support streaming over WiMAX,

    exploiting features of the WiMAX standard. In particular, channel

    state information held at WiMAX stations served to dynamically

    construct the MAC Protocol Data Unit (MPDU). The size of these

    units was thus determined such that the packet dropping probabil-

    ity was minimized without compromising goodput. Simulation

    results showed that the ARQ-enabled adaptive algorithm was

    always better than the non-adaptive algorithm. The scheme in

    [42] anticipated our work but no particular application was catered

    for in [42] other than it should be a real-time streaming one and,

    indeed, the authors of  [42] were particularly concerned with voice

    over IP (refer to   [43]). Because our work is specialized for video

    streaming it can apply aspects of video source coding to streaming.

    We also utilize adaptive channel coding rather than the block

    codes of  [42]. While we utilize ARQ to request additional FEC data

    according to channel conditions, ARQ in   [42]   was employed to

    change the MPDU size, as the possibility of altering the channel

    coding rate was not available to the authors of  [42].

    In   [16]   the researchers compared non-scalable video coding

    with data partitioning using H.264/AVC under similar application

    and channel constructions for conversational applications over

    mobile channels. The experimental results showed that by using

    the data-partitioning scheme the number of entirely lost frames

    can be lowered and the probability of poor-quality decoded video

    can be reduced. In the data-partitioning scheme of  [16], differentialprotection was achieved by selecting from a set of discrete channel

    coding rates, through punctured convolutional codes. However, in

    order to determine the protection level, an optimization procedure

    became necessary to minimize potential distortion. This procedure

    depended on the quantization parameter (QP) and the coding rate

    for each partition. The wireless channel characteristics also had to

    be known in advance by the encoder. However, leaving aside the

    computational complexity of the optimization search in [16], there

    is another key difference between the scheme of   [16]   and this

    paper. In [16] no feedback occurred, so that it was not possible to

    request additional redundant data. In fact, when using punctured

    convolutional codes in  [16]  (rather than the rateless codes used

    herein) it was not possible to generate additional redundant data.

    Data-partitioning can be viewed as a simplified form of SNR orquality layering. Extended quality layering can also be applied to

    video streaming across WiMAX. In [44], adaptive multicast stream-

    ing was proposed for WiMAX using the Scalable Video Coding

    (SVC) extension for H.264. WiMAX channel conditions were mon-

    itored in order to vary the bitrate accordingly. Unfortunately, the

    subsequent decision of the JVT standardization body for H.264/

    AVC   not  to support fine-grained scalability (FGS) implied it will

    be harder to respond to channel volatility in the way proposed in

    [44], which is one reason why we do not employ SVC and a number

    of other issues surrounding SVC are mentioned in the rest of this

    paragraph. Other work concerned with video streaming over

    WiMAX links has also investigated: combining scalable video with

    multi-connections in   [45]; and comparing   [46]   H.264/SVC with

    H.264/AVC for WiMAX. However, the data-dependencies betweenlayers in H.264/SVC medium-grained scalability remain a concern.

    Unlike in FGS, enhancement layer packets may successfully arrive

    but not be able to be reconstructed if key pictures also fail to arrive.

    Besides, for commercial one-way streaming, simulcast is now

    likely to be preferred to H.264/SVC for the reasons outlined in

    [47]. In  [47], it was found that the extra overhead from sending

    an SVC stream compared to an H.264/AVC stream meant that the

    cost of bandwidth consumption outweighed the reduced storage

    cost of SVC once more than 64 sessions had occurred (assuming16 simulcast streams or 16 video layers per session). In another

    comparison [48], it was proposed that scalable video with unequal

    error protection cannot provide any advantage over H.264/AVC

    with equal error protection in a wireless environment, due to the

    overhead of scalable video coding compared to that of single-layer

    coding.

    4. Outline of streaming system

    This Section outlines an effective video streaming system for

    WiMAX that provides error resilience through source-code data

    partitioning  [16] and which works without the need to apply priv-

    ileged protection to the high-priority partitions. As already men-

    tion in Section   1, the main point of this paper is to draw

    attention to the importance of correctly configuring the video

    codec parameters. Therefore, this Section provides a basic outline

    only and the interested reader is referred to the authors’ other

    works, such as [14] and  [15], for more details and variants of our

    streaming system.

    The H.264/AVC QP is set by us in such a way that temporally-

    coded texture data occupies a larger part of a frame’s data than

    that occupied by data in each of the other two partitions. If this

    texture data should be dropped, it can be replaced more easily

    through error concealment at the decoder than data in the other

    two partitions. Error concealment (backward error correction)

    [49] is a non-normative feature [50] of H.26/AVC that nevertheless

    is present in the H.264/AVC JM reference code (found at  http://

    iphome.hhi.de/suehring/tml/). The texture data in partition-C ispacketized in a WiMAX MAC Service Data Unit (MSDU), within a

    MAC Protocol Data Unit (MPDU) [10,51].

    In contrast to the authors’ counter-intuitive approach, the intu-

    itive approach is to give special protection to the partition-A,

    which for predictively-coded frames includes motion vectors as

    well as other important parameters. Partition-B, bearing intra-

    coded MBs, may also be given special protection, as it contains spa-

    tially-coded data whenever suitable references in other video

    frames are not available. As an example of the Unequal Error Pro-

    tection (UEP) approach, in   [52]   hierarchical modulation was

    employed to favor those partitions with more important data for

    the reconstruction of the video frame. For example, partition-A

    motion vectors can be used for motion copy error concealment

    when partition-C data are lost and, therefore, the intuitiveapproach is to protect partition-A (possibly combined with parti-

    tion-B). Readers should consider for themselves from the evalua-

    tion of this paper whether UEP measures are necessary.

    Though no special protection is given, it is still necessary to pro-

    tect the bit-stream (without privileging partitions A and B) against

    the risk of packet loss. This was achieved through Equal Error Pro-

    tection (EEP) of FEC provision. Application-layer rateless coding

    [53]   was selected for its flexibility and its linear computational

    complexity at both the decoder and the encoder. To avoid long

    latencies, which would occur if packet-level FEC were to be

    applied, redundant data were added to the packets themselves,

    treating the bytes within each packet as the data symbols. Again

    to reduce latency, just a single Automatic Repeat reQuest (ARQ)

    was made if the available data were insufficient to reconstruct acorrupted packet. Additional rateless redundant data were added

    1496   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

    http://iphome.hhi.de/suehring/tml/http://iphome.hhi.de/suehring/tml/http://iphome.hhi.de/suehring/tml/http://iphome.hhi.de/suehring/tml/

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    5/14

    to the next available packet to be transmitted, according to the

    amount calculated in the way specified by us elsewhere  [14].

    In an extension to the basic IPTV streaming scheme, we also

    added redundant NAL unit packets. A discussion of alternative

    redundant packet schemes is postponed to Section 6.4.

    5. Simulation model

    This Section details how we set about modeling the WiMAX

    system for the evaluation of Section  6.

    5.1. Simulation system

    Fig. 3 shows the overall data flow of the simulation system. Raw

    video (.YUV) is encoded by H.264/AVC into compressed form. The

    compressed file is passed for later extraction of any dropped pack-

    ets. At the same time, a video trace file is generated, which will

    become an input into the ns-2 simulator [54]. The trace file con-

    tains the size of each video packet and the transmission schedule.

    However, the trace file does not contain any video data. After sim-

    ulation using the wireless channel model, the simulator outputs alist of sent and dropped packets. This is used to filter from the ori-

    ginal compressed data file any data that did not arrive at the des-

    tination. The file is subsequently decoded by the H.264/AVC

    decoder. The decoder outputs a raw video file in YUV (YCbCr) for-

    mat. This file would normally be displayed but, for the simulation,

    it is compared with the original raw input video. This allows the

    objective video quality to be calculated as Peak Signal-to-Noise

    Ratio (PSNR), through a pixel-by-pixel comparison.

    Two video clips with different source-coding characteristics

    were employed in the tests in order to judge the dependency of 

    the results upon video source-coding complexity. The first test

    sequence was  Paris, which is a studio scene with two upper body

    images of presenters and moderate motion. The background is of 

    moderate to high spatial complexity leading to larger slices. Theother test sequence was   Football, which has rapid movements

    and consequently has high temporal coding complexity. Both

    sequences were encoded at Common Intermediate Format (CIF)

    (352 288 pixel/picture). CIF resolution was used for ready com-

    parison with the prior work of others on video communication to

    mobile devices.

    5.2. Wireless channel model

    The Gilbert–Elliott channel mode employed in this work has

    been used by researchers in the wireless field   [55,56] because of 

    its ability to model error burst patterns as experienced at the recei-ver. This channel model was introduced into the ns-2 event-driven

    simulator  [54]. The Gilbert–Elliott channel model itself is a two-

    state Markov chain. It is based on: good and bad states; the prob-

    abilities of these states; and the probabilities of the transition

    states between them. In the case of the bad state, losses happen

    with higher probability while in the good state losses happen with

    lower probability. PGG refers to the probability of being in the good

    state and PGB is the probability of a transition from the good state

    to the bad state. PBB is the probability of being in the bad state and

    PBG refers to the probability of a transition from the bad to good

    state. PGG (PBB) can be interpreted as the probability of remaining

    in the good (bad) state, given that the previous state was good

    (bad). Conversely, PGB represents the probability that given that

    the previous state was good, a transition is made from the good

    to the bad state. By the law of total probability, all probabilities

    sum to one (certainty). Therefore, we have PGG + PGB = 1, resulting

    in (1). A similar argument for the bad state leads to  (2).

    P GG ¼ 1 P GB   ð1Þ

    P BB ¼ 1 P BG   ð2Þ

    For the stochastic process to remain stationary in time,

    pG pGB  ¼ pB pBG   ð3Þ

    where pG and pB are the steady state probabilities of being in a good

    or bad state respectively. Again by the law of total probability,

    pB = 1 pG. Substituting this expression for  pB  into (3)  easily leads

    to:

    pG ¼  pBG

     pBG þ pGBð4Þ

    Similarly,   pG = 1 pB. Substituting this expression for   pG   into

    (3) easily leads to:

    pB  ¼  pGB

     pBG þ pGBð5Þ

    The Gilbert–Elliott model good and bad states have their own

    error distributions that are independent of the process of arriving

    or leaving those states, i.e. forming a Hidden Markov Model. Sup-

    pose the probability of packet loss is   pG   and   pB   in the good and

    bad states respectively. Then the average packet loss rate produced

    by a Gilbert–Elliott channel is given in (6) by the usual expression

    for the expectation of a probability distribution.

     p ¼  pGpG þ pBpB   ð6Þ

    To model the effect of slow fading at the packet-level, as the Gil-

    bert–Elliott model’s parameters, the PGG was set to 0.96,

    PBB = 0.95, PG = 0.01 and PB = 0.02. Additionally, it is still possible

    for a packet not to be dropped in the channel but nonetheless to be

    corrupted through the effect of fast fading (or other sources of 

    noise and interference). This byte-level corruption was modelled

    by the second Gilbert–Elliott model, with the same parameters

    (applied at the byte level) as that of the packet-level model except

    that PB (now probability of byte loss) was increased to 0.165. Effec-

    tively, this second model emulates fast fading between good and

    bad conditions.

    The slow fading parameters in the Gilbert–Elliott model wereestablished empirically by a long series of trial simulations aimed

    Video file

    .YUV

    JM 14.2

    H.264/AVC

    encoder

    Generate

    video trace

    Received trace

    packets

    Sent trace

    packets

    Checking

    dropped

    packets

    Filtered video

    stream

    Reconstructed

    YUV file

    PSNR

    calculation

    JM 14.2

    H.264/AVCdecoder

    Sending

    node

    Receiving

    node

    Network 

    channel model

    Fig. 3.   Simulation system.

    L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506    1497

    http://-/?-http://-/?-http://-/?-

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    6/14

    at establishing packet loss rates (PLRs) that were known from

    experience to be critical to the acceptable reconstruction of a video

    stream by a decoder. Specifically, PLRs below 5% can be tolerated

    without protection measures, while above 10% it is difficult for

    error resilience measures and/or channel coding to reconstruct a

    video sequence. This observation can be generalized in the above

    dual Gilbert–Elliott model to achieve data loss rates in the range

    5–10%. It is only realistic to specify a data loss range because of the statistically nature of the model’s behaviour. In this paper,

    we have presented results in the data loss range that was selected

    as critical. It is perfectly possible for a link to be benign and no data

    losses to occur but clearly one should engineer a streaming solu-

    tion to cope with critical conditions. As the channel coding protec-

    tion in this paper is adaptive, the problem of transmitting

    redundant overhead in a benign channel is reduced. As extra

    redundant data cannot be transmitted more than once, the possi-

    bility of attempting video repair when no amount of protection

    will work is also avoided.   Fig. 4   is an example from preliminary

    tests in which the Gilbert–Elliott PB parameter is systematically

    varied while the other Gilbert–Elliott parameters were set as

    above. This allows the relationship to the PLR for four reference

    video clips to be found. By observation, the relationship is approx-

    imately linear.

    In   [57]   for an IEEE 802.15.4 wireless link, the authors take

    extensive measurements of a bursty link to arrive at a metric for

    burstiness, which is then found to be a generalisation of the burs-

    tiness in a Gilbert–Elliott model. However, the Gilbert–Elliott

    model differed from ours in representing the hidden bad state to

    result in the loss of all packets and likewise the good state to result

    in all packets surviving. An interesting observation of the authors

    of   [57] is that burstiness tends to occur when a mobile device is

    at the limits of the range of a radio link, so that small changes in

    the physical channel cause the signal to drop below the receiver’s

    sensitivity. In [56], the estimation of model parameters from chan-

    nel measurements is also treated for wired Internet examples.

    5.3. WiMAX simulation configuration

    The physical (PHY) layer settings selected for WiMAX simula-

    tion on ns-2 [58] are given in Table 1. The antenna was modelled

    for comparison purposes as a half-wavelength dipole. The Time

    Division Duplex (TDD) frame length was set to 5 ms, as this is

    the only value supported by the WiMAX Forum [59]. As mentioned

    under modulation, the physical-layer FEC convolutional code rate

    was ½. Video was transmitted over the downlink with UDP trans-

    port (see Section 2.1). In order to introduce sources of traffic con-

    gestion, an always available FTP (File Transfer Protocol) source

    was modelled with TCP transport to a Subscriber Station (SS) from

    the WiMAX Base Station (BS). Likewise a CBR source with packet

    size of 1000 byte and inter-packet gap of 0.03 s was downloaded

    to another SS.

    6. Evaluation of codec settings

    This Section evaluates in turn the codec configurations intro-duced in Section 1.

    6.1. Intra-Refresh Macroblocks

    As previously mentioned, in H.264/AVC data partitioning

    [16,60], motion vectors (MVs) are packed into partition-A bearing

    NAL units, allowing motion copy error concealment at the decoder

    to partially reconstruct a picture, despite missing partition-C NAL 

    units (containing quantized transform coefficient residuals). Parti-

    tion-B NAL units contain intra-coded (spatially encoded) MBs,

    which are substituted for inter-coded MBs according to encoder

    implementation (only the decoder input format is standardized

    in H.264/AVC). Therefore, when Intra-Refresh (IR) MBs are

    included alongside naturally intra-encoded MBs, partition-B slicesgrow in size. This means that data-partitioned video compression

    provides a convenient way to examine the effect of various

    amounts of IR MB provision. Once the H.264/AVC encoder has

    formed a NAL unit, it can also provide a RTP header prior to encap-

    sulation by IP/UDP network protocol headers.

    A point to note is the different way that random IR MBs are

    specified in the H.264/AVC JM 14.2 implementation compared to

    that of cyclic IR line intra update. In random IR MB, a maximum

    percentage of IR can be specified, which percentage includes

    already encoded IR MB. If the given quota of IR MB is already lar-

    gely occupied by naturally encoded MBs (those encoded to cover

    newly revealed objects or to improve the quality of the video up

    to a given bit budget or for some other encoder-dependent reason),

    then only a small amount of extra randomly inserted MBs will beadded. In contrast, if a line of IR MBs is inserted then these MBs

    0

    5

    10

    15

    20

    25

    0.03 0.06 0.09 0.12 0.15 0.18 0.21 0.24 0.27 0.3

    PB

       P  a  c   k  e   t   l  o  s   t   %

    Foreman

    Car Phone

    Highway

    Paris

    Fig. 4.   Example plot showing the relationship between Gilbert–Elliott PB parameter

    and packet loss rate for several CIF video sequences at 30 Hz, with data-partitioningapplied.

     Table 1

    IEEE 802.16 (WiMAX) parameter settings.

    Parameter Value

    PHY OFDM

    Frequency band 5 GHz

    Bandwidth capacity 10 MHz

    Duplexing mode TDD

    Frame length 5 ms

    Max. packet length 1024 BRaw data rate 10.67 Mbps

    IFFT size 1024

    Modulation 16-QAM 1/2

    Guard band ratio 1/8

    DL/UL ratio 3:1

    Channel model Gilbert–Elliott

    SS transmit power 245 mW

    BS transmit power 20 W

    Approx. range to SS 1 km

    Antenna type Omni-directional

    Antenna gains 0 dBD

    SS antenna height 1.2 m

    BS antenna height 30

    PHY = Physical layer, IFFT = Inverse Fast Fourier Transform, DL/UL = Downlink/

    Uplink, QAM = Quadrature Amplitude Modulation, SS = Subscriber Station,

    BS = Base Station, TDD = Time Division Duplex, OFDM = Orthogonal Frequency

    Division Multiplexing.

    1498   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    7/14

    are added in addition to those intra-coded MBs that have already

    been included by the encoder, as shown in Fig. 5.

    Football was VBR encoded with a Group-of-Pictures (GoP) struc-

    ture of IPPP. . .  at 30 frame/s. (Refer forward to Section 6.3  to see

    the tests that led to choice of the more favourable IPPP. . .   GoP

    structure.) From Table 2, it is apparent that, as the percentage of 

    IR MBs is increased, the size in bytes of partition-B increases for

    the same QP. Because more MBs are assigned to partition-B, thesize of partition-C reduces. And because of the large amount of nat-

    urally-encoded intra MBs, this effect is gradual until 25% of random

    IR MBs are added. 25% of random IR MBs is shown in Table 2, as

    that amount approximately corresponds to the total partition-B

    size if cyclic line intra update is turned on instead (with approxi-

    mately the same number of MBs).

    Fig. 6   illustrates the range of metrics we extracted from our

    codec configuration tests. However, it should be noted that, in

    Fig. 6, increasing the provision of IR MBs from 2% to 5%, then to

    6% and finally 25% (in the case of MB line intra update), increases

    the throughput and, hence, the bandwidth requirements in respect

    to co-existing traffic. The 25% IR commitment tested is large in

    practice, due to the coding inefficiency of spatial reference coding.

    It results in larger packets and the size of packets is the most

    important factor affecting the percentage of dropped packets, as

    is also evident from the decrease in dropped packet percentages,

    Fig. 6a, when the QP increases (and video quality decreases). From

    Fig. 6a and c, higher-quality video (QP = 20, 25) benefits from ran-

    dom insertion (and this effect is reversed for low-quality video).

    From the objective video quality (PSNR) resulting,   Fig. 6c, it can

    be seen that reducing the IR MB percentage to 2% actually

    improves the PSNR at QPs 30 and 35. From this one can conclude

    that there is little if anything to be gained by raising the IR percent-

    age. This is not surprising, as at lower QPs the encoder has a greater

    bit budget and, hence, can include more naturally-encoded intra

    MBs. In addition, the main effect of reducing the percentage of IR 

    MBs is that the size of partition-B-bearing packets is reduced. In

    turn, this makes these packets less likely to be affected by channel

    conditions, especially burst errors arising from fast fading. Conse-quently, more partition-B packets survive intact, again causing a

    relative gain in video quality.

    Packet end-to-end delay,   Fig. 6d, is the mean delay of those

    packets unaffected by channel conditions. The results show that

    this is generally small in duration, though with a tendency to

    increase due to the propagation delay of larger packets at lower

    QPs. Significantly, a lower percentage of IR MBs, in addition to bet-

    ter quality video in many cases, actually results in lower delay.

    Turning to corrupted packets, there are larger percentages of 

    corrupted packets at higher QPs,  Fig. 6b. These are packets that

    have not been repaired completely by the adaptive channel coding

    scheme. Because of the additional transmission time, the mean

    end-to-end delay of corrupted packets is higher than other packets,Fig. 6e. In fact, it is the extent of the delay that is the main contri-

    bution of corrupted packets to the quality-of-service. However,

    though the percentage of corrupted packets increases at higher

    QPs, the packet sizes decrease due to the reduced coding efficiency.

    Smaller packets take less time to re-transmit compensating for the

    increase in the number of corrupted packets.

    6.2. Constrained intra prediction setting 

    In order to decode partition-B and -C, the decoder must know

    the location from which each MB was predicted, which implies

    that partitions B and C cannot be reconstructed if partition-A is

    lost. Though partition-A is independent of partitions B and C, CIP

    should be set in the codec configuration [20] to make partition-Bindependent of partition-C. (Though reference [20] refers to a pro-

    posal to add constrained inter prediction to H.264/AVC, it also

    describes constrained intra-prediction, which is already a part of 

    the codec standard.) By setting this option, partition-B MBs are

    no longer predicted from neighboring inter-coded MBs. This is

    because the prediction residuals from neighboring inter-coded

    Fig. 5.   Differences between random intra-refresh MBs (upper frames with 6%) and MB cyclic line intra update (lower frames).

     Table 2

    Mean size of different partitions in bytes for  Football  at various QPs.

    QP 2% Intra-refresh MB 5% Intra-refresh MB 6% Intra refresh MB

    A B C A B C A B C

    20 1842 2678 3889 1845 2767 3867 1846 2810 3850

    25 1687 1697 2533 1690 1763 2511 1696 1793 2502

    30 1459 1047 1496 1463 1082 1482 1467 1098 1479

    35 1117 572 688 1120 595 682 1123 604 681

    QP 25% Intra-refresh MB MB line intra update

    A B C A B C

    20 1893 3450 3669 1885 3385 3683

    25 1746 2216 2379 3683 2160 2400

    30 1505 1346 1405 1498 1312 1414

    35 1146 729 646 1143 716 652

    L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506    1499

    http://-/?-

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    8/14

    MBs reside in partition-C and cannot be accessed by the decoder if 

    a partition-C packet is lost. There is a by-product of increasing

    packet sizes due to a reduction in compression efficiency but the

    increase in size may be justified in error-prone environments.

    The two video clips (Paris   and  Football) were VBR encoded at

    CIF, with a GOP structure of IPPP(for choice of GoP structure again

    refer forward to Section   6.3). . .

      at 30 Hz and with 5% IR MBs(choosing a lower percentage as a result of the tests in Section 6.1).

    Table 3  presents NAL unit sizes for the  Paris   sequence with and

    without CIP, showing the increase in partition-B and -C sizes,

    which results from the loss in encoding efficiency. Notice that

    any NAL units that are above the maximum packet size of 1024

    B in Table 3 are later constrained by the encoder to the MTU when

    forming an RTP packet prior to encapsulation by network headers.

    The reason for this precaution is to avoid these larger NAL units

    being segmented at the link layer, avoiding the possible separation

    of header information from NAL data.

    At lower QPs, i.e. higher-quality video, the relative size of parti-

    tion-C NAL units means that the more important partition-A and -B

    packets are less likely to suffer channel error. However, the larger

    packet sizes mean that congestion may have more of an impact,because longer packets take longer to transmit and free the

    channel. At higher QPs, the advantage of differential packet sizes

    is lost but the generally smaller packet sizes compensate to some

    extent. There is also a small growth in partition-B and -C packet

    size when CIP is turned on.

    From Table 4 for Football one sees that though the relative rank-

    ing of sizes between the partition types is similar, the actual sizes

    are larger than those for Paris (see Table 3). The larger sizes are due

    to the temporal coding complexity of  Football. For high QP, the rel-

    atively larger size of partition-A NAL units compared to the other

    partitions NAL units may create a problem, as it does not result

    0

    6

    12

    18

    24

    30

    20 25 30 35

       D  r  o  p  p  e   d  p  a  c   k  e   t  s   %

    QP

    2% Intrarefresh MB

    5% Intra

    refresh MB

    6% Intra

    refresh MB

    MB Line Intra

    Update

    0

    6

    12

    18

    24

    30

    20 25 30 35

       C  o  r  r  u  p   t  e   d  p  a  c   k  e   t  s   %

    QP

    2% Intra

    refresh MB

    5% Intra

    refresh MB

    6% Intra

    refresh MB

    MB line Intra

    Update

    (a) (b)

    0

    8

    16

    24

    32

    40

    20 25 30 35

       M  e  a  n   P   S   N   R   (   d   B   )

    QP

    2% Intra

    refresh MB

    5% Intra

    refresh MB

    6% Intra

    refresh MB

    MB Line

    Intra Update

    0

    0.03

    0.06

    0.09

    0.12

    20 25 30 35   P  a  c   k  e   t  m  e  a  n  e  n   d  -   t  o  -  e  n   d   d  e   l  a  y   (  s   )

    QP

    2% Intra

    refresh MB

    5% Intra

    refresh MB

    6% Intra

    refresh MB

    MB Line Intra

    Update

    (c) (d)

    0

    0.03

    0.06

    0.09

    0.12

    0.15

    20 25 30 35   C  o  r  r  u  p   t  e   d  p  a  c   k  e   t  s  m  e  a  n   d  e   l  a  y   (  s   )

    QP

    2% Intra

    refresh MB

    5% Intrarefresh MB

    6% Intra

    refresh MB

    MB LineIntra Update

    (e)

    Fig. 6.   Mean performance metrics for   Football  with 2%, 5%, 6% IR MBs and MB line intra update.

     Table 3

    Paris  video sequence: Mean NAL unit size in bytes according to partition type and

    video quality.

    QP Without CIP (B) With CIP (B)

    A B C A B C

    20 740 495 4097 738 504 4154

    25 601 336 2058 601 356 2093

    30 473 210 838 473 238 857

    35 331 124 281 330 149 291

    1500   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    9/14

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    10/14

    GoP structures: IPPP. . .   (i.e. one initial I-picture and all P) and

    IBBPBBP. . .   (i.e. insertion of bi-predictive B-pictures for greater

    coding efficiency but still with one initial I-picture). Before starting

    those tests, we examined the effect of the two different GoP struc-ture types on the sizes of video frames. One potential impact of the

    GoP structure is that B-pictures increase coding efficiency at a cost

    in coding complexity due to the need to reference at least two

    other pictures. Unfortunately, with the inclusion of B-pictures,

    the mean size of P-pictures increases, as a result of the increased

    reference distance between P-pictures. For example, for QP = 20,

    the IBBPBBP. . .   mean P-picture size is as high as 15 kB, which

    may well result in a series of large packets.

    The tests were again performed on   Paris   and   Football. Both

    sequences were VBR encoded video at 30 Hz, CIF, with 2% IR MBs

    randomly inserted. Recall from Section 6.1  that inserting 2% of IR 

    MBs gave the best video quality and the lowest delay, which is

    why this percentage of IR MBs was selected for these tests. CIP

    was configured and the two different GoP types (IPPP. . .

      andIBBP. . .) generated the video traces.

    The effect of adding B-pictures is evident in  Fig. 12, in which

    many more packets are dropped for  Football  at QP = 20. ‘Dropped

    packets’ includes buffer overflow and outright channel drops.

    Repeated simulations of the IBBP. . .  configuration of  Football   for

    QP = 20 confirmed the packet drop spike at this setting. The effect

    derives from the larger P-pictures that are the result of inserting B-

    pictures into the GoPs, when P-pictures are already large because

    of the lower QP setting. The result is a series of large P-picture-

    bearing packets. As previously mentioned, from  Table 5, the Foot-

    ball  mean P-picture size at QP = 20 for IBBP. . . is almost twice the

    size of that when an IPPP. . . GoP structure is configured and simi-

    larly twice the size of P-pictures for either of the  Paris GoP config-

    urations at QP = 20 in   Table 5. The IBBP. . .   GoP structure

    particularly impacts the more temporally complex sequence,

    resulting in large B-partition packets. A series of large packets

    results in more transmission errors and more packet drops if the

    send buffer becomes saturated. The general reduction in packet

    drop numbers in Fig. 12  for decreasing QP, results in an increase

    in video quality in Fig. 13. Moreover, an IPPP. . .  GoP structure is

    preferable except at QP = 35. However, at QP = 35 all GoP struc-

    tures result in a PSNR of over 25 dB.Corrupted packet levels are generally high,   Fig. 14, but in

    respect to GoP structure it seems that an IPPP. . .  structure may

    be more favorable for temporally complex content (Football) and

    vice versa   for less active sequences. As was previously remarked,

    the main consequence of higher levels of packet corruption, after

    the application of the proposed adaptive scheme, is in greater

    delay for a greater percentage of packets,  Fig. 15. Apart from the

    anomaly at QP = 20 for  Football  with an IBBP. . . GoP structure, as

    0

    0.005

    0.01

    0.015

    0.02

    0.025

    20 25 30 35

       C  o  r  r  u  p   t  e   d  p  a  c   k  e   t  s  m  e  a  n   d  e   l  a  y   (  s   )

    QP

    Paris with

    CIP

    Pariswithout CIP

    Football

    with CIP

    Football

    without CIP

    Fig. 11.   Paris   and   Football   video sequences: Protection scheme mean delay of 

    corrupted packets, with and without CIP.

    0

    4

    8

    12

    16

    20

    20 25 30 35

       D  r  o  p  p  e   d  p  a  c   k  e   t  s   %

    QP

    FootballIPPP...

    Football

    IBBP...

    Paris

    IPPP...

    ParisIBBP...

    Fig. 12.   Dropped packets for differing GoP structure and content.

     Table 5

    Mean P-picture size (bytes) according to QP for two different GoP structures.

    QP Football Paris

    IPPP. . .   IBBP. . .   IPPP. . .   IBBP. . .

    20 8905 15,520 5102 7590

    25 6301 10,311 2824 4249

    30 4185 6381 1398 2238

    35 2444 3756 647 1100

    0

    7

    14

    21

    28

    35

    20 25 30 35

       M  e  a  n   P   S   N   R   (   d   B   )

    QP

    Football

    IPPP...

    Football

    IBBP...

    Paris

    IPPP...

    Paris

    IBBP...

    Fig. 13.   Video quality (PSNR) for two different GoP structures and content.

    0

    7

    14

    21

    28

    35

    20 25 30 35

       C  o  r  r  u  p   t  e   d  p  a  c   k  e   t  s   %

    QP

    FootballIPPP...

    FootballIBBP...

    ParisIPPP...

    ParisIBBP...

    Fig. 14.   Percentage of corrupted packets for two different GoP structures andcontent.

    1502   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    11/14

    a result of many more large packets each resulting in more trans-

    mission delay (refer back to the discussion of  Fig. 12 and Table 5),corrupted packet delay is approximately twice that of normal

    packet end-to-end delay,  Fig. 16, reflecting the single retransmis-

    sion of extra redundant that is permitted. Again for   Football,

    QP = 20 the mean packet delay is much higher in Fig. 16 as a result

    of many more large P-picture packets each with more transmission

    delay when using an IBBP. . . GoP structure. It was also confirmed

    that for a moderate increase in mean packet size, making parti-

    tion-B completely independent of partition-C through CIP resulted

    in a small (a few dB) improvement in video quality, whenever the

    QP setting allowed sufficient packets to be delivered.

    6.4. Redundant NAL units

    To improve video quality it is possible to provide redundantNAL unit packets. There is a variety of ways of providing redundant

    packets.

    It is possible to use redundant picture slices   [62]  or duplicate

    picture slices  [63]. Redundant picture slices employ a higher QP

    and, hence, coarser quantization than the original slices. Those

    interested in investigating the use of redundant slices further

    should notice that methods to refine the selection of redundant

    slices [64] have also been designed. However, in our scheme using

    redundant picture slices will cause additional drift between the

    encoder and the decoder, if (say) a partition-A packet was matched

    with a partition-C packet’s data with a different QP. Therefore, we

    tested the inclusion of duplicate picture slices/data partitions using

    the same QP as the original slice.

    Even so, there is an implementation issue when employingduplicate picture slices. Though both redundant/duplicate slices

    and data-partitioned slices co-exist in the Extended profile, they

    are not jointly implemented in the JM implementation of H.264/

    AVC  [65]  and, in fact, appear to not to be implemented at all in

    most other software codec implementations such as QuickTime,

    Nero, and LEAD randomly to name a few. However, when employ-

    ing data-partitioning, it is also possible through repeated runs of 

    the encoder to create an additional stream of all partition-A slice

    packets or an additional stream consisting of partition-A and par-tition-B packets or, indeed, a complete duplicate version of the ori-

    ginal stream.

    To test the use of duplicate packets, the same two video clips

    (Paris   and   Football) with different source coding characteristics

    were employed in the tests to judge content dependency. Both

    sequences were VBR encoded, with a GoP structure of IPPP. . .  at

    30 Hz. 5% IR MB data were added to each picture, increasing the

    size of partition-B packets (refer to Table 2). Again a low percent-

    age of IR MBs was added as a result of the gains in video quality

    experienced in Section 6.1’s tests from avoiding higher percentages

    and, as before, the favorable IPPP. . . GoP structure was selected as a

    result of the tests in Section 6.3.

    The duplicate NAL units do not amount to a change in bitrate

    because the packets are simply replicated. However, the end-to-

    end packet delay will obviously increase because of the interleav-

    ing of the duplicate slice packets. In the redundant/duplicate NAL 

    unit scheme, retransmission of extra redundant data was sched-

    uled for all corrupted packets, even if two packets duplicated each

    other. This is because it is not possible to know in advance whether

    the extra redundant data will arrive for any one of the two packets.

    This provision has a significant effect in improving the video qual-

    ity at higher QPs. The reason is that retransmitting extra redundant

    data by two alternative means increases the chance that a packet

    can be reconstructed.

    Fig. 17 shows the effect of the various schemes on packet drops

    when streaming Paris. ‘Data-partition’ in the Figure legend refers to

    sending no redundant/duplicate packets. ‘Redundant X’ refers to

    sending duplicate redundant packets containing data-partitions

    of partition type(s) X in addition to the rateless coded data-parti-tion packets. From   Fig. 17, the larger packet drop rates at

    QP = 20, due to the larger packets, will have a significant effect

    on the video quality.

    Fig. 18   shows the pattern of corrupted packet losses arising

    from simulated fast fading. There is actually an increase in the per-

    centage of packets corrupted if a completely redundant/duplicate

    stream is sent (partitions A, B, and C), though this percentage is

    taken from corrupted original and redundant/duplicate packets.

    However, the effect of the corrupted packets on video quality only

    occurs if a packet cannot be reconstructed after application of the

    adaptive retransmission scheme.

    0

    0.02

    0.04

    0.06

    0.08

    20 25 30 35

       C  o  r  r  u  p   t  e   d  p  a  c   k  e   t  s  m  e  a  n   d  e   l  a  y   (  s   )

    QP

    Football

    IPPP...

    Football

    IBBP...

    Paris

    IPPP...

    Paris

    IBBP...

    Fig. 15.  Corrupted packet mean delay for two different GoP structures and content.

    0

    0.015

    0.03

    0.045

    0.06

    20 25 30 35

       P  a  c   k  e   t  s   M  e  a  n  e  n   d  -   t  o  -  e  n

       d   d  e   l  a  y   (  s   )

    QP

    Football

    IPPP...

    Football

    IBBP...

    Paris

    IPPP...

    Paris

    IBBP...

    Fig. 16.   Mean end-to-end packet delay for differing GoP structure and content.

    0

    1

    2

    3

    4

    5

    20 25 30 35

       D  r  o  p  p  e   d  p  a  c   k  e   t  s   %

    QP

    Data

     partition

    Redundant

    A

    Redundant

    A,B

    Redundant

    A,B,C

    Fig. 17.   Paris sequence protection schemes’ packet drops.

    L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506    1503

    http://-/?-http://-/?-http://-/?-

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    12/14

    Examining  Fig. 19  for the resulting video quality (PSNR), one

    sees that data partitioning with FEC protection, when used without

    redundant/duplicate packets, is insufficient to bring the videoquality to above 31 dB, that is to a ‘good’ quality. However, it is

    important to notice that sending duplicate redundant partition-A

    packets alone (without redundant packets from other partitions)

    is also insufficient to raise the video quality to a ‘good’ rating

    (above 31 dB). Therefore, to raise the video quality to a ‘good’ level

    requires not only the application of the adaptive rateless channel

    coding scheme but also the sending of duplicate data streams.

    The impact of corrupted packets, given the inclusion of retrans-

    mitted additional redundant data, is again largely seen in addi-

    tional delay. As before, there is an approximate doubling in per

    packet delay between the total end-to-end delay for corrupted

    packets  Fig. 20  and normal packet end-to-end delay,   Fig. 21. It

    must be recalled, though, that for the redundant schemes there is

    up to twice the number of packets being sent. Therefore, delay is

    approximately further doubled, still though with end-to-end

    delays remaining in the tens of millisecond range. As previously

    noted, this type of delay range is acceptable even for interactive

    applications, but may contribute to additional delay if the WiMAX

    link forms part of a longer network path.

    The experimental results for Football are summarized in Table 6.

    Table 6 shows how packet drops and losses are reflected in video

    quality. Very large numbers of packets are dropped at QP = 20

    because of the larger packet sizes. However, there is a threshold

    effect, as the numbers of dropped packets decline quickly with

    increasing QP (as packet sizes reduce). The protection pattern for

    redundant/duplicate packets is accentuated compared to  Paris, in

    the sense that providing duplicate redundant versions of more

    than just partition-A packets is now clearly seen to be preferable.

    Given that in Quality-of-Experience subjective tests for mobile

    devices [66], news scenes rather than sport are preferred by view-

    ers, it may be advisable to favor content without rapid motion,

    especially if small footballs or similar sports’ balls need to be

    tracked by the viewer.

    6.5. Discussion

    As is normal in a research environment and in papers of this

    nature, we have made a set of simulations of IPTV video streaming

    rather than a testbed. The disadvantage of a testbed, whatever itsmerits in terms of verisimilitude is a lack of flexibility. Therefore,

    0

    6

    12

    18

    24

    30

    20 25 30 35

       C  o  r  r  u  p   t  e   d   P  a  c   k  e   t  s   %

    QP

    Data partition

    RedundantA

    Redundant

    A,B

    RedundantA,B,C

    Fig. 18.   Paris sequence protection schemes’ corrupted packets.

    0

    7

    14

    21

    28

    35

    20 25 30 35

       M  e  a  n   P   S   N   R   (   d   B   )

    QP

    Data partition

    Redundant

    A

    RedundantA,B

    RedundantA,B,C

    Fig. 19.   Paris sequence protection schemes’ video quality.

    0

    0.03

    0.06

    0.09

    0.12

    0.15

    20 25 30 35

       C  o  r  r  u  p   t  e   d

      p  a  c   k  e   t  m  e  a  n   d  e   l  a  y   (  s   )

    QP

    Data

     partition

    Redundant

    A

    Redundant

    A,B

    RedundantA,B,C

    Fig. 20.   Paris sequence protection schemes’ mean delay for corrupted packets.

    0

    0.03

    0.06

    0.09

    0.12

    20 25 30 35

       P  a  c   k  e   t  m  e  a  n  e  n   d  -   t  o  -  e  n   d   d  e   l  a  y   (  s   )

    QP

    Data

     partition

    Redundant

    A

    Redundant

    A,B

    Redundant

    A,B,C

    Fig. 21.   Paris   sequence protection schemes’ mean end-to-end delay for normal

    packets.

     Table 6

    Football redundant/duplicate NAL unit results.

    QP Data-partition Redundant A Redundant A, B Redundant A, B, C

    Dropped packets  (%)

    20 6.92 2.69 13.55 23.12

    25 4.23 2.50 1.38 4.88

    30 3.97 1.44 0.46 0.12

    35 1.66 1.44 0.38 0.00

    Corrupted packets  (%)

    20 30.76 31.37 22.72 14.51

    25 30.64 30.79 30.27 25.11

    30 27.56 30.79 27.42 26.97

    35 21.92 16.55 24.73 22.35

    PSNR (dB)

    20 19.98 23.65 23.11 19.65

    25 20.96 26.47 27.58 33.9230 21.16 29.45 29.16 34.50

    35 23.27 27.65 27.59 30.73

    1504   L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506 

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    13/14

    in an ideal world both types of tests should be conducted. In the

    absence of a testbed then detailed simulation can be indicative of 

    the streaming performance if not a conclusive demonstration. Both

    types of test environments may not anticipate all the physical con-

    ditions that may arise such as distances from the base station,

    types of channel fading, and multi-path effects. It should be

    noticed though that the Gilbert–Elliott channel model used in the

    tests is not a physical model of channel conditions but a modelof the burst errors experienced at the receiver. In other words, it

    models the error patterns rather than the physical conditions that

    given rise to error conditions. This distinction may explain the pop-

    ularity of this type of channel model for studies of video commu-

    nication over wireless networks.

    Certain aspects of the proposed scheme such as data-partition-

    ing, CIP, IR MB percentage, and GoP structure can be configured at

    encoding time, allowing pre-storage of TV programs within server

    banks. Unfortunately, the wireless environment is notoriously vol-

    atile, especially if mobile stations are present. It is for this reason

    that we include the rateless channel coding element that allows

    the video server, herein collocated at the WiMAX base station, to

    adapt the rateless channel coding rate to adapt to channel condi-

    tions. Compared to (say) dropping frames the adaptive channel cod-

    ing solution is more fine-grained, as it occurs at the packet level.

    Consequently, the user experience is less likely to be disrupted.

    Other adaptive solutions are possible that take advantage of 

    flexibilities within the hardware. For example, the authors have

    themselves experimented with a form of WiMAX adaptive modula-

    tion in [67]. The disadvantage of such approaches is that they are

    technology dependent in the sense that the firmware within a base

    station may need to be modified, sometimes through cross-layer

    intervention. A software adaptive approach such as the one using

    adaptive rateless coding in this paper can more easily be deployed

    and modified for differing broadband wireless technologies.

    7. Conclusion

    Both FEC and data-partitioning of IPTV video streams are a way

    of providing graceful quality degradation in a form that will work

    in good and difficult wireless channel conditions. This paper

    showed that video configuration also affect the video quality,

    dropped packets and delay. In that respect, it was shown that it

    is better to include a small percentage of IR MBs that can build

    their effect over time than employ the cyclic IR line update scheme.

    Packet size, which is determined by content, video quality, and GoP

    structure is an important determinant of packet drops. The use of 

    equal error protection is a way of taking advantage of the natural

    packet size differential, which is in inverse order of the priority

    of the data partitions. Thus, smaller packet lengths already confer

    a lower risk of channel error. However, the inverse size order of 

    data partitions (larger partition-A and -B) was seen to occur whensmaller QPs were chosen. It was also confirmed that for a moderate

    increase in mean packet size, making partition-B completely inde-

    pendent of partition-C resulted in a small but significant improve-

    ment in video quality.

    An interesting observation is that there is a need to reduce

    packet size to reduce packet loss, despite the combined effect of 

    redundant packets and application adaptive channel coding. This

    is because during ‘bursty’ error conditions (as was simulated by

    the Gilbert–Elliott channel model) it is possible that both the origi-

    nal packet and its redundant counterpart may be dropped or cor-

    rupted. However, this effect is dependent on choice of QP, as a

    low QP can lead to high packet drop rates with poor video quality.

    In general, in poor channel conditions with both slow and fast fad-

    ing, it is not sufficient to employ just application-layer FEC unlessstream replication also takes place.

    The evaluation tests performed in this paper employed two ‘ref-

    erence’ video sequences from a number of such sequences that

    video coding experts provide as a test of codec implementations.

    Another valid approach, particularly when developing a specific

    product is to select from video actually requested by members of 

    the public. For example, video requests can be selected from a real

    trace file and used to select popular videos that can subsequently

    be used in tests. In fact, this approach represents future work forthis research. Future work will also confirm the validity of the

    wireless channel model by streaming video over a live WiMAX

    link.

     Acknowledgments

    Laith Al-Jobouri, who has recently completed his doctorate at

    the University of Essex, UK, would like to thank his financial spon-

    sors at the Ministry of Science and Technology, Baghdad, Iraq.

    References

    [1] D. de Veeschauwer, Z. Avramova, S. Wittevrongel, H. Bruneel, Transportcapacity for a catch-up television service, in: Proc. of 7th European Conf. on

    Interactive Television, 2009, pp. 161–170.[2]  Y.J. Liang, J.G. Apostolopoulos, B. Girod, Analysis of packet loss for compressed

    video: effect of burst losses and correlation between error frames, IEEE Trans.Circuits Syst. Video Technol. 18 (7) (2008) 861–874.

    [3]  T. Wiegand, G.J. Sullivan, G. Bjontegaard, A. Luthra, Overview of the H.264/AVCvideo coding standard, IEEE Trans. Circuits Syst. Video Technol. 13 (7) (2003)560–576.

    [4]   T. Schierl, M.M. Hannuksela, Y.-K. Wang, S. Wenger, System layer integration of high efficiency video coding, IEEE Trans. Circuits Syst. Video Technol. 22 (12)(2012) 1871–1884.

    [5]   I. Sodagar, The MPEG-DASH standard for multimedia streaming over theInternet, IEEE Multimedia 18 (4) (2011) 62–67.

    [6]   S. Pecheron, Is HEVC ready for prime time?, Comms, Eng., Design (CED) Mag.39 (9) (2013) 22–24.

    [7] S. Lederer, S. Müller, C. Timmerer, Dynamic adaptive streaming over HTTPdataset, in: Proc. of ACM Utimedia Conf., 2012.

    [8] J. Martin, F.Y. Fu, N. Wourms, T. Shaw, Characterizing Netflix bandwidthconsumption, in: Proc. IEEE Consumer and Commun. Conf., 2013, pp. 230–235.

    [9] M. Ryu, H. Kim, U. Ramachandran, Why are state-of-the-art flash-based multi-tiered storage systems performing poorly for HTTP video streaming? in: Proc.ACM Workshop on Network and Operating Systems Support for Digital Audioand Video, 2012.

    [10]   J.G. Andrews, A. Ghosh, R. Muhammed, Fundamentals of WiMAX:Understanding Broadband Wireless Networking, Prentice Hall, Upper SaddleRiver, NJ, 2007.

    [11]  E. Dahlman, S. Parkvall, J. Sköld, P. Beming, 3G Evolution, second ed., AcademicPress, Amsterdam, 2008.

    [12] O.I. Hillestad, A. Perkis, V. Genc, S. Murphy, J. Murphy, Delivery of on-demandvideo services in rural areas via IEEE 802.16 broadband wireless accessnetworks, in: Proc. of 2nd ACM Workshop on Wireless Multimedia, 2006, pp.43–52.

    [13]   D. Niyato, E. Hossain, Integration of WiMAX and WiFi: optimal pricing forbandwidth sharing, IEEE Commun. Mag. 45 (5) (2007) 14–146.

    [14] L. Al-Jobouri, M. Fleury, M. Ghanbari, Protecting H.264/AVC data-partitionedvideo streams over broadband WiMAX, Adv. Multimedia [online journal](2012) 11 pages (article ID 129517).

    [15]   L. Al-Jobouri, M. Fleury, M. Ghanbari, Intra-refresh provision for data-

    partitioned H.264 video streaming over WiMAX, Consumer Electron. Times 2(3) (2013) 137–145.[16] T. Stockhammer, M. Bystrom, H.264/AVC data partitioning for mobile video

    communication, in: Proc. of IEEE Int. Conf. on Image Processing, Singapore,2004, pp. 545–548.

    [17]  D.J.C. MacKay, Fountain codes, IEE Proc.: Commun. 152 (6) (2005) 1062–1068 .[18]   T. Stockhammer, W. Zia, Error-resilient coding and decoding strategies for

    video communication, in: P.A. Chou, M. van der Schaar (Eds.), Multimedia in IPand Wireless Networks, Academic Press, Burlington, MA, 2007, pp. 13–58.

    [19]  I.A. Ali, S. Moiron, M. Fleury, M. Ghanbari, Packet prioritization for H.264/AVCvideo with cyclic intra-refresh line, J. Vis. Commun. Image Represent. 24 (4)(2013) 486–498.

    [20] Y. Dhondt, S. Mys, K. Vermeirsch, R. Van de Walle, Constrained interprediction: removing dependencies between different data partitions, in:Proc. of Advanced Concepts for Intelligent Visual Systems, Delft, 2007, pp.720–731.

    [21]   S. Wenger, H.264/AVC over IP, IEEE Trans. Circuits Syst. Video Technol. 13 (7)(2003) 645–655.

    [22]   B. Waggoner, Compression for Great Video and Audio: Master Tips and

    Common Sense, Focal Press, Burlington, MA, 2010.[23]   W. Simpson, Video Over IP, second ed., Focal Press, Burlington, MA, 2008 .

    L. Al-Jobouri et al./ J. Vis. Commun. Image R. 25 (2014) 1493–1506    1505

    http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0025http://refhub.elsevier.com/S1047-3203(14)00112-6/h0025http://refhub.elsevier.com/S1047-3203(14)00112-6/h0030http://refhub.elsevier.com/S1047-3203(14)00112-6/h0030http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0055http://refhub.elsevier.com/S1047-3203(14)00112-6/h0055http://refhub.elsevier.com/S1047-3203(14)00112-6/h0065http://refhub.elsevier.com/S1047-3203(14)00112-6/h0065http://refhub.elsevier.com/S1047-3203(14)00112-6/h0065http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0085http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0105http://refhub.elsevier.com/S1047-3203(14)00112-6/h0105http://refhub.elsevier.com/S1047-3203(14)00112-6/h0110http://refhub.elsevier.com/S1047-3203(14)00112-6/h0110http://refhub.elsevier.com/S1047-3203(14)00112-6/h0115http://refhub.elsevier.com/S1047-3203(14)00112-6/h0115http://refhub.elsevier.com/S1047-3203(14)00112-6/h0115http://refhub.elsevier.com/S1047-3203(14)00112-6/h0110http://refhub.elsevier.com/S1047-3203(14)00112-6/h0110http://refhub.elsevier.com/S1047-3203(14)00112-6/h0110http://refhub.elsevier.com/S1047-3203(14)00112-6/h0105http://refhub.elsevier.com/S1047-3203(14)00112-6/h0105http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0095http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0090http://refhub.elsevier.com/S1047-3203(14)00112-6/h0085http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0075http://refhub.elsevier.com/S1047-3203(14)00112-6/h0065http://refhub.elsevier.com/S1047-3203(14)00112-6/h0065http://refhub.elsevier.com/S1047-3203(14)00112-6/h0055http://refhub.elsevier.com/S1047-3203(14)00112-6/h0055http://refhub.elsevier.com/S1047-3203(14)00112-6/h0055http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0050http://refhub.elsevier.com/S1047-3203(14)00112-6/h0030http://refhub.elsevier.com/S1047-3203(14)00112-6/h0030http://refhub.elsevier.com/S1047-3203(14)00112-6/h0025http://refhub.elsevier.com/S1047-3203(14)00112-6/h0025http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0020http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0015http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://refhub.elsevier.com/S1047-3203(14)00112-6/h0010http://-/?-

  • 8/16/2019 Al-Jobouri, Fleury, Ghanbari - 2014 - Engineering Wireless Broadband Access to IPTV

    14/14