sony f1 service format specification › sony › docs › 05 › docs › 4k › phase 1 ›...

40
SONY F1 Service Format Specification Version 0.94 © Sony Corporation 2013. All rights reserved. | 1 15 March 2013 CONFIDENTIAL SONY F1 Service Format Specification Version 0.94 15 March 2013

Upload: others

Post on 30-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 1 15 March 2013 CONFIDENTIAL

SONY F1 Service

Format Specification

Version 0.94

15 March 2013

Page 2: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

2 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

Disclaimer Even though Sony has thoroughly reviewed this document, it is provided "as is" with no

warranties whatsoever, express or implied, including any warranty of merchantability,

non-infringement, fitness for any particular purpose or any warranty otherwise arising out of

this document or use, implementation or attempted implementation of any information

contained in this document. Sony Corporation disclaims all liability, including liability for

infringement of any proprietary rights, relating to this document or use, implementation or

attempted implementation of any information contained herein.

The product names, brand names, service names, etc. used in this document are each

company, each organization’s trademarks, or registered trademarks.

The references to the trademarks and copyrights are omitted in this document.

Page 3: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 3 15 March 2013 CONFIDENTIAL

Table of Contents

1 General .................................................................................................................................. 6

1.1. Scope .............................................................................................................................. 6

1.2. Specification Architecture Overview ............................................................................. 6

1.3. Reference ....................................................................................................................... 6

1.4. Meaning of words .......................................................................................................... 7

1.5. Definitions ...................................................................................................................... 7

1.6. Acronyms ....................................................................................................................... 7

2. File Format ............................................................................................................................ 8

2.1. Container Header ........................................................................................................... 8

2.1.1. Container for Required Metadata .......................................................................... 8

2.2. Movie Fragments ........................................................................................................... 9

2.2.1. General ................................................................................................................... 9

2.2.2. Movie Fragments for Video .................................................................................... 9

2.2.3. Movie Fragments for Audio .................................................................................... 9

2.2.4. Movie Fragments for Subtitle ................................................................................. 9

3. Media Format ...................................................................................................................... 10

3.1. Video Format ............................................................................................................... 10

3.1.1. AVC video stream ................................................................................................. 10

3.1.1.1. User Data Unregistered SEI Message for the Extended-Gamut YCC Colour

Space 14

3.2. Audio Format ............................................................................................................... 16

3.2.1. MPEG-4 AAC LC audio stream [2-channel] ........................................................... 16

3.2.2. MPEG-4 AAC LC audio stream [5.1-channel] ........................................................ 16

3.2.3. LPCM audio stream [2-channel] ........................................................................... 17

3.2.3.1. AudioSampleEntry Box for LPCM [2-channel, 48kHz, 16bits] ....................... 17

3.2.3.2. LPCM Elementary Stream Constraints [2-channel, 48kHz, 16bits] ............... 17

3.2.4. F1 LPCM audio stream .......................................................................................... 18

3.2.4.1. Storage of F1 LPCM Elementary Streams ..................................................... 18

3.2.4.2. AudioSampleEntry Box for F1 LPCM ............................................................. 18

3.2.4.3. F1LPCMSpecific Box ...................................................................................... 19

3.2.4.4. F1 LPCM Elementary Stream Constraints ..................................................... 22

3.3. Subtitle Format (TBD) .................................................................................................. 26

3.3.1. Subtitle Format ..................................................................................................... 26

3.3.2. Hypothetical Render Model ................................................................................. 27

3.3.3. Subtitle Selection .................................................................................................. 28

Page 4: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

4 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.4. Metadata (TBD) ............................................................................................................ 29

3.4.1. SONY F1 Required Metadata ................................................................................ 29

3.4.2. Image Reference................................................................................................... 32

4. Content encryption ............................................................................................................. 34

Annex A. Profile ....................................................................................................................... 34

A.1. SONY F1 Phase 1 Day1 Profile ...................................................................................... 34

A.1.1. Requirements on SONY F1 Phase 1-Day1 File Format and Media Format .......... 34

A.1.2. Requirements on SONY F1 Phase 1-Day1 Subtitle processor implementation ... 37

A.1.3. Requirements on SONY F1 Phase 1-Day1 Required Metadata (TBD) .................. 40

Page 5: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 5 15 March 2013 CONFIDENTIAL

List of Figures

FIGURE 3-1 EXAMPLES OF ACCESS UNIT STRUCTURE FOR RA-I PICTURE AND NON RA-I PICTURE .................................... 13

FIGURE 3-3 24-BIT LPCM SAMPLE ..................................................................................................................... 23

FIGURE 3-4 20-BIT LPCM SAMPLE ..................................................................................................................... 23

FIGURE 3-5 GROUP OF LPCM SAMPLES .............................................................................................................. 24

FIGURE 3-6 BLOCK DIAGRAM OF HYPOTHETICAL RENDER MODEL ............................................................................ 27

FIGURE 3-7 EXAMPLES OF TRACKREFERENCE ELEMENT FOR REQUIREDIMAGES ELEMENT IN REQUIRED METADATA ...... 33

FIGURE 3-8 EXAMPLES OF TRACKREFERENCE ELEMENT FOR OPTIONALIMAGES ELEMENT IN OPTIONAL METADATA ...... 33

List of Tables

TABLE 3-1 ALLOWED PICTURE FORMATS FOR AVC VIDEO STREAM ........................................................................... 10

TABLE 3-2 SYNTAX OF USER DATA UNREGISTERED SEI MESSAGE FOR EXTENDED-GAMUT YCC COLOUR SPACE ................. 14

TABLE 3-3 COLOR_DATA() .............................................................................................................................. 14

TABLE 3-4 ALLOWED AUDIO FORMAT ................................................................................................................. 16

TABLE 3-5 LPCM AUDIO FORMAT[2-CHANNEL] .................................................................................................... 17

TABLE 3-6 F1 LPCM AUDIO FORMAT ................................................................................................................. 18

TABLE 3-7 PERMITTED AUDIO_DATA_PAYLOAD_SIZE VALUES .................................................................................. 20

TABLE 3-8 CHANNEL_ASSIGNMENT .................................................................................................................... 21

TABLE 3-9 SAMPLING_FREQUENCY .................................................................................................................... 22

TABLE 3-10 BITS_PER_SAMPLE .......................................................................................................................... 22

TABLE 3-11 TRACK ID AND ROLE OF SUBTITLE TRACK ............................................................................................. 28

TABLE 3-12 CONTAINERMOVIEMETADATA-TYPE ............................................................................................. 30

TABLE 3-13 ADDITIONALLOCALIZEDINFOLIST-TYPE ................................................................................... 31

TABLE 3-13 ADDITIONALLOCALIZEDINFO-TYPE ........................................................................................... 32

TABLE A-2 UNICODE CODE POINTS TO BE SUPPORTED........................................................................................... 37

TABLE A-3 TEXT RENDERING RATES .................................................................................................................... 39

TABLE A-4 DRAWING RATE ............................................................................................................................... 39

Page 6: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

6 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

1 General

1.1. Scope

This specification defines the file format and the media formats of audio-visual contents and

the download manifest file for the purpose of SONY F1 Service. As a file format and media

format, the specification includes container formats, elementary stream formats,

requirements on encryption of the audio-visual contents and requirements for the playback

devices. As a download manifest file, the specification includes the manifest file structure and

segment file structure and operational rules for the download system.

1.2. Specification Architecture Overview

This specification is composed of three parts. The first part, section 2, defines the file format.

The second part, section 3, defines the media format. The third part, Annex A, defines the

profile requirements of the SONY F1 service. The specification references already available

standards and specifications.

1.3. Reference

[1] DECE Common File Format & Media Formats Specification, Version 1.0.5,

31-October-2012.

[2] Common Metadata, TR-META-CM, v1.2d, September 24, 2012, Motion Picture

Laboratories, Inc.,

http://www.movielabs.com/md/md/v1.2/Common%20Metadata%20v1.2d.pdf

[3] T. e. a. Berners-Lee, RFC 3986, Uniform Resource Identifier (URI): Generic Syntax,

January 2005.

[4] ITU-T Rec. H.264 | ISO/IEC 14496-10, (2010), “Information Technology – Coding of

audio visual objects – Part 10: Advanced Video Coding.”.

[5] ITU-R Rec. BT.709-5: Parameter values for the HDTV standards for production and

international programme exchange.

[6] IEC61966-2-4 Ed. 1.0:2006, Multimedia systems and equipment, - Colour measurement

and management -, Extended-gamut YCC colour space for video applications - xvYCC.

[7] DECE Common File Format & Media Formats Specification, Version 1.0.3,

3-January-2012.

[8] EIA/CEA-708-D, Digital Television (DTV) Closed Captioning, December 1999.

[9] DECE Content Metadata Specification, Version 1.0.5, 31-October-2012.

[10] ISO/IEC 23009-1:2012 Information technology –Dynamic adaptive streaming over HTTP

(DASH) – Part 1: Media presentation description and segment formats.

Page 7: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 7 15 March 2013 CONFIDENTIAL

[11] ISO/IEC 23001-7:2012: Information technology – MPEG systems technologies – Part 7:

Common encryption in ISO base media file format files.

[12] Marlin Developer Community, “Marlin Adaptive Streaming Specification – Simple

Profile”, Version 1.0.

1.4. Meaning of words

In this document, the following words have a special meaning:

INFORMATIVE: indicates a Section or Annex describes supplemental information to aid

understanding of this specification. A compliant SONY F1 Service device is recommended but

not required to comply with informative Sections and Annexes.

MANDATORY: describes a feature that must be implemented to claim compliance to this

specification.

MAY: indicates an action or feature that is not mandatory.

NORMATIVE: indicates a Section or Annex describes a prescriptive part of this specification.

A compliant SONY F1 Service device shall comply with all normative Sections and Annexes.

OPTIONAL: describes a feature that may or may not be implemented. If implemented, the

feature shall be implemented as described.

SHALL and SHALL NOT: indicate requirements strictly to be followed in order to conform to the

document and from which no deviation is permitted.

SHOULD: indicates an action or feature that is optional, but its implementation is

recommended.

1.5. Definitions

For the purposes of this specification, the definitions in section 1.6 of DECE Common File

Format & Media Formats Specification [1] are applied.

1.6. Acronyms

For the purposes of this specification, the acronyms in section 1.6 of DECE Common File

Format & Media Formats Specification [1] are defined. In addition, the following definitions are

defined.

DECE-CFF DECE Common File Format

Page 8: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

8 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

2. File Format The SONY F1 File Format SHALL comply with file format defined in section 2 and section C.2 of

DECE-CFF [1] with additional requirements and constraints defined in this section.

2.1. Container Header

The header of the file SHALL conform to the DCC Header defined in DECE-CFF [1] with the

following additional constraints.

XML Box (‘xml ’) for Required Metadata

The xml field SHALL contain a well-formed XML document with contents that

conform to section 0.

Edit Box (‘edts’)

The ‘edts’ SHALL be present.

The ‘edts’ SHALL contain the Edit List Box (‘elst’) as defined in section 2.1.2.2

in DECE-CFF [1].

Media Header Box (‘mdhd’)

For audio tracks and subtitle tracks, the language SHALL contain the language of

the content in the track. The language SHALL NOT contain the original relase

language of the content.

2.1.1. Container for Required Metadata

Images and any other binary data referred to by the contents of the XML Box for Required

Metadata MAY NOT be in the file.

In case the images and any other binary data referred to by the contents of the XML Box

for Required Metadata are not in the file, they MAY be referenced by URN [3].

In case the images and any other binary data referred to by the contents of the XML Box

for Required Metadata are in the file, the constraints in n section 2.1.2.1 of DECE-CFF [1]

SHALL be confirmed.

Page 9: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 9 15 March 2013 CONFIDENTIAL

2.2. Movie Fragments

2.2.1. General

The movie fragments SHALL conform to the DCC Movie Fragments defined in DECE-CFF [1] with

the following additional constraints.

Track Fragment Run Box (‘trun’)

The version of ‘trun’ SHALL be set to 1.

AVC NAL unit storage Box (‘avcn’)

The ‘avcn’ SHALL NOT be present in the file.

2.2.2. Movie Fragments for Video

The movie fragments for video SHALL conform to the DCC Movie Fragment for video defined

DECE-CFF [1] with the following additional constraints.

In case the video codec is AVC [4], the Movie Fragment for video track SHALL conform to

DCC Movie Fragment defined in section 4.2 and C.4 except for AVC elementary stream

constraints defined in C.4.1 and C.4.3 of DECE-CFF [1]. The constraints on AVC elementary

stream SHALL comply with constraints defined in section 3.1.1.

2.2.3. Movie Fragments for Audio

The movie fragments for audio SHALL conform to the DCC Movie Fragment for audio defined

in DECE-CFF [1] with the following additional constraints.

The Movie Fragment for audio SHALL conform to DCC Movie Fragment defined in section

5.2 and section C.5 except for allowed combinations of audio format defined in Table C-3

of DECE-CFF [1]. The allowed combination of audio format for SONY F1 File Format is

defined in section 0.

In case the audio codec is MPEG-4 AAC, the Movie Fragment for audio track SHALL

conform to DCC Movie Fragment defined in section 5.3 or DECE-CFF [1].

2.2.4. Movie Fragments for Subtitle

The movie fragments for the subtitle SHALL conform to movie fragments defined in section 6.6

and section C.6 of DECE-CFF [1] with the following additional constraints.

Track Header Box (‘tkhd’)

The width SHALL be set to 1920.

The height SHALL be set to 1080.

Page 10: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

10 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3. Media Format This section describes the requirements for each media format.

3.1. Video Format

3.1.1. AVC video stream

The AVC video stream for SONY F1 Service SHALL comply with section 4.3 and section C.4 of

DECE-CFF [1] with additonal constraints defined in this section. For those constraints which

overlap with the constraints defined in this section SHALL override the constraints defined in

section 4.3 and section C.4 of DECE-CFF [1].

Profile

The video stream SHALL comply with the High Profile defined in AVC [4].

The value of profile_idc in sequence parameter set (SPS) SHALL be set to 100.

Level

The content SHALL comply with the constraints specified for Level 5.1 defined in AVC.

The level_idc in SPS SHALL be set to 51.

Picture Format

The AVC video stream SHALL comply with the picture formats in Table 3 1.

Table 3-1 Allowed Picture Formats for AVC video stream

Picture Formats Sub-sample Factors Parameter Constraints

Frame size Picture

aspect

Frame

rate

Horiz. Vert. Max size

encoded

pic_width_in

_mbs_minus1

pic_height_i

n_map_minus1

aspect_

ratio_idc

3840x2160 1.778 23.976

29.97

1 1 3840x2160 239 134 1

Color Descriptions

The color space used for the AVC video stream SHALL be BT.709 [5] or xvYCC709 [6].

The following parameters Visual Usability Information (VUI) Parameters SHALL have

pre-determined values as defined and the values SHALL be the same throughout the

AVC video stream.

The video_signal_type_present_flag SHALL be set to 1.

The colour_description_present_flag SHALL be set to 1.

The colour_primaries SHALL be set to 1.

The transfer_characteristics SHALL be set to 1(for BT.709 [5]) or

11(for xvYCC709 [6]).

The matrix_coefficients SHALL be set to 1.

Page 11: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 11 15 March 2013 CONFIDENTIAL

Picture Types

I picture : A picture SHALL consists only of I slices.

P picture : A picture SHALL consists only of P slices.

B picture : A picture SHALL consists only of B slices.

Slices

Slice Type

I slice : slice_type SHALL be set to 7.

P slice : slice_type SHALL be set to 5.

B slice : slice_type SHALL be set to 6.

A slice SHALL be composed of one or more macroblock rows. A macroblock row

indicates all the macroblocks in a horizontal row of macroblocks.

In case level_idc is set to Level 5.1 (51), each picture SHALL be encoded as

multi-slice picture with 4 or more slices per picture.

HRD Parameters

nal_hrd_parameters_present_flag in VUI parameters SHALL be set to 1.

vcl_hrd_parameters_present_flag in VUI parameters SHALL be set to 1.

Maximum CPB size

In case the level_idc is set to Level 5.1 (51), the maximum CPB size (MaxCPB)

SHALL be constrained to 120000 [1250 bits/s (cpbBrVclFactor), 1500 bits/s

(cpbBrNalFactor)].

Minimum compression ratio

In case the level_idc is set to Level 5.1 (51), MinCR SHALL be constrained to 4.

Maximum DPB size

In case the level_idc is set to Level 5.1 (51), the MaxDpbMbs SHALL be less than

or equal to the constraints specified for Level 5.1 defined in AVC[4].

Access Unit Structure

The maximum number of NAL units per each access unit SHALL be less than or equal

to 32.

The Picture Timing SEI message SHALL be present for each access uint.

Data structure

The coded video sequence SHALL be less than or equal to 3.003 sec.

Sequence parameter set

The sequence parameter sets within duration of 3.003 sec in presentation time

from the first picture of the video sequence in order of presentation SHALL have

unique seq_parameter_set_id, if any of the parameters have different

values.

Page 12: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

12 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

Picture parameter set

The picture parameter sets within duration of 3.003 sec in presentation time

from the first picture of the video sequence in order of presentation SHALL have

unique pic_parameter_set_id, if any of the parameters have different

values.

- NOTE : In case more than one coded video sequence exists within duration

of 3.003 sec in presentation time from the first picture of the video

sequence in order of presentation, this constraint applies across the coded

video sequences such that pic_parameter_set_id SHALL be unique

across coded video sequences, if any of the parameters have different

values.

In case the video stream is structured as byte stream fromat All picture paremter

sets in coded video sequence SHALL be placed together with the picture

parameter set for the first access unit in the coded video sequence.

Supplemental Enhancement Information (SEI)

In case an access unit is an IDR or a Random Access I picture (RA-I picture) as defined

in section 2.2.7.2.1 of DECE-CFF [1], following SEI messages SHALL be present in the

access unit. (See Figure 3-1) Note : RA-I picture does not include an IDR picture.

Buffering period SEI message

Recovery point SEI message

In case an access unit is non-IDR or non RA-I picture, following SEI messages SHALL

NOT be present together in the access unit. i.e. Only one or none of the following SEI

messages can be present. (See Figure 3-1)

Buffering period SEI message

Recovery point SEI message

In case the color space used for the AVC video stream is xvYCC709 [6], the user data

unregistered SEI message for the Extended-Gamut YCC Colour Space defined in

section 0 SHALL be present in the video stream.

NOTE : The transfer_characteristics SHALL be set to 11 in case the

user data unregistered SEI messages for the Extended-Gamut YCC Colour Space

are present in the video stream.

Only the first decoded picture in each coded video sequence or Random Access

I-picture SHALL have exactly one user data unregistered SEI message for the

Extended-Gamut YCC Colour Space defined in section 0.

Page 13: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 13 15 March 2013 CONFIDENTIAL

RA-I picture slice data

Recovery Point SEI

Buffering Period SEI

Non RA-I picture slice data

Recovery Point SEI

Buffering Period SEI

Non RA-I pictureslice data

AU Delimiter

AU Delimiter

AU Delimiter

Other SEIs

Other SEIs

Other SEIs

except forRecovery Point SEI

except forBuffering Period SEI

Access Unit for RA-I picture

Examples of Access Unit for Non RA-I picture

Non RA-I pictureslice data

AU Delimiter Other SEIs

except forBuffering Period SEI

andRecovery Point SEI

Figure 3-1 Examples of Access Unit structure for RA-I picture and Non RA-I picture

Page 14: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

14 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.1.1.1. User Data Unregistered SEI Message for the Extended-Gamut YCC Colour Space

The colour information for the extended-gamut YCC colour space SHALL be carried by the the

user data unregistered SEI messge syntax and semantics indicated in this section.

Table 3-2 Syntax of User Data Unregistered SEI message for Extended-Gamut YCC Colour Space

Syntax Num of bits Mnemonic

user_data_unregistered (payload) {

uuid_iso_idc_11578 128 Uimsbf

TypeIndicator 32 Uimsbf

if ( TypeIndicator == 0x43 4c 49 44 ) {

COLOR_data() {

Format_Flag 1 bslbf

Reserved 2 bslbf

GBD_Color_Precision 2 bslbf

GBD_Color_Space 3 bslbf

Min_Red_Data 12 bslbf

Max_Red_Data 12 bslbf

Min_Green_Data 12 bslbf

Max_Green_Data 12 bslbf

Min_Blue_Data 12 bslbf

Max_Blue_Data 12 bslbf

}

Reserved 16 bslbf

}

}

uuid_iso_idc_11578 SHALL be set to“A74602BB-F8A1-4CC0-A936-48E391DCE761”.

TypeIndicator indicates the type of user data that is carried in this SEI message.

TypeIndicator SHALL be set to“0x43 4c 49 44”.

COLOR_data() indicates the syntax and semantics of COLOR_data() based on Table E-6 in

Appendix E [6].

Format_Flag, GBD_Color_Precision, GBD_Color_Space, Min_Red_Data,

Max_Red_Data, Min_Green_Data, Max_Green_Data, Min_Blue_Data and

Max_Blue_Data SHALL be set as defined in Table 3-3.

reserved SHALL be set to 0 for future usage.

Table 3-3 COLOR_data()

Page 15: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 15 15 March 2013 CONFIDENTIAL

Field Value

Format_Flag 1b

GBD_Color_Precision 10b

GBD_Color_Space 010b

Min_Red_Data 1 00 110110101b

Max_Red_Data 0 01 011110100b

Min_Green_Data 1 00 011010111b

Max_Green_Data 0 01 010010110b

Min_Blue_Data 1 00 011001100b

Max_Blue_Data 0 01 010010010b

Page 16: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

16 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.2. Audio Format

This section describes the requirements of audio stream in audio track of SONY F1 Service.

The allowed audio format are defined in Table 3-4.

Table 3-4 Allowed Audio Format

Audio Format Max number

of Channel

Sample Rate

[kHz]

Max Bitrate

[kbps]

Bitrate

Calculation

MPEG-4 AAC LC

[2-channel]

2 48 192 Section 5.3.2.2.2.4

of DECE-CFF [1]

MPEG-4 AAC LC

[5.1-channel]

5.1 48 960 Section 5.3.3.2.2.4

of DECE-CFF [1]

LPCM

[2-channel]

2 48, 96, 192 1536, 3072,

6144

-

F1 LPCM

max. 7.1 48, 96, 192 - Table 3-7

3.2.1. MPEG-4 AAC LC audio stream [2-channel]

MPEG-4 AAC LC [2-channel] audio stream for SONY F1 audio format SHALL comply with

MPEG-4 AAC LC [2-channel] audio stream defined in section 5.3.1, section 5.3.2 and section

C.5.2.1 of DECE-CFF [1] with additional constraints defined in this section with additional

constraints defined in this section.

Channel configuration

In case the audio format is MPEG-4 AAC LC 2-channel, the audio SHALL be encoded in

2-channel stereo.

3.2.2. MPEG-4 AAC LC audio stream [5.1-channel]

MPEG-4 AAC LC [5.1-channel] audio stream for SONY F1 audio format SHALL comply with

MPEG-4 AAC LC [5.1-channel] audio stream defined in section 5.3.1, section 5.3.3 and section

C.5.2.2 of DECE-CFF [1].

Page 17: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 17 15 March 2013 CONFIDENTIAL

3.2.3. LPCM audio stream [2-channel]

LPCM [2-channel] audio stream for SONY F1 audio format SHALL comply with this section

based on section 5.1, 5.2 and section C.5 of DECE-CFF [1] with additional constraints.

Table 3-5 LPCM audio format[2-channel]

Codingname Audio Format SampleEntry Type Section

Reference

twos LPCM [2-channel] MJ2AudioSampleEntry ISO/IEC

15444-3:2007

NOTE: AudioSampleEntry cannot be applied to higher sampling frequency audio such as

96/192kHz. For that purpose AudioSampleEntryV1 is to be newly defined in ISO/IEC

14496-12:2012/DAM2.

3.2.3.1. AudioSampleEntry Box for LPCM [2-channel, 48kHz, 16bits]

The syntax and values of the AudioSampleEntry box SHALL conform to AudioSampleEntry as

defined 5.2.1.6 of DECE-CFF [1], and the following fields SHALL be set as defined:

AudioFormat = ‘twos’

channelcount=2

samplesize=16

samplerate= BB800000h (48kHz)

3.2.3.2. LPCM Elementary Stream Constraints [2-channel, 48kHz, 16bits]

The data consists of interleaved left/right samples.

A sample has 16 bit values with the bytes in big-endian format.

16bit-values range from -32768 to 32767, with 0 being silence.

The presentation length of an audio access unit (audio frame) of the LPCM audio stream is

equal to 40 milli-second(1920 samples/ch).

class MJ2AudioSampleEntry() extends AudioSampleEntry (AudioFormat){

}

Page 18: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

18 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.2.4. F1 LPCM audio stream

F1 LPCM audio stream for SONY F1 audio format SHALL comply with this section based on

section 5.2 and section C.5 of DECE-CFF [1] with additional constraints.

Table 3-6 F1 LPCM audio format

Codingname Audio Format SampleEntry Type Section

Reference

fpcm F1 LPCM F1LPCMAudioSampleEntry Section 3.3.4

3.2.4.1. Storage of F1 LPCM Elementary Streams

An audio sample SHALL consist of a single access unit (audio frame).

For 16bit quantization, code values range from -32768 to 32767, with 0 being silence.

3.2.4.2. AudioSampleEntry Box for F1 LPCM

The syntax of the F1LPCMAudioSampleEntry(‘fpcm’) box SHALL conform to that of

AudioSampleEntry as defined 5.2.1.6 of DECE-CFF [1], and the following fields SHALL be set as

defined:

AudioFormat(codingname) = ‘fpcm’

channelcount=2, 4, 6, or 8

The value of channelcount should be equal to the actual number of channels

specified in F1LPCMSpecificBox. The use of this field in the SONY F1 File Format is

optional; it may be ignored on reading.

class F1LPCMAudioSampleEntry extends SampleEntry(‘fpcm’)

{

const unsigned int(32) reserved[2] = 0;

template unsigned int(16) channelcount;

template unsigned int(16) samplesize;

unsigned int(16) pre_defined = 0;

const unsigned int(16) reserved = 0;

template unsigned int(32) samplerate;

F1LPCMSpecificBox

}

Page 19: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 19 15 March 2013 CONFIDENTIAL

samplesize=16, 20, or 24

The value of samplesize should be equal to the actual bits per sample value

specified in F1LPCMSpecificBox. The use of this field in the SONY F1 File Format is

optional; it may be ignored on reading.

samplerate= BB800000h

The value of samplerate is a suitable integer division of the actual sampling

frequency specified in F1LPCMSpecificBox.

3.2.4.3. F1LPCMSpecific Box

The Syntax of the F1LPCMSpecificBox(‘fcfg’) is shown below:

class F1LPCMSpecificBox extends Box (‘fcfg’)

{

unsigned int(32) audio_data_payload_size;

unsigned int(4) channel_assignment;

unsigned int(4) sampling_frequency;

unsigned int(2) bits_per_sample;

unsigned int(6) reserved = 0;

}

Page 20: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

20 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.2.4.3.1. Semantics of F1LPCMSpecific Box

audio_data_payload_size - indicates the size in bytes of F1LPCMAudioDataPayload()

Table 3-7 Permitted audio_data_payload_size values

sampling frequency bits per sample number of

channels

audio_data_payload_size

[bytes]

48 kHz 16-bit 2 7680

4 15360

6 23040

8 30720

20-bit / 24-bit 2 11520

4 23040

6 34560

8 46080

96 kHz 16-bit 2 15360

4 30720

6 46080

8 61440

20-bit / 24-bit 2 23040

4 46080

6 69120

8 92160

192 kHz 16-bit 2 30720

4 61440

6 92160

20-bit / 24-bit 2 46080

4 92160

6 138240

Page 21: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 21 15 March 2013 CONFIDENTIAL

channel_assignment - specifies the channel assignment for the channel configuration in

the F1 LPCM audio stream.

Table 3-8 channel_assignment

Value number of

channels

channel configuration channel number

1 2 3 4 5 6 7 8

0 - reserved

1 2 ch mono M1 X

2 reserved

3 stereo L R

4 4 ch L, C, R (3/0) L R C X

5 L, R, S (2/1) L R S X

6 L,C,R,S (3/1) L R C S

7 L,R,LS,RS (2/2) L R LS RS

8 6 ch L, C, R, LS, RS (3/2) L R C LS RS X

9 L, C, R, LS, RS, lfe(3/2+lfe) L R C LS RS lfe

10 8 ch L, C, R, LS, Rls, Rrs, RS

(3/4)

L R C LS Rls Rrs RS X

11 L, C, R, LS, Rls, Rrs, RS, lfe

(3/4+lfe) (‘surround back’)

L R C LS Rls Rrs RS lfe

12 L, C, R, LS, RS, Vhl, Vhr, lfe

(5/2+lfe) (‘front high’)

L R C LS RS Vhl Vhr lfe

13–15 - reserved

M: Mono, L: Left, R: Right, C: Center, S: Surround,

Rls: Rear surround left, Rrs: Rear surround right,

Vhl: Vertical height left, Vhr: Vertical height right,

X: Sample values shall be set to zero.

Page 22: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

22 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

sampling_frequency - specifies the sampling frequency of the F1 LPCM audio stream as

shown in Table 3-9.

Table 3-9 sampling_frequency

Value Meaning

0 Reserved

1 48 kHz

2 reserved

3 reserved

4 96 kHz

5 192 kHz

6 – 15 reserved

bits_per_sample - specifies the sampling resolution of the audio samples for all channels in

the F1 LPCM audio stream as shown in Table 3-10.

Table 3-10 bits_per_sample

Value Meaning

0 Reserved

1 16 bits/sample

2 20 bits/sample

3 24 bits/sample

3.2.4.4. F1 LPCM Elementary Stream Constraints

This Section specifies the syntax and semantics of the F1 LPCM audio stream.

All the channels shall be sampled simultaneously on sampling phase.

The following conditions shall not change in the F1 LPCM audio stream carried in a file.

Sampling frequency

Bits per sample

The channel assignment for each channel configuration

Page 23: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 23 15 March 2013 CONFIDENTIAL

3.2.4.4.1. LPCM audio samples

Audio samples are 16, 20, or 24-bit two’s complement integers. Bit ordering for LPCM audio

samples is such that the most significant bit (msb) is the first (left-most) bit and the least

significant bit (lsb) is last.

3.2.4.4.1.1. 16-bit LPCM audio samples

A 16-bit LPCM sample is split into two bytes, as shown in Figure 3-2. The high byte represents

the eight most significant (b15..b8), and the low byte represents the eight least significant bits

(b7..b0).

b15 b0

LSB MSB

Figure 3-2 16-bit LPCM sample

3.2.4.4.1.2. 24-bit LPCM audio samples

A 24-bit LPCM sample is split into three bytes, as shown in Figure 3-3. The high byte represents

the eight most significant (b23..b16), the middle byte represents bits (b15..b8), and the low

byte represents the eight least significant bits (b7..b0).

b23 b0

LSB MSB

Figure 3-3 24-bit LPCM sample

3.2.4.4.1.3. 20-bit LPCM audio samples

Four zero-value bits shall be postfixed to a 20-bit LPCM sample to make 24-bits, as shown in

Figure 3-4. The 24-bits with trailing zeros are then packed in the same way as 24-bit LPCM

samples.

b23

b3 b2 b1 b0

LSBMSB

0 0 0 0b4

Figure 3-4 20-bit LPCM sample

Page 24: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

24 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.2.4.4.2. Group of LPCM samples (GOL)

Each group of LPCM samples(GOL) contains sequence of LPCM samples. The samples within

each GOL shall be in the order of their channel number.

1st group of LPCM samples 2nd group of LPCM samples

1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB

1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB

1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB

1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB

ChannelNumber 1

ChannelNumber 2

ChannelNumber 3

ChannelNumber 4

Figure 3-5 Group of LPCM samples

Page 25: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 25 15 March 2013 CONFIDENTIAL

3.2.4.4.3. Audio access unit (audio frame) of the F1 LPCM audio stream

The presentation length of an audio access unit (audio frame) of the LPCM audio stream is

equal to 40 milli-second.

If the sampling frequency of the LPCM audio stream is 48 kHz, an audio access unit (audio

frame) of the LPCM audio stream consists of 1920 GOLs.

If the sampling frequency of the LPCM audio stream is 96 kHz, an audio access unit (audio

frame) of the LPCM audio stream consists of 3840 GOLs.

If the sampling frequency of the LPCM audio stream is 192 kHz, an audio access unit

(audio frame) of the LPCM audio stream consists of 7680 GOLs.

Here each of GOL contains N samples; the N is the number of channels, and this shall be 2,

4, 6, or 8.

The Syntax of the F1LPCMAudioDataPayload is shown below:

F1LPCMAudioFrame contains one audio access unit (audio frame) of the F1 LPCM

audio stream as defined in section 0.

The size in bytes of this field is different for each value of the

audio_data_payload_size of the F1LPCMSpecificBox as specified in section

3.2.4.3.

F1LPCMAudioDataPayload() {

F1LPCMAudioFrame

}

Page 26: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

26 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.3. Subtitle Format (TBD)

3.3.1. Subtitle Format

The Subtitle Elementary stream in subtitle track for SONY F1 file SHALL comply with

requirements defined in section 6.2 of DECE-CFF 1.0.3 [7] with additional constraints defined in

this section.

Additional constraints

The root container SHALL be set to the size of 1920x1080 in units of pixels. The

subtitles SHALL be designed for 1920x1080 video format.

CFF-TT text subtitles in a subtitle track SHALL be authored such that their size and

position falls within the bounds of the width and height parameters of the Track

Header Box (‘tkhd’) of the subtitle track.

Nested span SHALL NOT be used.

Nested division SHALL NOT be used.

‘cff:forcedDisplayMode’ SHALL NOT be set to ‘true’.

If a region whose ‘tts:showBackground’ attribute is set to ‘always’ is specified

(including the case that ‘tts:showBackground’ attribute is not specified),

Start time of subtitle presentation SHALL be set to 0.0 sec.

End time of subtitle presentation SHALL be set to duration of the track which has

longest duration in the DECE CFF Container.

‘start’, ‘end’ and ‘duration’ attribute SHALL NOT be specified in such ‘region’

element.

Time expression SHALL be calculated in ‘dropNTSC’ mode.

Note that ‘ttp:dropMode’ element SHALL NOT be used according to section 6.2

of DECE-CFF 1.0.3 [7].

A frame count discontinuity occurs in ’dropNTSC’ mode, however, content

authors need not to be aware of frame count discontinuity.

Transformation from time to MP4 tick is, e.g. in case of 90000Hz tick and

29.97Hz video framerate, a frame duration is 3000. In this case, 1 second

duration denotes 90000 tick.

Time expression (the longest time of subtitle) SHALL NOT be ‘float’ value.

‘auto’ value SHALL NOT be specified in ‘tts:extent’ or ‘tts:origin’ attribute.

‘normal’ value SHALL NOT be specified in ‘tts:lineHeight’ attribute.

‘sign’ (“+” or “-”) value SHALL NOT be used to the ‘length’ value in any attributes.

Page 27: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 27 15 March 2013 CONFIDENTIAL

‘length’ value specified in ‘tts:fontSize’ attribute SHALL be from 8 to 144 in units of

pixels.

Minimum Font Size = 8px

Maximum Font Size = 144px

‘length’ value specified in ‘tts:lineHeight’ attribute SHALL be from 8 to 144 in units of

pixels.

Minimum Line Height = 8px

Maximum Line Height = 144px

3.3.2. Hypothetical Render Model

The hypothetical render model SHALL comply with requirements defined in section 6.5.1, 6.5.2,

6.5.4 and 6.5.5 of DECE-CFF 1.0.3 [7] with following items.

Block diagram of hypothetical render model is replaced with Figure 3-6.

The subtitle SHALL be scaled to the same size as video after the subtitle rendering to

be combined with video plane. (See Figure 3-6)

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL match the

Subtitle Plane and Video Plane color space for subtitle overlay.

Figure 3-6 Block Diagram of Hypothetical Render Model

File Parser

Doc Buffer

Enc. ImageBuffer

Image Dec

Doc DOM 1

Doc DOM 2

Dec. ImageBuffer 1

Dec. ImageBuffer 2

GlyphRenderer

PresentationCompositor

PresentationBuffer

SubtitlePlane

(1920x1080)Scaling

OutputVideo Plane(3840x2160)

Page 28: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

28 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

3.3.3. Subtitle Selection

In this subsection, the Track IDs for subtitle tracks are defined categorized by the role of the

subtitle. The Track ID for each subtitle track SHOULD be set as defined in Table 3-11.

Table 3-11 Track ID and Role of subtitle track

Track ID Role of subtitle track

128-255 Closed caption for accessibility which

corresponds to CC1 (cc_type=002) of EIA-708

B [8]

256-383 Closed caption for accessibility which

corresponds to CC2 (cc_type=012) of EIA-708

B [8]

384-639 For other use (e.g. normal subtitle,

commentary)

Page 29: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 29 15 March 2013 CONFIDENTIAL

3.4. Metadata (TBD)

This section describes the requirements of XML document for metadata for SONY F1 Service.

3.4.1. SONY F1 Required Metadata

The XML document of Required metadata for SONY F1 Service is based on DECE Container

Required Metadata defined in section 4.1 of DECE-META [9] with additional extensions and

requirements in this section.

All types and elements are in the ‘mdf1’ namespace unless otherwise specified.

ContainerMovieMetadata-type

The ContainerMovieMetadata-type defined in section 4.1.1 of DECE-META

[9] is replaced with Table 3-12.

The Ratings element SHALL be present.

The Chapters element SHALL be present.

The value of Priority is as defined as follows:

The value 0 is reserved.

The value SHALL be in range of 1 to 255.

AdditionalLocalizedInfoList-type

AdditionalLocalizedInfoList-type is as defined in Table 3-13.

AdditionalLocalizedInfo-type

AdditionalLocalizedInfo-type is as defined in Table 3-13.

If no copy right description is available, the CopyrightLine element SHOULD be

empty.

Page 30: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

30 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

Table 3-12 ContainerMovieMetadata-type

Element Attribute Definition Type Card.

ContainerMo

vieMetadata

-type

MetadataVersi

onReference

A string that defines the

version of the metadata in this

element. If the metadata

changes, this string SHOULD

be included and unique relative

to other instances of this

attribute.

xs:string 0..1

priority The priority indicates the

priority of the file for

Auto-delete function. The

value of 0 is reserved.

The larger number indicates in

higher priority that the file is

selected as auto-deleted file.

E.g. the file with priority set to

10 is selected as auto-delete

file compare to the file with

priority set to 3.

xs:unsignedInt 1

ContentMeta

data

Mandatory descriptive

metadata regarding the media

in the Container.

mddece:Conta

inerContentM

etadata-type

RequiredIma

ges

References to Container

required images in Compliance

with Section 3.2 of

DECE-META [9].

md:DigitalAs

setIMageData

-type

1..n

TrackMetada

ta

Descriptions of each track mddece:Conta

inerTrackMet

adata-type

Ratings Content ratings for media in

the Container as defined in

Section 7.3 of Common

md:ContentRa

ting-type

0..1

Page 31: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 31 15 March 2013 CONFIDENTIAL

Metadata [2].

Chapters Chapter entry points

mddece:Conta

inerChapterL

ist-type

0..1

OptionalIma

ges

References to Container

optional images

md:DigitalAs

setIMageData

-type

0..n

TrackSelect

ions

mddece:Conta

inerTrackSel

ectionList-t

ype

0..1

Interactive

CapabilityL

evel

The Interactive Capability Level

required of Devices to use this

Content.

xs:string 0..1

ContainerVe

rsionRefere

nce

A string that defines the

version of the Container. It can

be used as a reference to

identify changes in the

Container.

xs:string

0..1

AdditionalL

ocalizedInf

oList

mdf1:Additio

nalLocalized

InfoList-typ

e

1..n

(any) An element which enables to

extend the XML document.

xs:any 0..n

Table 3-13 AdditionalLocalizedInfoList-type

Element Attribute Definition Type Card.

AdditionalLo

calizedInfoL

ist-type

AdditionalLo

calizedInfo

Additional localized information

descriptor.

mdf1:Additio

nalLocalized

Info-type

1..n

Page 32: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

32 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

Table 3-14 AdditionalLocalizedInfo-type

Element Attribute Definition Type Card.

Additional

LocalizedI

nfo-type

Genre Subject-matter

classification of the

show. See Genre

Encoding below.

xs:string

1..n

source Naming system from

which genre is

derived.

xs:anyURI 0..1

id Identifier for genre

used within source

xs:string 0..1

level Indicates precedence

of genre, with a

lower number being

high precedence.

xs:integer 0..1

CopyrightL

ine

Displayable

copyright line.

xs:string 1

(any) An element which

enables to extend

the XML document.

xs:any 0..n

3.4.2. Image Reference

The images referenced to by the metadata XML document for F1 SONY service defined in

section 3.4.1 and section 0 SHALL conform to section 4.3 of DECE-META [9] with additional

constraints defined in this section.

The image referenced to by the metadata XML document MAY NOT be in the file.

The images referenced to by the metadata XML document MAY be referenced by be

referenced by URN [3]. See Figure 3-8 for an example.

In case the images referred to by the contents of XML Box is in the file, the filname SHALL

have a unique index number, e.g. 1.png. See Figure 3-8 for an example.

Page 33: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 33 15 March 2013 CONFIDENTIAL

Figure 3-7 Examples of TrackReference element for RequiredImages element in

Required Metadata

Figure 3-8 Examples of TrackReference element for OptionalImages element in

Optional Metadata

<mddece:RequiredImages>

...

<md:TrackReference>

http://f1.sony.net/metadata/contentid/thumbnail/1.png</md:TrackReference>

</mddece:RequiredImages>

<mddece:OptionalImages>

...

<md:TrackReference>

urn:dece:conatiner:metadataimageindex:1.png</md:TrackReference>

</mddece:OptionalImages>

Page 34: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

34 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

4. Content encryption The content encryption SHALL comply with the Marlin extensions to MPEG Common

Encryption Format [11] as defined in section 2.3 of Marlin Adaptive Streaming Specification

[12].

The following requirements SHALL apply if the content is protected by Marlin.

A Protection System Specific Header Box(‘pssh’)

The ‘pssh’ as defined in section 2.3.2 of Marlin Adaptive Streaming Specification

[12] SHALL be present.

The ‘pssh’ box SHALL contain a MarlinKidMappingTable Box (‘mkid’) which

includes all the Content ID mapping information associated with the file protected by

Marlin.

Annex A. Profile

A.1. SONY F1 Phase 1 Day1 Profile

A.1.1. Requirements on SONY F1 Phase 1-Day1 File Format and Media Format

The file format and media format for SONY F1 Phase 1-Day1 SHALL conform with the following

additional requirements in Table A-1.

The requirements are applied for both Type-A and Type-B unless otherwise separate

requirements are defined.

Table A-1 Requirements on SONY F1 Phase 1-Day1 Profile

Type-A Type-B

Container Container Header Asset Information Box (‘ainf’)

- profile_version SHALL be set to ‘sfv1’.

Content

Encryption

IV_size in

‘pssh’

8

Video Encryption Same as audio track

defined in section 4

As defined in section 4

DRM N/A Marlin-BB [12]

Max number of keys

per file

48

Audio Max number of audio 1

Page 35: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 35 15 March 2013 CONFIDENTIAL

tracks per file

Allowed audio format MPEG-4 AAC LC [2

channel] as defined

in section 3.3.1

MPEG-4 AAC LC

[5.1 channel] as

defined in 3.3.2

MPEG-4 AAC LC [2 channel]

as defined in section 3.3.1

MPEG-4 AAC LC [5.1

channel] as defined in 3.3.2

LPCM [2 channel] as defined

in section 3.3.3 with

following additional

constraints (See section

3.2.3.1)

- channelcount SHALL

be set to 2.

- samplerate SHALL be

set to BB800000h

F1 LPCM [5.1 channel] as

defined in section 3.3.4 with

following additional

constraints

F1LPCMAudioSampleEntry(

‘fpcm’)

- channelcount SHALL

be set to 6

- samplesize SHALL be

set to 16

- samplerate SHALL be

set to BB800000h

F1LPCMSpecificBox(‘fcfg

’)

- channel_assignmen

t SHALL be set to 8 or 9

- sampling_frequenc

y SHALL be set to 1

- bits_per_sample

SHALL be set to 1

Subtitle Max number of

subtitle tracks per file

N/A 4

Page 36: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

36 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

Video Maximum Bitrate 100x106 bits/s

(80000 [1250 bits/s (cpbBrVclFactor), 1500 bits/s

(cpbBrNalFactor)])

Min number of slices

per picture

8

Maximum duration

of the video

86486.4 sec (24 hours)

File File extension Un-encrypted File: “.sfv”

Encrypted File: “.sev”

Page 37: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 37 15 March 2013 CONFIDENTIAL

A.1.2. Requirements on SONY F1 Phase 1-Day1 Subtitle processor implementation

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL conform with definitions

in this section. The definitions in this section minimize the features of the subtitle to the usage

of closed captioning defined in EIA-708-B [8], and it is expected for content authors to prepare

Subtitle Elementary Stream pursuant to ‘SMPTE RP 2052-10:2012 Conversion from CEA-608

Data to SMPTE-TT’.

Subtitle Elementary Stream SHOULD NOT exceed the capability of the SONY F1 Phase-1-Day1

Subtitle processor implementation.

The SONY F1 Phase-1-Day1 Subtitle processor implementation SHALL NOT decode and

present more than one subtitle track simultaneously.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT support to

decode image (PNG) subtitle track.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support rendering

of all glyphs that correspond to the Unicode Code Points defined in Table A-2.

Table A-2 Unicode Code Points to be supported

U+0020 - U+007E (Basic Latin)

U+00A1 - U+00FF (Latin-1 Supplement)

U+0152 (LATIN CAPITAL LIGATURE OE)

U+0153 (LATIN SMALL LIGATURE OE)

U+0160 (LATIN CAPITAL LETTER S WITH CARON)

U+0161 (LATIN SMALL LETTER S WITH CARON)

U+0178 (LATIN CAPITAL LETTER Y WITH DIAERESIS)

U+2018 (Left Single Quotation Mark)

U+2019 (Right Single Quotation Mark)

U+201C (Left Double Quotation Mark)

U+201D (Right Double Quotation Mark)

U+2122 (TRADE MARK SIGN)

U+02DC (SMALL TILDE)

U+25A1 (WHITE SQUARE)

U+266A (EIGHTH NOTE)

In the case where the Unicode Code Point is not supported by the SONY F1 Phase 1 Day 1

Subtitle processor, the Glyph rendered SHALL correspond with Unicode Code Point

U+25A1 ("WHITE SQUARE") or Unicode Code Point U+005F ("LOW LINE").

Page 38: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

38 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

The SONY F1 Phase 1 Day 1 Subtitle processor SHALL ignore Control Codes Unicode Code

Points (U+0000..U+001F and U+007F..U+009F) encountered in textual content within

CFF-TT subtitles. An ignored Control Code does not affect presentation. No glyph is

rendered for a Control Code.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement

presentation of at least following nine (9) values for the ‘tts:backgroundColor‘ and

‘tts:color‘ attribute.

'blue', 'cyan', 'green', 'magenta', 'red', 'white', 'yellow', 'black' and ’transparent’

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement at least

following two (2) values for the ‘tts:opacity’ attribute.

0.0 and 1.0

The SONY F1 Phase 1-Day1 Subtitle processor implementation MAY NOT give exact

typeface which specified in the ’ tts:fontFamily’ attribute.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement

presentation of ‘none’ and 'underline' in the ‘tts:textDecoration’ attribute.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement

presentation of ‘normal’ and 'italic' in the ‘tts:fontStyle’ attribute.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least ’px’

units for ’tts:lineHeight’ attribute.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least ’px’

units for ’tts:fontSize’ attribute.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at least ‘px’

units for ’tts:lineHeight’ attribute and MAY NOT support ‘float’ value.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT present a

character whose font size is less than 8 pixels or more than 144 pixels.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL draw the character

with 60 pixels (60 = 1920/32) if any font size are not specified to any characters.

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least

'percentage' representation for the length expression in ’ tts:origin’ attribute.

The SONY F1 Phase 1-Day1 Subtitle processor implementation MAY NOT support

tts:textOutline attribute.

If ‘tts:extent’ attribute is specified on ‘tt’ element, the SONY F1 Phase 1-Day1 Subtitle

processor implementation SHALL support at least 'px' representation for the length

expression in ’tts:extent’ attribute. Otherwise, the SONY F1 Phase 1-Day1 Subtitle

processor implementation SHALL support at least 'percentage' representation for the

length expression in ’tts:extent’ attribute.

Page 39: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

© Sony Corporation 2013. All rights reserved. | 39 15 March 2013 CONFIDENTIAL

The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT support

‘tts:padding’ attribute.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at

least ”lrtb” for ’tts:writingMode’ attribute.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT

support ’tts:unicodeBidi’ and ’tts:direction’ attribute.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support ”par” for

‘timeContainer’ attribute and MAY NOT support ”seq” for ’timeContainer’ attribute.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at least

following time expressions.

HH:MM:SS:FF

HH:MM:SS

The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support time

expressions calculated in ‘dropNTSC’ mode. Note that DECE CFF [7] defines that

‘tts:dropNTSC’ element is prohibited.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT satisfy the

decoding and drawing rates defined in Table C - 6 in DECE CFF [7].

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL satisfy the text

rendering rates defined in Table A-3.

Table A-3 Text Rendering Rates

Font Size text rendering rate

[characters/s]

8-72 120 (TBD)

73-144 100 (TBD)

Note: As a character whose Unicode Code Point is U+0009, U+000A, U+000D, U+0020 or

U+0085 is not displayed at all, it does not deeded as an atomic unit of text.

The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL satisfy the drawing

rate defined in Table A-4.

Table A-4 Drawing Rate

Property Constraint

Background Drawing rate 20x220 [pixels/s]

Page 40: SONY F1 Service Format Specification › sony › docs › 05 › docs › 4k › Phase 1 › v0.94_rel… · Sony Corporation disclaims all liability, including liability for infringement

SONY F1 Service Format Specification Version 0.94

40 | Sony Corporation 2013. All rights reserved. 15 March 2013

CONFIDENTIAL

A.1.3. Requirements on SONY F1 Phase 1-Day1 Required Metadata (TBD)

The Required Metadata d media format for SONY F1 Phase 1-Day1 SHALL conform with the

following additional requirements.

Elements

ContainerMovieMetadata-type

The (any) element SHALL NOT be used.

The AdditionalLocalizedInfoList element SHALL be the last element.

ContainerTrackMetadata-type

SegmentSize SHALL be set to 0 in case the value is unknown.

ContainerChapterLIst-type

The maximum number of Chapter element SHALL be 128.

AdditionalLocalizedInfo-type

The (any) element SHALL NOT be used.

The CopyrightLine element SHALL be the last element.

DigitalAssetMetadata-type

Image element SHALL NOT be present.

DigitalAssetAudioData-type, when present;

TrackReference element SHALL NOT be present.

DigitalAssetVideoData-type, when present;

TrackReference element SHALL NOT be present.

DigitalAssetSubtitleData-type

TrackReference element SHALL NOT be present

Images

In case the images and any other binary data referred to by the contents of the XML

Box for Required Metadata are in the file, the image format SHALL be PNG format.

Image Size (TBD)

Image file size (TBD)

XML document

XML file size (TBD)

XML schema

The schema for SONY F1 Phase 1 Day1 Required Metadata is defined in

“mdf1-p1-d1.xsd”