sony f1 service format specification › sony › docs › 05 › docs › 4k › phase 1 ›...
TRANSCRIPT
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 1 15 March 2013 CONFIDENTIAL
SONY F1 Service
Format Specification
Version 0.94
15 March 2013
SONY F1 Service Format Specification Version 0.94
2 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
Disclaimer Even though Sony has thoroughly reviewed this document, it is provided "as is" with no
warranties whatsoever, express or implied, including any warranty of merchantability,
non-infringement, fitness for any particular purpose or any warranty otherwise arising out of
this document or use, implementation or attempted implementation of any information
contained in this document. Sony Corporation disclaims all liability, including liability for
infringement of any proprietary rights, relating to this document or use, implementation or
attempted implementation of any information contained herein.
The product names, brand names, service names, etc. used in this document are each
company, each organization’s trademarks, or registered trademarks.
The references to the trademarks and copyrights are omitted in this document.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 3 15 March 2013 CONFIDENTIAL
Table of Contents
1 General .................................................................................................................................. 6
1.1. Scope .............................................................................................................................. 6
1.2. Specification Architecture Overview ............................................................................. 6
1.3. Reference ....................................................................................................................... 6
1.4. Meaning of words .......................................................................................................... 7
1.5. Definitions ...................................................................................................................... 7
1.6. Acronyms ....................................................................................................................... 7
2. File Format ............................................................................................................................ 8
2.1. Container Header ........................................................................................................... 8
2.1.1. Container for Required Metadata .......................................................................... 8
2.2. Movie Fragments ........................................................................................................... 9
2.2.1. General ................................................................................................................... 9
2.2.2. Movie Fragments for Video .................................................................................... 9
2.2.3. Movie Fragments for Audio .................................................................................... 9
2.2.4. Movie Fragments for Subtitle ................................................................................. 9
3. Media Format ...................................................................................................................... 10
3.1. Video Format ............................................................................................................... 10
3.1.1. AVC video stream ................................................................................................. 10
3.1.1.1. User Data Unregistered SEI Message for the Extended-Gamut YCC Colour
Space 14
3.2. Audio Format ............................................................................................................... 16
3.2.1. MPEG-4 AAC LC audio stream [2-channel] ........................................................... 16
3.2.2. MPEG-4 AAC LC audio stream [5.1-channel] ........................................................ 16
3.2.3. LPCM audio stream [2-channel] ........................................................................... 17
3.2.3.1. AudioSampleEntry Box for LPCM [2-channel, 48kHz, 16bits] ....................... 17
3.2.3.2. LPCM Elementary Stream Constraints [2-channel, 48kHz, 16bits] ............... 17
3.2.4. F1 LPCM audio stream .......................................................................................... 18
3.2.4.1. Storage of F1 LPCM Elementary Streams ..................................................... 18
3.2.4.2. AudioSampleEntry Box for F1 LPCM ............................................................. 18
3.2.4.3. F1LPCMSpecific Box ...................................................................................... 19
3.2.4.4. F1 LPCM Elementary Stream Constraints ..................................................... 22
3.3. Subtitle Format (TBD) .................................................................................................. 26
3.3.1. Subtitle Format ..................................................................................................... 26
3.3.2. Hypothetical Render Model ................................................................................. 27
3.3.3. Subtitle Selection .................................................................................................. 28
SONY F1 Service Format Specification Version 0.94
4 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.4. Metadata (TBD) ............................................................................................................ 29
3.4.1. SONY F1 Required Metadata ................................................................................ 29
3.4.2. Image Reference................................................................................................... 32
4. Content encryption ............................................................................................................. 34
Annex A. Profile ....................................................................................................................... 34
A.1. SONY F1 Phase 1 Day1 Profile ...................................................................................... 34
A.1.1. Requirements on SONY F1 Phase 1-Day1 File Format and Media Format .......... 34
A.1.2. Requirements on SONY F1 Phase 1-Day1 Subtitle processor implementation ... 37
A.1.3. Requirements on SONY F1 Phase 1-Day1 Required Metadata (TBD) .................. 40
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 5 15 March 2013 CONFIDENTIAL
List of Figures
FIGURE 3-1 EXAMPLES OF ACCESS UNIT STRUCTURE FOR RA-I PICTURE AND NON RA-I PICTURE .................................... 13
FIGURE 3-3 24-BIT LPCM SAMPLE ..................................................................................................................... 23
FIGURE 3-4 20-BIT LPCM SAMPLE ..................................................................................................................... 23
FIGURE 3-5 GROUP OF LPCM SAMPLES .............................................................................................................. 24
FIGURE 3-6 BLOCK DIAGRAM OF HYPOTHETICAL RENDER MODEL ............................................................................ 27
FIGURE 3-7 EXAMPLES OF TRACKREFERENCE ELEMENT FOR REQUIREDIMAGES ELEMENT IN REQUIRED METADATA ...... 33
FIGURE 3-8 EXAMPLES OF TRACKREFERENCE ELEMENT FOR OPTIONALIMAGES ELEMENT IN OPTIONAL METADATA ...... 33
List of Tables
TABLE 3-1 ALLOWED PICTURE FORMATS FOR AVC VIDEO STREAM ........................................................................... 10
TABLE 3-2 SYNTAX OF USER DATA UNREGISTERED SEI MESSAGE FOR EXTENDED-GAMUT YCC COLOUR SPACE ................. 14
TABLE 3-3 COLOR_DATA() .............................................................................................................................. 14
TABLE 3-4 ALLOWED AUDIO FORMAT ................................................................................................................. 16
TABLE 3-5 LPCM AUDIO FORMAT[2-CHANNEL] .................................................................................................... 17
TABLE 3-6 F1 LPCM AUDIO FORMAT ................................................................................................................. 18
TABLE 3-7 PERMITTED AUDIO_DATA_PAYLOAD_SIZE VALUES .................................................................................. 20
TABLE 3-8 CHANNEL_ASSIGNMENT .................................................................................................................... 21
TABLE 3-9 SAMPLING_FREQUENCY .................................................................................................................... 22
TABLE 3-10 BITS_PER_SAMPLE .......................................................................................................................... 22
TABLE 3-11 TRACK ID AND ROLE OF SUBTITLE TRACK ............................................................................................. 28
TABLE 3-12 CONTAINERMOVIEMETADATA-TYPE ............................................................................................. 30
TABLE 3-13 ADDITIONALLOCALIZEDINFOLIST-TYPE ................................................................................... 31
TABLE 3-13 ADDITIONALLOCALIZEDINFO-TYPE ........................................................................................... 32
TABLE A-2 UNICODE CODE POINTS TO BE SUPPORTED........................................................................................... 37
TABLE A-3 TEXT RENDERING RATES .................................................................................................................... 39
TABLE A-4 DRAWING RATE ............................................................................................................................... 39
SONY F1 Service Format Specification Version 0.94
6 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
1 General
1.1. Scope
This specification defines the file format and the media formats of audio-visual contents and
the download manifest file for the purpose of SONY F1 Service. As a file format and media
format, the specification includes container formats, elementary stream formats,
requirements on encryption of the audio-visual contents and requirements for the playback
devices. As a download manifest file, the specification includes the manifest file structure and
segment file structure and operational rules for the download system.
1.2. Specification Architecture Overview
This specification is composed of three parts. The first part, section 2, defines the file format.
The second part, section 3, defines the media format. The third part, Annex A, defines the
profile requirements of the SONY F1 service. The specification references already available
standards and specifications.
1.3. Reference
[1] DECE Common File Format & Media Formats Specification, Version 1.0.5,
31-October-2012.
[2] Common Metadata, TR-META-CM, v1.2d, September 24, 2012, Motion Picture
Laboratories, Inc.,
http://www.movielabs.com/md/md/v1.2/Common%20Metadata%20v1.2d.pdf
[3] T. e. a. Berners-Lee, RFC 3986, Uniform Resource Identifier (URI): Generic Syntax,
January 2005.
[4] ITU-T Rec. H.264 | ISO/IEC 14496-10, (2010), “Information Technology – Coding of
audio visual objects – Part 10: Advanced Video Coding.”.
[5] ITU-R Rec. BT.709-5: Parameter values for the HDTV standards for production and
international programme exchange.
[6] IEC61966-2-4 Ed. 1.0:2006, Multimedia systems and equipment, - Colour measurement
and management -, Extended-gamut YCC colour space for video applications - xvYCC.
[7] DECE Common File Format & Media Formats Specification, Version 1.0.3,
3-January-2012.
[8] EIA/CEA-708-D, Digital Television (DTV) Closed Captioning, December 1999.
[9] DECE Content Metadata Specification, Version 1.0.5, 31-October-2012.
[10] ISO/IEC 23009-1:2012 Information technology –Dynamic adaptive streaming over HTTP
(DASH) – Part 1: Media presentation description and segment formats.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 7 15 March 2013 CONFIDENTIAL
[11] ISO/IEC 23001-7:2012: Information technology – MPEG systems technologies – Part 7:
Common encryption in ISO base media file format files.
[12] Marlin Developer Community, “Marlin Adaptive Streaming Specification – Simple
Profile”, Version 1.0.
1.4. Meaning of words
In this document, the following words have a special meaning:
INFORMATIVE: indicates a Section or Annex describes supplemental information to aid
understanding of this specification. A compliant SONY F1 Service device is recommended but
not required to comply with informative Sections and Annexes.
MANDATORY: describes a feature that must be implemented to claim compliance to this
specification.
MAY: indicates an action or feature that is not mandatory.
NORMATIVE: indicates a Section or Annex describes a prescriptive part of this specification.
A compliant SONY F1 Service device shall comply with all normative Sections and Annexes.
OPTIONAL: describes a feature that may or may not be implemented. If implemented, the
feature shall be implemented as described.
SHALL and SHALL NOT: indicate requirements strictly to be followed in order to conform to the
document and from which no deviation is permitted.
SHOULD: indicates an action or feature that is optional, but its implementation is
recommended.
1.5. Definitions
For the purposes of this specification, the definitions in section 1.6 of DECE Common File
Format & Media Formats Specification [1] are applied.
1.6. Acronyms
For the purposes of this specification, the acronyms in section 1.6 of DECE Common File
Format & Media Formats Specification [1] are defined. In addition, the following definitions are
defined.
DECE-CFF DECE Common File Format
SONY F1 Service Format Specification Version 0.94
8 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
2. File Format The SONY F1 File Format SHALL comply with file format defined in section 2 and section C.2 of
DECE-CFF [1] with additional requirements and constraints defined in this section.
2.1. Container Header
The header of the file SHALL conform to the DCC Header defined in DECE-CFF [1] with the
following additional constraints.
XML Box (‘xml ’) for Required Metadata
The xml field SHALL contain a well-formed XML document with contents that
conform to section 0.
Edit Box (‘edts’)
The ‘edts’ SHALL be present.
The ‘edts’ SHALL contain the Edit List Box (‘elst’) as defined in section 2.1.2.2
in DECE-CFF [1].
Media Header Box (‘mdhd’)
For audio tracks and subtitle tracks, the language SHALL contain the language of
the content in the track. The language SHALL NOT contain the original relase
language of the content.
2.1.1. Container for Required Metadata
Images and any other binary data referred to by the contents of the XML Box for Required
Metadata MAY NOT be in the file.
In case the images and any other binary data referred to by the contents of the XML Box
for Required Metadata are not in the file, they MAY be referenced by URN [3].
In case the images and any other binary data referred to by the contents of the XML Box
for Required Metadata are in the file, the constraints in n section 2.1.2.1 of DECE-CFF [1]
SHALL be confirmed.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 9 15 March 2013 CONFIDENTIAL
2.2. Movie Fragments
2.2.1. General
The movie fragments SHALL conform to the DCC Movie Fragments defined in DECE-CFF [1] with
the following additional constraints.
Track Fragment Run Box (‘trun’)
The version of ‘trun’ SHALL be set to 1.
AVC NAL unit storage Box (‘avcn’)
The ‘avcn’ SHALL NOT be present in the file.
2.2.2. Movie Fragments for Video
The movie fragments for video SHALL conform to the DCC Movie Fragment for video defined
DECE-CFF [1] with the following additional constraints.
In case the video codec is AVC [4], the Movie Fragment for video track SHALL conform to
DCC Movie Fragment defined in section 4.2 and C.4 except for AVC elementary stream
constraints defined in C.4.1 and C.4.3 of DECE-CFF [1]. The constraints on AVC elementary
stream SHALL comply with constraints defined in section 3.1.1.
2.2.3. Movie Fragments for Audio
The movie fragments for audio SHALL conform to the DCC Movie Fragment for audio defined
in DECE-CFF [1] with the following additional constraints.
The Movie Fragment for audio SHALL conform to DCC Movie Fragment defined in section
5.2 and section C.5 except for allowed combinations of audio format defined in Table C-3
of DECE-CFF [1]. The allowed combination of audio format for SONY F1 File Format is
defined in section 0.
In case the audio codec is MPEG-4 AAC, the Movie Fragment for audio track SHALL
conform to DCC Movie Fragment defined in section 5.3 or DECE-CFF [1].
2.2.4. Movie Fragments for Subtitle
The movie fragments for the subtitle SHALL conform to movie fragments defined in section 6.6
and section C.6 of DECE-CFF [1] with the following additional constraints.
Track Header Box (‘tkhd’)
The width SHALL be set to 1920.
The height SHALL be set to 1080.
SONY F1 Service Format Specification Version 0.94
10 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3. Media Format This section describes the requirements for each media format.
3.1. Video Format
3.1.1. AVC video stream
The AVC video stream for SONY F1 Service SHALL comply with section 4.3 and section C.4 of
DECE-CFF [1] with additonal constraints defined in this section. For those constraints which
overlap with the constraints defined in this section SHALL override the constraints defined in
section 4.3 and section C.4 of DECE-CFF [1].
Profile
The video stream SHALL comply with the High Profile defined in AVC [4].
The value of profile_idc in sequence parameter set (SPS) SHALL be set to 100.
Level
The content SHALL comply with the constraints specified for Level 5.1 defined in AVC.
The level_idc in SPS SHALL be set to 51.
Picture Format
The AVC video stream SHALL comply with the picture formats in Table 3 1.
Table 3-1 Allowed Picture Formats for AVC video stream
Picture Formats Sub-sample Factors Parameter Constraints
Frame size Picture
aspect
Frame
rate
Horiz. Vert. Max size
encoded
pic_width_in
_mbs_minus1
pic_height_i
n_map_minus1
aspect_
ratio_idc
3840x2160 1.778 23.976
29.97
1 1 3840x2160 239 134 1
Color Descriptions
The color space used for the AVC video stream SHALL be BT.709 [5] or xvYCC709 [6].
The following parameters Visual Usability Information (VUI) Parameters SHALL have
pre-determined values as defined and the values SHALL be the same throughout the
AVC video stream.
The video_signal_type_present_flag SHALL be set to 1.
The colour_description_present_flag SHALL be set to 1.
The colour_primaries SHALL be set to 1.
The transfer_characteristics SHALL be set to 1(for BT.709 [5]) or
11(for xvYCC709 [6]).
The matrix_coefficients SHALL be set to 1.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 11 15 March 2013 CONFIDENTIAL
Picture Types
I picture : A picture SHALL consists only of I slices.
P picture : A picture SHALL consists only of P slices.
B picture : A picture SHALL consists only of B slices.
Slices
Slice Type
I slice : slice_type SHALL be set to 7.
P slice : slice_type SHALL be set to 5.
B slice : slice_type SHALL be set to 6.
A slice SHALL be composed of one or more macroblock rows. A macroblock row
indicates all the macroblocks in a horizontal row of macroblocks.
In case level_idc is set to Level 5.1 (51), each picture SHALL be encoded as
multi-slice picture with 4 or more slices per picture.
HRD Parameters
nal_hrd_parameters_present_flag in VUI parameters SHALL be set to 1.
vcl_hrd_parameters_present_flag in VUI parameters SHALL be set to 1.
Maximum CPB size
In case the level_idc is set to Level 5.1 (51), the maximum CPB size (MaxCPB)
SHALL be constrained to 120000 [1250 bits/s (cpbBrVclFactor), 1500 bits/s
(cpbBrNalFactor)].
Minimum compression ratio
In case the level_idc is set to Level 5.1 (51), MinCR SHALL be constrained to 4.
Maximum DPB size
In case the level_idc is set to Level 5.1 (51), the MaxDpbMbs SHALL be less than
or equal to the constraints specified for Level 5.1 defined in AVC[4].
Access Unit Structure
The maximum number of NAL units per each access unit SHALL be less than or equal
to 32.
The Picture Timing SEI message SHALL be present for each access uint.
Data structure
The coded video sequence SHALL be less than or equal to 3.003 sec.
Sequence parameter set
The sequence parameter sets within duration of 3.003 sec in presentation time
from the first picture of the video sequence in order of presentation SHALL have
unique seq_parameter_set_id, if any of the parameters have different
values.
SONY F1 Service Format Specification Version 0.94
12 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
Picture parameter set
The picture parameter sets within duration of 3.003 sec in presentation time
from the first picture of the video sequence in order of presentation SHALL have
unique pic_parameter_set_id, if any of the parameters have different
values.
- NOTE : In case more than one coded video sequence exists within duration
of 3.003 sec in presentation time from the first picture of the video
sequence in order of presentation, this constraint applies across the coded
video sequences such that pic_parameter_set_id SHALL be unique
across coded video sequences, if any of the parameters have different
values.
In case the video stream is structured as byte stream fromat All picture paremter
sets in coded video sequence SHALL be placed together with the picture
parameter set for the first access unit in the coded video sequence.
Supplemental Enhancement Information (SEI)
In case an access unit is an IDR or a Random Access I picture (RA-I picture) as defined
in section 2.2.7.2.1 of DECE-CFF [1], following SEI messages SHALL be present in the
access unit. (See Figure 3-1) Note : RA-I picture does not include an IDR picture.
Buffering period SEI message
Recovery point SEI message
In case an access unit is non-IDR or non RA-I picture, following SEI messages SHALL
NOT be present together in the access unit. i.e. Only one or none of the following SEI
messages can be present. (See Figure 3-1)
Buffering period SEI message
Recovery point SEI message
In case the color space used for the AVC video stream is xvYCC709 [6], the user data
unregistered SEI message for the Extended-Gamut YCC Colour Space defined in
section 0 SHALL be present in the video stream.
NOTE : The transfer_characteristics SHALL be set to 11 in case the
user data unregistered SEI messages for the Extended-Gamut YCC Colour Space
are present in the video stream.
Only the first decoded picture in each coded video sequence or Random Access
I-picture SHALL have exactly one user data unregistered SEI message for the
Extended-Gamut YCC Colour Space defined in section 0.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 13 15 March 2013 CONFIDENTIAL
RA-I picture slice data
Recovery Point SEI
Buffering Period SEI
Non RA-I picture slice data
Recovery Point SEI
Buffering Period SEI
Non RA-I pictureslice data
AU Delimiter
AU Delimiter
AU Delimiter
Other SEIs
Other SEIs
Other SEIs
except forRecovery Point SEI
except forBuffering Period SEI
Access Unit for RA-I picture
Examples of Access Unit for Non RA-I picture
Non RA-I pictureslice data
AU Delimiter Other SEIs
except forBuffering Period SEI
andRecovery Point SEI
Figure 3-1 Examples of Access Unit structure for RA-I picture and Non RA-I picture
SONY F1 Service Format Specification Version 0.94
14 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.1.1.1. User Data Unregistered SEI Message for the Extended-Gamut YCC Colour Space
The colour information for the extended-gamut YCC colour space SHALL be carried by the the
user data unregistered SEI messge syntax and semantics indicated in this section.
Table 3-2 Syntax of User Data Unregistered SEI message for Extended-Gamut YCC Colour Space
Syntax Num of bits Mnemonic
user_data_unregistered (payload) {
uuid_iso_idc_11578 128 Uimsbf
TypeIndicator 32 Uimsbf
if ( TypeIndicator == 0x43 4c 49 44 ) {
COLOR_data() {
Format_Flag 1 bslbf
Reserved 2 bslbf
GBD_Color_Precision 2 bslbf
GBD_Color_Space 3 bslbf
Min_Red_Data 12 bslbf
Max_Red_Data 12 bslbf
Min_Green_Data 12 bslbf
Max_Green_Data 12 bslbf
Min_Blue_Data 12 bslbf
Max_Blue_Data 12 bslbf
}
Reserved 16 bslbf
}
}
uuid_iso_idc_11578 SHALL be set to“A74602BB-F8A1-4CC0-A936-48E391DCE761”.
TypeIndicator indicates the type of user data that is carried in this SEI message.
TypeIndicator SHALL be set to“0x43 4c 49 44”.
COLOR_data() indicates the syntax and semantics of COLOR_data() based on Table E-6 in
Appendix E [6].
Format_Flag, GBD_Color_Precision, GBD_Color_Space, Min_Red_Data,
Max_Red_Data, Min_Green_Data, Max_Green_Data, Min_Blue_Data and
Max_Blue_Data SHALL be set as defined in Table 3-3.
reserved SHALL be set to 0 for future usage.
Table 3-3 COLOR_data()
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 15 15 March 2013 CONFIDENTIAL
Field Value
Format_Flag 1b
GBD_Color_Precision 10b
GBD_Color_Space 010b
Min_Red_Data 1 00 110110101b
Max_Red_Data 0 01 011110100b
Min_Green_Data 1 00 011010111b
Max_Green_Data 0 01 010010110b
Min_Blue_Data 1 00 011001100b
Max_Blue_Data 0 01 010010010b
SONY F1 Service Format Specification Version 0.94
16 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.2. Audio Format
This section describes the requirements of audio stream in audio track of SONY F1 Service.
The allowed audio format are defined in Table 3-4.
Table 3-4 Allowed Audio Format
Audio Format Max number
of Channel
Sample Rate
[kHz]
Max Bitrate
[kbps]
Bitrate
Calculation
MPEG-4 AAC LC
[2-channel]
2 48 192 Section 5.3.2.2.2.4
of DECE-CFF [1]
MPEG-4 AAC LC
[5.1-channel]
5.1 48 960 Section 5.3.3.2.2.4
of DECE-CFF [1]
LPCM
[2-channel]
2 48, 96, 192 1536, 3072,
6144
-
F1 LPCM
max. 7.1 48, 96, 192 - Table 3-7
3.2.1. MPEG-4 AAC LC audio stream [2-channel]
MPEG-4 AAC LC [2-channel] audio stream for SONY F1 audio format SHALL comply with
MPEG-4 AAC LC [2-channel] audio stream defined in section 5.3.1, section 5.3.2 and section
C.5.2.1 of DECE-CFF [1] with additional constraints defined in this section with additional
constraints defined in this section.
Channel configuration
In case the audio format is MPEG-4 AAC LC 2-channel, the audio SHALL be encoded in
2-channel stereo.
3.2.2. MPEG-4 AAC LC audio stream [5.1-channel]
MPEG-4 AAC LC [5.1-channel] audio stream for SONY F1 audio format SHALL comply with
MPEG-4 AAC LC [5.1-channel] audio stream defined in section 5.3.1, section 5.3.3 and section
C.5.2.2 of DECE-CFF [1].
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 17 15 March 2013 CONFIDENTIAL
3.2.3. LPCM audio stream [2-channel]
LPCM [2-channel] audio stream for SONY F1 audio format SHALL comply with this section
based on section 5.1, 5.2 and section C.5 of DECE-CFF [1] with additional constraints.
Table 3-5 LPCM audio format[2-channel]
Codingname Audio Format SampleEntry Type Section
Reference
twos LPCM [2-channel] MJ2AudioSampleEntry ISO/IEC
15444-3:2007
NOTE: AudioSampleEntry cannot be applied to higher sampling frequency audio such as
96/192kHz. For that purpose AudioSampleEntryV1 is to be newly defined in ISO/IEC
14496-12:2012/DAM2.
3.2.3.1. AudioSampleEntry Box for LPCM [2-channel, 48kHz, 16bits]
The syntax and values of the AudioSampleEntry box SHALL conform to AudioSampleEntry as
defined 5.2.1.6 of DECE-CFF [1], and the following fields SHALL be set as defined:
AudioFormat = ‘twos’
channelcount=2
samplesize=16
samplerate= BB800000h (48kHz)
3.2.3.2. LPCM Elementary Stream Constraints [2-channel, 48kHz, 16bits]
The data consists of interleaved left/right samples.
A sample has 16 bit values with the bytes in big-endian format.
16bit-values range from -32768 to 32767, with 0 being silence.
The presentation length of an audio access unit (audio frame) of the LPCM audio stream is
equal to 40 milli-second(1920 samples/ch).
class MJ2AudioSampleEntry() extends AudioSampleEntry (AudioFormat){
}
SONY F1 Service Format Specification Version 0.94
18 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.2.4. F1 LPCM audio stream
F1 LPCM audio stream for SONY F1 audio format SHALL comply with this section based on
section 5.2 and section C.5 of DECE-CFF [1] with additional constraints.
Table 3-6 F1 LPCM audio format
Codingname Audio Format SampleEntry Type Section
Reference
fpcm F1 LPCM F1LPCMAudioSampleEntry Section 3.3.4
3.2.4.1. Storage of F1 LPCM Elementary Streams
An audio sample SHALL consist of a single access unit (audio frame).
For 16bit quantization, code values range from -32768 to 32767, with 0 being silence.
3.2.4.2. AudioSampleEntry Box for F1 LPCM
The syntax of the F1LPCMAudioSampleEntry(‘fpcm’) box SHALL conform to that of
AudioSampleEntry as defined 5.2.1.6 of DECE-CFF [1], and the following fields SHALL be set as
defined:
AudioFormat(codingname) = ‘fpcm’
channelcount=2, 4, 6, or 8
The value of channelcount should be equal to the actual number of channels
specified in F1LPCMSpecificBox. The use of this field in the SONY F1 File Format is
optional; it may be ignored on reading.
class F1LPCMAudioSampleEntry extends SampleEntry(‘fpcm’)
{
const unsigned int(32) reserved[2] = 0;
template unsigned int(16) channelcount;
template unsigned int(16) samplesize;
unsigned int(16) pre_defined = 0;
const unsigned int(16) reserved = 0;
template unsigned int(32) samplerate;
F1LPCMSpecificBox
}
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 19 15 March 2013 CONFIDENTIAL
samplesize=16, 20, or 24
The value of samplesize should be equal to the actual bits per sample value
specified in F1LPCMSpecificBox. The use of this field in the SONY F1 File Format is
optional; it may be ignored on reading.
samplerate= BB800000h
The value of samplerate is a suitable integer division of the actual sampling
frequency specified in F1LPCMSpecificBox.
3.2.4.3. F1LPCMSpecific Box
The Syntax of the F1LPCMSpecificBox(‘fcfg’) is shown below:
class F1LPCMSpecificBox extends Box (‘fcfg’)
{
unsigned int(32) audio_data_payload_size;
unsigned int(4) channel_assignment;
unsigned int(4) sampling_frequency;
unsigned int(2) bits_per_sample;
unsigned int(6) reserved = 0;
}
SONY F1 Service Format Specification Version 0.94
20 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.2.4.3.1. Semantics of F1LPCMSpecific Box
audio_data_payload_size - indicates the size in bytes of F1LPCMAudioDataPayload()
Table 3-7 Permitted audio_data_payload_size values
sampling frequency bits per sample number of
channels
audio_data_payload_size
[bytes]
48 kHz 16-bit 2 7680
4 15360
6 23040
8 30720
20-bit / 24-bit 2 11520
4 23040
6 34560
8 46080
96 kHz 16-bit 2 15360
4 30720
6 46080
8 61440
20-bit / 24-bit 2 23040
4 46080
6 69120
8 92160
192 kHz 16-bit 2 30720
4 61440
6 92160
20-bit / 24-bit 2 46080
4 92160
6 138240
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 21 15 March 2013 CONFIDENTIAL
channel_assignment - specifies the channel assignment for the channel configuration in
the F1 LPCM audio stream.
Table 3-8 channel_assignment
Value number of
channels
channel configuration channel number
1 2 3 4 5 6 7 8
0 - reserved
1 2 ch mono M1 X
2 reserved
3 stereo L R
4 4 ch L, C, R (3/0) L R C X
5 L, R, S (2/1) L R S X
6 L,C,R,S (3/1) L R C S
7 L,R,LS,RS (2/2) L R LS RS
8 6 ch L, C, R, LS, RS (3/2) L R C LS RS X
9 L, C, R, LS, RS, lfe(3/2+lfe) L R C LS RS lfe
10 8 ch L, C, R, LS, Rls, Rrs, RS
(3/4)
L R C LS Rls Rrs RS X
11 L, C, R, LS, Rls, Rrs, RS, lfe
(3/4+lfe) (‘surround back’)
L R C LS Rls Rrs RS lfe
12 L, C, R, LS, RS, Vhl, Vhr, lfe
(5/2+lfe) (‘front high’)
L R C LS RS Vhl Vhr lfe
13–15 - reserved
M: Mono, L: Left, R: Right, C: Center, S: Surround,
Rls: Rear surround left, Rrs: Rear surround right,
Vhl: Vertical height left, Vhr: Vertical height right,
X: Sample values shall be set to zero.
SONY F1 Service Format Specification Version 0.94
22 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
sampling_frequency - specifies the sampling frequency of the F1 LPCM audio stream as
shown in Table 3-9.
Table 3-9 sampling_frequency
Value Meaning
0 Reserved
1 48 kHz
2 reserved
3 reserved
4 96 kHz
5 192 kHz
6 – 15 reserved
bits_per_sample - specifies the sampling resolution of the audio samples for all channels in
the F1 LPCM audio stream as shown in Table 3-10.
Table 3-10 bits_per_sample
Value Meaning
0 Reserved
1 16 bits/sample
2 20 bits/sample
3 24 bits/sample
3.2.4.4. F1 LPCM Elementary Stream Constraints
This Section specifies the syntax and semantics of the F1 LPCM audio stream.
All the channels shall be sampled simultaneously on sampling phase.
The following conditions shall not change in the F1 LPCM audio stream carried in a file.
Sampling frequency
Bits per sample
The channel assignment for each channel configuration
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 23 15 March 2013 CONFIDENTIAL
3.2.4.4.1. LPCM audio samples
Audio samples are 16, 20, or 24-bit two’s complement integers. Bit ordering for LPCM audio
samples is such that the most significant bit (msb) is the first (left-most) bit and the least
significant bit (lsb) is last.
3.2.4.4.1.1. 16-bit LPCM audio samples
A 16-bit LPCM sample is split into two bytes, as shown in Figure 3-2. The high byte represents
the eight most significant (b15..b8), and the low byte represents the eight least significant bits
(b7..b0).
b15 b0
LSB MSB
Figure 3-2 16-bit LPCM sample
3.2.4.4.1.2. 24-bit LPCM audio samples
A 24-bit LPCM sample is split into three bytes, as shown in Figure 3-3. The high byte represents
the eight most significant (b23..b16), the middle byte represents bits (b15..b8), and the low
byte represents the eight least significant bits (b7..b0).
b23 b0
LSB MSB
Figure 3-3 24-bit LPCM sample
3.2.4.4.1.3. 20-bit LPCM audio samples
Four zero-value bits shall be postfixed to a 20-bit LPCM sample to make 24-bits, as shown in
Figure 3-4. The 24-bits with trailing zeros are then packed in the same way as 24-bit LPCM
samples.
b23
b3 b2 b1 b0
LSBMSB
0 0 0 0b4
Figure 3-4 20-bit LPCM sample
SONY F1 Service Format Specification Version 0.94
24 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.2.4.4.2. Group of LPCM samples (GOL)
Each group of LPCM samples(GOL) contains sequence of LPCM samples. The samples within
each GOL shall be in the order of their channel number.
1st group of LPCM samples 2nd group of LPCM samples
1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB
1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB
1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB
1st group of LPCM samples 2nd group of LPCM samplesMSB LSB MSB LSB
ChannelNumber 1
ChannelNumber 2
ChannelNumber 3
ChannelNumber 4
Figure 3-5 Group of LPCM samples
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 25 15 March 2013 CONFIDENTIAL
3.2.4.4.3. Audio access unit (audio frame) of the F1 LPCM audio stream
The presentation length of an audio access unit (audio frame) of the LPCM audio stream is
equal to 40 milli-second.
If the sampling frequency of the LPCM audio stream is 48 kHz, an audio access unit (audio
frame) of the LPCM audio stream consists of 1920 GOLs.
If the sampling frequency of the LPCM audio stream is 96 kHz, an audio access unit (audio
frame) of the LPCM audio stream consists of 3840 GOLs.
If the sampling frequency of the LPCM audio stream is 192 kHz, an audio access unit
(audio frame) of the LPCM audio stream consists of 7680 GOLs.
Here each of GOL contains N samples; the N is the number of channels, and this shall be 2,
4, 6, or 8.
The Syntax of the F1LPCMAudioDataPayload is shown below:
F1LPCMAudioFrame contains one audio access unit (audio frame) of the F1 LPCM
audio stream as defined in section 0.
The size in bytes of this field is different for each value of the
audio_data_payload_size of the F1LPCMSpecificBox as specified in section
3.2.4.3.
F1LPCMAudioDataPayload() {
F1LPCMAudioFrame
}
SONY F1 Service Format Specification Version 0.94
26 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.3. Subtitle Format (TBD)
3.3.1. Subtitle Format
The Subtitle Elementary stream in subtitle track for SONY F1 file SHALL comply with
requirements defined in section 6.2 of DECE-CFF 1.0.3 [7] with additional constraints defined in
this section.
Additional constraints
The root container SHALL be set to the size of 1920x1080 in units of pixels. The
subtitles SHALL be designed for 1920x1080 video format.
CFF-TT text subtitles in a subtitle track SHALL be authored such that their size and
position falls within the bounds of the width and height parameters of the Track
Header Box (‘tkhd’) of the subtitle track.
Nested span SHALL NOT be used.
Nested division SHALL NOT be used.
‘cff:forcedDisplayMode’ SHALL NOT be set to ‘true’.
If a region whose ‘tts:showBackground’ attribute is set to ‘always’ is specified
(including the case that ‘tts:showBackground’ attribute is not specified),
Start time of subtitle presentation SHALL be set to 0.0 sec.
End time of subtitle presentation SHALL be set to duration of the track which has
longest duration in the DECE CFF Container.
‘start’, ‘end’ and ‘duration’ attribute SHALL NOT be specified in such ‘region’
element.
Time expression SHALL be calculated in ‘dropNTSC’ mode.
Note that ‘ttp:dropMode’ element SHALL NOT be used according to section 6.2
of DECE-CFF 1.0.3 [7].
A frame count discontinuity occurs in ’dropNTSC’ mode, however, content
authors need not to be aware of frame count discontinuity.
Transformation from time to MP4 tick is, e.g. in case of 90000Hz tick and
29.97Hz video framerate, a frame duration is 3000. In this case, 1 second
duration denotes 90000 tick.
Time expression (the longest time of subtitle) SHALL NOT be ‘float’ value.
‘auto’ value SHALL NOT be specified in ‘tts:extent’ or ‘tts:origin’ attribute.
‘normal’ value SHALL NOT be specified in ‘tts:lineHeight’ attribute.
‘sign’ (“+” or “-”) value SHALL NOT be used to the ‘length’ value in any attributes.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 27 15 March 2013 CONFIDENTIAL
‘length’ value specified in ‘tts:fontSize’ attribute SHALL be from 8 to 144 in units of
pixels.
Minimum Font Size = 8px
Maximum Font Size = 144px
‘length’ value specified in ‘tts:lineHeight’ attribute SHALL be from 8 to 144 in units of
pixels.
Minimum Line Height = 8px
Maximum Line Height = 144px
3.3.2. Hypothetical Render Model
The hypothetical render model SHALL comply with requirements defined in section 6.5.1, 6.5.2,
6.5.4 and 6.5.5 of DECE-CFF 1.0.3 [7] with following items.
Block diagram of hypothetical render model is replaced with Figure 3-6.
The subtitle SHALL be scaled to the same size as video after the subtitle rendering to
be combined with video plane. (See Figure 3-6)
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL match the
Subtitle Plane and Video Plane color space for subtitle overlay.
Figure 3-6 Block Diagram of Hypothetical Render Model
File Parser
Doc Buffer
Enc. ImageBuffer
Image Dec
Doc DOM 1
Doc DOM 2
Dec. ImageBuffer 1
Dec. ImageBuffer 2
GlyphRenderer
PresentationCompositor
PresentationBuffer
SubtitlePlane
(1920x1080)Scaling
OutputVideo Plane(3840x2160)
SONY F1 Service Format Specification Version 0.94
28 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
3.3.3. Subtitle Selection
In this subsection, the Track IDs for subtitle tracks are defined categorized by the role of the
subtitle. The Track ID for each subtitle track SHOULD be set as defined in Table 3-11.
Table 3-11 Track ID and Role of subtitle track
Track ID Role of subtitle track
128-255 Closed caption for accessibility which
corresponds to CC1 (cc_type=002) of EIA-708
B [8]
256-383 Closed caption for accessibility which
corresponds to CC2 (cc_type=012) of EIA-708
B [8]
384-639 For other use (e.g. normal subtitle,
commentary)
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 29 15 March 2013 CONFIDENTIAL
3.4. Metadata (TBD)
This section describes the requirements of XML document for metadata for SONY F1 Service.
3.4.1. SONY F1 Required Metadata
The XML document of Required metadata for SONY F1 Service is based on DECE Container
Required Metadata defined in section 4.1 of DECE-META [9] with additional extensions and
requirements in this section.
All types and elements are in the ‘mdf1’ namespace unless otherwise specified.
ContainerMovieMetadata-type
The ContainerMovieMetadata-type defined in section 4.1.1 of DECE-META
[9] is replaced with Table 3-12.
The Ratings element SHALL be present.
The Chapters element SHALL be present.
The value of Priority is as defined as follows:
The value 0 is reserved.
The value SHALL be in range of 1 to 255.
AdditionalLocalizedInfoList-type
AdditionalLocalizedInfoList-type is as defined in Table 3-13.
AdditionalLocalizedInfo-type
AdditionalLocalizedInfo-type is as defined in Table 3-13.
If no copy right description is available, the CopyrightLine element SHOULD be
empty.
SONY F1 Service Format Specification Version 0.94
30 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
Table 3-12 ContainerMovieMetadata-type
Element Attribute Definition Type Card.
ContainerMo
vieMetadata
-type
MetadataVersi
onReference
A string that defines the
version of the metadata in this
element. If the metadata
changes, this string SHOULD
be included and unique relative
to other instances of this
attribute.
xs:string 0..1
priority The priority indicates the
priority of the file for
Auto-delete function. The
value of 0 is reserved.
The larger number indicates in
higher priority that the file is
selected as auto-deleted file.
E.g. the file with priority set to
10 is selected as auto-delete
file compare to the file with
priority set to 3.
xs:unsignedInt 1
ContentMeta
data
Mandatory descriptive
metadata regarding the media
in the Container.
mddece:Conta
inerContentM
etadata-type
RequiredIma
ges
References to Container
required images in Compliance
with Section 3.2 of
DECE-META [9].
md:DigitalAs
setIMageData
-type
1..n
TrackMetada
ta
Descriptions of each track mddece:Conta
inerTrackMet
adata-type
Ratings Content ratings for media in
the Container as defined in
Section 7.3 of Common
md:ContentRa
ting-type
0..1
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 31 15 March 2013 CONFIDENTIAL
Metadata [2].
Chapters Chapter entry points
mddece:Conta
inerChapterL
ist-type
0..1
OptionalIma
ges
References to Container
optional images
md:DigitalAs
setIMageData
-type
0..n
TrackSelect
ions
mddece:Conta
inerTrackSel
ectionList-t
ype
0..1
Interactive
CapabilityL
evel
The Interactive Capability Level
required of Devices to use this
Content.
xs:string 0..1
ContainerVe
rsionRefere
nce
A string that defines the
version of the Container. It can
be used as a reference to
identify changes in the
Container.
xs:string
0..1
AdditionalL
ocalizedInf
oList
mdf1:Additio
nalLocalized
InfoList-typ
e
1..n
(any) An element which enables to
extend the XML document.
xs:any 0..n
Table 3-13 AdditionalLocalizedInfoList-type
Element Attribute Definition Type Card.
AdditionalLo
calizedInfoL
ist-type
AdditionalLo
calizedInfo
Additional localized information
descriptor.
mdf1:Additio
nalLocalized
Info-type
1..n
SONY F1 Service Format Specification Version 0.94
32 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
Table 3-14 AdditionalLocalizedInfo-type
Element Attribute Definition Type Card.
Additional
LocalizedI
nfo-type
Genre Subject-matter
classification of the
show. See Genre
Encoding below.
xs:string
1..n
source Naming system from
which genre is
derived.
xs:anyURI 0..1
id Identifier for genre
used within source
xs:string 0..1
level Indicates precedence
of genre, with a
lower number being
high precedence.
xs:integer 0..1
CopyrightL
ine
Displayable
copyright line.
xs:string 1
(any) An element which
enables to extend
the XML document.
xs:any 0..n
3.4.2. Image Reference
The images referenced to by the metadata XML document for F1 SONY service defined in
section 3.4.1 and section 0 SHALL conform to section 4.3 of DECE-META [9] with additional
constraints defined in this section.
The image referenced to by the metadata XML document MAY NOT be in the file.
The images referenced to by the metadata XML document MAY be referenced by be
referenced by URN [3]. See Figure 3-8 for an example.
In case the images referred to by the contents of XML Box is in the file, the filname SHALL
have a unique index number, e.g. 1.png. See Figure 3-8 for an example.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 33 15 March 2013 CONFIDENTIAL
Figure 3-7 Examples of TrackReference element for RequiredImages element in
Required Metadata
Figure 3-8 Examples of TrackReference element for OptionalImages element in
Optional Metadata
<mddece:RequiredImages>
...
<md:TrackReference>
http://f1.sony.net/metadata/contentid/thumbnail/1.png</md:TrackReference>
</mddece:RequiredImages>
<mddece:OptionalImages>
...
<md:TrackReference>
urn:dece:conatiner:metadataimageindex:1.png</md:TrackReference>
</mddece:OptionalImages>
SONY F1 Service Format Specification Version 0.94
34 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
4. Content encryption The content encryption SHALL comply with the Marlin extensions to MPEG Common
Encryption Format [11] as defined in section 2.3 of Marlin Adaptive Streaming Specification
[12].
The following requirements SHALL apply if the content is protected by Marlin.
A Protection System Specific Header Box(‘pssh’)
The ‘pssh’ as defined in section 2.3.2 of Marlin Adaptive Streaming Specification
[12] SHALL be present.
The ‘pssh’ box SHALL contain a MarlinKidMappingTable Box (‘mkid’) which
includes all the Content ID mapping information associated with the file protected by
Marlin.
Annex A. Profile
A.1. SONY F1 Phase 1 Day1 Profile
A.1.1. Requirements on SONY F1 Phase 1-Day1 File Format and Media Format
The file format and media format for SONY F1 Phase 1-Day1 SHALL conform with the following
additional requirements in Table A-1.
The requirements are applied for both Type-A and Type-B unless otherwise separate
requirements are defined.
Table A-1 Requirements on SONY F1 Phase 1-Day1 Profile
Type-A Type-B
Container Container Header Asset Information Box (‘ainf’)
- profile_version SHALL be set to ‘sfv1’.
Content
Encryption
IV_size in
‘pssh’
8
Video Encryption Same as audio track
defined in section 4
As defined in section 4
DRM N/A Marlin-BB [12]
Max number of keys
per file
48
Audio Max number of audio 1
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 35 15 March 2013 CONFIDENTIAL
tracks per file
Allowed audio format MPEG-4 AAC LC [2
channel] as defined
in section 3.3.1
MPEG-4 AAC LC
[5.1 channel] as
defined in 3.3.2
MPEG-4 AAC LC [2 channel]
as defined in section 3.3.1
MPEG-4 AAC LC [5.1
channel] as defined in 3.3.2
LPCM [2 channel] as defined
in section 3.3.3 with
following additional
constraints (See section
3.2.3.1)
- channelcount SHALL
be set to 2.
- samplerate SHALL be
set to BB800000h
F1 LPCM [5.1 channel] as
defined in section 3.3.4 with
following additional
constraints
F1LPCMAudioSampleEntry(
‘fpcm’)
- channelcount SHALL
be set to 6
- samplesize SHALL be
set to 16
- samplerate SHALL be
set to BB800000h
F1LPCMSpecificBox(‘fcfg
’)
- channel_assignmen
t SHALL be set to 8 or 9
- sampling_frequenc
y SHALL be set to 1
- bits_per_sample
SHALL be set to 1
Subtitle Max number of
subtitle tracks per file
N/A 4
SONY F1 Service Format Specification Version 0.94
36 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
Video Maximum Bitrate 100x106 bits/s
(80000 [1250 bits/s (cpbBrVclFactor), 1500 bits/s
(cpbBrNalFactor)])
Min number of slices
per picture
8
Maximum duration
of the video
86486.4 sec (24 hours)
File File extension Un-encrypted File: “.sfv”
Encrypted File: “.sev”
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 37 15 March 2013 CONFIDENTIAL
A.1.2. Requirements on SONY F1 Phase 1-Day1 Subtitle processor implementation
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL conform with definitions
in this section. The definitions in this section minimize the features of the subtitle to the usage
of closed captioning defined in EIA-708-B [8], and it is expected for content authors to prepare
Subtitle Elementary Stream pursuant to ‘SMPTE RP 2052-10:2012 Conversion from CEA-608
Data to SMPTE-TT’.
Subtitle Elementary Stream SHOULD NOT exceed the capability of the SONY F1 Phase-1-Day1
Subtitle processor implementation.
The SONY F1 Phase-1-Day1 Subtitle processor implementation SHALL NOT decode and
present more than one subtitle track simultaneously.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT support to
decode image (PNG) subtitle track.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support rendering
of all glyphs that correspond to the Unicode Code Points defined in Table A-2.
Table A-2 Unicode Code Points to be supported
U+0020 - U+007E (Basic Latin)
U+00A1 - U+00FF (Latin-1 Supplement)
U+0152 (LATIN CAPITAL LIGATURE OE)
U+0153 (LATIN SMALL LIGATURE OE)
U+0160 (LATIN CAPITAL LETTER S WITH CARON)
U+0161 (LATIN SMALL LETTER S WITH CARON)
U+0178 (LATIN CAPITAL LETTER Y WITH DIAERESIS)
U+2018 (Left Single Quotation Mark)
U+2019 (Right Single Quotation Mark)
U+201C (Left Double Quotation Mark)
U+201D (Right Double Quotation Mark)
U+2122 (TRADE MARK SIGN)
U+02DC (SMALL TILDE)
U+25A1 (WHITE SQUARE)
U+266A (EIGHTH NOTE)
In the case where the Unicode Code Point is not supported by the SONY F1 Phase 1 Day 1
Subtitle processor, the Glyph rendered SHALL correspond with Unicode Code Point
U+25A1 ("WHITE SQUARE") or Unicode Code Point U+005F ("LOW LINE").
SONY F1 Service Format Specification Version 0.94
38 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
The SONY F1 Phase 1 Day 1 Subtitle processor SHALL ignore Control Codes Unicode Code
Points (U+0000..U+001F and U+007F..U+009F) encountered in textual content within
CFF-TT subtitles. An ignored Control Code does not affect presentation. No glyph is
rendered for a Control Code.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement
presentation of at least following nine (9) values for the ‘tts:backgroundColor‘ and
‘tts:color‘ attribute.
'blue', 'cyan', 'green', 'magenta', 'red', 'white', 'yellow', 'black' and ’transparent’
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement at least
following two (2) values for the ‘tts:opacity’ attribute.
0.0 and 1.0
The SONY F1 Phase 1-Day1 Subtitle processor implementation MAY NOT give exact
typeface which specified in the ’ tts:fontFamily’ attribute.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement
presentation of ‘none’ and 'underline' in the ‘tts:textDecoration’ attribute.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL implement
presentation of ‘normal’ and 'italic' in the ‘tts:fontStyle’ attribute.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least ’px’
units for ’tts:lineHeight’ attribute.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least ’px’
units for ’tts:fontSize’ attribute.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at least ‘px’
units for ’tts:lineHeight’ attribute and MAY NOT support ‘float’ value.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT present a
character whose font size is less than 8 pixels or more than 144 pixels.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL draw the character
with 60 pixels (60 = 1920/32) if any font size are not specified to any characters.
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support at least
'percentage' representation for the length expression in ’ tts:origin’ attribute.
The SONY F1 Phase 1-Day1 Subtitle processor implementation MAY NOT support
tts:textOutline attribute.
If ‘tts:extent’ attribute is specified on ‘tt’ element, the SONY F1 Phase 1-Day1 Subtitle
processor implementation SHALL support at least 'px' representation for the length
expression in ’tts:extent’ attribute. Otherwise, the SONY F1 Phase 1-Day1 Subtitle
processor implementation SHALL support at least 'percentage' representation for the
length expression in ’tts:extent’ attribute.
SONY F1 Service Format Specification Version 0.94
© Sony Corporation 2013. All rights reserved. | 39 15 March 2013 CONFIDENTIAL
The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT support
‘tts:padding’ attribute.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at
least ”lrtb” for ’tts:writingMode’ attribute.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT
support ’tts:unicodeBidi’ and ’tts:direction’ attribute.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support ”par” for
‘timeContainer’ attribute and MAY NOT support ”seq” for ’timeContainer’ attribute.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL support at least
following time expressions.
HH:MM:SS:FF
HH:MM:SS
The SONY F1 Phase 1-Day1 Subtitle processor implementation SHALL support time
expressions calculated in ‘dropNTSC’ mode. Note that DECE CFF [7] defines that
‘tts:dropNTSC’ element is prohibited.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation MAY NOT satisfy the
decoding and drawing rates defined in Table C - 6 in DECE CFF [7].
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL satisfy the text
rendering rates defined in Table A-3.
Table A-3 Text Rendering Rates
Font Size text rendering rate
[characters/s]
8-72 120 (TBD)
73-144 100 (TBD)
Note: As a character whose Unicode Code Point is U+0009, U+000A, U+000D, U+0020 or
U+0085 is not displayed at all, it does not deeded as an atomic unit of text.
The SONY F1 Phase 1 Day 1 Subtitle processor implementation SHALL satisfy the drawing
rate defined in Table A-4.
Table A-4 Drawing Rate
Property Constraint
Background Drawing rate 20x220 [pixels/s]
SONY F1 Service Format Specification Version 0.94
40 | Sony Corporation 2013. All rights reserved. 15 March 2013
CONFIDENTIAL
A.1.3. Requirements on SONY F1 Phase 1-Day1 Required Metadata (TBD)
The Required Metadata d media format for SONY F1 Phase 1-Day1 SHALL conform with the
following additional requirements.
Elements
ContainerMovieMetadata-type
The (any) element SHALL NOT be used.
The AdditionalLocalizedInfoList element SHALL be the last element.
ContainerTrackMetadata-type
SegmentSize SHALL be set to 0 in case the value is unknown.
ContainerChapterLIst-type
The maximum number of Chapter element SHALL be 128.
AdditionalLocalizedInfo-type
The (any) element SHALL NOT be used.
The CopyrightLine element SHALL be the last element.
DigitalAssetMetadata-type
Image element SHALL NOT be present.
DigitalAssetAudioData-type, when present;
TrackReference element SHALL NOT be present.
DigitalAssetVideoData-type, when present;
TrackReference element SHALL NOT be present.
DigitalAssetSubtitleData-type
TrackReference element SHALL NOT be present
Images
In case the images and any other binary data referred to by the contents of the XML
Box for Required Metadata are in the file, the image format SHALL be PNG format.
Image Size (TBD)
Image file size (TBD)
XML document
XML file size (TBD)
XML schema
The schema for SONY F1 Phase 1 Day1 Required Metadata is defined in
“mdf1-p1-d1.xsd”