oipf logo

Open IPTV Forum
Release 2 Specification

Volume 2 - Media Formats

[V2.3] - [2014-01-24]

Postal Address
Open IPTV Forum support office
650 Route des Lucioles - Sophia Antipolis
Valbonne - FRANCE
Tel.: +33 4 92 94 43 83
Fax: +33 4 92 38 52 90

Internet
http://www.oipf.tv

Disclaimer
The Open IPTV Forum accepts no liability whatsoever for any use of this document.

Copyright Notification
No part may be reproduced except as authorized by written permission.
Any form of reproduction and/or distribution of these works is prohibited.
Copyright 2014 © Open IPTV Forum e.V.
All rights reserved.


Abstract

This Technical Specification (TS) has been produced by the Open IPTV Forum.

This specification provides multiple options for some features. The Open IPTV Forum Profiles specification complements the Release 2 specifications by defining the Open IPTV Forum implementation and deployment profiles. Any implementation based on Open IPTV Forum specifications that does not follow the Profiles specification cannot claim Open IPTV Forum compliance.

Contents

Figures

Tables

Introduction

The Open IPTV Forum Release 2 Specification consists of ten volumes:

The present volume defines the set of media formats and their usage, available for the implementation of Release 2 Open IPTV Forum compliant services and devices.

The set of media formats comprises:

For each of these it is described how they apply to the IPTV solution and to the various Release 2 services (described in [OIPF_OVIEW2]), and the implications for interoperability are discussed.

Figure 1 summarises the set of media formats specified by the present document in the form of a media formats stack. Media formats are specified at the content (audio, video, etc.) layers and for the systems layer. Transport protocols below the systems layer are specified in Volume 4 [OIPF_PROT2].

figure 1
Figure 1: Media formats stack

This volume specifies formats for the A/V content provided by IPTV services using fixed line access networks or mobile access networks and voice and video telephony services. It does not apply to the broadcast channel input of hybrid devices except where explicitly specified.

This specification defines formats for the delivery of 3D video. At the present time, delivery to fixed terminals is targeted. No special provision is made for mobile or portable devices.

This specification defines the media formats utilised on the UNI Reference Point UNIT-17 of the Open IPTV Forum Functional Architecture [OIPF_ARCH2].

1. References

1.1 Normative references

[3GPPAMR]3GPP, TS 26.071, "Mandatory speech CODEC speech processing functions; AMR speech Codec; General description".
3GPP TS 26.090, "Mandatory Speech Codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Transcoding functions".
[3GPPAMRWB]3GPP, TS 26.171: "Speech codec speech processing functions; Adaptive Multi-Rate - Wideband (AMR WB) speech codec; General description".
3GPP, TS 26.190, "Speech codec speech processing functions; Adaptive Multi-Rate - Wideband (AMR WB) speech codec; Transcoding functions".
[3GPPAMRWB+]3GPP, TS 26.290, "Audio codec processing functions; Extended Adaptive Multi-Rate - Wideband (AMR WB+) codec; Transcoding functions".
[3GPPEAAC+]3GPP, TS 26.401, "General audio codec audio processing functions; Enhanced aacPlus general audio codec; General description".
[3GPPTS26234]3GPP, TS 26.234, "Transparent end-to-end Packet-switched; Streaming Service (PSS); Protocols and codecs (Release 9)".
[3GPPTS26245]3GPP, TS 26.245, "Transparent end-to-end Packet switched Streaming Service (PSS); Timed text format".
[AAC]ISO/IEC, 14496-3:2009, "Information Technology - Coding of audio-visual objects - Part 3: Audio".
[AC3]ETSI, TS 102 366 V1.2.1 (2008-08), "Digital Audio Compression (AC-3, Enhanced AC-3) Standard".
[AVCFF]ISO/IEC, 14496-15:2004, "Information Technology - Coding of Audio-Visual Objects - Part 15: Advanced Video Coding (AVC) file format", International Standards Organization.
[CEA-2014-A]Consumer Electronics Association, CEA-2014-A, "Web-based Protocol Framework for Remote User Interface on UPnP Networks and the Internet (Web4CE)", (including the August 2008 Errata)
[CEACC]Consumer Electronics Association, CEA-708-C (2006), "Digital Television (DTV) Closed Captioning".
[CENC]ISO/IEC, FDIS 23001-7:2011, "Information technology - MPEG systems technologies - Part 7: Common encryption in ISO base media file format files"
[DLNAMEDIA]IEC, 62481-2, Digital living network alliance (DLNA) home networked device interoperability guidelines – Part 2: Media Formats, ed1.0 (2007-08).
[DTS]ETSI TS 102 114 V1.4.1 (2012-09), "DTS Coherent Acoustics; Core and Extensions".
[DVB3D]ETSI, TS 101 547 V1.1.1, "DVB Frame Compatible Plano-Stereoscopic 3DTV"
[DVBSI]ETSI, EN 300 468 V1.13.1 (2012-08), "Digital Video Broadcasting (DVB); Specification for Service Information (SI) in DVB systems".
[DVBSUBT]ETSI EN 300 743 V1.4.1 (2011-10), "Digital Video Broadcasting (DVB); Subtitling systems".
[DVBTTXT]ETSI, EN 300 472 V1.3.1 (2003-05), "Digital Video Broadcasting (DVB); Specification for conveying ITU-R System B Teletext in DVB bitstreams".
[G711]ITU-T, Recommendation G.711, "Pulse code modulation (PCM) of voice frequencies".
[G719]ITU-T, Recommendation G.719, "Low-complexity, full-band audio coding for high-quality, conversational applications".
[G722]ITU-T, Recommendation G.722, "7 kHz Audio Coding within 64 Kbit/s".
[G729]ITU-T, Recommendation G.729, "Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear prediction (CS-ACELP)".
[G729-1]ITU-T, Recommendation G.729.1, "G.729 based Embedded Variable bit-rate coder: An 8-32 Kbit/s scalable wideband coder bitstream interoperable with G.729".
[GEM]ETSI, TS 102 728 V1.2.1 (2011-09), "Digital Video Broadcasting (DVB); Globally Executable MHP (GEM) Specification 1.3 (including OTT and hybrid broadcast/broadband)"
[GIF]Graphics Interchange Format version 89a, © 1987, 1988, 1989, 1990, CompuServe Incorporated, Columbus, Ohio.
[H262]ITU T, Recommendation H.262 / ISO/IEC 13818 2: "Information Technology – Generic Coding of moving pictures and associated audio information: Video".
[H263]ITU-T, Recommendation H.263 (2005), "Video coding for low bitrate communication".
[H264]ITU T, Recommendation H.264 / ISO/IEC 14496-10:2005: "Information technology - Coding of audio visual objects Part 10: Advanced Video Coding".
[ISOFF]ISO/IEC, 14496-12:2012, "Information Technology - Coding of Audio-Visual Objects - Part 12: ISO Base Media file format", International Standards Organization.
[JFIF]Hamilton, Eric. JPEG File Interchange Format. Sep 1992. URL: http://www.w3.org/Graphics/JPEG/jfif3.pdf C-Cube Microsystems. Milpitas, CA, USA.
[MP4FF]ISO/IEC, 14496-14:2003, "Information Technology - Coding of Audio-Visual Objects - Part 14: MP4 file format", International Standards Organization.
[MP4V]ISO/IEC, 14496-2:2004, "Information technology - Coding of audio-visual objects - Part 2: Visual".
[MPEG1]ISO/IEC, 11172 3:1993/Cor 1:1996, "Information Technology - Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s - Part 3: Audio".
[MPEG2TS]ISO/IEC, 13818-1:2000/Amd.3:2004, "Generic coding of moving pictures and associated audio information: Systems".
[MPS]ISO/IEC, 23003-1:2007, "Information Technology - MPEG audio technologies - Part 1: MPEG Surround", including ISO/IEC 23003-1:2007/Cor:2008, "Information Technology - MPEG audio technologies - Part 1: MPEG Surround, TECHNICAL CORRIGENDUM 1".
[MRL-BBTS]Marlin Developer Community, "Marlin Broadband Transport Stream Specification", Version 1.0, July 2008.
[MRL-DMZ]Marlin Developer Community, "Marlin Dynamic Media Zones", Version 1.1
[MRL-FF]Marlin Developer Community, "Marlin - File Formats Specification", Version 1.1, July 2008, and latest version of "Marlin Errata: Marlin - File Formats Specification V1.1".
[OMARLIN]Marlin Developer Community, "OMArlin Specification", Version 1.0.1, July 2008.
[PNG]ISO/IEC, 15948:2004, "Information technology - Computer graphics and image processing - Portable Network Graphics (PNG): Functional specification".
[TS101154]ETSI, TS 101 154 V1.11.1 (2012-11), "Digital Video Broadcasting (DVB); Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream".
Also available as DVB Bluebook A157 (06/2012)
[TS102034]ETSI, TS 102 034 V1.4.1 (2007-10), "Digital Video Broadcasting (DVB); Transport of MPEG-2 TS Based DVB Services over IP Networks".
[TS102809]ETSI, TS 102 809 V1.1.1 (2010-01), "Digital Video Broadcasting (DVB); Signalling and carriage of interactive applications and services in hybrid broadcast / broadband environments".
[TS126114]ETSI, TS 126 114 V10.0.0 (2011-04), "IMS Multimedia Telephony; media handling and interaction".
[TS181005]ETSI, TS 181 005 V3.3.1 (2009-12), "TISPAN Service and Capability Requirements".

1.2 OIPF references

[OIPF_ARCH2]Open IPTV Forum, "Functional Architecture - V2.3", January 2014.
[OIPF_CSP2]Open IPTV Forum, "Release 2 Specification, Volume 7 - Authentication, Content Protection and Service Protection", V2.3, January 2014.
[OIPF_DAE2]Open IPTV Forum, "Release 2 Specification, Volume 5 - Declarative Application Environment", V2.3, January 2014.
[OIPF_HAS2]Open IPTV Forum, "Release 2 Specification, Volume 2a - HTTP Adaptive Streaming", V2.3, January 2014.
[OIPF_META2]Open IPTV Forum, "Release 2 Specification, Volume 3 - Content Metadata", V2.3, January 2014.
[OIPF_OVIEW2]Open IPTV Forum, "Release 2 Specification, Volume 1 - Overview", V2.3, January 2014.
[OIPF_PAE2]Open IPTV Forum, "Release 2 Specification, Volume 6 - Procedural Application Environment", V2.3, January 2014.
[OIPF_PROT2]Open IPTV Forum, "Release 2 Specification, Volume 4 - Protocols", V2.3, January 2014.
[OIPF_PROT_EX2]Open IPTV Forum, "Release 2 Specification, Volume 4a - Examples of IPTV Protocol Sequences", V2.3, January 2014.
[OIPF_WSTVP2]Open IPTV Forum, "Release 2 Specification, Volume 5a - Web Standards TV Profile", V2.3, January 2014.

1.3 Informative references

[RFC2119]S. Bradner. Key words for use in RFCs to Indicate Requirement Levels. March 1997. Internet RFC 2119. URL: http://www.ietf.org/rfc/rfc2119.txt

2. Conventions and Terminology

2.1 Conventions

The key words "must", "must not", "required", "shall", "shall not", "should", "should not", "recommended", "may", and "optional" in this document are to be interpreted as described in [RFC2119].

All sections and appendixes, except "Introduction", are normative, unless they are explicitly indicated to be informative.

2.2 Terminology

2.2.1 Definitions

In addition to the Definitions provided in Volume 1, the following abbreviations are used in this Volume.

TermDefinition
Mobile A/V ServiceAn IPTV service delivered using mobile access networks and protocols.

2.2.2 Abbreviations

In addition to the Abbreviations provided in Volume 1, the following abbreviations are used in this Volume.

AcronymExplanation
AACAdvanced Audio Coding
AAC LCAAC Low Complexity
ADPCMAdaptive Differential Pulse-Code Modulation
AITApplication Information Table
AMRAdaptive Multi-Rate
AMR-WBAdaptive Multi-Rate Wideband
ATSCAdvanced Television Systems Committee
DSM-CCDigital Storage Media - Command and Control
DVBDigital Video Broadcasting
DVB-SIDVB Service Information
EBUEuropean Broadcasting Union
EITEvent Information Table
ETSIEuropean Telecommunications Standards Institute
fpsFrames per Second
GIFGraphics Interchange Format
GOPGroup Of Pictures
HE-AACHigh Efficiency-AAC
JPEGJoint Photographic Experts Group
MPEGMoving Pictures Expert Group
MPSMPEG Surround
PCMPulse-Code Modulation
PIDPacked Identifier
PMTProgram MapTable
PNGPortable Network Graphics
PSParametric Stereo
PSIProgramme Specific Information
SBRSpectral Band Replication
SIService Information
S/PDIFSony/Philips Digital Interconnect Format
UDPUser Datagram Protocol

3. A/V Media Formats

A set of A/V media formats is defined, being combinations of audio, video and systems layer formats defined in the following sections.

The TS and TTS systems layer formats are specified in section 4.1. The protection layers BBTS and PF are specified in Volume 7 [OIPF_CSP2].

MP4 systems layer format is specified in section 4.2. The protection layers PDCF, MIPMP, CENC and DCF are specified in Volume 7 [OIPF_CSP2].

Video Formats are defined in section 5.1 and Audio Formats in section 8.1.

Volume 3 [OIPF_META2] describes how the media format of content is signalled in the metadata.

For A/V content in 25Hz systems the following A/V media format combinations are defined:

Table 1: A/V Media Formats for 25Hz video system
System FormatVideo FormatAudio FormatMime Type
TS AVC_HD_25
AVC_SD_25
AVC_SP_25
AVC_3D_25
HEAAC
HEAAC2
HEAAC_MPS
MPEG1_L2
MPEG1_L2_MPS
AC3
E-AC3
DTS
video/mpeg or video/mp2t
TTS AVC_HD_25
AVC_SD_25
AVC_SP_25
AVC_3D_25
HEAAC
HEAAC2
HEAAC_MPS
MPEG1_L2
MPEG1_L2_MPS
AC3
E-AC3
DTS
video/vnd.dlna.mpeg-tts
MP4 AVC_HD_25
AVC_SD_25
AVC_SP_25
AVC_3D_25
HEAAC
HEAAC2
HEAAC_MPS
MPEG1_L2
MPEG1_L2_MPS
AC3
E-AC3
DTS
video/mp4
TS MPEG2_HD_25
MPEG2_SD_25
MPEG2_SP_25
AC3
E-AC3
MPEG1_L2
MPEG1_L2_MPS
video/mpeg or video/mp2t
TTS MPEG2_HD_25
MPEG2_SD_25
MPEG2_SP_25
AC3
E-AC3
MPEG1_L2
MPEG1_L2_MPS
video/vnd.dlna.mpeg-tts

For A/V content in 30Hz systems the following A/V media format combinations are defined:

Table 2: A/V Media Formats for 30Hz video system
System FormatVideo FormatAudio FormatMime Type
TS AVC_HD_30
AVC_SD_30
AVC_SP_30
AVC_3D_30
HEAAC
HEAAC2
HEAAC_MPS
AC3
E-AC3
DTS
video/mpeg or video/mp2t
TTS AVC_HD_30
AVC_SD_30
AVC_SP_30
AVC_3D_30
HEAAC
HEAAC2
HEAAC_MPS
AC3
E-AC3
DTS
video/vnd.dlna.mpeg-tts
MP4 AVC_HD_30
AVC_SD_30
AVC_SP_30
AVC_3D_30
HEAAC
HEAAC2
HEAAC_MPS
AC3
E-AC3
DTS
video/mp4

For protected A/V contents, the following protected A/V media format combinations are defined:

Table 3: Protected A/V media formats
System FormatProtection FormatVideo FormatAudio FormatMime Type
TS BBTS
PF
(a combination of video format and audio format used for TS system, as defined by Table 1 and Table 2) video/mpeg or video/mp2t
TTS BBTS
PF
(a combination of video format and audio format used for TTS system, as defined by Table 1 and Table 2) video/vnd.dlna.mpeg-tts
MP4 PDCF
MIPMP
(a combination of video format and audio format used for MP4 system, as defined by Table 1 and Table 2) video/mp4
DCF (a combination of video format and audio format used for MP4 system, as defined by Table 1 and Table 2) application/vnd.oma.drm.dcf

The following audio media formats are defined that are independent of the video system:

Table 4: Pure audio media formats
Audio FormatMime Type
MPEG1_L3audio/mpeg
HEAACaudio/mp4 or audio/3gpp
WAVaudio/x-wav
DTSaudio/vnd.dts.hd
AMRaudio/amr
AMR-WBaudio/amr-wb
AMR-WB+audio/amr-wb+
HEAAC2audio/mp4 or audio/3gpp
AC3audio/ac3
E-AC3audio/eac3

NOTE: The HEAAC and HEAAC2 pure audio media formats imply carriage of the respective audio content inside the MP4 system format container.

The following graphics formats are defined for usage as specified in section 9:

Table 5: Graphics media formats
Image FormatMime Type
JPEGimage/jpeg
GIFimage/gif
PNGimage/png

The following video media formats are defined for video telephony services:

Table 6: A/V Media Formats for video telephony
Video FormatMime Type
H263video/H263
video/H263-1998
video/H263-2000
MP4Vvideo/MP4V-ES
AVC_VDCvideo/H264

The following audio media formats are defined for Narrow-Band voice and video telephony services:

Table 7: Audio Formats for voice and video telephony (Narrow-Band)
Audio FormatMime Type
G711audio/PCMA
audio/PCMU
AMRaudio/AMR
G729Aaudio/G729

The following audio media formats are defined for Wide-Band voice and video telephony services:

Table 8: Audio Formats for voice and video telephony (Wide-Band)
Audio FormatMime Type
G722audio/G722
AMRWBaudio/AMR-WB
G7291audio/G7291

The following audio media formats are defined for Super-Wideband voice and video telephony services:

Table 9: Audio Formats for voice and video telephony (Super-Wideband)
Audio FormatMime Type
AACLDaudio/mpeg4-generic
AACELDaudio/mpeg4-generic
G719audio/G719

The following text format is defined for subtitles provided for Mobile A/V Services:

Table 10: Subtitle Format for Mobile A/V Services (Super-Wideband)
Subtitle FormatMime Type
3GPP-TTvideo/3gpp-tt

4. Systems Layer

At the systems layer, two formats for the carriage of A/V content are defined, namely MPEG-2 Transport Stream and MP4 File Format.

A/V content protection is performed at the systems layer, as defined in [OIPF_CSP2]. The present volume of the specification describes the protected formats in relation to the total set of media format definitions.

4.1 MPEG-2 Transport Stream

The carriage of A/V content and related information (e.g. subtitles) in an MPEG-2 transport stream shall be in compliance with [TS101154] clause 4, with the following additional constraints:

The preceding specification of the MPEG-2 transport stream format is referred to as the TS systems layer format.

An additional variant of the TS format is defined, namely the time stamped MPEG-2 transport stream, as defined in [DLNAMEDIA] section 9.3.4.4, applied to the TS systems layer format.

The time stamped MPEG-2 transport stream format is referred to as the TTS systems layer format.

The methods to protect (encrypt) MPEG-2 transport streams are specified in Volume 7 [OIPF_CSP2]. Volume 7 specifies two approaches for content and service protection, namely the terminal-centric approach and the gateway-centric approach.

For the terminal-centric approach and for the output of the CSP gateway in the gateway-centric approach, the protected MPEG-2 transport stream shall comply with protection system signalling as specified in [MPEG2TS] and may use the Conditional Access Table (CAT) as defined therein. This protected format is referred to generically as PF.

For the gateway-centric approach, the input stream to the CSP gateway is not specified, except in the case of the CI+ gateway-centric approach, where the input stream shall comply with the PF format. PF applies to both the TS and TTS systems layer formats.

The protected MPEG-2 transport stream format for the terminal-centric approach is further defined in [MRL-BBTS] and is referred to as BBTS. BBTS applies to both the TS and TTS systems layer formats.

The OITF shall support the application signaling and in-band delivery of DAE applications via the IP channel, as defined above. In environments where the broadcast channel is based on DVB network technologies and uses DVB-SI as specified in [DVBSI], the OITF shall also support the application signaling and in-band delivery of DAE applications via the broadcast channel.

4.2 MP4 File Format

The carriage of A/V content and related information (e.g. subtitles) in file-based formats (systems layer format: MP4) shall use the MP4 File Format [MP4FF] and ISO Base Media File Format [ISOFF] standards with the constraints defined in section 9.4.4.3 of [DLNAMEDIA], except for 9.4.4.3.3 and 9.4.4.3.10. This is the preferred format for MP4 based unprotected content.

For services that allow the real-time playback of downloaded content before the download has been completed (e.g. Progressive Download), the following additional constraints apply:

A service may apply the additional constraint on moov box size according to section 9.4.4.3.11 of [DLNAMEDIA], in order to provide content compliant to the DLNA specification.

In addition, carriage of H.264/AVC content in the MP4 systems layer shall be conformant to the AVC File Format standard [AVCFF].

In addition, carriage of MPEG-4 AAC/HE-AAC content in the MP4 systems layer shall be conformant to the MP4 File Format standard [MP4FF].

The storage of AC-3 and Enhanced AC-3 content in the MP4 file format shall be conformant to Annex F of [AC3].

MP4 files may contain media zone information (zone map), possibly including navigation constraints, using the signalling mechanisms specified in [MRL-DMZ]. Rules about the handling of Marlin media zone information by the OITF, for both unprotected and protected content, are contained in section 6 of [OIPF_CSP2]. Note: this means an MP4 file may contain one or more mDMZ boxes containing zone parameters and zone properties (i.e., navigation constraints) according to [MRL-DMZ] section 7.1,

The methods to protect (encrypt) MP4-based file formats are specified in [OIPF_CSP2]. Four protection methods are specified and they are allocated the protection format labels as follows:

MP4 files containing 3D content shall comply with section 8.15 of [ISOFF], i.e. the 3D video sample shall be signalled as restricted video (‘resv’) with stereoscopic video scheme type (‘stvi’).

4.3 Service Usage

Multicast IPTV services (Scheduled Content) shall use either the TS or the TTS systems layer format.

Unicast streamed IPTV services (Scheduled Content, CoD) using the Direct UDP or RTP/UDP transport protocols shall use either the TS or the TTS systems layer format.

Unicast streamed IPTV services (Scheduled Content, CoD) using the HTTP transport protocol (including adaptive streaming) shall use either the TS, the TTS, or the MP4 systems layer format.

Download IPTV services (CoD) shall use either the TS, the TTS, or the MP4 systems layer format.

Voice and video telephony services using the RTP/UDP transport protocol do not require any specific systems layer format.

Mobile A/V services do not require any specific systems layer format.

The systems layer formats used for content services are summarised in the following table.

Table 11: Systems layer formats for content services
ServiceTransport ProtocolSystems layer format
Scheduled contentMulticast Direct UDP or RTP/UDPTS, TTS
Scheduled contentUnicast Direct UDP or RTP/UDPTS, TTS
Scheduled contentHTTP (including adaptive streaming)TS, TTS
Streamed CoDUnicast Direct UDP or RTP/UDPTS, TTS
Streamed CoDHTTP (including adaptive streaming)TS, TTS, MP4
Download CoDHTTPTS, TTS, MP4

5. Video

The specification of video formats and codec profiles is based upon the DVB A/V codec usage specification for applications based on MPEG-2 transport streams [TS101154]. The present specification further profiles the DVB specification by mandating certain codec choices and video formats.

H.264/AVC [H264] (video format label: AVC) is the preferred video codec for both standard definition and high definition content and is the mandatory video content format. Decoding support for AVC is a mandatory minimum OITF capability with regard to A/V media formats.

MPEG-2 video [H262] (video format label: MPEG2) may be used when appropriate, for example when legacy equipment or content in that format has already been deployed, or due to regulatory or contractual considerations.

H.264/AVC [H264] (video format label: AVC_VDC), MPEG-4 Part 2 Visual [MP4V] (video format label: MP4V) and H.263 [H263] (video format label: H263) may be used for video telephony services. If video telephony services are supported on the OITF, encoding and decoding of H.264 (video format label AVC_VDC) shall be supported and encoding and decoding of H.263 (video format label: H263) is recommended.

H.264/AVC [H264] (video format label: AVC_VDC), MPEG-4 Part 2 Visual [MP4V] (video format label: MP4V) and H.263 [H263] (video format label: H263) may be used for Mobile A/V services.

Plano-stereoscopic 3D TV is supported using H.264/AVC, using side-by-side and top-bottom formats.

5.1 Formats

Five profiles of video content are defined and described in the following sub-sections:

Additionally, Video formats for Mobile A/V Services are defined.

5.1.1 High Definition Profile

5.1.1.1 H.264/AVC

H.264/AVC HD video content shall comply with [TS101154] clauses 5.5 and 5.7.

This format corresponds to video format label AVC_HD_25 in 25Hz systems, and AVC_HD_30 in 30Hz systems.

5.1.1.2 MPEG-2

MPEG-2 HD video content in 25Hz systems shall comply with [TS101154] clause 5.2 with the following exceptions:

  • The 2.21:1 Aspect Ratio shall not be used.

This format corresponds to video format label MPEG2_HD_25.

MPEG-2 HD video content in 30Hz systems shall comply with [TS101154] clause 5.4 with the following exceptions:

  • The 2.21:1 Aspect Ratio shall not be used.

This format corresponds to video format label MPEG2_HD_30.

5.1.2 Standard Definition Profile

5.1.2.1 H.264/AVC

H.264/AVC SD video content shall comply with [TS101154] clauses 5.5 and 5.6.

This format corresponds to video format label AVC_SD_25 in 25Hz systems, and AVC_SD_30 in 30Hz systems.

5.1.2.2 MPEG-2

MPEG-2 SD video content in 25Hz systems shall comply with [TS101154] clause 5.1 with the following exceptions:

  • The 2.21:1 Aspect Ratio shall not be used.

This format corresponds to video format label MPEG2_SD_25.

MPEG-2 SD video content in 30Hz systems shall comply with [TS101154] clause 5.3 with the following exceptions:

  • The 2.21:1 Aspect Ratio shall not be used.

This format corresponds to video format label MPEG2_SD_30.

5.1.3 Video Telephony Profile

5.1.3.1 H.264/AVC

H.264/AVC video content shall comply with [TS126114] clause 5.2.2.

This format corresponds to video format label AVC_VDC.

5.1.3.2 MPEG-4 Part-2 Visual

MPEG-4 Part-2 Visual video content shall comply with [TS126114] clause 5.2.2.

This format corresponds to video format label MP4V.

5.1.3.3 H.263

H.263 video content shall comply with [TS126114] clause 5.2.2.

This format corresponds to video format label H263.

5.1.4 Sub-Picture Profile

The following table list the supported coding parameters for sub-picture video
Table 12: Sub-Picture formats
Horizontal Resolution
(pixels)
Vertical Resolution
(lines)
Scan TypeFrame Rate (fps)Aspect RatioSystem
192192p23.976, 24, 59.9416:930Hz
192144p23.976, 24, 59.9416:930Hz
12896p23.976, 24, 59.9416:930Hz
192192p29.9716:9, 4:330Hz
192144p29.9716:9, 4:330Hz
12896p29.9716:9, 4:330Hz
192192p2516:9, 4:325Hz
192144p2516:9, 4:325Hz
12896p2516:9, 4:325Hz
192192p5016:925Hz
192144p5016:925Hz
12896p5016:925Hz
5.1.4.1 H.264/AVC

The IPTV solution shall utilize the following encoded video media profile for content used in Sub Picture streams.

  • H.264/AVC Main Profile @ Level 1.3

This format corresponds to video format label AVC_SP_25 in 25Hz systems, and AVC_SP_30 in 30Hz systems.

5.1.4.2 MPEG-2

The MPEG-2 Main Profile @ Low Level as defined in [H262] shall be used for Sub-Picture video streams. The DVB codec toolbox [TS101154] does not provide constraints applicable to Sub-Picture formats.

This format corresponds to video format label MPEG2_SP_25 in 25Hz systems, and MPEG2_SP_30 in 30Hz systems.

5.1.5 Video formats for Mobile A/V Services

5.1.5.1 H.263

H.263 video content shall comply with [3GPPTS26234] section 7.4.

5.1.5.2 H.264/AVC

H.264/AVC video content shall comply with [3GPPTS26234] section 7.4.

5.1.5.3 MPEG-4 Part-2 Visual

MPEG-4 Part-2 Visual video content shall comply with [3GPPTS26234] section 7.4.

5.1.6 H.264/AVC GOP Structure

All AVC format content provided in IPTV services shall conform to the following constraints in GOP structure:

  • I picture: A picture with slice_type=7 or slice_type=2 for all the slices composing that picture or IDR picture
  • P picture: A picture with slice_type=5 or slice_type=0 for all the slices composing that picture.
  • B picture: A picture with slice_type=6 or slice_type=1 for all the slices composing that picture.
  • Decoding order among I or P pictures shall be kept in their display order.
  • P picture shall not refer to B pictures.
  • Complementary reference field pair that includes I/P field shall not include B field.
  • Reference B picture shall refer to the following.
    • I or P frames or complementary reference field pairs of I or P pictures that immediately precedes/follows in display order.
  • Non-reference B picture shall refer to the following.
    • I or P frames or complementary reference field pairs of I or P pictures that immediately precedes/follows in display order.
    • A reference B frame or a complementary reference field pair of reference B pictures that immediately precedes/follows in display order and is present between "pic1" and "pic2" in display order. Here, "pic1" is immediately preceding I or P picture and "pic2" is immediately following I or P picture.

5.1.7 3D

5.1.7.1 H.264/AVC

3D content shall comply with [DVB3D] section 5.1. The coded video stream shall apply the frame packing arrangement supplemental enhancement information (SEI) message according to [DVB3D] section 6.4.

This format corresponds to video format label AVC_3D_25 in 25Hz systems, and AVC_3D_30 in 30Hz systems.

The following formats are specified by [DVB3D]:

  • 720p @ 50Hz Top-and-Bottom (TaB);
  • 720p @ 50Hz Side-by-Side (SbS) (*);
  • 1080i @ 25Hz Side-by-Side (SbS);
  • 720p @ 59.94 / 60 Hz Top-and-Bottom (TaB);
  • 720p @ 59.94 / 60 Hz Side-by-Side SbS (SbS) (*);
  • 1080i @ 29.97 / 30 Hz Side-by-Side (SbS);
  • 1080p @ 23.98 / 24 Hz Top-and-Bottom (TaB);
  • 1080p @ 23.98 / 24 Hz Side-by-Side (SbS) (*).

(*): these formats are optional for compliance with HDMI. Some 3DTV capable display devices might not support these 3DTV video formats.

5.2 Service Usage

The video formats specified in the Standard Definition Profile and in the High Definition Profile are applicable to A/V content provided within any of the Release 2 IPTV services. The video formats for Mobile A/V Services are applicable to any of the Release 2 IPTV services. The video formats specified in the Video Telephony Profile are applicable to content provided within Video Telephony services. The video formats specified in the Sub Picture Profile are applicable for use with Picture-in-Picture function.

6. Subtitles

This section defines the formats of subtitle streams for the purpose of providing alternative language subtitles and closed captions for A/V services. The decision on the use and format of subtitle streams is made by the service provider or content provider. Subtitle content may be provided with any IPTV service.

6.1 Formats

For an IPTV service delivered using the TS or TTS system formats, any of the following subtitle formats shall be used:

If other subtitle formats are used, e.g. for market specific or regulatory reasons, their usage is outside the scope of the present specification.

For Mobile A/V Services, the Timed Text [3GPPTS26245] format shall be used. This format corresponds to video format label 3GPP-TT.

6.2 Service Usage

Subtitle streams within an IPTV service may be used for the provision of:

7. Teletext

This section defines the formats of teletext for the purpose of providing an information service together with the A/V stream. Teletext is a legacy sub-service of Scheduled Content Services utilised in some parts of the European market.

Teletext information may be supported by the Scheduled Content Service.

It is expected that in the future such information services will be provided by the Declarative Application Environment [OIPF_DAE2].

7.1 Formats

For an IPTV scheduled content service delivered using the TS or TTS system formats, the Teletext information shall be based on EBU Teletext [DVBTTXT]. This format corresponds to the format label EBU-TTXT.

7.2 Service Usage

The Scheduled Content service may include teletext information.

Teletext information shall not be provided with content delivered by the Content on Demand services.

8. Audio

The specification of audio formats and codec profiles is based upon the DVB A/V codec usage specification for applications based on MPEG-2 transport streams [TS101154]. The present specification further profiles the DVB specification by mandating certain codec choices and audio formats.

MPEG-4 AAC or HE-AAC [AAC] (audio format label: HEAAC) is the preferred audio codec for A/V content and is the mandatory audio content format. Decoding support for HE-AAC is a mandatory minimum OITF capability with regard to A/V media formats.

MPEG 4 HE-AAC v2 [AAC] (audio format label: HEAAC2) may be used when appropriate, as designated by systems requirements.

MPEG-1 Audio Layer II [MPEG1] (audio format label: MPEG1_L2) or AC-3 (Dolby Digital) [AC3] (audio format label: AC3) may be used when appropriate, for example when legacy equipment or content in that format has already been deployed, or due to regulatory or contractual considerations.

DTS-HD [DTS] (audio format label: DTS) may be used when appropriate, as designated by systems requirements.

Enhanced AC-3 (Dolby Digital Plus) [AC3] (audio format label: E-AC3) may be used when appropriate, as designated by systems requirements.

MPEG Surround [MPS] (audio format label: MPS) may be used in combination with MPEG-4 AAC or HE-AAC or MPEG-1 Layer II. This combination implements scalability from a stereo (or mono) core bitstream to multichannel and will thus play at least in stereo (or mono) on MPEG-4 AAC-only (respectively MPEG-4 HE-AAC or MPEG-1 Layer II) decoding devices.

AMR [3GPPAMR] (audio format label: AMR) and AMR-WB [3GPPAMRWB] (audio format label: AMR-WB) may be used for Mobile A/V Services. MPEG-4 AAC or HE-AAC [AAC] (audio format label: HEAAC), enhanced aacPlus [3GPPEAAC+] (audio format label: HEAAC2) and Extended AMR-WB [3GPPAMRWB+] (audio format label: AMR-WB+) may be used for Mobile A/V Services.

For audio-only services, the MPEG-1 Audio Layer III (MP3) codec [MPEG1] may also be used.

Profiles of audio are also used to provide audible notifications and audio clips within the Declarative [OIPF_DAE2] and Procedural Application Environments [OIPF_PAE2], as specified in section 8.2.1.

For voice and video telephony services the following audio media formats may be used:

If voice or video telephony services are supported on the OITF, encoding and decoding of G.711 and AMR for narrow-band and G.722 and AMR-WB/G722.2 for wide-band shall be supported.

8.1 Formats

8.1.1 HE-AAC and AAC

AAC, HE-AAC and HE-AAC v2 audio coding shall be in accordance with [AAC], which contains the audio object types AAC LC, SBR and PS. Its use is constrained according to [TS101154] clause 6.4.

AAC and HE-AAC formats correspond to the audio format label HEAAC.

HE_AAC v2 format corresponds to the audio format label HEAAC2.

8.1.1.1 A/V Content

HEAAC format audio for A/V content shall utilise Level 4 encoding as specified in [AAC].

If used in combination with MPEG Surround, HE AAC format audio for A/V content shall utilise Level 2 encoding or Level 4 encoding as specified in [AAC].

8.1.1.2 Audio clips

HEAAC format audio for audible notifications and audio clip content shall utilise Level 2 encoding as specified in [AAC], consisting of a sequence of single of multiple audio frames whereby an audio frame consists of an ADTS header and an audio frame data pair.

8.1.1.3 HE-AAC Metadata

HEAAC format audio may contain metadata as specified in [AAC] or [TS101154], specifically:

  • Dynamic Range Control parameters as defined in [AAC] section 4.5.2.7 or [TS101154] section 6.4.3 and Annex C.5
  • Down-mix parameters as defined in [AAC] section 4.5.1.2.2 or [TS101154] Annex C.5.

The Dynamic Range Control metadata shall be used, if present in the encoded audio data.

For stereo output of 5.1 surround audio streams, the down-mix parameters shall be used, if present in the encoded audio data.

8.1.2 AC3

AC-3 audio coding shall be compliant with [AC3], constrained according to [TS101154] clause 6.2, with the following additional constraints:

  • AC-3 audio streams shall be encoded at a sample rate of 48 kHz

This format corresponds to the audio format label AC3.

8.1.3 Enhanced AC-3

Enhanced AC-3 audio coding shall be compliant with [AC3], constrained according to [TS101154] clause 6.2, with the following additional constraints:

  • Enhanced AC-3 audio streams shall be encoded at a sample rate of 48 kHz

This format corresponds to the audio format label E-AC3

8.1.4 MPEG-1 Layer II

MPEG-1 Layer II audio coding shall be compliant with [MPEG1] constrained according to [TS101154] clause 6.1.

This format corresponds to the audio format label MPEG1_L2.

8.1.5 MPEG-1 Layer III

MPEG-1 Layer III audio coding shall only be used for audio only services. It shall not be used in conjunction with a video stream to form an A/V service.

MPEG-1 Layer III encoding shall be compliant with [MPEG1], constrained according to [DLNAMEDIA]. Either of the MP3 and MP3X profiles from [DLNAMEDIA] may be used.

This format corresponds to the audio format label MPEG1_L3.

8.1.6 WAVE

Wave format (Audio Format: WAV) audio coding may be used for audible notifications and audio clips within the Declarative Application Environment [OIPF_DAE2]. The following characteristics shall be supported by the OITF.

Sampling Frequency:From 12 kHz up to 16 kHz
Codec(s):Uncompressed (PCM), ADPCM
Quantisation Bit Rate:16 bits
Channels:From mono up to 5.1 channels

This format corresponds to the audio format label WAV.

8.1.7 DTS-HD

DTS-HD is an expansion on the original DTS Coherent Acoustics definition. DTS-HD maintains support for Coherent Acoustics and extends the range of capabilities, which have been commercialized as:

  • DTS Express®
  • DTS-HD High Resolution Audio®
  • DTS-HD Master Audio®

This is in addition to the original DTS family which were commercialized as:

  • DTS®
  • DTS-ES®
  • DTS 96/24®

Using the DTS-HD audio descriptor, as defined in [DVBSI], and transport requirements as defined in [TS101154], support for DTS-HD is seamless across the variations.

DTS-HD format (Audio Format: DTS) audio coding shall be compliant with [DTS] and according to [TS101154] section 6.3.

  • Usage of DTS-HD in ISOBMFF is defined in Annex E of [DTS].

8.1.8 MPEG Surround

MPEG Surround shall be compliant with [MPS] and shall be used in combination with MPEG-4 AAC or HE AAC constrained according to section 8.1.1 or in combination with MPEG-1 Layer II constrained according to section 8.1.4. Its use is further constrained according to [TS101154] clause 6.1 and clause 6.4 and the following:

  • Sampling frequency
    • Encoding: For audio encoded using MPEG Surround, the sampling frequency of the MPEG Surround data shall be equal to the sampling frequency of the core audio stream.

The combination of MPEG Surround and MPEG-4 AAC or HE-AAC corresponds to the audio format label HEAAC_MPS.

The combination of MPEG Surround and MPEG-1 Layer II corresponds to the audio format label MPEG1_L2_MPS.

8.1.9 Audio Formats for voice and video telephony

G.711 audio coding shall be compliant with [G711] according to [TS181005] clause 6.2; this format corresponds to the audio format label G711.

AMR audio coding shall be compliant with [3GPPAMR] according to [TS126114] clause 5.2.1 and [TS181005] clause 6.2; this format corresponds to the audio format label AMR.

G.729 audio coding shall be compliant with [G729] according to [TS181005] clause 6.2; this format corresponds to the audio format label G729A.

G.722 audio coding shall be compliant with [G722] according to [TS181005] clause 6.3.2; this format corresponds to the audio format label G722.

AMR-WB/G.722.2 audio coding shall be compliant with [3GPPAMRWB] according to [TS181005] clause 6.3.2; this format corresponds to the audio format label AMRWB.

G.729.1 audio coding shall be compliant with [G729-1] according to [TS181005] clause 6.3.2; this format corresponds to the audio format label G7291.

G.719 audio coding shall be compliant with [G719]; this format corresponds to the audio format label G719.

8.1.9.1 MPEG-4 AAC LD and ELD

MPEG-4 AAC LD audio coding shall be compliant with the Low Delay AAC Profile as defined in clause 1.5.2.1 of [AAC] (profile-level-id=52 as defined in clause 1.5.2.4, table 1.4 of [AAC]); this format corresponds to the audio format label AACLD.

MPEG-4 AAC ELD audio coding shall be compliant with AAC Enhanced Low Delay as defined in clause 1.5.1, Table 1.1 of [AAC] and described in clause 1.5.1.2.37 of [AAC] (audio object type = "ER AAC ELD", object type ID = 39); this format corresponds to the audio format label AACELD.

8.1.10 Audio Formats for Mobile A/V Services

For Mobile A/V Services the following applies:

  • AMR content shall be compliant with [3GPPAMR]. This format corresponds to label AMR.
  • AMR-WB content shall be compliant with [3GPPAMRWB]. This format corresponds to label AMR-WB.
  • AAC and HE-AAC content shall be compliant with [3GPPEAAC+]. This format corresponds to label HEAAC.
  • Enhanced aacPlus content shall be compliant with [3GPPEAAC+]. This format corresponds to label HEAAC2.
  • Extended AMR-WB shall be compliant with [3GPPAMRWB+]. This format corresponds to label AMR-WB+.

8.2 Platform Usage

8.2.1 Audible Notifications and Audio Clips

IPTV Service Providers may utilize the following audio formats for audible notifications and audio clips within either declarative or procedural applications used to provide services, as specified in [OIPF_DAE2] and [OIPF_PAE2]:

  • AAC formatted files with a maximum file size of 512KB identified with the MIME type "audio/mp4",
  • WAV formatted files with a maximum file size of 512KB identified with the MIME type "audio/x-wav" (DAE only),
  • MPEG1_L3 formatted files identified with the MIME type "audio/mpeg" (PAE only).

8.2.2 Audio Description

If audio description is provided for the service, then the method to provide Audio Description shall be either the provision of a pre-mixed combination of audio description and the main audio as a suitably signalled HE-AAC stream or according to Annex E of [TS101154]. In either case, either the HE-AAC, MPEG-1 Audio Layer II or Enhanced E-AC3 audio codec may be used (the latter two codecs only when supported).

However, if the optional MPEG-1 Audio Layer II codec is supported, then the method for Audio Description defined in Annex E of [TS101154] may be applied.

If the service platform requires the deployment of any other of the optional audio codec for A/V services, then that optional codec may also be used to provide audio description as a pre-mixed combination of audio description and the main audio as a suitably signalled stream.

8.2.3 Clean Audio

Clean Audio is a supplementary audio service that enhances the listening experience for the hearing impaired. If Clean Audio is provided for the IPTV service then it shall be provided as specified in [TS101154] Annex E.4.

8.2.4 Audio output interfaces

For stereo output interfaces, 5.1 surround audio streams shall be down-mixed to stereo.

For digital outputs (e.g. S/PDIF or HDMI) one of the following conversions may be used:

  • Conversion of the received Enhanced AC-3 audio streams to AC-3 [AC3]
  • Transcoding of the received HEAAC, HEAAC_MPS or MPEG1_L2_MPS audio streams to the AC3 [AC3] or DTS-HD [DTS] formats
  • Decoding of the received DTS, HEAAC, HEAAC_MPS or MPEG1_L2_MPS audio streams and output of PCM multi-channel over HDMI

9. Still Pictures and Graphics

9.1 Formats

Still pictures and graphics content are used within both the Declarative (DAE) and the Procedural Application Environments (PAE).

The usage of still pictures and graphics formats within declarative applications is specified in [OIPF_DAE2]. The formats adopted in the DAE are defined in [CEA-2014-A].

The usage of still pictures and graphics formats within procedural applications is specified in [OIPF_PAE2]. The formats adopted in the PAE are defined in [GEM].

The present volume just notes the labels applied to the used formats – JPEG [JFIF], GIF [GIF] and PNG [PNG].

9.1.1 JPEG

This format corresponds to the graphics format label JPEG.

The mime type of "image/jpeg" shall be used for compliant JPEG images.

9.1.2 GIF

This format corresponds to the graphics format label GIF.

The mime type of "image/gif" shall be used for compliant GIF images.

9.1.3 PNG

This format corresponds to the graphics format label PNG.

The mime type of "image/png" shall be used for compliant PNG images.