HEVC (H.265) WebCodecs Registration

W3C Group Draft Note,

More details about this document
This version:
https://www.w3.org/TR/2023/DNOTE-webcodecs-hevc-codec-registration-20230104/
Latest published version:
https://www.w3.org/TR/webcodecs-hevc-codec-registration/
Editor's Draft:
https://w3c.github.io/webcodecs/hevc_codec_registration.html
Previous Versions:
History:
https://www.w3.org/standards/history/webcodecs-hevc-codec-registration
Feedback:
GitHub
Editors:
Paul Adenot (Mozilla)
Bernard Aboba (Microsoft Corporation)
Former Editor:
Chris Cunningham (Google Inc.)
Participate:
Git Repository.
File an issue.
Version History:
https://github.com/w3c/webcodecs/commits

Abstract

This registration is entered into the [webcodecs-codec-registry]. It describes, for HEVC (H.265), the (1) fully qualified codec strings, (2) the codec-specific EncodedVideoChunk [[internal data]] bytes, (3) the VideoDecoderConfig.description bytes, (4) the values of EncodedVideoChunk [[type]], and (5) the codec-specific extensions to VideoEncoderConfig

The registration is not intended to include any information on whether a codec format is encumbered by intellectual property claims. Implementers and authors are advised to seek appropriate legal counsel in this matter if they intend to implement or use a specific codec format. Implementers of WebCodecs are not required to support the HEVC / H.265 codec.

This registration is non-normative.

Status of this document

This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.

Feedback and comments on this specification are welcome. GitHub Issues are preferred for discussion on this specification. Alternatively, you can send comments to the Media Working Group’s mailing-list, public-media-wg@w3.org (archives). This draft highlights some of the pending issues that are still to be discussed in the working group. No decision has been taken on the outcome of these issues including whether they are valid.

This document was published by the Media Working Group as a Group Draft Note using the Note track.

Group Draft Notes are not endorsed by W3C nor its Members.

This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

The W3C Patent Policy does not carry any licensing requirements or commitments on this document.

This document is governed by the 2 November 2021 W3C Process Document.

1. Fully qualified codec strings

The codec string begins with the prefix "hev1." or "hvc1.", with a suffix of four dot-separated fields as described in Section E.3 of [iso14496-15].

2. EncodedVideoChunk data

EncodedVideoChunk [[internal data]] is expected to be an access unit as defined in [ITU-T-REC-H.265] section 7.4.2.4.

NOTE: An access unit contains exactly one base layer coded picture.

If the bitstream is in hevc format, [[internal data]] is assumed to be in canonical format, as defined in [iso14496-15] section 8.3.2.

If the bitstream is in annexb format, [[internal data]] is assumed to be in Annex B format, as defined in [ITU-T-REC-H.265] Annex B.

NOTE: Since [[internal data]] is inherently byte-aligned, implementations are not required to recover byte-alignment.

3. VideoDecoderConfig description

If the VideoDecoderConfig.description is present, it is assumed to be an HEVCDecoderConfigurationRecord, as defined by [iso14496-15], section 8.3.3.1, and the bitstream is assumed to be in hevc format.

NOTE: This format is commonly used in .MP4 files, where the player generally has random access to the media data.

If the VideoDecoderConfig.description is not present, the bitstream is assumed to be in annexb format.

NOTE: "annexb" format is described in greater detail by [ITU-T-REC-H.265], Annex B. This format is commonly used in live-streaming applications, where including the VPS, SPS, and PPS data periodically allows users to easily start from the middle of the stream.

4. EncodedVideoChunk type

If an EncodedVideoChunk's [[type]] is key, and the bitstream is in hevc format, then the EncodedVideoChunk is expected to contain a base layer primary coded picture that is an instantaneous decoding refresh (IDR), clean random access (CRA), or broken link access (BLA) picture.

NOTE: If the bitstream is in hevc format, parameter sets necessary for decoding are included in VideoDecoderConfig.description.

If an EncodedVideoChunk's [[type]] is key, and the bitstream is in annexb format, then the EncodedVideoChunk is expected to contain both a base layer coded picture that is an instantaneous decoding refresh (IDR), clean random access (CRA), or broken link access (BLA) picture, and all parameter sets necessary to decode all video data NAL units in the EncodedVideoChunk.

5. VideoEncoderConfig extensions

partial dictionary VideoEncoderConfig {
  HevcEncoderConfig hevc;
};

hevc, of type HevcEncoderConfig
Contains codec specific configuration options for the HEVC (H.265) codec.

5.1. HevcEncoderConfig

dictionary HevcEncoderConfig {
  HevcBitstreamFormat format = "hevc";
};

format, of type HevcBitstreamFormat, defaulting to "hevc"
Configures the format of output EncodedVideoChunks. See HevcBitstreamFormat.

5.2. HevcBitstreamFormat

enum HevcBitstreamFormat {
  "annexb",
  "hevc",
};

The HevcBitstreamFormat determines the location of HEVC parameter sets, and mechanisms for packaging the bitstream.

annexb
Parameter sets are included periodically throughout the bitstream.

NOTE: This format is described in greater detail by [ITU-T-REC-H.265], Annex B. This format is commonly used in live-streaming applications, where including the parameter set data periodically allows users to easily start from the middle of the stream.

hevc
Parameter sets are not included in the bitstream and are instead emitted via the [[output callback]] as the VideoDecoderConfig.description of the EncodedVideoChunkMetadata.decoderConfig.

NOTE: This format is described in greater detail by [iso14496-15], section 8.3. This format is commonly used in .MP4 files, where the player generally has random access to the media data.

6. Privacy Considerations

Please refer to the section Privacy Considerations in [WEBCODECS].

7. Security Considerations

Please refer to the section Security Considerations in [WEBCODECS].

Conformance

Document conventions

Conformance requirements are expressed with a combination of descriptive assertions and RFC 2119 terminology. The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in the normative parts of this document are to be interpreted as described in RFC 2119. However, for readability, these words do not appear in all uppercase letters in this specification.

All of the text of this specification is normative except sections explicitly marked as non-normative, examples, and notes. [RFC2119]

Examples in this specification are introduced with the words “for example” or are set apart from the normative text with class="example", like this:

This is an example of an informative example.

Informative notes begin with the word “Note” and are set apart from the normative text with class="note", like this:

Note, this is an informative note.

Conformant Algorithms

Requirements phrased in the imperative as part of algorithms (such as "strip any leading space characters" or "return false and abort these steps") are to be interpreted with the meaning of the key word ("must", "should", "may", etc) used in introducing the algorithm.

Conformance requirements phrased as algorithms or specific steps can be implemented in any manner, so long as the end result is equivalent. In particular, the algorithms defined in this specification are intended to be easy to understand and are not intended to be performant. Implementers are encouraged to optimize.

Index

Terms defined by this specification

Terms defined by reference

References

Normative References

[RFC2119]
S. Bradner. Key words for use in RFCs to Indicate Requirement Levels. March 1997. Best Current Practice. URL: https://datatracker.ietf.org/doc/html/rfc2119
[WEBCODECS]
Paul Adenot; Bernard Aboba. WebCodecs. 12 December 2022. WD. URL: https://www.w3.org/TR/webcodecs/

Informative References

[ISO14496-15]
ISO/IEC 14496-15:2019 Information technology — Coding of audio-visual objects — Part 15: Carriage of network abstraction layer (NAL) unit structured video in the ISO base media file format. September 2019. URL: https://www.iso.org/standard/74429.html
[ITU-T-REC-H.265]
H.265 : High efficiency video coding. August 2021. URL: https://www.itu.int/rec/T-REC-H.265
[WEBCODECS-CODEC-REGISTRY]
Chris Cunningham; Paul Adenot; Bernard Aboba. WebCodecs Codec Registry. 10 October 2022. NOTE. URL: https://www.w3.org/TR/webcodecs-codec-registry/

IDL Index

partial dictionary VideoEncoderConfig {
  HevcEncoderConfig hevc;
};


dictionary HevcEncoderConfig {
  HevcBitstreamFormat format = "hevc";
};


enum HevcBitstreamFormat {
  "annexb",
  "hevc",
};