W3C Candidate Recommendation Draft
Copyright © 2022 W3C® (MIT, ERCIM, Keio, Beihang). W3C liability, trademark and permissive document license rules apply.
        This specification defines an API extending the
        HTMLMediaElement
This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
This document was published by the Second Screen Working Group as a Candidate Recommendation Draft using the Recommendation track.
This document builds on the group's experience on presenting web content on external presentation-type displays, and re-uses patterns and design considerations from the Presentation API specification whenever appropriate [PRESENTATION-API].
Although this document is still a work in progress and is subject to change, the Working Group believes that the API surface is stable. Most of the remaining issues listed on the issue tracker are considered minor at this stage except for Issue #41.
Issue #41 discusses the set of media playback features that remote playback devices are expected to support. The group will seek further developer feedback and implementation experience to identify any interoperability issues around these features when used during remote playback, and will further clarify the specification based on feedback received.
For other issues or concerns, it is possible to file a bug or send an email to the mailing list. For small editorial changes like typos, sending a pull request is appreciated.
The Working Group invites everyone to review this document, and will work with relevant groups at W3C to conduct horizontal reviews on accessibility, internationalization, privacy, security and technical architecture principles.
No feature has been identified as being at risk.
The Second Screen Working Group will develop a test suite for the Remote Playback API during the Candidate Recommendation period and prepare an implementation report. For this specification to advance to Proposed Recommendation, two independent, interoperable implementations of each feature must be demonstrated, as detailed in the Candidate Recommendation exit criteria section.
Publication as a Candidate Recommendation does not imply endorsement by W3C and its Members. A Candidate Recommendation Draft integrates changes from the previous Candidate Recommendation that the Working Group intends to include in a subsequent Candidate Recommendation Snapshot.
This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 2 November 2021 W3C Process Document.
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key words MAY, MUST, MUST NOT, RECOMMENDED, and SHOULD in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.
This specification defines conformance criteria that apply to a single product: the user agent that implements the interfaces that it contains.
Implementations that use ECMAScript to expose the APIs defined in this specification MUST implement them in a manner consistent with the ECMAScript Bindings defined in the Web IDL specification [WEBIDL].
This section is non-normative.
This specification aims to make remote playback devices such as connected TVs, projectors or audio-only speakers, available to the Web and takes into account playback devices that are attached using wired (HDMI, DVI, or similar) and wireless technologies (Miracast, Chromecast, DLNA, AirPlay, or similar).
Devices with limited screen size or quiet speakers lack the ability to playback media content to a larger audience, for example, a group of colleagues in a conference room, or friends and family at home. Playing media content on an external larger and/or louder remote playback device helps to improve the perceived quality and impact of the played media.
At its core, this specification enables a page that acts as the browsing context to initiate and control remote playback of a particular media element on a selected remote playback device. How the remoting is initiated and controlled is left to the UA in order to allow the use of remote playback devices that can be attached in a wide variety of ways. For example, when a remote playback device is attached using HDMI or Miracast, the same UA that acts as the browsing context renders the remote media. Instead of playing the media on the same device, however, it can use whatever means the operating system provides for using the external remote playback device. In such a case, both the browsing context and the media player run on the same UA and the operating system is used to route the player output to the remote playback device. This is commonly referred to as the media mirroring case. This specification imposes no requirements on the remote playback devices connected in such a manner.
If the remote playback device is able to play the media and communicate with the browsing context but is unable to fetch the media, the browsing context needs to fetch the media data and pass it on to the remote playback device for rendering. This is commonly referred to as media remoting case.
If the remote playback device is able to fetch and play the media and communicate with the browsing context, the browsing context does not need to fetch or render the remoted media. In this case, the UA acts as a proxy that requests the remote playback device to play the media itself by passing the necessary data like the media source. This is commonly referred to as the media flinging case. This way of attaching to displays could be enhanced in the future by defining a standard protocol for delivering these types of messages that remote playback devices could choose to implement.
The API defined here is intended to be used with UAs that attach to remote playback device devices through any of the above means.
This section is non-normative.
The use cases and requirements of this specification are captured in a separate document available here.
        This section shows code examples that highlight the usage of the main
        features of the Remote Playback API. In these examples,
        player.html implements the player page controlling the
        remote playback and media.ext is the media file to be
        played remotely. Both the page and the media are served from the domain
        https://example.org. Please refer to the comments in the
        code examples for further details.
      
<!-- player.html -->
<!-- The video element with custom controls that supports remote playback. -->
<video id="videoElement" src="https://example.org/media.ext" />
<button id="deviceBtn" style="display: none;">Pick device</button>
<script>
  // The "Pick device" button is visible if at least one remote playback device is available.
  const deviceBtn = document.getElementById("deviceBtn");
  const videoElem = document.getElementById("videoElement");
  function availabilityCallback(available) {
    // Show or hide the device picker button depending on device availability.
    deviceBtn.style.display = available ? "inline" : "none";
  }
  videoElem.remote.watchAvailability(availabilityCallback).catch(() => {
    // Availability monitoring is not supported by the platform, so discovery of
    // remote playback devices will happen only after remote.prompt() is called.
    // Pretend the devices are available for simplicity; or, one could implement
    // a third state for the button.
    deviceBtn.style.display = "inline";
  });
</script><!-- player.html -->
<script>
  devicesBtn.onclick = () => {
    // Request the user to select a remote playback device.
    videoElem.remote.prompt()
      // Update the UI and monitor the connected state.
      .then(updateRemotePlaybackState);
      // Otherwise, the user cancelled the selection UI or no screens were found.
  };
<script><!-- player.html -->
<script>
  // The remote playback may be initiated by the user agent,
  // so check the initial state to sync the UI with it.
  if (videoElem.remote.state == "disconnected")
    switchToLocalUI();
  else
    switchToRemoteUI();
  videoElem.remote.onconnecting = switchToRemoteUI;
  videoElem.remote.onconnect = swithToRemoteUI;
  videoElem.remote.ondisconnect = switchToLocalUI;
  // Handles both 'connecting' and 'connected' state. Calling more than once
  // is a no-op.
  function switchToRemoteUI() {
    // Indicate that the state is 'connecting' or 'connected' to the user.
    // For example, hide the video element as only controls are needed.
    videoElem.style.display = "none";
    // Stop monitoring the availability of remote playback devices.
    videoElem.remote.cancelWatchAvailability();
  };
  function switchToLocalUI() {
    // Show the video element.
    videoElem.style.display = "inline";
    // Start watching the device availability again.
    videoElem.remote.watchAvailability(availabilityCallback);
  };
<script>A local playback device is the device the browsing context is running on along with the default video/audio outputs the device has.
A remote playback device is any other device but the local playback device that the browsing context can use to play media on.
A media element state is the set of all single media element properties observable by the page and/or the user via the user agent implementation. The new properties introduced by this spec are not considered part of the media element state for convenience.
paused attribute or the pause/resume
          button reflecting that state on the default controls of the media
          element would be a part of media element state.
        A local playback state is the user agent implementation of media element state for the particular media element for playback on the local playback device.
A remote playback state is the user agent implementation of media element state for the particular media element for playback on the certain remote playback device.
          For a good user experience it is important that the media element
          state doesn't change unexpectedly when the
          state changes. It
          is also important that remote playback state is in sync with
          the media element state so when the media is paused on the
          remote playback device it looks paused to both the user and
          the page.
        
The task source for the tasks mentioned in this specification is the media element event task source.
RemotePlayback interface
        WebIDL[Exposed=Window]
interface RemotePlayback : EventTarget {
  Promise<long> watchAvailability(RemotePlaybackAvailabilityCallback callback);
  Promise<undefined> cancelWatchAvailability(optional long id);
  readonly attribute RemotePlaybackState state;
  attribute EventHandler onconnecting;
  attribute EventHandler onconnect;
  attribute EventHandler ondisconnect;
  Promise<undefined> prompt();
};
enum RemotePlaybackState {
  "connecting",
  "connected",
  "disconnected"
};
callback RemotePlaybackAvailabilityCallback = undefined(boolean available);
        
          A RemotePlayback object is an interface allowing the page
          to detect availability of, connect to and control playback on
          remote playback devices.
        
          A RemotePlaybackState enumeration represents the state of
          connection to some remote playback device.
        
          A RemotePlaybackAvailabilityCallback represents the
          remote playback device availability information.
        
            A RemotePlaybackAvailabilityCallback is the way
            for the page to obtain the remote playback device
            availability for the corresponding media element. If
            the user agent can monitor the list of available remote playback
            devices in the background (without a pending request to
            prompt()), the RemotePlaybackAvailabilityCallback behavior
            defined below MUST be implemented by the user agent. Otherwise, the
            promise returned by watchAvailability() MUST be rejected
            with NotSupportedError.
          
              The user agent MUST keep track of the set of
              availability callbacks registered with each media
              element through the watchAvailability() method. The
              set of availability callbacks for each
              RemotePlayback object is represented as a set of tuples
              (callbackId, callback),
              initially empty, where:
            
watchAvailability()
                in a given browsing context;
              RemotePlaybackAvailabilityCallback object.
              
              Since there's one and only one RemotePlayback object per
              each media element, set of availability callbacks
              of a media element is the same set as the set of
              availability callbacks of the RemotePlayback object
              referred to by the element's remote property.
            
              The combined set of all sets of availability
              callbacks of all RemotePlayback objects known to the
              browsing context is referred to as global set of
              availability callbacks.
            
The user agent MUST keep a list of available remote playback devices. This list contains remote playback devices and is populated based on an implementation specific discovery mechanism. It is set to the most recent result of the algorithm to monitor the list of available remote playback devices or an empty list if the algorithm hasn't been run yet.
              The user agent MAY not support running the algorithm to
              monitor the list of available remote playback devices
              continuously, for example, because of platform or power
              consumption restrictions. In this case the promise returned by
              watchAvailability() MUST be rejected
              with NotSupportedError, the global set of availability
              callbacks will be empty and the algorithm to monitor the
              list of available remote playback devices will only run as
              part of the initiate remote playback algorithm.
            
When the global set of availability callbacks is not empty, the user agent MUST monitor the list of available remote playback devices continuously, so that pages can keep track of the last value received via the registered callbacks to offer remote playback only when there are available devices.
The user agent is expected not to monitor the list of available remote playback devices when possible, to satisfy the power saving non-functional requirement. For example, the user agent might choose not to run the monitoring algorithm when the global set of availability callbacks is empty, when every page that has media elements with non-empty set of availability callbacks is in the background.
Some remote playback devices may only be able to play a subset of media resources because of functional, security or hardware limitations. Examples are set-top boxes, smart TVs or networked speakers capable of rendering only certain formats of video and/or audio. We say that such a device is a compatible remote playback device for a media resource if the user agent can reasonably guarantee that the remote playback of the media specified by the resource will succeed on that device.
              The user agent can use the srclangtrack
The media resources of a media element, that were considered by the user agent to find a compatible remote playback device, are called the availability sources set.
The media resource of a media element that is used to initiate remote playback on the selected remote playback device is called the remote playback source. A remote playback source MUST belong to the media element's availability sources set.
The mechanism to choose the remote playback source from the availability sources set is implementation-specific, but the user agent SHOULD consider every resource in the availability sources set as a potential remote playback source.
HTMLMediaElement's 
              resource selection algorithm.  However, if media
              remoting or media flinging is used, the best media
              source can depend on the selected remote playback device
              fetch and playback capabilities.
            If the user agent cannot determine a remote playback source appropriate for the remote playback device, it is RECOMMENDED that the user agent send metadata (for example, the extended MIME type) about all resources in the availability sources set to the remote playback device so it can run its own resource selection algorithm and choose the remote playback source.
              Remote playback is said to be unavailable for the
              media element if the list of available remote playback
              devices is empty or none of them is compatible with any source
              from availability sources set for the media
              element. The remote playback is said to be
              available otherwise. A boolean set to
              false if the remote playback is unavailable
              for the media element or true if it is
              available is called availability for the
              media element.
            
              If the user agent stops
              
              monitoring the list of available remote playback devices
              (for example by a user control or for power saving), it SHOULD
              invoke all callbacks in the global set of availability
              callbacks with false so that pages can update
              their user experience appropriately.  It SHOULD also set
              the availability value for all media elements
              to false so that availability information can
              be propagated correctly if the user agent later resumes
              
              monitoring the list of available remote playback devices.
            
              When the watchAvailability()
              method is called, the user agent MUST run the following steps:
            
Promise.
              Promise.
              disableRemotePlayback
                attribute is present for the media element, reject the promise
                with InvalidStateError and abort all the remaining steps.
              false as its argument.
                  NotSupportedError exception.
                  A simple algorithm for assigning callbackId values is to keep a counter for each browsing context and incrementing it in step 6.
If the set of availability callbacks is non-empty, or there is a pending request to initiate remote playback, the user agent MUST monitor the list of available remote playback devices by running the following steps:
disableRemotePlayback
                    attribute is present for mediaElement, abort all
                    the remaining steps for this tuple and continue to the next one.
                  
              When a cancelWatchAvailability
              () method is called, the user agent MUST run the
              following steps:
            
Promise.
              Promise.
              disableRemotePlayback attribute is present
                for the media element, reject promise with
                InvalidStateError and abort all the remaining steps.
              undefined,
                clear the set of availability callbacks.
              NotFoundError and abort all the remaining steps.
              
            When the prompt() method is called, the
            user agent MUST run the following steps:
          
Promise.
            Promise.
            disableRemotePlayback attribute is present
              for the media element, reject the promise with
              InvalidStateError and abort all the remaining steps.
            prompt for the same media element or even for
              the same browsing context, the user agent MAY reject
              promise with an OperationError exception and abort all
              remaining steps.
              prompt() would not be able to show any UI.
              InvalidAccessError exception and abort these steps.
            state or the
              list of available remote playback devices),
              reject promise with a NotSupportedError and abort
              all remaining steps.
              URL that can be passed to a
                remote playback device.
              NotFoundError exception and
              abort all remaining steps.
            state of the remote object
                  to connecting.
                connecting at the remote property of the media
                  element. The event must not bubble, must not be
                  cancelable, and has no default action.
                By picking a remote playback device the user grants permission to use the device.
NotAllowedError
              exception and hide the UI shown by the user agent.
            state attribute
          
            The state
            attribute represents the RemotePlayback connection's current
            state. It can take one of the values of RemotePlaybackState
            depending on the connection state:
          
connecting means that the user agent is attempting to
              initiate remote playback with the selected remote
              playback device. This is the initial state when the
              promise returned by prompt() is fulfilled. The local
              playback of the media element continues in this state and media
              commands still take effect on the local playback state.
            connected means that the transition from local to
              remote playback has finished and all media commands now take
              effect on the remote playback state.
            disconnected means that the remote playback has not
              been initiated, has
              failed to initiate or has been stopped. All media commands will
              take effect on the local playback state. The remote
              playback can be initiated through a call to prompt().
            When the user agent is to establish a connection with the remote playback device, it MUST run the following steps:
RemotePlayback object that is to be
              connected.
            state of remote is not equal to
              connecting,
              abort all the remaining steps.
            state of remote to
                  connected.
                connect at remote.
                disconnected.
                disconnect at remote.
                
            The user agent SHOULD pause local audio and video output of the
            media element while the remote playback state
            is connected.
          
            If the user agent is 
            exposing a user interface to the user for the media element
            (i.e., using default controls), the user agent SHOULD convey the
            fact that the remote playback state is connected through an icon or other means.
          
Accept-Language header it sends to fetch media resources.
          connected
            state and its content is being rendered on a remote playback
              device.
          A user agent MAY support connecting to a remote playback device from the browser. This can be done by adding appropriate media controls to the user interface that is exposed to the user, or when the user activates system-wide mirroring of the display. This feature is known as browser initiated remote playback. A user agent that supports browser initiated remote playback SHOULD initiate the remote playback only when the user has expressed an intention to do so via a user gesture, for example by clicking a button in the browser.
            If the user agent supports browser initiated remote playback,
            the state attribute MUST reflect the current state of the
            connection to the remote playback device.  When the browser
            initiates or terminates remote playback, it MUST fire the corresponding
            events by following the algorithms to establish a connection
            with the remote playback device and
            disconnect from a remote playback device.
          
            If the browser will
            initiate remote playback on a newly created media element,
            it SHOULD initialize the value of its state attribute to
            connecting
            and then follow the steps to establish a connection
            with the remote playback device.
          
            The HTMLMediaElement interface interacts with the remotely
            played media as soon as the connection with the remote playback
            device is established.
          
            When the state of
            a RemotePlayback object
            is connected, the
            following conditions relate the local playback state,
            the media element state, and the remote playback
            state:
          
HTMLMediaElement object to the remote playback
              device in order to change its remote playback state;
            If sending any command fails, the user agent MAY disconnect from a remote playback device.
              The remote playback device can implement a subset
              of the capabilities of the playback engine of the user
              agent, and some HTMLMediaElement APIs do not always
              make sense to use during remote playback. In this case, the
              local playback state is expected to reflect as closely
              as possible the actual remote playback state after a
              media command that is not supported during remote playback.
            
              For example, after calling fastSeek()seekingHTMLMediaElement is expected to remain false and no
              seeking event is to be fired.
            
When the user agent is to disconnect from a remote playback device, it MUST do the following:
RemotePlayback object
              representing the playback to be stopped.
            state
              of remote is disconnected, abort all remaining steps.
            state to disconnected.
                disconnect at remote.
                
            If the remote playback device is abruptly disconnected during
            playback (for example, by power loss or a network disconnection),
            the user agent SHOULD run the steps to monitor the list of
            available remote playback devices before the steps to
            disconnect from a remote playback device.  This allows
            callbacks in the set of availability callbacks to be invoked
            before the disconnect event is fired, so the page can update
            itself to show resumption of playback is not possible.
          
disconnected state.
          
            The following are the event handlers (and their corresponding
            event handler event types) that must be supported, as
            event handler IDL attributes, by objects implementing the
            RemotePlayback interface:
          
| Event handler | Event handler event type | 
|---|---|
| onconnecting | connecting | 
| onconnect | connect | 
| ondisconnect | disconnect | 
HTMLMediaElementWebIDLpartial interface HTMLMediaElement {
  [SameObject] readonly attribute RemotePlayback remote;
  [CEReactions] attribute boolean disableRemotePlayback;
};
        
          The remote attribute MUST
          return the RemotePlayback instance associated with the
          media element.
        
            Some pages may wish to disable remote playback of a media element;
            for example, they may prefer to use a PresentationRequest
            to present a complete document on a presentation screen. To support
            this use case, a new disableRemotePlayback attribute is added to
            the list of content attributes for audio and video elements.
          
            A corresponding disableRemotePlayback IDL attribute which
            reflects the value of each element's disableRemotePlayback
            content attribute is added to the HTMLMediaElement interface.
            The disableRemotePlayback IDL attribute MUST
            
            reflect the content attribute of the same name.
          
            If the disableRemotePlayback attribute is present
            on the media element, the user agent MUST NOT play
            the media element remotely or present any UI to do so.
          
            When the disableRemotePlayback attribute is added to
            the media element, the user agent MUST run the steps
            to disable remote playback:
          
RemotePlayback methods with InvalidStateError.
            state is not
            disconnected, run the disconnect from a remote
            playback device algorithm for the remote playback device
            the media element is connected or connecting to.
            This section is non-normative.
          Firing the callback provided via the watchAvailability() method reveals one
          bit of information about the presence (or non-presence) of a
          remote playback device typically discovered through the local
          area network. This could be used in conjunction with other
          information for fingerprinting the user. However, this information is
          also dependent on the user's local network context, so the risk is
          minimized.  Also, by design, the human readable name of a remote
          playback device is not revealed to the page.
        
The API enables monitoring the list of available remote playback devices. How the user agent determines the compatibility and availability of a remote playback device with a media element's resource is an implementation detail. If a user agent matches a media resource to a particular type of device to determine its availability, this feature can be used to probe information about which remote playback device the user has without user consent.
The user agent should not monitor the list of available remote playback devices if the user disables background monitoring through a browser setting.
When the user is asked permission to use a remote playback device during the steps to change remote playback state, the user agent should make it clear what origin the request is coming from.
Display of the origin requesting remote playback will help the user understand what content is making the request, especially when the request is initiated from a nested browsing context. For example, embedded content may try to convince the user to click to trigger a request to start an unwanted remote playback.
              Showing the origin that will be presented will help the user know
              if that content is from a potentially trustworthy origin
              (e.g., https:), and corresponds to a known or expected site.
            
The Remote Playback API abstracts away what "local" means for displays, meaning that it exposes network-accessible displays as though they were local displays. The Remote Playback API requires user permission for a page to access any display to mitigate issues that could arise, such as showing unwanted content on a display viewable by others.
This specification will not mandate communication protocols between the local playback device and the remote playback device, but the user agent should set some guarantees of message confidentiality and authenticity between them.
The Remote Playback API is not limited to [SECURE-CONTEXTS] because it exposes a feature to web applications that user agents usually offer natively on all media regardless of the browsing context. A user agent can limit the API to [SECURE-CONTEXTS] by always returning an empty list as part of the monitor the list of available remote playback devices algorithm in contexts that are not secure.
For this specification to be advanced to Proposed Recommendation, there must be at least two independent, interoperable implementations of each feature. Each feature may be implemented by a different set of products, there is no requirement that all features be implemented by a single product. Additionally, implementations must demonstrate support for the media remoting and media flinging cases, either within the same product or within different products.
For the purposes of these criteria, we define the following terms:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in: