MediaStream Image Capture

Abstract

This document specifies the takePhoto()and grabFrame()methods, and corresponding camera settings for use with MediaStreamTracks (as defined in Media Capture and Streams [GETUSERMEDIA]).

2. Image Capture API

The User Agent must support Promises in order to implement the Image Capture API. Any Promise object is assumed to have resolver object, with resolve() and reject() methods associated with it.

[Constructor(MediaStreamTrack track)]
interface ImageCapture {
    readonly attribute MediaStreamTrack videoStreamTrack;
    readonly attribute MediaStream      previewStream;
    Promise<PhotoCapabilities> getPhotoCapabilities();
    Promise<void>              setOptions(PhotoSettings? photoSettings);
    Promise<Blob>              takePhoto(PhotoSettings? photoSettings);
    Promise<ImageBitmap>       grabFrame();
};

2.1 Constructors

ImageCapture

Parameter	Type	Nullable	Optional	Description
track	`MediaStreamTrack`	✘	✘	The MediaStreamTrack to be used as source of data. This will be the value of the `videoStreamTrack` attribute. The `MediaStreamTrack` passed to the constructor MUST have its `kind` attribute set to "`video`" otherwise a `DOMException` of type `NotSupportedError` will be thrown.

2.2 Attributes

videoStreamTrack of type MediaStreamTrack, readonly: The MediaStreamTrack passed into the constructor.
previewStream of type MediaStream, readonly: The MediaStream that provides a camera preview.

2.3 Methods

getPhotoCapabilities

When the getPhotoCapabilities() method of an ImageCapture object is invoked, a new Promise is returned. If the UA is unable to execute the getPhotoCapabilities() method for any reason (for example, the MediaStreamTrack being ended asynchronously), then the UA MUST return a promise rejected with a newly created ImageCaptureError with the appropriate errorDescription set. Otherwise it MUST queue a task, using the DOM manipulation task source, that runs the following steps:

Gather data from the MediaStreamTrack into a PhotoCapabilities object containing the available capabilities of the device, including ranges where appropriate. The resolved PhotoCapabilities will also include the current conditions in which the capabilities of the device are found. The method of doing this will depend on the underlying device.
Return a resolved promise with the PhotoCapabilities object.

setOptions

When the setOptions() method of an ImageCapture object is invoked, then a valid PhotoSettings object MUST be passed in the method to the ImageCapture object. In addition, a new Promise object is returned. If the UA can successfully apply the settings, then the UA MUST return a resolved promise. If the UA cannot successfully apply the settings, then the UA MUST return a promise rejected with a newly created ImageCaptureError whose errorDescription is set to OPTIONS_ERROR. If the UA can successfully apply the settings, the effect MAY be reflected, if visible at all, in previewStream.

Parameter	Type	Nullable	Optional	Description
type	`PhotoSettings`	✔	✘	The `PhotoSettings` dictionary to be applied.

takePhoto

When the takePhoto() method of an ImageCapture object is invoked, a new Promise object is returned. If the readyState of the VideoStreamTrack provided in the constructor is not "live", the UA MUST return a promise rejected with a newly created ImageCaptureError object whose errorDescription is set to INVALID_TRACK. If the UA is unable to execute the takePhoto() method for any other reason (for example, upon invocation of multiple takePhoto() method calls in rapid succession), then the UA MUST return a promise rejected with a newly created ImageCaptureError object whose errorDescription is set to PHOTO_ERROR. Otherwise it MUST queue a task, using the DOM manipulation task source, that runs the following steps:

Let photoSettings be the method's first argument, if provided, or undefined.
If photoSettings is not undefined, the UA MUST try to apply these settings immediately before gathering the data (i.e. the still image is captured using this photoSettings as if setOptions() had been called immediately before takePhoto()). If the UA cannot successfully apply the settings, then the UA MUST return a promise rejected with a newly created ImageCaptureError whose errorDescription is set to OPTIONS_ERROR.
Gather data from the MediaStreamTrack into a Blob containing a single still image. The method of doing this will depend on the underlying device. Devices may temporarily stop streaming data, reconfigure themselves with the appropriate photo settings, take the photo, and then resume streaming. In this case, the stopping and restarting of streaming SHOULD cause mute and unmute events to fire on the Track in question.
Return a resolved promise with the Blob object.

Parameter	Type	Nullable	Optional	Description
type	`PhotoSettings`	✔	✔	The `PhotoSettings` dictionary to be applied.

grabFrame

When the grabFrame() method of an ImageCapture object is invoked, a new Promise object is returned. If the readyState of the MediaStreamTrack provided in the contructor is not "live", the UA MUST return a promise rejected with a newly created ImageCaptureError object whose errorDescription is set to INVALID_TRACK. If the UA is unable to execute the grabFrame() method for any other reason, then the UA MUST return a promise rejected with a newly created ImageCaptureError object whose errorDescription is set to FRAME_ERROR. Otherwise it MUST queue a task, using the DOM manipulation task source, that runs the following steps:

Gathers data from the MediaStreamTrack into an ImageBitmap object (as defined in [HTML51]). The width and height of the ImageBitmap object are derived from the constraints of the MediaStreamTrack.
Returns a resolved promise with a newly created ImageBitmap object. (Note: grabFrame() returns data only once upon being invoked).

4. `PhotoCapabilities`

interface PhotoCapabilities {
    readonly attribute boolean            autoWhiteBalanceMode;
    readonly attribute MediaSettingsRange whiteBalanceMode;
    readonly attribute MeteringMode       exposureMode;
    readonly attribute MediaSettingsRange exposureCompensation;
    readonly attribute MediaSettingsRange iso;
    readonly attribute boolean            redEyeReduction;
    readonly attribute MeteringMode       focusMode;
    readonly attribute MediaSettingsRange brightness;
    readonly attribute MediaSettingsRange contrast;
    readonly attribute MediaSettingsRange saturation;
    readonly attribute MediaSettingsRange sharpness;
    readonly attribute MediaSettingsRange imageHeight;
    readonly attribute MediaSettingsRange imageWidth;
    readonly attribute MediaSettingsRange zoom;
    readonly attribute FillLightMode      fillLightMode;
};

4.1 Attributes

autoWhiteBalanceMode of type boolean: This reflects whether automated White Balance Mode selection is on or off, and is boolean - on is true.
whiteBalanceMode of type MediaSettingsRange: This reflects the current white balance mode setting. Values are of type WhiteBalanceModeEnum.
exposureMode of type MeteringMode: This reflects the current exposure mode setting.
exposureCompensation of type MediaSettingsRange: This reflects the current Exposure compensation setting and permitted range. Values are signed integers multiplied by 100 (to avoid using floating point). The supported range can be, and usually is, centered around 0 EV.
iso of type MediaSettingsRange: This reflects the current camera ISO setting and permitted range. Values are numeric.
redEyeReduction of type boolean: This reflects whether camera red eye reduction is on or off, and is boolean - on is true
focusMode of type MeteringMode: This reflects the current focus mode setting.
brightness of type MediaSettingsRange: This reflects the current brightness setting of the camera and permitted range. Values are numeric.
contrast of type MediaSettingsRange: This reflects the current contrast setting of the camera and permitted range. Values are numeric.
saturation of type MediaSettingsRange: This reflects the current saturation setting of the camera and permitted range. Values are numeric.
sharpness of type MediaSettingsRange: This reflects the current sharpness setting of the camera and permitted range. Values are numeric.
imageHeight of type MediaSettingsRange: This reflects the image height range supported by the UA and the current height setting.
imageWidth of type MediaSettingsRange: his reflects the image width range supported by the UA and the current width setting.
zoom of type MediaSettingsRange: This reflects the zoom value range supported by the UA and the current zoom setting.
fillLightMode of type FillLightMode: his reflects the current fill light (flash) mode setting. Values are of type FillLightMode.

Note

The supported resolutions are presented as segregated imageWidth and imageHeight ranges to prevent increasing the fingerprinting surface and to allow the UA to make a best-effort decision with regards to actual hardware configuration.

4.2 Discussion

This section is non-normative.

The PhotoCapabilities interface provides the photo-specific settings options and current settings values. The following definitions are assumed for individual settings and are provided for information purposes:

White balance mode is a setting that cameras use to adjust for different color temperatures. Color temperature is the temperature of background light (measured in Kelvin normally). This setting can also be automatically determined by the implementation. If 'automatic' mode is selected, then the Kelvin setting for White Balance Mode may be overridden. Typical temprature ranges for different modes are provided below:

Mode	Kelvin range
incandescent	2500-3500
fluorescent	4000-5000
warm-fluorescent	5000-5500
daylight	5500-6500
cloudy-daylight	6500-8000
twilight	8000-9000
shade	9000-10000

Exposure is the amount of light allowed to fall on the photographic medium. Auto-exposure mode is a camera setting where the exposure levels are automatically adjusted by the implementation based on the subject of the photo.
Exposure Compensation is a numeric camera setting that adjusts the exposure level from the current value used by the implementation. This value can be used to bias the exposure level enabled by auto-exposure, and usually is a symmetric range around 0 EV (the no-compensation value).
The ISO setting of a camera describes the sensitivity of the camera to light. It is a numeric value, where the lower the value the greater the sensitivity. This setting in most implementations relates to shutter speed, and is sometimes known as the ASA setting.
Red Eye Reduction is a feature in cameras that is designed to limit or prevent the appearance of red pupils ("Red Eye") in photography subjects due prolonged exposure to a camera's flash.
Focus mode describes the focus setting of the capture device (e.g. auto or manual).
Brightness refers to the numeric camera setting that adjusts the perceived amount of light emitting from the photo object. A higher brightness setting increases the intensity of darker areas in a scene while compressing the intensity of brighter parts of the scene.
Contrast is the numeric camera setting that controls the difference in brightness between light and dark areas in a scene. A higher contrast setting reflects an expansion in the difference in brightness.
Saturation is a numeric camera setting that controls the intensity of color in a scene (i.e. the amount of gray in the scene). Very low saturation levels will result in photos closer to black-and-white.
Sharpness is a numeric camera setting that controls the intensity of edges in a scene. Higher sharpness settings result in higher edge intensity, while lower settings result in less contrast and blurrier edges (i.e. soft focus).
Zoom is a numeric camera setting that controls the focal length of the lens. The setting usually represents a ratio, e.g. 4 is a zoom ratio of 4:1. The minimum value is usually 1, to represent a 1:1 ratio (i.e. no zoom).
Fill light mode describes the flash setting of the capture device (e.g. auto, off, on).

5. `PhotoSettings`

The PhotoSettings object is optionally passed into the setOptions() method in order to modify capture device settings specific to still imagery. Each of the attributes in this object is optional.

dictionary PhotoSettings {
    boolean           autoWhiteBalanceMode;
    unsigned long     whiteBalanceMode;
    MeteringMode      exposureMode;
    unsigned long     exposureCompensation;
    unsigned long     iso;
    boolean           redEyeReduction;
    MeteringMode      focusMode;
    sequence<Point2D> pointsOfInterest;
    unsigned long     brightness;
    unsigned long     contrast;
    unsigned long     saturation;
    unsigned long     sharpness;
    unsigned long     zoom;
    unsigned long     imageHeight;
    unsigned long     imageWidth;
    FillLightMode     fillLightMode;
};

5.1 Members

autoWhiteBalanceMode of type boolean: This reflects whether automatic White Balance Mode selection is desired.
whiteBalanceMode of type unsigned long: This reflects the desired white balance mode setting.
exposureMode of type MeteringMode: This reflects the desired exposure mode setting. Acceptable values are of type MeteringMode.
exposureCompensation of type unsigned long, multiplied by 100 (to avoid using floating point). A value of 0 EV is interpreted as no exposure compensation.: This reflects the desired exposure compensation setting.
iso of type unsigned long: This reflects the desired camera ISO setting.
redEyeReduction of type boolean: This reflects whether camera red eye reduction is desired
focusMode of type MeteringMode: This reflects the desired focus mode setting. Acceptable values are of type MeteringMode.
pointsOfInterest of type sequence<Point2D>: A sequence of Point2Ds to be used as metering area centers for other settings, e.g. Focus and Exposure.
brightness of type unsigned long: This reflects the desired brightness setting of the camera.
contrast of type unsigned long: This reflects the desired contrast setting of the camera.
saturation of type unsigned long: This reflects the desired saturation setting of the camera.
sharpness of type unsigned long: This reflects the desired sharpness setting of the camera.
zoom of type unsigned long: This reflects the desired zoom setting of the camera.
imageHeight of type unsigned long: This reflects the desired image height. The UA MUST select the closest height value this setting if it supports a discrete set of height options.
imageWidth of type unsigned long: This reflects the desired image width. The UA MUST select the closest width value this setting if it supports a discrete set of width options.
fillLightMode of type FillLightMode: This reflects the desired fill light (flash) mode setting. Acceptable values are of type FillLightMode.

10. Examples

This section is non-normative.

10.1 Grabbing a Frame for Post-Processing

Example 1

navigator.mediaDevices.getUserMedia({video: true}).then(gotMedia, failedToGetMedia);

function gotMedia(mediastream) {
    //Extract video track.
    var videoTrack = mediastream.getVideoTracks()[0];
    // Check if this device supports a picture mode...
    var captureDevice = new ImageCapture(videoTrack);
    if (captureDevice) {
        captureDevice.grabFrame().then(processFrame(imgData));
    }
}

function processFrame(e) {
    imgData = e.imageData;
    width = imgData.width;
    height = imgData.height;
    for (j=3; j < imgData.width; j+=4) {
        // Set all alpha values to medium opacity
        imgData.data[j] = 128;
    }

    // Create new ImageObject with the modified pixel values
    var canvas = document.createElement('canvas');
    ctx = canvas.getContext("2d");
    newImg = ctx.createImageData(width,height);
    for (j=0; j < imgData.width; j++) {
        newImg.data[j] = imgData.data[j];
    }

    // ... and do something with the modified image ...
    }
}

function failedToGetMedia(e) {
    console.log('Stream failure: ' + e);
}

10.2 Taking a picture with Red Eye Reduction supported and used

Example 2

navigator.getUserMedia({video: true}, gotMedia, failedToGetMedia);

function gotMedia(mediastream) {
    //Extract video track.
    var videoDevice = mediastream.getVideoTracks()[0];
    // Check if this device supports a picture mode...
    var captureDevice = new ImageCapture(videoDevice);
    if (captureDevice) {
        if (captureDevice.photoCapabilities.redEyeReduction) {
            captureDevice.setOptions({redEyeReductionSetting:true})
                .then(captureDevice.takePhoto()
                .then(showPicture(blob),function(error){alert("Failed to take photo");}));
        } else {
            console.log('No red eye reduction');
        }
    }

function showPicture(e) {
    var img = document.querySelector("img");
    img.src = URL.createObjectURL(e.data);
}

function failedToGetMedia(e) {
    console.log('Stream failure: ' + e);
}

10.3 Repeated grabbing of a frame

Example 3

<html>
<body>
<p><canvas id="frame"></canvas></p>
<button onclick="stopFunction()">Stop frame grab</button>
<script>
  var canvas = document.getElementById('frame');
  navigator.getUserMedia({video: true}, gotMedia, failedToGetMedia);

  function gotMedia(mediastream) {
    //Extract video track.
    var videoDevice = mediastream.getVideoTracks()[0];
    // Check if this device supports a picture mode...
    var captureDevice = new ImageCapture(videoDevice);
    var frameVar;
    if (captureDevice) {
        frameVar = setInterval(captureDevice.grabFrame().then(processFrame()), 1000);
    }
  }

  function processFrame(e) {
    imgData = e.imageData;
    canvas.width = imgData.width;
    canvas.height = imgData.height;
    canvas.getContext('2d').drawImage(imgData, 0, 0,imgData.width,imgData.height);
  }

  function stopFunction(e) {
    clearInterval(myVar);
  }
</script>
</body>
</html>

MediaStream Image Capture

W3C Working Draft 30 August 2016

Abstract

Status of This Document

1. Introduction

2. Image Capture API

2.1 Constructors

2.2 Attributes

2.3 Methods

3. `ImageCaptureError`

3.1 Attributes

4. `PhotoCapabilities`

4.1 Attributes

4.2 Discussion

5. `PhotoSettings`

5.1 Members

6. `MediaSettingsRange`

6.1 Attributes

7. `FillLightMode`

7.1 Values

8. `MeteringMode`

8.1 Values

9. `Point2D`

9.1 Attributes

10. Examples

10.1 Grabbing a Frame for Post-Processing

10.2 Taking a picture with Red Eye Reduction supported and used

10.3 Repeated grabbing of a frame

A. References

A.1 Normative references

Abstract

Status of This Document

1. Introduction

2. Image Capture API

2.1 Constructors

2.2 Attributes

2.3 Methods

3. ImageCaptureError

3.1 Attributes

4. PhotoCapabilities

4.1 Attributes

4.2 Discussion

5. PhotoSettings

5.1 Members

6. MediaSettingsRange

6.1 Attributes

7. FillLightMode

7.1 Values

8. MeteringMode

8.1 Values

9. Point2D

9.1 Attributes

10. Examples

10.1 Grabbing a Frame for Post-Processing

10.2 Taking a picture with Red Eye Reduction supported and used

10.3 Repeated grabbing of a frame

A. References

A.1 Normative references

3. `ImageCaptureError`

4. `PhotoCapabilities`

5. `PhotoSettings`

6. `MediaSettingsRange`

7. `FillLightMode`

8. `MeteringMode`

9. `Point2D`