1. Introduction
The API defined in this document captures images from a photographic device referenced through a valid MediaStreamTrack
[GETUSERMEDIA]. The produced image can be in the form of a Blob
(see takePhoto()
method) or as a ImageBitmap
(see grabFrame()
).
Reading capabilities and settings and applying constraints is done in one of two ways depending on whether it impacts the video MediaStreamTrack
or not. Photo-specific capabilities and current settings can be retrieved via getPhotoCapabilities()
/getPhotoSettings()
and configured via takePhoto()
's PhotoSettings
argument. Manipulating video-related capabilities, current settings and constraints is done via the MediaStreamTrack extension mechanism.
2. Security and Privacy Considerations
The privacy and security considerations discussed in [GETUSERMEDIA] apply to this extension specification.
Moreover, implementors should take care to prevent additional leakage of privacy-sensitive data from captured images. For instance, embedding the user’s location in the metadata of the digitzed image (e.g. EXIF) might transmit more private data than the user is expecting.
3. Image Capture API
The User Agent must support Promises in order to implement the Image Capture API. Any Promise
object is assumed to have a resolver object, with resolve()
and reject()
methods associated with it.
[Exposed =Window ]interface {
ImageCapture constructor (MediaStreamTrack );
videoTrack Promise <Blob >takePhoto (optional PhotoSettings = {});
photoSettings Promise <PhotoCapabilities >getPhotoCapabilities ();Promise <PhotoSettings >getPhotoSettings ();Promise <ImageBitmap >grabFrame ();readonly attribute MediaStreamTrack track ; };
takePhoto()
returns a captured image encoded in the form of a Blob
, whereas grabFrame()
returns a snapshot of the track
video feed in the form of a non-encoded ImageBitmap
. 3.1. Attributes
track
, of type MediaStreamTrack, readonly- The
MediaStreamTrack
passed into the constructor.
3.2. Methods
ImageCapture(MediaStreamTrack videoTrack)
-
Parameter Type Nullable Optional Description videoTrack MediaStreamTrack
✘ ✘ The MediaStreamTrack
to be used as source of data. This will be the value of thetrack
attribute. TheMediaStreamTrack
passed to the constructor MUST have itskind
attribute set to"video"
otherwise aDOMException
of typeNotSupportedError
will be thrown. takePhoto(optional PhotoSettings photoSettings)
-
takePhoto()
produces the result of a single photographic exposure using the video capture device sourcing thetrack
and including anyPhotoSettings
configured, returning an encoded image in the form of aBlob
if successful. When this method is invoked, the user agent MUST run the following steps:- If the
readyState
oftrack
provided in the constructor is notlive
, return a promise rejected with a newDOMException
whose name isInvalidStateError
, and abort these steps. - Let p be a new promise.
-
Run the following steps in parallel:
-
Gather data from the
track
underlying source with the definedphotoSettings
and into aBlob
containing a single still image. The method of doing this will depend on the underlying device. - If the operation cannot be completed for any reason (for example, upon invocation of multiple
takePhoto()
method calls in rapid succession), then reject p with a newDOMException
whose name isUnknownError
, and abort these steps. - Resolve p with the Blob object.
-
Gather data from the
- Return p.
Parameter Type Nullable Optional Description settings PhotoSettings
✔ ✘ The PhotoSettings
dictionary to be applied. - If the
getPhotoCapabilities()
-
getPhotoCapabilities()
is used to retrieve the ranges of available configuration options, if any. When this method is invoked, the user agent MUST run the following steps:- If the
readyState
oftrack
provided in the constructor is notlive
, return a promise rejected with a newDOMException
whose name isInvalidStateError
, and abort these steps. - Let p be a new promise.
-
Run the following steps in parallel:
- Gather data from
track
into aPhotoCapabilities
dictionary containing the available capabilities of the device, including ranges where appropriate. The method of doing this will depend on the underlying device. - If the data cannot be gathered for any reason (for example, the
MediaStreamTrack
being ended asynchronously), then reject p with a newDOMException
whose name isOperationError
, and abort these steps. - Resolve p with the
PhotoCapabilities
dictionary.
- Gather data from
- Return p.
- If the
getPhotoSettings()
-
getPhotoSettings()
is used to retrieve the current configuration settings values, if any. When this method is invoked, the user agent MUST run the following steps:- If the
readyState
oftrack
provided in the constructor is notlive
, return a promise rejected with a newDOMException
whose name isInvalidStateError
, and abort these steps. - Let p be a new promise.
-
Run the following steps in parallel:
- Gather data from
track
into aPhotoSettings
dictionary containing the current conditions in which the device is found. The method of doing this will depend on the underlying device. - If the data cannot be gathered for any reason (for example, the
MediaStreamTrack
being ended asynchronously), then reject p with a newDOMException
whose name isOperationError
, and abort these steps. - Resolve p with the
PhotoSettings
dictionary.
- Gather data from
- Return p.
- If the
grabFrame()
-
grabFrame()
takes a snapshot of the live video being held intrack
, returning anImageBitmap
if successful.grabFrame()
returns data only once upon being invoked. When this method is invoked, the user agent MUST run the following steps:- If the
readyState
oftrack
provided in the constructor is notlive
, return a promise rejected with a newDOMException
whose name isInvalidStateError
, and abort these steps. - Let p be a new promise.
-
Run the following steps in parallel:
- Gather data from
track
into anImageBitmap
object. Thewidth
andheight
of theImageBitmap
object are derived from the constraints oftrack
. - If the operation cannot be completed for any reason (for example, upon invocation of multiple
grabFrame()
/takePhoto()
method calls in rapid succession), then reject p with a newDOMException
whose name isUnknownError
, and abort these steps. - Resolve p with the
ImageBitmap
object.
- Gather data from
- Return p.
- If the
4. PhotoCapabilities
dictionary {
PhotoCapabilities RedEyeReduction redEyeReduction ;MediaSettingsRange imageHeight ;MediaSettingsRange imageWidth ;sequence <FillLightMode >fillLightMode ; };
4.1. Members
redEyeReduction
, of type RedEyeReduction- The red eye reduction capacity of the source.
imageHeight
, of type MediaSettingsRange- This reflects the image height range supported by the UA.
imageWidth
, of type MediaSettingsRange- This reflects the image width range supported by the UA.
fillLightMode
, of type sequence<FillLightMode>- This reflects the supported fill light mode (flash) settings, if any.
imageWidth
and imageHeight
ranges to prevent increasing the fingerprinting surface and to allow the UA to make a best-effort decision with regards to actual hardware configuration. 5. PhotoSettings
dictionary {
PhotoSettings FillLightMode fillLightMode ;double imageHeight ;double imageWidth ;boolean redEyeReduction ; };
5.1. Members
redEyeReduction
, of type boolean- This reflects whether camera red eye reduction is desired
imageHeight
, of type double- This reflects the desired image height. The UA MUST select the closest height value to this setting if it supports a discrete set of height options.
imageWidth
, of type double- This reflects the desired image width. The UA MUST select the closest width value to this setting if it supports a discrete set of width options.
fillLightMode
, of type FillLightMode- This reflects the desired fill light mode (flash) setting.
6. MediaSettingsRange
dictionary {
MediaSettingsRange double max ;double min ;double step ; };
6.1. Members
max
, of type double- The maximum value of this setting
min
, of type double- The minimum value of this setting
step
, of type double- The minimum difference between consecutive values of this setting.
7. RedEyeReduction
enum {
RedEyeReduction "never" ,"always" ,"controllable" };
7.1. Values
never
- Red eye reduction is not available in the device.
always
- Red eye reduction is available in the device and it is always configured to true.
controllable
- Red eye reduction is available in the device and it is controllable by the user via
redEyeReduction
.
8. FillLightMode
enum {
FillLightMode "auto" ,"off" ,"flash" };
8.1. Values
auto
- The video device’s fill light will be enabled when required (typically low light conditions). Otherwise it will be off. Note that auto does not guarantee that a flash will fire when
takePhoto()
is called. Useflash
to guarantee firing of the flash fortakePhoto()
method. off
- The source’s fill light and/or flash will not be used.
flash
- This value will always cause the flash to fire for
takePhoto()
method.
9. Extensions
This Section defines a number of new set of Constrainable Properties for MediaStreamTrack
that can be applied in order to make its behavior more suitable for taking pictures. Use of these constraints via MediaStreamTrack
's methods getCapabilities()
, getSettings()
, getConstraints()
and applyConstraints()
will modify the behavior of the ImageCapture
object’s track
.
9.1. MediaTrackSupportedConstraints
dictionary
MediaTrackSupportedConstraints
is extended here with the list of constraints that a User Agent recognizes for controlling the photo capabilities. This dictionary can be retrieved using MediaDevices
getSupportedConstraints()
method.
partial dictionary MediaTrackSupportedConstraints {boolean whiteBalanceMode =true ;boolean exposureMode =true ;boolean focusMode =true ;boolean pointsOfInterest =true ;boolean exposureCompensation =true ;boolean exposureTime =true ;boolean colorTemperature =true ;boolean iso =true ;boolean brightness =true ;boolean contrast =true ;boolean pan =true ;boolean saturation =true ;boolean sharpness =true ;boolean focusDistance =true ;boolean tilt =true ;boolean zoom =true ;boolean torch =true ; };
9.1.1. Members
whiteBalanceMode
, of type boolean, defaulting totrue
- Whether white balance mode constraining is recognized.
colorTemperature
, of type boolean, defaulting totrue
- Whether color temperature constraining is recognized.
exposureMode
, of type boolean, defaulting totrue
- Whether exposure constraining is recognized.
exposureCompensation
, of type boolean, defaulting totrue
- Whether exposure compensation constraining is recognized.
exposureTime
, of type boolean, defaulting totrue
- Whether exposure time constraining is recognized.
iso
, of type boolean, defaulting totrue
- Whether ISO constraining is recognized.
focusMode
, of type boolean, defaulting totrue
- Whether focus mode constraining is recognized.
pointsOfInterest
, of type boolean, defaulting totrue
- Whether points of interest are supported.
brightness
, of type boolean, defaulting totrue
- Whether brightness constraining is recognized.
contrast
, of type boolean, defaulting totrue
- Whether contrast constraining is recognized.
pan
, of type boolean, defaulting totrue
- Whether pan constraining is recognized.
saturation
, of type boolean, defaulting totrue
- Whether saturation constraining is recognized.
sharpness
, of type boolean, defaulting totrue
- Whether sharpness constraining is recognized.
focusDistance
, of type boolean, defaulting totrue
- Whether focus distance constraining is recognized.
tilt
, of type boolean, defaulting totrue
- Whether tilt constraining is recognized.
zoom
, of type boolean, defaulting totrue
- Whether configuration of the zoom level is recognized.
torch
, of type boolean, defaulting totrue
- Whether configuration of torch is recognized.
9.2. MediaTrackCapabilities
dictionary
MediaTrackCapabilities
is extended here with the capabilities specific to image capture. This dictionary is produced by the UA via getCapabilities()
and represents the supported ranges and enumerations of the supported constraints.
partial dictionary MediaTrackCapabilities {sequence <DOMString >whiteBalanceMode ;sequence <DOMString >exposureMode ;sequence <DOMString >focusMode ;MediaSettingsRange exposureCompensation ;MediaSettingsRange exposureTime ;MediaSettingsRange colorTemperature ;MediaSettingsRange iso ;MediaSettingsRange brightness ;MediaSettingsRange contrast ;MediaSettingsRange saturation ;MediaSettingsRange sharpness ;MediaSettingsRange focusDistance ;MediaSettingsRange pan ;MediaSettingsRange tilt ;MediaSettingsRange zoom ;boolean torch ; };
9.2.1. Members
whiteBalanceMode
, of type sequence<DOMString>- A sequence of supported white balance modes. Each string MUST be one of the members of
MeteringMode
. colorTemperature
, of type MediaSettingsRange- This range reflects the supported correlated color temperatures to be used for the scene white balance calculation.
exposureMode
, of type sequence<DOMString>- A sequence of supported exposure modes. Each string MUST be the members of
MeteringMode
. exposureCompensation
, of type MediaSettingsRange- This reflects the supported range of exposure compensation. The supported range can be, and usually is, centered around 0 EV.
exposureTime
, of type MediaSettingsRange- This reflects the supported range of exposure time. Values are numeric. Increasing values indicate increasing exposure time.
iso
, of type MediaSettingsRange- This reflects the permitted range of ISO values.
focusMode
, of type sequence<DOMString>- A sequence of supported focus modes. Each string MUST be one of the members of
MeteringMode
. brightness
, of type MediaSettingsRange- This reflects the supported range of brightness setting of the camera. Values are numeric. Increasing values indicate increasing brightness.
contrast
, of type MediaSettingsRange- This reflects the supported range of contrast. Values are numeric. Increasing values indicate increasing contrast.
pan
, of type MediaSettingsRange-
This reflects the pan value range supported by the UA and by the track.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support pan. In that case the UA MUST NOT expose the pan value range. saturation
, of type MediaSettingsRange- This reflects the permitted range of saturation setting. Values are numeric. Increasing values indicate increasing saturation.
sharpness
, of type MediaSettingsRange- This reflects the permitted sharpness range of the camera. Values are numeric. Increasing values indicate increasing sharpness, and the minimum value always implies no sharpness enhancement or processing.
focusDistance
, of type MediaSettingsRange- This reflects the focus distance value range supported by the UA.
tilt
, of type MediaSettingsRange-
This reflects the tilt value range supported by the UA and by the track.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support tilt. In that case the UA MUST NOT expose the tilt value range. zoom
, of type MediaSettingsRange-
This reflects the zoom value range supported by the UA and by the track.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support zoom. In that case the UA MUST NOT expose the zoom value range. torch
, of type boolean- A boolean indicating whether camera supports torch mode- on meaning supported.
9.3. MediaTrackConstraintSet
dictionary
MediaTrackConstraintSet
[GETUSERMEDIA] dictionary is used for both reading the current status with getConstraints()
and for applying a set of constraints with applyConstraints()
.
MediaTrackSettings
can be retrieved to verify the effect of the application by the user agent of the requested MediaTrackConstraints
. Some constraints such as, e.g. zoom, might not be immediately applicable. partial dictionary MediaTrackConstraintSet {ConstrainDOMString whiteBalanceMode ;ConstrainDOMString exposureMode ;ConstrainDOMString focusMode ;ConstrainPoint2D pointsOfInterest ;ConstrainDouble exposureCompensation ;ConstrainDouble exposureTime ;ConstrainDouble colorTemperature ;ConstrainDouble iso ;ConstrainDouble brightness ;ConstrainDouble contrast ;ConstrainDouble saturation ;ConstrainDouble sharpness ;ConstrainDouble focusDistance ; (boolean or ConstrainDouble )pan ; (boolean or ConstrainDouble )tilt ; (boolean or ConstrainDouble )zoom ;ConstrainBoolean torch ; };
9.3.1. Members
whiteBalanceMode
, of type ConstrainDOMString- This string MUST be one of the members of
MeteringMode
. See white balance mode constrainable property. exposureMode
, of type ConstrainDOMString- This string MUST be one of the members of
MeteringMode
. See exposure constrainable property. focusMode
, of type ConstrainDOMString- This string MUST be one of the members of
MeteringMode
. See focus mode constrainable property. colorTemperature
, of type ConstrainDouble- See color temperature constrainable property.
exposureCompensation
, of type ConstrainDouble- See exposure compensation constrainable property.
exposureTime
, of type ConstrainDouble- See exposure time constrainable property.
iso
, of type ConstrainDouble- See iso constrainable property.
pointsOfInterest
, of type ConstrainPoint2D- See points of interest constrainable property.
brightness
, of type ConstrainDouble- See brightness constrainable property.
contrast
, of type ConstrainDouble- See contrast constrainable property.
pan
, of type(boolean or ConstrainDouble)
- A value of true is normalized to a value of empty
ConstrainDoubleRange
. A value of false is normalized to a value of undefined. See pan constrainable property. saturation
, of type ConstrainDouble- See saturation constrainable property.
sharpness
, of type ConstrainDouble- See sharpness constrainable property.
focusDistance
, of type ConstrainDouble- See focus distance constrainable property.
tilt
, of type(boolean or ConstrainDouble)
- A value of true is normalized to a value of empty
ConstrainDoubleRange
. A value of false is normalized to a value of undefined. See tilt constrainable property. zoom
, of type(boolean or ConstrainDouble)
- A value of true is normalized to a value of empty
ConstrainDoubleRange
. A value of false is normalized to a value of undefined. See zoom constrainable property. torch
, of type ConstrainBoolean- See torch constrainable property.
9.4. MediaTrackSettings
dictionary
When the getSettings()
method is invoked on a video stream track, the user agent must return the extended MediaTrackSettings
dictionary, representing the current status of the underlying user agent.
partial dictionary MediaTrackSettings {DOMString whiteBalanceMode ;DOMString exposureMode ;DOMString focusMode ;sequence <Point2D >pointsOfInterest ;double exposureCompensation ;double exposureTime ;double colorTemperature ;double iso ;double brightness ;double contrast ;double saturation ;double sharpness ;double focusDistance ;double pan ;double tilt ;double zoom ;boolean torch ; };
9.4.1. Members
whiteBalanceMode
, of type DOMString- Current white balance mode setting. The string MUST be one of the members of
MeteringMode
. exposureMode
, of type DOMString- Current exposure mode setting. The string MUST be one of the members of
MeteringMode
. colorTemperature
, of type double- Color temperature in use for the white balance calculation of the scene. This field is only significant if
whiteBalanceMode
ismanual
. exposureCompensation
, of type double- Current exposure compensation setting. A value of 0 EV is interpreted as no exposure compensation. This field is only significant if
exposureMode
iscontinuous
orsingle-shot
exposureTime
, of type double- Current exposure time setting. This field is only significant if
exposureMode
ismanual
. iso
, of type double- Current camera ISO setting.
focusMode
, of type DOMString- Current focus mode setting. The string MUST be one of the members of
MeteringMode
. pointsOfInterest
, of type sequence<Point2D>- A sequence of
Point2D
s in use as points of interest for other settings, e.g. Focus, Exposure and Auto White Balance. brightness
, of type double- This reflects the current brightness setting of the camera.
contrast
, of type double- This reflects the current contrast setting of the camera.
pan
, of type double-
This reflects the current pan setting of the camera.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support pan. In that case the UA MUST NOT expose the pan setting. saturation
, of type double- This reflects the current saturation setting of the camera.
sharpness
, of type double- This reflects the current sharpness setting of the camera.
focusDistance
, of type double- This reflects the current focus distance setting of the camera.
tilt
, of type double-
This reflects the current tilt setting of the camera.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support tilt. In that case the UA MUST NOT expose the tilt setting. zoom
, of type double-
This reflects the current zoom setting of the camera.
If the track has been created without requesting permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set to
camera
and itspanTiltZoom
member set to true or if that permission request is denied, the track does not support zoom. In that case the UA MUST NOT expose the zoom setting. torch
, of type boolean- Current camera torch configuration setting.
9.5. Additional Constrainable Properties
dictionary {
ConstrainPoint2DParameters sequence <Point2D >exact ;sequence <Point2D >ideal ; };typedef (sequence <Point2D >or ConstrainPoint2DParameters );
ConstrainPoint2D
9.5.1. Members
exact
, of type sequence<Point2D>- The exact required value of points of interest.
ideal
, of type sequence<Point2D>- The ideal (target) value of points of interest.
10. Photo Capabilities and Constrainable Properties
-
White balance mode is a setting that cameras use to adjust for different color temperatures. Color temperature is the temperature of background light (usually measured in Kelvin). This setting can usually be automatically and continuously determined by the implementation, but it’s also common to offer a
manual
mode in which the estimated temperature of the scene illumination is hinted to the implementation. Typical temperature ranges for popular modes are provided below:Mode Kelvin range incandescent 2500-3500 fluorescent 4000-5000 warm-fluorescent 5000-5500 daylight 5500-6500 cloudy-daylight 6500-8000 twilight 8000-9000 shade 9000-10000 - Exposure is the amount of light that is allowed to fall on the photosensitive device. In auto-exposure modes (
single-shot
orcontinuous
exposureMode
), the exposure time and/or camera aperture are automatically adjusted by the implementation based on the subject of the photo. Inmanual
exposureMode
, these parameters are set to fixed absolute values. - Focus mode describes the focus setting of the capture device (e.g.
auto
ormanual
). -
Points of interest describe the metering area centers used in other settings, e.g. exposure, white balance mode and focus mode each one being a
Point2D
(usually these three controls are modified simultaneously by the so-called3A
algorithm: auto-focus, auto-exposure, auto-white-balance).A
Point2D
Point of Interest is interpreted to represent a pixel position in a normalized square space ({x,y} ∈ [0.0, 1.0]
). The origin of coordinates{x,y} = {0.0, 0.0}
represents the upper leftmost corner whereas the{x,y} = {1.0, 1.0}
represents the lower rightmost corner: thex
coordinate (columns) increases rightwards and they
coordinate (rows) increases downwards. Values beyond the mentioned limits will be clamped to the closest allowed value. - Exposure Compensation is a numeric camera setting that adjusts the exposure level from the current value used by the implementation. This value can be used to bias the exposure level enabled by auto-exposure, and usually is a symmetric range around 0 EV (the no-compensation value). This value is only used in
single-shot
andcontinuous
exposureMode
. - Exposure Time is a numeric camera setting that controls the length of time during which light is allowed to fall on the photosensitive device. This value is used in
manual
exposureMode
to control exposure. The value is in 100 microsecond units. That is, a value of 1.0 means an exposure time of 1/10000th of a second and a value of 10000.0 means an exposure time of 1 second. - The ISO setting of a camera describes the sensitivity of the camera to light. It is a numeric value, where the lower the value the greater the sensitivity. This value should follow the [iso12232] standard.
- Red Eye Reduction is a feature in cameras that is designed to limit or prevent the appearance of red pupils ("Red Eye") in photography subjects due prolonged exposure to a camera’s flash.
- [LIGHTING-VOCABULARY] defines brightness as "the attribute of a visual sensation according to which an area appears to emit more or less light" and in the context of the present API, it refers to the numeric camera setting that adjusts the perceived amount of light emitting from the photo object. A higher brightness setting increases the intensity of darker areas in a scene while compressing the intensity of brighter parts of the scene. The range and effect of this setting is implementation dependent but in general it translates into a numerical value that is added to each pixel with saturation.
- Contrast is the numeric camera setting that controls the difference in brightness between light and dark areas in a scene. A higher contrast setting reflects an expansion in the difference in brightness. The range and effect of this setting is implementation dependent but it can be understood as a transformation of the pixel values so that the luma range in the histogram becomes larger; the transformation is sometimes as simple as a gain factor.
- [LIGHTING-VOCABULARY] defines saturation as "the colourfulness of an area judged in proportion to its brightness" and in the current context it refers to a numeric camera setting that controls the intensity of color in a scene (i.e. the amount of gray in the scene). Very low saturation levels will result in photos closer to black-and-white. Saturation is similar to contrast but referring to colors, so its implementation, albeit being platform dependent, can be understood as a gain factor applied to the chroma components of a given image.
- Sharpness is a numeric camera setting that controls the intensity of edges in a scene. Higher sharpness settings result in higher contrast along the edges, while lower settings result in less contrast and blurrier edges (i.e. soft focus). The implementation is platform dependent, but it can be understood as the linear combination of an edge detection operation applied on the original image and the original image itself; the relative weights being controlled by this sharpness.
-
Image width and image height represent the supported/desired resolution of the resulting photographic image after any potential sensor corrections and other algorithms are run.
The supported resolutions are managed segregated e.g.
imageWidth
andimageHeight
values/ranges to prevent increasing the fingerprinting surface and to allow the UA to make a best-effort decision with regards to actual hardware configuration vis-a-vis requested constraints. - Focus distance is a numeric camera setting that controls the focus distance of the lens. The setting usually represents distance in meters to the optimal focus distance.
-
Pan is a numeric camera setting that controls the pan of the camera. The setting represents pan in arc seconds, which are 1/3600th of a degree. Values are in the range from -180*3600 arc seconds to +180*3600 arc seconds. Positive values pan the camera clockwise as viewed from above, and negative values pan the camera counter clockwise as viewed from above.
Any algorithm which uses a
MediaTrackConstraintSet
object and itspan
dictionary member which exists after a possible normalization MUST request permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set tocamera
and itspanTiltZoom
member set to true, and, optionally, consider itsdeviceId
member set to any appropriate device’s deviceId.If the
visibilityState
of the top-level browsing context value is "hidden", theapplyConstraints()
algorithm MUST throw aSecurityError
ifpan
dictionary member exists after a possible normalization.Thepan
,tilt
andzoom
dictionary members exists after a possible normalization if the normalized value is a double value or aConstrainDoubleRange
value (whether empty or not). A boolean value of true is normalized to an emptyConstrainDoubleRange
value thus the dictionary member exists after a normalization. A boolean value of false is normalized to no value thus the dictionary member does not exist after a normalization. An emptyConstrainDoubleRange
value implies no constraints but only a permission and capability request. -
Tilt is a numeric camera setting that controls the tilt of the camera. The setting represents tilt in arc seconds, which are 1/3600th of a degree. Values are in the range from -180*3600 arc seconds to +180*3600 arc seconds. Positive values tilt the camera upward when viewed from the front, and negative values tilt the camera downward as viewed from the front.
Any algorithm which uses a
MediaTrackConstraintSet
object and itstilt
dictionary member which exists after a possible normalization MUST request permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set tocamera
and itspanTiltZoom
member set to true, and, optionally, consider itsdeviceId
member set to any appropriate device’s deviceId.If the
visibilityState
of the top-level browsing context value is "hidden", theapplyConstraints()
algorithm MUST throw aSecurityError
iftilt
dictionary member exists after a possible normalization.There is no defined order when applying pan and tilt, the UA is allowed to apply them in any order. In practice this should not matter since these values are absolute, so order will not affect the final position. However, if applying pan and tilt is slow enough, the order in which they are applied may be visually noticeable. -
Zoom is a numeric camera setting that controls the focal length of the lens. The setting usually represents a ratio, e.g. 4 is a zoom ratio of 4:1. The minimum value is usually 1, to represent a 1:1 ratio (i.e. no zoom).
Any algorithm which uses a
MediaTrackConstraintSet
object and itszoom
dictionary member which exists after a possible normalization MUST request permission to use (as defined in [permissions]) a PermissionDescriptor with its name member set tocamera
and itspanTiltZoom
member set to true, and, optionally, consider itsdeviceId
member set to any appropriate device’s deviceId.If the
visibilityState
of the top-level browsing context value is "hidden", theapplyConstraints()
algorithm MUST throw aSecurityError
ifzoom
dictionary member exists after a possible normalization. - Fill light mode describes the flash setting of the capture device (e.g.
auto
,off
,on
). Torch describes the setting of the source’s fill light as continuously connected, staying on as long astrack
is active.
11. MeteringMode
enum {
MeteringMode "none" ,"manual" ,"single-shot" ,"continuous" };
11.1. Values
none
- This source does not offer focus/exposure/white balance mode. For setting, this is interpreted as a command to turn off the feature.
manual
- The capture device is set to manually control the lens position/exposure time/white balance, or such a mode is requested to be configured.
single-shot
- The capture device is configured for single-sweep autofocus/one-shot exposure/white balance calculation, or such a mode is requested.
continuous
- The capture device is configured for continuous focusing for near-zero shutter-lag/continuous auto exposure/white balance calculation, or such continuous focus hunting/exposure/white balance calculation mode is requested.
12. Point2D
A Point2D
represents a location in a two dimensional space. The origin of coordinates is situated in the upper leftmost corner of the space.
dictionary {
Point2D double x = 0.0;double y = 0.0; };
12.1. Members
x
, of type double, defaulting to0.0
- Value of the horizontal (abscissa) coordinate.
y
, of type double, defaulting to0.0
- Value of the vertical (ordinate) coordinate.
13. Examples
13.1. Update camera pan, tilt and zoom and takePhoto()
< html> < body> < video autoplay>< /video>< img> < div> < input id= "pan" title= "Pan" type= "range" disabled/> < labelfor = "pan" > Pan< /label>< /div>< div> < input id= "tilt" title= "Tilt" type= "range" disabled/> < labelfor = "tilt" > Tilt< /label>< /div>< div> < input id= "zoom" title= "Zoom" type= "range" disabled/> < labelfor = "zoom" > Zoom< /label>< /div>< script> let imageCapture; asyncfunction getMedia() { try { const stream= await navigator. mediaDevices. getUserMedia({ video: { pan: true , tilt: true , zoom: true }, }); const video= document. querySelector( 'video' ); video. srcObject= stream; const [ track] = stream. getVideoTracks(); imageCapture= new ImageCapture( track); const capabilities= track. getCapabilities(); const settings= track. getSettings(); for ( const ptzof [ 'pan' , 'tilt' , 'zoom' ]) { // Check whether pan/tilt/zoom is available or not. if ( ! ( ptzin settings)) continue ; // Map it to a slider element. const input= document. getElementById( ptz); input. min= capabilities[ ptz]. min; input. max= capabilities[ ptz]. max; input. step= capabilities[ ptz]. step; input. value= settings[ ptz]; input. disabled= false ; input. oninput= async event=> { try { // Warning: Chrome requires advanced constraints. await track. applyConstraints({[ ptz] : input. value}); } catch ( err) { console. error( "applyConstraints() failed: " , err); } }; } } catch ( err) { console. error( err); } } asyncfunction takePhoto() { try { const blob= await imageCapture. takePhoto(); console. log( "Photo taken: " + blob. type+ ", " + blob. size+ "B" ); const image= document. querySelector( 'img' ); image. src= URL. createObjectURL( blob); } catch ( err) { console. error( "takePhoto() failed: " , err); } } < /script>< /body>< /html>
13.2. Repeated grabbing of a frame with grabFrame()
< html> < body> < canvas>< /canvas>< button id= "stopButton" > Stop frame grab< /button>< script> asyncfunction grabFrames() { try { const canvas= document. querySelector( 'canvas' ); const video= document. querySelector( 'video' ); const stream= await navigator. mediaDevices. getUserMedia({ video: true }); video. srcObject= stream; const [ track] = stream. getVideoTracks(); try { const imageCapture= new ImageCapture( track); stopButton. onclick= () => track. stop(); while ( track. readyState== 'live' ) { const imgData= await imageCapture. grabFrame(); canvas. width= imgData. width; canvas. height= imgData. height; canvas. getContext( '2d' ). drawImage( imgData, 0 , 0 ); awaitnew Promise( r=> setTimeout( r, 1000 )); } } finally { track. stop(); } } catch ( err) { console. error( err); } } < /script>< /body>< /html>
13.3. Grabbing a Frame and Post-Processing
< html> < body> < canvas>< /canvas>< script> asyncfunction grabFrames() { try { const canvas= document. querySelector( 'canvas' ); const video= document. querySelector( 'video' ); const stream= await navigator. mediaDevices. getUserMedia({ video: true }); video. srcObject= stream; const [ track] = stream. getVideoTracks(); try { const imageCapture= new ImageCapture( track); const imageBitmap= await imageCapture. grabFrame(); // |imageBitmap| pixels are not directly accessible: we need to paint // the grabbed frame onto a <canvas>, then getImageData() from it. const ctx= canvas. getContext( '2d' ); canvas. width= imageBitmap. width; canvas. height= imageBitmap. height; ctx. drawImage( imageBitmap, 0 , 0 ); // Read back the pixels from the <canvas>, and invert the colors. const imageData= ctx. getImageData( 0 , 0 , canvas. width, canvas. height); const data= imageData. data; for ( let i= 0 ; i< data. length; i+= 4 ) { data[ i] ^= 255 ; // red data[ i+ 1 ] ^= 255 ; // green data[ i+ 2 ] ^= 255 ; // blue } // Finally, draw the inverted image to the <canvas> ctx. putImageData( imageData, 0 , 0 ); } finally { track. stop(); } } catch ( err) { console. error( err); } } < /script>< /body>< /html>
13.4. Update camera focus distance and takePhoto()
< html> < body> < video autoplay>< /video>< img> < input type= "range" hidden> < script> let imageCapture; asyncfunction getMedia() { try { const stream= await navigator. mediaDevices. getUserMedia({ video: true }); const video= document. querySelector( 'video' ); video. srcObject= stream; const [ track] = stream. getVideoTracks(); imageCapture= new ImageCapture( track); const capabilities= track. getCapabilities(); const settings= track. getSettings(); // Check whether focus distance is available or not. if ( ! capabilities. focusDistance) { return ; } // Map focus distance to a slider element. const input= document. querySelector( 'input[type="range"]' ); input. min= capabilities. focusDistance. min; input. max= capabilities. focusDistance. max; input. step= capabilities. focusDistance. step; input. value= settings. focusDistance; input. oninput= async event=> { try { await track. applyConstraints({ focusMode: "manual" , focusDistance: input. value}); } catch ( err) { console. error( "applyConstraints() failed: " , err); } }; input. parentElement. hidden= false ; } catch ( err) { console. error( err); } } asyncfunction takePhoto() { try { const blob= await imageCapture. takePhoto(); console. log( "Photo taken: " + blob. type+ ", " + blob. size+ "B" ); const image= document. querySelector( 'img' ); image. src= URL. createObjectURL( blob); } catch ( err) { console. error( "takePhoto() failed: " , err); } } < /script>< /body>< /html>