Tracking Compliance and Scope

Abstract

This specification defines the meaning of a Do Not Track (DNT) preference and sets out practices for websites to comply with this preference.

2. Definitions

2.1 User

A user is an individual human. When user agent software accesses online resources, whether or not the user understands or has specific knowledge of a particular request, that request is "made by the user."

2.2 User Agent

The term user agent refers to any of the various client programs capable of initiating HTTP requests, including but not limited to browsers, spiders (web-based robots), command-line tools, native applications, and mobile apps [RFC7230].

Issue 227: User Agent requirements in UA Compliance vs. Scope section

There is a proposal to move a sentence about user agents from the Introduction/Scope section to this section. We might also include a reference here to the conformance requirements on user agents in the companion TPE recommendation.

2.3 Network Interaction

A network interaction is a single HTTP request and its corresponding response(s): zero or more interim (1xx) responses and a single final (2xx-5xx) response.

2.4 User Action

A user action is a deliberate action by the user, via configuration, invocation, or selection, to initiate a network interaction. Selection of a link, submission of a form, and reloading a page are examples of user actions.

2.5 Subrequest

A subrequest is any network interaction that is not directly initiated by user action. For example, an initial response in a hypermedia format that contains embedded references to stylesheets, images, frame sources, and onload actions will cause a browser, depending on its capabilities and configuration, to perform a corresponding set of automated subrequests to fetch those references using additional network interactions.

2.6 Party

A party is a natural person, a legal entity, or a set of legal entities that share common owner(s), common controller(s), and a group identity that is easily discoverable by a user. Common branding or providing a list of affiliates that is available via a link from a resource where a party describes DNT practices are examples of ways to provide this discoverability.

2.7 Service Provider

Access to Web resources often involves multiple parties that might process the data received in a network interaction. For example, domain name services, network access points, content distribution networks, load balancing services, security filters, cloud platforms, and software-as-a-service providers might be a party to a given network interaction because they are contracted by either the user or the resource owner to provide the mechanisms for communication. Likewise, additional parties might be engaged after a network interaction, such as when services or contractors are used to perform specialized data analysis or records retention.

For the data received in a given network interaction, a service provider is considered to be the same party as its contractee if the service provider:

processes the data on behalf of the contractee;
ensures that the data is only retained, accessed, and used as directed by the contractee;
has no independent right to use the data other than in a permanently deidentified form (e.g., for monitoring service integrity, load balancing, capacity planning, or billing); and,
has a contract in place with the contractee which is consistent with the above limitations.

2.8 First Party

With respect to a given user action, a first party is a party with which the user intends to interact, via one or more network interactions, as a result of making that action. Merely hovering over, muting, pausing, or closing a given piece of content does not constitute a user's intent to interact with another party.

In some cases, a resource on the Web will be jointly controlled by two or more distinct parties. Each of those parties is considered a first party if a user would reasonably expect to communicate with all of them when accessing that resource. For example, prominent co-branding on the resource might lead a user to expect that multiple parties are responsible for the content or functionality.

Network interactions and subrequests related to a given user action may not constitute intentional interaction when, for example, the user is unaware or only transiently informed of redirection or framed content.

2.9 Third Party

For any data collected as a result of one or more network interactions resulting from a user's action, a third party is any party other than that user, a first party for that user action, or a service provider acting on behalf of either that user or that first party.

2.10 Deidentification

Data is permanently deidentified when there exists a high level of confidence that no human subject of the data can be identified, directly or indirectly (e.g., via association with an identifier, user agent, or device), by that data alone or in combination with other retained or available information.

2.10.1 Deidentification Considerations

This section is non-normative.

In this specification the term permanently deidentified is used for data that has passed out of the scope of this specification and can not, and will never, come back into scope. The organization that performs the deidentification needs to be confident that the data can never again identify the human subjects whose activity contributed to the data. That confidence may result from ensuring or demonstrating that it is no longer possible to:

isolate some or all records which correspond to a device or user;
link two or more records (either from the same database or different databases), concerning the same device or user;
deduce, with significant probability, information about a device or user.

Regardless of the deidentification approach, unique keys can be used to correlate records within the deidentified dataset, provided the keys do not exist and cannot be derived outside the deidentified dataset and have no meaning outside the deidentified dataset (i.e. no mapping table can exist that links the original identifiers to the keys in the deidentified dataset).

In the case of records in such data that relate to a single user or a small number of users, usage and/or distribution restrictions are advisable; experience has shown that such records can, in fact, sometimes be used to identify the user or users despite technical measures taken to prevent reidentification. It is also a good practice to disclose (e.g. in the privacy policy) the process by which deidentification of these records is done, as this can both raise the level of confidence in the process, and allow for for feedback on the process. The restrictions might include, for example:

technical safeguards that prohibit reidentification of deidentified data and/or merging of the original tracking data and deidentified data;
business processes that specifically prohibit reidentification of deidentified data and/or merging of the original tracking data and deidentified data;
business processes that prevent inadvertent release of either the original tracking data or deidentified data;
administrative controls that limit access to both the original tracking data and deidentified data.

Geolocation data (of a certain precision or over a period of time) may itself identify otherwise deidentified data.

2.11 Tracking

Tracking is the collection of data regarding a particular user's activity across multiple distinct contexts and the retention, use, or sharing of data derived from that activity outside the context in which it occurred. A context is a set of resources that are controlled by the same party or jointly controlled by a set of parties.

Tracking data is any data that could be combined with other data to engage in tracking a user across different contexts.

2.12 Collect, Use, Share, Facilitate

A party collects data received in a network interaction if that data remains within the party’s control after the network interaction is complete.

A party uses data if the party processes the data for any purpose other than storage or merely forwarding it to another party.

A party shares data if it transfers or provides a copy of data to any other party.

A party facilitates any other party’s collection of data if it enables such party to collect data and engage in tracking.

3. Server Compliance

It is outside the scope of this specification to control short-term, transient collection and use of data, so long as the data is not shared with a third party and is not used to build a profile about a user or otherwise alter an individual user’s user experience outside the current network interaction. For example, the contextual customization of ads shown as part of the same network interaction is not restricted by a DNT:1 signal.

Issue 134: Would we additionally permit logs that are retained for a short enough period?

3.1 Indicating Compliance and Non-Compliance

In order to communicate compliance with a user's expressed tracking preference as described in this recommendation, a party MUST indicate compliance using the tracking status resource defined in the [TRACKING-DNT] recommendation. A party MUST use the following URI (in the compliance property array) to indicate compliance with this version of the recommendation:

http://www.w3.org/TR/2014/WD-tracking-compliance-20141125/

A party to a given user action that is tracking that action MUST indicate so to the user agent. A party that is tracking a user with that user's consent MUST use the corresponding C or P tracking status values. A party that is tracking a user for reasons allowable under this recommendation (for example, for one of the permitted uses described below) MUST use the T value. A party to a given user action that is not engaged in tracking SHOULD use the N value (a T value is also conformant but not as informative).

A party to a given user action that disregards a DNT signal MUST indicate so to the user agent, using the response mechanism defined in the [TRACKING-DNT] recommendation. The party MUST provide information in its privacy policy listing the specific reasons for not honoring the user's expressed preference. The party's representation MUST be clear and easily discoverable.

In the interest of transparency, especially where multiple reasons are listed, a server might use the [TRACKING-DNT] qualifiers or config properties to indicate a particular reason for disregarding or steps to address the issue. A user agent can parse this response to communicate the reason to the user or direct the user to the relevant section of a privacy policy. This document does not define specific qualifiers for different reasons servers might have for disregarding signals.

3.2 First Party Compliance

With respect to a given user action, a first party to that action which receives a DNT:1 signal MAY collect, retain and use data received from those network interactions. This includes customizing content, services and advertising with respect to those user actions.

A first party to a given user action MUST NOT share data about those network interactions with third parties to that action who are prohibited from collecting data from those network interactions under this recommendation. Data about the interaction MAY be shared with service providers acting on behalf of the first party.

Compliance rules in this section apply where a party determines that it is a first party to a given user action — either because network resources are intended only for use as a first party to a user action or because the status is dynamically discerned. For cases where a party later determines that data was unknowingly collected as a third party to a user action, see Section 6. Unknowing Collection.

A first party to a given user action MAY elect to follow the rules defined under this recommendation for third parties.

Note

Given WG decision on ISSUE-241, how should a first party to an action indicate to the user that it is electing to follow third-party rules? Should we suggest using "N" or some other tracking status code?

3.3 Third Party Compliance

Issue 203: Use of 'tracking' in third-party compliance

When a third party to a given user action receives a DNT:1 signal in a related network interaction:

that party MUST NOT collect, share, or use tracking data related to that interaction;
that party MUST NOT use data about previous network interactions in which it was a third party to the user action.

A third party to a given user action MAY nevertheless collect and use such data when:

a user has explicitly-granted an exception, as described below;
data is collected for the set of permitted uses described below;
or, the data is permanently deidentified as defined in this specification.

Outside the permitted uses and explicitly-granted exceptions listed below, a third party to a given user action MUST NOT collect, share, or associate with related network interactions any identifiers that identify a specific user, user agent, or device. For example, a third party that does not require unique user identifiers for one of the permitted uses MUST NOT place a unique identifier in cookies or other browser-based local storage mechanisms.

3.3.1 General Requirements for Permitted Uses

Some collection and use of data by third parties to a given user action is permitted, notwithstanding receipt of DNT:1 in a network interaction, as enumerated below. Different permitted uses may differ in their permitted items of data collection, retention times, and consequences. In all cases, collection and use of data must be reasonably necessary and proportionate to achieve the purpose for which it is specifically permitted; unreasonable or disproportionate collection, retention, or use are not “permitted uses”.

Note

The requirements in the following sub-sections apply to a party that collects data for a permitted use and that would otherwise be prohibited from collecting, retaining or using that data under the third-party compliance requirements above. Where a first party to a given user action, for example, collects some data for a purpose listed among the permitted uses (e.g. security of network services), these requirements do not apply.

3.3.1.1 No Secondary Uses

A party MUST NOT use data collected for permitted uses for purposes other than the permitted uses for which each datum was permitted to be collected.

3.3.1.2 Data Minimization, Retention and Transparency

Data collected by a party for permitted uses MUST be minimized to the data reasonably necessary for such permitted uses. Such data MUST NOT be retained any longer than is proportionate to, and reasonably necessary for, such permitted uses. A party MUST NOT rely on unique identifiers if alternative solutions are reasonably available.

A party MUST provide public transparency of the time periods for which data collected for permitted uses are retained. The party MAY enumerate different retention periods for different permitted uses. Data MUST NOT be used for a permitted use once the data retention period for that permitted use has expired. After there are no remaining permitted uses for given data, the data MUST be deleted or permanently deidentified.

Issue 199: Limitations on the use of unique identifiers

3.3.1.3 No Personalization

A party that collects data for a permitted use MUST NOT use that data to alter a specific user's online experience based on multi-site activity, except as specifically permitted below.

3.3.1.4 Reasonable Security

A party that collects data for a permitted use MUST use reasonable technical and organizational safeguards to prevent further processing of data retained for permitted uses. While physical separation of data maintained for permitted uses is not required, best practices SHOULD be in place to ensure technical controls ensure access limitations and information security. That party SHOULD ensure that the access and use of data retained for permitted uses is auditable.

3.3.2 Permitted Uses

Issue 211: Should we specify retention periods (extended with transparency) for permitted uses?

3.3.2.1 Frequency Capping

Regardless of the tracking preference expressed, data MAY be collected, retained and used to limit the number of times that a user sees a particular advertisement, often called frequency capping, as long as the data retained do not reveal the user’s browsing history. A party MUST NOT construct profiles of users or user behaviors based on their ad frequency history, or otherwise alter the user’s experience.

3.3.2.2 Financial Logging

Regardless of the tracking preference expressed, data MAY be collected and used for billing and auditing related to the current network interaction and concurrent transactions. This may include counting ad impressions to unique visitors, verifying positioning and quality of ad impressions and auditing compliance with this and other standards.

3.3.2.3 Security

Regardless of the tracking preference expressed, data MAY be collected and used to the extent reasonably necessary to detect security incidents, protect the service against malicious, deceptive, fraudulent, or illegal activity, and prosecute those responsible for such activity, provided that such data is not used for operational behavior beyond what is reasonably necessary to protect the service or institute a graduated response.

When feasible, a graduated response to a detected security incident is preferred over widespread data collection. In this recommendation, a graduated response is a data minimization methodology where actions taken are proportional to the problem or risk being mitigated.

3.3.2.4 Debugging

Regardless of the tracking preference expressed, data MAY be collected, retained and used for debugging purposes to identify and repair errors that impair existing intended functionality.

3.3.3 Qualifiers for Permitted Uses

A party MAY indicate which of the listed permitted uses apply to tracking of a user with the qualifiers mechanism defined in the [TRACKING-DNT] document. While providing qualifiers is OPTIONAL, a party that wishes to indicate particular permitted uses MUST use the corresponding characters as indicated in the table below.

qualifier	permitted use
c	frequency capping
f	financial logging
s	security
d	debugging

A party MAY use multiple qualifiers to indicate that multiple permitted uses of tracking might be ongoing and that each such use conforms to any corresponding requirements. Where qualifiers are present, a party MUST indicate all claimed permitted uses.

Note

The qualifiers in this table correspond directly to the permitted uses described in the previous section. This list, the characters and the names may change depending on the resolution of open issues regarding the permitted uses.

Abstract

Status of This Document

Table of Contents

1. Scope

2. Definitions

2.1 User

2.2 User Agent

2.3 Network Interaction

2.4 User Action

2.5 Subrequest

2.6 Party

2.7 Service Provider

2.8 First Party

2.9 Third Party

2.10 Deidentification

2.10.1 Deidentification Considerations

2.11 Tracking

2.12 Collect, Use, Share, Facilitate

3. Server Compliance

3.1 Indicating Compliance and Non-Compliance

3.2 First Party Compliance

3.3 Third Party Compliance

3.3.1 General Requirements for Permitted Uses

3.3.1.1 No Secondary Uses

3.3.1.2 Data Minimization, Retention and Transparency

3.3.1.3 No Personalization

3.3.1.4 Reasonable Security

3.3.2 Permitted Uses

3.3.2.1 Frequency Capping

3.3.2.2 Financial Logging

3.3.2.3 Security

3.3.2.4 Debugging

3.3.3 Qualifiers for Permitted Uses

4. User-Granted Exceptions

5. Interaction with Existing User Privacy Controls

6. Unknowing Collection

7. Legal Compliance

A. Acknowledgements

B. References

B.1 Normative references