Date	Name	Commitments
2024-08-09	TDM Reservation Protocol, version 3	Licensing commitments
2024-04-16	TDM Reservation Protocol, version 2	Licensing commitments
2022-02-16	TDM Reservation Protocol, version 1	Licensing commitments

Notes, September 30th, 2025

Laurent Le Meur | Posted on: October 1, 2025

The chair described the status of the IETF AI-Pref initiative, which is starting a 3-day meeting in Zurich. Links to the current public drafts had been circulated before; they are:

– https://datatracker.ietf.org/doc/html/draft-ietf-aipref-vocab-03
– https://datatracker.ietf.org/doc/html/draft-ietf-aipref-attach-03

Although there is no assurance that the IETF group will succeed in gaining consensus on such a vocabulary, a favorable outcome would be beneficial for TDMRep, as our specification can easily evolve to include such signals. As a reminder, the `tdm-reservation` signal indicates that the copyright owner reserves his rights for any processing of “his” content via TDM techniques. It is not a NO-TDM signal, but a RIGHTS-RESERVED signal. The complementary `tdm-policy` URL directs users to an ODRL file, which outlines the procedures for obtaining mining rights and provides contact information for the copyright owner.

The chair outlined a proposal for the evolution of TDMRep, incorporating a fine-grained vocabulary. It is based solely on the evolution of the TDM Policy document, hosted on a Web Server. The proposal will soon be made available on the TDMRep W3C CG Page (https://www.w3.org/community/tdmrep/). The significant advantage of this solution is that the large volume of publications already released with embedded `tdm-reservation` and `tdm-policy` properties do not need to be modified in any way. Additionally, no modification of TDMRep implementations is required in HTTP headers or the server-hosted tdmrep.json file. Only the ODRL file, which is referenced by `tdm-policy`, needs to be modified if the provider wants to provide fine-grained signals; this file is typically unique for a given provider.

One reason for adopting fine-grained signals may be for Web Search management. Search actors firmly state that modern Search engines rely on AI in their processing pipelines. As AI pipelines include TDM techniques (tokenization is considered by most people to be a TDM technique), reserving all TDM Rights could imply that Search engines need to obtain an agreement from content providers before indexing their content. TDMRep CG members advocate that robots.txt is an effective way to signal web crawlers an opt-in for Search, and therefore, constitutes a complementary solution for the TDM opt-out provided by TDMRep. However, this complementarity is not specified in the TDMRep document and remains subject to interpretation. Therefore, an explicit signal, free from specific bot names, could be more secure for both copyright owners and search actors. In this case, display constraints like ‘no more than 100 characters” — as proposed by Microsoft in the IETF initiative — would be a good addition to the Search opt-in.

The present members of the TDMRep CG decided during the call to work on a proposal for a vocabulary that covers the inference side of the problem currently being discussed at IETF. It may be proposed as an RFC.

The group had previously decided to examine the outcome of the IETF initiative before determining a path for completing and standardizing TDMRep. The IETF initiative is taking more time than initially expected. Still, we’ll follow it until the end of 2025 before deciding whether to move forward with our own set of fine-grained signals or rely on an AI-Pref consensus.

TDMRep CG meeting during the W3C TPAC 2025

Laurent Le Meur | Posted on: July 25, 2025

A TDMRep meeting is planned during the new W3C annual conference, on Friday 14 November 2025 at 7:30 am UTC, duration 1:30h.
The W3C TPAC is an hybrid event. Some participants will be in Kobe, Japan, from November 1O to 14th. Other participants will be online. Registration is open at https://www.w3.org/register/tpac2025.

Some news, July 2025

Laurent Le Meur | Posted on: July 7, 2025

Since the last CG meeting, some participants have been following the ongoing AI Pref project at the IETF. Intensive discussions are still ongoing there (email archive: https://mailarchive.ietf.org/arch/browse/ai-control/); currently, the Vocabulary for Expressing AI Usage Preferences, initially drafted within the Open Future initiative, is gaining traction and has been expanded to also address AI inference, a complex matter to be further evaluated within the CG, especially in relation of AI search (current draft: https://ietf-wg-aipref.github.io/drafts/draft-ietf-aipref-vocab.html). Members of this group with a technical expertise are invited to check the public exchanges on AI Pref at the IETF and participate if they wish.

The CG is monitoring IETF developments to evaluate the possibility of aligning the respective solutions for a more granular definition of AI usage, at least in part. This process, as well as the continuation of the discussion on the standardisaton of the protocol, is likely to take place in September.

Notes, April 15th, 2025

Laurent Le Meur | Posted on: April 22, 2025

Quick recap

The CG discussed the evolution of the opt-out landscape, open issues related to inference, search, and content discovery, and the need for standardization, considering the possibility of integrating the work done at IETF on robots.txt. They also discussed two paths to standardization, via the W3C or ISO.

Next steps

Giulia, Laurent, and Leonard to explore and detail the ISO standardization path for TDMRep.
Upon notification by the people involved in IETF AIPREF discussions, group members to review the IETF mailing list discussions on robots.txt and provide input if interested.
The co-chairs to organize a last call for feedback on the standardization path options before making a final decision.

Multiple initiatives

Laurent made a quick summary of the landscape:

TDMRep: location and asset-based specification (EPUB, XMP; the notion of tdm policy is specific to this effort)
C2PA: asset-based specification
Spawning AI: location-based specification
IETF AIPREF: location-based. The new kid in town, focusing on an evolution of robots.txt.
TDMAI: registry-based specification. It is evolving to support the Open Future recommended vocabulary.
Open Future initiative: specification of a common vocabulary for opt-out refinements. This vocabulary seems to get traction, including in the IETF AIPREF initiative.

Inference, RAG, search & discovery.

The group discussed the need to address the inference use cases with the opt-out. Inference is a broad term; RAG is a technique contributing to inference. Search engines evolve into answering engines. They provide – or don’t – links to reference resources on the Web. The role of Search vs TDM and the applicability of opt-out to search-related functionalities (especially if AI-boosted) is an open question in the IETF discussion.

For Chris (BBC), inference is critical. For Quentin (FEP), the intent behind inference is more crucial than the technique used.

TDMRep vs robots.txt

The group discussed the evolution of the landscape and the need for standardization. They considered the possibility of integrating the work started at IETF on robots.txt, depending on how it evolves. Discussion on AIPREF is public. The TDMRep technique relative to tdmrep.json (a well-known file on the web server where content files are stored) is similar to using a modified robots.txt. An evolution of the latter could satisfy users of the former, and a reference to the IETF specification would then replace the tdmrep.json technique. This could also apply to HTTP headers.

The group agreed to follow the evolution of the IETF initiative before finalizing the specifications to be proposed for formal standardization. They also discussed the potential risks of interference between TDM opt-out and search and discovery, which is also a requirement of the ongoing work at the IETF.

W3C Path for Standardization Discussion

Laurent presented the steps required to achieve W3C Recommendation status. The prominent TDMRep CG members could be invited to the Working Group if they are not W3C members.

The group expressed concern that any member of the W3C can raise a Formal Objection to a Working Group Charter, which blocks the process for some time.

ISO Path for Standardization Discussion

The specification can be worked on by the PDF Association and moved to ISO/C 171/SC2 as Fast Track. The TDMRep CG would be closed, and the W3C TDMRep Report frozen.

Leonard clarified that he is not pushing for any specific direction and is willing to facilitate the process if the group decides to go the ISO path. He also mentioned that the PDF Association has a special arrangement with ISO, allowing for the publication of ISO documents at no charge.

Current CG members have different options for joining the ISO process: through the ISO national standard body and ISO liaison organizations (a fee is required), or by joining the PDF Association (a fee is also required).

In the meeting, the team discussed adopting ISO standards and the potential for opposition from national standards bodies. Leonard clarified that the voting process is not country-based but representative-based, with each representative forming the country’s vote. He also mentioned that it’s rare for a standard to be rejected due to the vote of a single national standards body. The team also discussed the importance of having experts from different backgrounds to ensure a diverse perspective. The conversation ended with a discussion of the potential challenges and the need for consensus in the ISO process.

Notes, December 17th, 2024

Laurent Le Meur | Posted on: December 19, 2024

Update on the ongoing effort for the development of a common vocabulary on TDM

The Open Future Foundation is facilitating discussion among representatives of opt-out solutions, rightsholders organisations, and AI platforms, towards developing a common vocabulary on TDM usages. The work in progress included some meetings in person and online exchanges.
The discussion aims to reach a common understanding of the scope of the TDM opt-out as defined by the European Directive on copyright in the digital single market (DSM Directive) and identify some granular TDM usage.

So far, consensus has emerged on three levels of the vocabulary:
1) a broad “TDM” category as per the DSM Directive, including, among possible TDM usages:
2) General Purpose AI training (where “General Purpose AI” is the term used in the EU AI Act), including:
3) Generative AI training (where Generative AI would fall within the General Purpose AI)

There is no consensus so far on whether search and discovery (either traditional indexing or new AI-based search) should be in the scope of the opt-out. The interpretation adopted by the TDMRep W3C CG and supported by rightsholders is that search and discovery are out of the scope of the TDM opt-out.

Representatives of opt-out solutions involved in the discussion facilitated by the Open Future Foundation are currently working on definitions for the three vocabulary terms defined above (TDM, General purpose AI training, and Generative AI Training). Members of the TDM Rep are welcome to contact the chair for clarification and share views on such definitions.

It is important to note that representatives of opt-out solutions (some of them are also members of the TDM Rep) are willing to work together and make their solutions converge toward the vocabulary/model which reaches consensus.

Possible evolution of TDM Rep specifications

To allow the expression of more granular opt-outs as defined in the common vocabulary, the TDM Rep specifications should evolve into a new version. Some options have been briefly discussed: one could be to extend the “tdm-rep” values to include either a broad or more granular rights reservation, and another could be to convey such granularity within the “tdm-policy”. Different solutions shall be evaluated by the TDM Rep CG, considering several aspects such as the need to ensure backward compatibility, the consensus on various technical solutions, the balancing between clarity, ease of expressing opt-out and ease of discovering information while crawling.

Evaluation of possible standardization paths

At the same time, the community group will start exploring possible standardization paths for the TDM Rep that could be initiated once the specifications are updated. Different standardization bodies could host this work, including W3C, ISO, IETF. It was agreed that it would be wiser to choose a standardization body that can handle the location-based (HTTP, tdmrep.json) and content-based (HTML, EPUB, PDF) parts of the TDMRep specifications. The possibility for interested stakeholders to take part in the work is another key decision element for the choice of the standardization body.

Notes, July 26th, 2023

Laurent Le Meur | Posted on: August 3, 2023

Update on meetings and presentations of the TDM protocol

On 5th June a webinar on the protocol and how to implement it was organized by FEP and EDRLab; more than 70 publishers attended, and positive feedback was received.

On July 11th, in Bruxelles, the TDM protocol was presented by AIE at the “Seminar on best practices for opting-out of generative ML training”, organized by Open Future. AIE and FEP attended the event, which was an occasion to exchange with organizations representing other rightsholders in the content industry, the EC Commission, AI experts, and other projects/initiatives offering solutions for machine-readable opt-out, namely the C2PA coalition and Spawning. The latter integrates different opt-out methods in order to provide a service to AI companies that, given a URL in input, can check if there is an opt-out associated with the resource that AI players intend to use.

Collaboration with Spawning AI

After some exchanges, Spawning AI has already integrated partially the opt-out solution developed by the TDM Rep CG in their service, and they are open to collaborating further with the CG.

Discussion on possible developments of the protocol

EDRLab presented an overview of the different opt-out initiatives that are in touch with our CG. Some of them are media-specific (like the ones by IPTC and C2PA) and provide solutions at the content metadata level, other like Spawning AI (and the TDM Rep protocol) are applicable to any content type, at the URL level. Even though different solutions (content specific and not-content-specific) are complementary and can coexist in line with the different standards and practices in the content industry, there are significant differences in the semantic approach adopted by IPTC and C2PA on one hand, and the TDM Rep on the other: in particular, the different solutions reflect different views on whether the TDM concept would cover all/some AI usages, and whether indexing by search engines could be part of the opt-out. Such discrepancies are partly due to the different legal frameworks (US vs. EU) where such initiatives were developed.

Considering the rapid evolution of AI applications, and the ongoing discussion in the creative industries on rights reservation and licensing for AI, the CG agreed to continue to monitor the situation and exchange with the other initiatives in this field before taking any decision on the possible refinement of the protocol with new properties or values.

In the short term, it was agreed that:

the CG will check if the semantics of the protocol can be further clarified at the level of the specifications, to prevent any ambiguity and facilitate interoperability among different solutions.
the CG will work at a FAQ for non-techies that will further clarify the meaning of the TDM opt-out in light of the EU legal framework and will provide practical insight to the adopters on how to implement it in the context of AI.

Implementation in EPUB files

Given the increasing interest by the publishing sector – including, among GC members, Mondadori, Penguin Random House, and the STM association – for the integration of the TDM protocol in EPUB files, it was agreed that the CG will liaise with the W3C Publishing Community Group and the Publishing Business Group, which follow EPUB related developments, via EDRLab (who is member of both groups).

Particularly, it was agreed that:

On behalf of the CG, EDRLab will send to the W3C Publishing Business Group a proposal to be discussed during their next meeting in September;
Should CG members have views or suggestions on the integration of TDM Rep in EPUB, they are requested to share them within the CG mailing list at their earliest convenience, so that they can be taken into account in the framework of the collaboration with the W3C Publishing Business and Community Groups

Other activities

A FAQ for non-tech users: the group agreed to work on a FAQ; for more details see above;
Keeping track of early adopters: group members are invited to share on the CG mailing list information about new adopters of the protocol. The list of the early adopted will be publicized on the website of the CG, in order to give visibility to it. Early adopters are also encouraged to publicize the adoption of the protocol on their own websites.

Notes, April 4th, 2023

Laurent Le Meur | Posted on: April 6, 2023

The TDMRep Community Group had its first 2023 call on April 4th. Several new members of the CG joined the call, from the International Association of STM Publishers, the CCC (Copyright Clearance Center) and Taylor & Francis Group.

It was the opportunity to remind members about some useful links:

Several threads of discussion will be developed during the coming months:

a) about the relationship between our work and AI/ML training, especially in the scope of Generative AI.

We will ask the EU Commission if they have a view about the applicability of Article 4 of the DSM Directive to AI/ML training. We will also try to get direct information about this issue from the publishers community and legal experts in the field of digital publishing technologies.

b) about a possible addition of usage details to the TDMRep specification.

We will study if we there is a requirement from publishers/rightholders to differentiate between different usages of their content in the framework of AI, e.g. allow AI/ML training or allow all but Generative AI training. Other working groups – namely C2PA and the IPTC – are taking this approach.

c) about the inclusion of “opt-out” information inside content items.

Several publishers have expressed a requirement to attach their “opt-out” decision to the content they publish. Several working groups – especially C2PA and the IPTC – are working on adding usage rights to PDF, JPEG and other types of files.

We will study if we should extend our specifications to support this use case, or if we should rather keep a liaison with other working groups and make so that our respective specifications are compatible (and therefore complementary).

In the short term, we will ask representatives of C2PA and IPTC to present their current work to our community group during the next two calls. These calls will be announced to the group members on the mailing list.

Next steps, let’s talk

Laurent Le Meur | Posted on: March 22, 2023

Giulia and I had a quick call, to discussed a practical path to the next steps of our work. Here is our proposal:

1/ Study if “the use of data for the purposes of training an AI/ML model” is the same thing as “the use of data for the purposes of data mining”, especially in the scope of the EU DSM Directive. For that we need access to document formalizing the position of experts in this field, we need lawyers around the table, and the opinion of EU Commission people would be welcome.

2/ Discuss if we should apply evolutions to the specification in order to embed TDMRep properties in different types of resources (images, PDF and EPUB come to minds). One important question is what should be found in such resources: TDMRep properties (as we defined them) or detailed usage rights information (as the IPTC is defining them)?

3/ If the answer to issue 2 is “yes we should embed TDMRep properties in resources”, then how should we embed them? (XMP comes to minds there).

Could we start with issue 1? AI and LLMs are the big thing currently, and there is traction for protecting artistic content against free commercial use in AI products.

We therefore propose to held a Zoom call on this topic on April 4th, 14 UTC (16:00 CEST), using this URL: https://us06web.zoom.us/j/84639131689?pwd=QXl1T1R1UldjdG1XR00rWG5ZQUlZQT09

Please indicate if you’re willing to join, and if you have a specific expertise in the AI vs TDM field.

Best regards, Laurent LE MEUR

Next steps for the group

Laurent Le Meur | Posted on: March 14, 2023

After the release of our final report, in February 2022, we had the opportunity to present the project to several publishers and publishers associations in European countries and start discussing possible pilots in order to gather feedback on the first release of the protocol.

We found that the interest for the “TDM opt-out” solution is growing fast, in particular since ChatGPT was released. GPT-3, the LLM (Large Language Model) behind ChatGPT, constitutes an impressive example of the potentiality of AI agents when processing digital publications, and rightsholders from different sectors of the content industries are looking for an effective machine readable solution for managing TDM rights on their content.

The TDMRep effort is especially interesting for the IPTC (International Press Telecommunications Council), which develops standards for the News Industry and has started exchanging ideas with us.

This group will therefore continue its work on refining the specification and study possible extensions to meet the requirements of specific sectors of the content industry and to facilitate the integration of our work with other metadata standards (such as the IPTC standard for photos) and content formats (e.g. to embed TDMRep metadata in PDF resources).

The final report has been released

Laurent Le Meur | Posted on: February 16, 2022

Good news, we have just published the final report of the TDM Representation Protocol. You’ll find it at the URL https://www.w3.org/2022/tdmrep/. It is referenced from the TDMRep Community home page.

As multiple EU countries are moving towards the implementation of the DSM directive, publishers are looking for solutions. The Community Group will now focus on communication, its targets being the European Commission, TDM Actors and publishers.

In order to communicate effectively, the Community Group will have to work on its governance and define some tools (maybe a presentation website, recorded demos, a brochure, a logo …). We’ll therefore organize multiple calls in 2022.

If you are aware of events which could be good supports for presentations, please contact the CG chairs. We’re currently thinking about the IPTC Spring Meeting 2022 (May) and the Digital Publishing Summit 2022 (June).

Community & Business Groups

Text and Data Mining Reservation Protocol Community Group

Final reports / licensing info